The personal disquiet of

Mark Boulton

February 6th, 2006

Five Simple Steps to Typesetting on the web: Dashes

{title}In this install­ment I’ll be talk­ing about three dashes which are often used, but fre­quently mis­used. The Hyphen, the En Dash and the Em Dash.

The Hyphen

{title}The Hyphen, or the ‘hyphen-minus’, is what you get when you press the key next to zero (stand­ard qwerty key­board, well mine any­way, for all those ped­ants out there). It’s the shortest of the three and is often used incor­rectly, I’ll look at the most com­mon cor­rect uses of the hyphen first before mov­ing on to the dashes it is often used, incor­rectly, to replace.

There are two types of Hyphen: The ‘soft’ hyphen and the ‘hard’ hyphen. Some­times they are dif­fer­ent lengths, but this depends on the typeface.

Hard hyphen.

The hard hyphen joins two words together any­where they are posi­tioned on the same line. For example, ‘run-of-the-mill’. It’s set closed up (which means no space either side).

Soft hyphen.

The soft hyphen indic­ates where a word has been split at the end of a line. Argu­ably, there’s very little use for the soft hyphen on the web when the user has so much con­trol over the present­a­tion of the type.

There are many gram­mat­ical rules asso­ci­ated with hyphens, which dif­fer greatly from lan­guage to lan­guage. For Brit­ish type­set­ting, and the Eng­lish Lan­guage, I’d recom­mend get­ting your­self a copy of the Oxford Guide to Style (the old Hart’s Rules).

The En Dash or En Rule

{title}The en dash is one en in length. It’s slightly longer than a hyphen and half the width of an em dash. Em and en are typo­graphic meas­ures based on point size. An em is a equal to the size of the set type (eg. 12pt) and an en is half that.

  1. An en dash is used closed up in-between ele­ments that show a range. Eg. Monday–Sunday, 1985–2005. It is also used when the end ele­ment is not known: Joe Bloggs (1984–)*.
  2. The en dash can be used to show the mean­ing of to and from. Eg. on–off switch.
  3. The en dash can also be used to join com­pound adject­ives which include mul­tiple words or hyphens already. In this case the en dash cla­ri­fies what is grouped with what. Eg. high–priority–high–pressure tasks.
  4. In Uni­code, the en dash is U+2013 (decimal 8211). In HTML, the numeric forms are – and –. The HTML entity is –

The Em Dash or Em Rule

{title}The em dash, as it’s name sug­gests, is one em in width. The em dash is per­haps the char­ac­ter that has suffered most over recent years. Fre­quently replaced by the hyphen, or the that relic from type­writer days, the double hyphen ( — )**, I think it’s about time we give this little fella the time of day.

Once again, there are dif­fer­ing gram­mat­ical usages depend­ing on lan­guage. In Brit­ish and North Amer­ican type­set­ting there are a few simple rules:

  1. Use the em dash closed up in writ­ten dia­logue to indic­ate an inter­rup­tion. Eg. ‘What a load of—’, but his words were lost on her.
  2. It can also be used either side of an interruption—like this one—and is set closed up.
  3. In Uni­code, the em dash is U+2014 (decimal 8212). In HTML, the numeric forms are — and —. The HTML entity is —.

It’s worth not­ing that em dash usage is incon­sist­ent, not only across lan­guages, but also across house styles. The most com­mon replace­ments are an en dash and the hyphen, both set with a space (or a hair space) either side.

* It’s com­mon prac­tice in North Amer­ican type­set­ting to use an em dash for this purpose.

** The usage of this is of course valid on a type­writer where, as with most mono­spaced fonts, the hyphens, em and en dashes all are sim­ilar length.

I want to thank Phil Wright for his help on this art­icle. The man knows his type.

The next steps

When does print design mat­ter on the web? When you use a print stylesheet of course. The next three art­icles will doc­u­ment pro­duc­tion of a print stylesheet from a print design perspective.

Grid, meas­ure, type size and lead­ing, orphans and wid­ows. The lot.

24 Responses to “Five Simple Steps to Typesetting on the web: Dashes”

  1. Phil said on: February 6th, 2006 at 2:16 pm

    Tread­ing the fine line between type­set­ting and copy­writ­ing here. Which I think is a good thing. As a designer/typesetter/jack-of-all-trades, I have found the time spent learn­ing proper gram­mar and style has been invaluable. 

    Punc­tu­ation and other typo­graphic marks are often neg­lected in web design but they are there to cre­ate land­marks and give dir­ec­tion to the reader. 

    When you con­sider how bad most copy provided to web design­ers is?from a view­point of on-screen use?it is a real strength if web design­ers are able to edit, repunc­tu­ate and if need be rewrite­copy for our medium. 

    Hope­fully these art­icles will encour­age people to look at the words a little more closely. 

    Mark refers to The Oxford Guide to Style, if you’re based in the UK this is a must have, if you’re in the US look at the Chicago Manual. Along­side these it is well worth look­ing at ?The Ele­ments of Style? by Strunck & White and ?The Ele­ments of Typo­graphic Style? by Robert Bring­hurst. All excel­lent, all eye-opening, all essen­tial reading.

  2. paul haine said on: February 6th, 2006 at 3:08 pm

    There’s a good art­icle on A List Apart on this sub­ject that I’d recom­mend along­side what’s already been writ­ten here: The Trouble With EM ‘n EN (and Other Shady Char­ac­ters)

    Soft hyphens could be use­ful if you’re writ­ing long words in nar­row places, but sadly browser sup­port isn’t quite there yet.

  3. Paul D said on: February 6th, 2006 at 3:18 pm

    “Tread­ing the fine line between type­set­ting and copy­writ­ing here.” 

    That’s the trick, isn’t it? To be a good online writer, you also have to be a good editor and typo­grapher. Magazine and book writers rely on oth­ers to know such details, so self-publishing online requires quite a bit more knowledge.

  4. Chris said on: February 6th, 2006 at 3:23 pm

    For those that want a punc­tu­ation geek’s laugh at these mat­ters while still man­aging to learn and remem­ber all these rules I can’t recom­mend Lynne Truss’ book, Eats, Shoots & Leaves highly enough.

    Hand­ily, she’s a Brit so the book is good for those folk but the US edi­tion cov­ers both styles. 

    Lastly, while I do my level best to remem­ber and prop­erly imple­ment all of these rules—and many more—when I writeit’s good to remem­ber that the Eng­lish lan­guage is one of the most mal­le­able of the world’s lan­guages and that, more than likely, with the pas­sage of time and dying away of stick­lers many of our favor­ite gram­mar and typo­graphic con­ven­tions will go the way of the dodo. 

    Of course, know­ing that any­one read­ing this is of a stick­ler nature I’m poten­tially embar­rased by the num­ber of mis­takes I made above.  Such is life.

  5. gochess said on: February 6th, 2006 at 6:56 pm

    How those entit­ies appear on vari­ous brow­ers using dif­fer­ent ISO cod­ings is very prob­lem­atic.  Ditto for trans­fer­ing such char­ac­ters from PDF doc­u­ments to other applications. 

    Appendix F, Char­ac­ter Entit­ies, con­tains a com­plete table of other obscure HTML codes [in J. Niederst, WEB DESIGN in a NUTSHELL, second edi­tion, see esp. p. 583]. 

    Also, using a double hypen, instead of em, in some wikis, may cause the “strike out” of words.

  6. 1981 said on: February 6th, 2006 at 8:01 pm

    Help­ful places are news­pa­per Style Guides too:

    Guard­ian Style Guide

    Times Style Guide

  7. Edward Clarke said on: February 6th, 2006 at 8:54 pm

    Good to see some thought going into print­ing off the web. Inter­est­ing stuff.

  8. Kim Siever said on: February 7th, 2006 at 12:28 am

    I think you for­got an en dash, Phil. :)

  9. Phil said on: February 7th, 2006 at 10:24 am

    Where? I know I missed put­ting com­mas around a clause. Typ­ing too fast.  :)

  10. Joe said on: February 7th, 2006 at 12:10 pm

    “In Brit­ish and North Amer­ican type­set­ting there are a few simple rules: […] 2. It can also be used either side of an interruption?like this one?and is set closed up.” 

    I think it’s false to call the closing-up a rule since, as you say later on, it var­ies enorm­ously across Brit­ish and US house styles. Last night I put it to the test by look­ing at around a dozen books from main­stream pub­lish­ers (Pen­guin, Virago, Faber, Ran­dom House) taken ran­domly from my book­shelves. The vast major­ity put spaces ? like this ? around the inter­rup­tion dashes. 

    In my straw poll, the only two books I found which fol­lowed the Oxford style guide’s rule on clos­ing up were Ellmann’s bio­graphy of James Joyce and Vir­ginia Woolf’s Mrs Dal­lo­way. Both were, unsur­pris­ingly, pub­lished by Oxford Uni­ver­sity Press. 

    All the oth­ers I looked at put spaces around the dashes, apart from Sterne’s Tris­tram Shandy (Pen­guin) and Pynchon’s Gravity’s Rain­bow (Virago), which have their own rather crazy idio­syn­crasies. These were post/modernist works which took, shall we say, a relaxed view towards lit­er­ary rules. The Sterne book???notably???used what looked to me like ememem dashes!

    While con­sist­ent style is clearly valu­able within a pub­lic­a­tion or pub­lish­ing house, I’d be wary of align­ing one­self too closely to the dogma of another ? even a well-respected source such as the Oxford rules ? par­tic­u­larly when it’s rejec­ted by the vast major­ity of other book and news­pa­per pub­lish­ers (includ­ing the Guard­ian and Times, men­tioned above). The fact that one author­ity uses a mode doesn’t make it a rule.

  11. Mark Boulton said on: February 7th, 2006 at 12:20 pm

    Good point Joe. I noticed this last night on going through some early Pen­guin books, some of which dated to the early sixties. 

    Like you say, con­sist­ent style is per­haps more important.

  12. Phil said on: February 7th, 2006 at 1:28 pm

    The prob­lem is that many of the style guides out there are press spe­cific and, as such, have their own industry quirks. The OUP style guide is, at the very least, far-reaching and applic­able to many situ­ations. Pen­guin do a series of style guides and writers manu­als but none to the same degree of depth. 

    Internal con­sist­ency is key, which is why these things are called house styles. You should be dog­matic, just be con­sist­ent with your dogma.  :)

  13. Chris said on: February 7th, 2006 at 3:53 pm

    just be con­sist­ent with your dogma.  :)

    Else, it will just piss all over the place.

  14. Hugh said on: February 7th, 2006 at 11:48 pm

    You seem to have omit­ted the fol­low­ing: A throw­back to the days of type­writers, two hyphens–like these–were used to make a dash because true dash char­ac­ters are not avail­able on a type­writer. But this is a major no-no in type­set­ting and desktop pub­lish­ing, where em dashes?like these?should be used instead. An em is a unit of meas­ure equal to the point size you are using. For example, using 10-point type, an em dash would be approx­im­ately 10 points (approx. 0.14 inches) wide, but this is depend­ent on the indi­vidual typeface. Actu­ally, this is prob­ably the widest it would be. Many typefaces have em dashes that are slightly nar­rower than a full em, but still con­sid­er­ably wider than a hyphen. 

    Hyphens are used to hyphen­ate words and sep­ar­ate phone num­bers. They should never be used as dashes. A dash, more spe­cific­ally, an em dash, is a form of punc­tu­ation used to off­set clauses in a sentence.

    An en dash is typ­ic­ally half the length of an em dash (some­times slightly wider than half, depend­ing on the typeface) but still longer than a hyphen. En dashes are primar­ily used to denote dur­a­tion, as in 8:00?5:00, or August 12?14, or Aardvark?Adelaide. Some people use them to sep­ar­ate phone num­bers, but I think they are too large and look awk­ward for this. 

    The period is prefer­able to the space, but this is purely a mat­ter of per­sonal taste. Phone num­bers sep­ar­ated by spaces are quite com­mon in Europe, less so in the U.S.

    When cre­at­ing em and en dashes, you can add space before and after the dash, or not. I prefer to add either a small amount of space (usu­ally via the application?s kern­ing com­mands), or no space at all. Page lay­out pro­grams such as Adobe Page­Maker and QuarkX­Press let you adjust space between char­ac­ters at a micro level (i.e., kern­ing), as do illus­tra­tion pro­grams such as Illus­trator and Free­Hand, but many word pro­cessors are lim­ited in this regard and only allow you to add space via the Space Bar. The nor­mal space cre­ated with the Space Bar seems a bit too wide for my tastes, but you may find it accept­able. Gen­er­ally speak­ing, the wider the column of text, the more space you can insert before and after dashes (up to a full space). In a typ­ical word pro­cessed doc­u­ment­such as a memo or let­ter, for example, where your column width might be as great as 5 or 6 inches, insert­ing a nor­mal space before and after a dash looks just fine. But in a doc­u­ment­with nar­rower columns, say a news­let­ter with three columns of text, this much space will stand out, and your dashes will resemble diving boards. In this case, I would sug­gest adding no space at all, and simply use the program?s kern­ing com­mands (if avail­able) to tweak the space as neces­sary.

    In Page­Maker, press the Cmd key in con­junc­tion with the Left and Right Arrow keys to decrease and increase kern­ing (the amount of space between char­ac­ters) respect­ively. Hold down the Shift and Cmd keys if you want to increase or decrease kern­ing in smal­ler units. In QuarkX­Press, press Cmd-Shift in con­junc­tion with the Left and Right Bracket keys to kern text. Hold down the Cmd, Shift, and Option keys if you want to kern in smal­ler units. In Page­Maker, a value of about 0.1 should suf­fice; in QuarkX­Press, con­sider a value of about 20 before and after a dash.

    If you do add a full space by press­ing the Space Bar, it is import­ant to add it before and after the dash. Some­times people will add a space after a dash to break a line. Then, if they edit the text or change the lay­out, the dash with a space after it but no space before will appear rather awk­ward look­ing.

    To cre­ate an em dash in most Mac applic­a­tions, press Shift-Option-hyphen. To cre­ate an en dash, press Option-hyphen.

    To cre­ate an em dash in most Win­dows applic­a­tions, press Alt-0151. To cre­ate an en dash, press Alt-0150.

    Some expert font sets con­tain a three-quarter em dash, but in real­ity, most em dashes are about this wide any­way. That is, most em dashes are not one em in width, and depend on the indi­vidual typeface. The three-quarter em dash can be sub­sti­tuted for the em dash. The two are inter­change­able. But it is too wide to be used when you would nor­mally use an en dash.

    It is also accept­able to use an en dash instead of an em dash to set off clauses in text. I don?t like the prac­tice, but it?s not incor­rect to do so. If you do sub­sti­tute en dashes, con­sider adding space before and after them. 

    Keep up the good work —-

  15. Mark Boulton said on: February 8th, 2006 at 4:23 pm

    Hugh: I really don’t know where to start. What a great com­ment. I’ll be sure to update the art­icle with some of these thoughts. 

    There are quite a few dif­fer­ent dashes, such as the fig­ure dash for example, which I pur­pose­fully left out of this art­icle because, well, it would become very com­plic­ated, very quickly.

    You raised some really inter­est­ing points though.

  16. gsf said on: February 8th, 2006 at 6:49 pm

    I became far more aware of the en dash and em dash, and their proper use, when I began using LaTeX, which parses two dashes (–) as an en dash and three (—) as as em dash.  

    I agree with Paul Haine that the art­icle on A List Apart is another fine resource. Your art­icle is nice and easy to digest, I think in large part due to the rule­mark gifs that quickly and clearly identify the con­tent of each sec­tion. Great use of visuals. 

    Two gripes I have regard­ing the way web browsers handle these dandy punc­tu­ation marks: first, it’s a shame fire­fox still doesn’t recog­nize the soft hyphen (­)?really, it’s a shame that no browsers have the abil­ity to break words between syl­lables the way a decent text pro­cessor can; second, in a num­ber of browser fonts, includ­ing the ones this page dis­plays in on both fire­fox and ie, one can barely dis­tin­guish between the hyphen and the en dash?if any­thing, the hyphen is actu­ally wider.

  17. gsf said on: February 8th, 2006 at 6:55 pm

    There should be a ‘­’ between the par­en­theses above fol­low­ing ‘soft hyphen.’ Sorry, the pre­view doesn’t seem to be work­ing for me and I didn’t know that the ‘­’ would be translated.

  18. Jimmy said on: February 9th, 2006 at 6:19 pm

    Thank you so much for post­ing some type­set­ting tips– so hard to find on the web!

  19. Nils T. Devine said on: February 10th, 2006 at 1:03 am

    I like to pull my favour­ite Bring­hurst quote when this dis­cus­sion comes up:

    5.2.1 Use spaced en dashes – rather than em dashes or hyphens — to set off phrases.

    The em dash is the nineteenth-century stand­ard, still pre­scribed by many edit­or­ial style books, but the em dash is too long for use with the best text faces. Like the over­sized space between sen­tences, it belongs to the pad­ded and cor­seted aes­thetic of Vic­torian typography.

    The web being my nat­ive design medium, I did some exper­i­ment­ing with browser sup­port for vari­ous dashes and spaces. I hope this is use­ful to someone.

  20. Fabian De Rango said on: February 12th, 2006 at 9:50 am

    This is all well and good but the prob­lem is that many people are to lazy to fix up their gram­mer on their sent­ances, like this one way to long! 

    Also their are some con­tent man­ag­ment sys­tems which con­vert these auto­mat­ic­ally but I hon­estly couldn’t remem­ber all the html code for them, and I simply am to lazy to bother.

  21. Khoi Vinh said on: February 13th, 2006 at 4:31 am

    Thanks Mark! I learned a lot from your ori­ginal post and from the com­ments here — and I kind of thought I knew everything already. Two comments: 

    First, it’s per­haps wrong to say that there’s little use for the soft hyphen. I’d be very keen to use a soft-hyphen along with smart, auto­mated hyphen­a­tion in a Web browser to achieve aes­thet­ic­ally pleas­ing right-rag effects with bod­ies of type. That’s some­thing that has been lost entirely online.

    Second, I think Hugh really out­lines the use of the em dash very well; my per­sonal taste is that an em dash deserves a space before and after, espe­cially in browsers, in order to avoid effect­ively join­ing two long words together with an extra long dash and caus­ing egre­giously awk­ward right-rag effects. That said, I wish I could spe­cify a shorter dis­tance than a full space, as that amount does look odd. But it’s an accept­able com­prom­ise, at least for now.

  22. James denny said on: March 2nd, 2006 at 3:11 pm

    You can use a thin space on both sides of an em-dash to avoid link­ing the words and so you can allow them to wrap bet­ter. This could also be a solu­tion to open em or en dashes that cause wide open spaces in nar­row columns.

    The char­ac­ter entity is shown on the HTML 4 ref­er­ence at W3C site: <block­quote title=“W3C”><!ENTITY thinsp CDATA “ ”—thin space, U+2009 ISO­pub –></blockquote>http://www.w3.org/TR/1999/REC-html401-19991224/sgml/entities.html#h-24.4.1

    This is right under the entry for em and en spaces. 

    It worked on Fire­Fox 1.5.0.1 but I’ve not tried any other browser.

  23. James denny said on: March 2nd, 2006 at 3:21 pm

    erm block­quote errors not sure if that was me or the soft­ware — the “live pre­view” is not work­ing and there’s no backup! 

    quoted:

    <!– Gen­eral Punc­tu­ation –>

    <!ENTITY ensp CDATA “ ”—en space, U+2002 ISO­pub –>

    <!ENTITY emsp CDATA “ ”—em space, U+2003 ISO­pub –>

    <!ENTITY thinsp CDATA “ ”—thin space, U+2009 ISO­pub –>



    <!ENTITY ndash CDATA “–”—en dash, U+2013 ISO­pub –>

    <!ENTITY mdash CDATA “—”—em dash, U+2014 ISO­pub –>

  24. James denny said on: March 2nd, 2006 at 6:54 pm

    ok looks like I need to remove encode it as some char­ac­ters that seem to slip though the com­ment­ing sytem’s net! 

    quoted:

    – Gen­eral Punc­tu­ation –

    ENTITY ensp CDATA “ ”—en space, U+2002 ISO­pub –

    ENTITY emsp CDATA “ ”—em space, U+2003 ISOpub –

    ENTITY thinsp CDATA “ ”—thin space, U+2009 ISO­pub –



    ENTITY ndash CDATA “–”—en dash, U+2013 ISO­pub –

    ENTITY mdash CDATA “—”—em dash, U+2014 ISOpub –

  • Me

    Hello. My name is Mark Boulton. I’m a designer, an author, a speaker and I run a small design agency where we work with lovely cli­ents and pub­lish books as we go. This is my blog.

  • More of me

  • Publications

    Design­ing for the Web
    Start­ing from £19 + VAT for a PDF Down­load. £29 for a full col­our paperback.
  • Where I work

    Mark Boulton Design
    A small design stu­dio doing good things for nice clients.
    Five Simple Steps
    Pub­lish­ing easy to read design books.
  • See me speak

    @Media 2010
    June 8th — 11th, Lon­don, UK.
    Drupal­Con 2010
    August 23th — 27th, Copen­ha­gen, Denmark.
    dCon­struct 2010: Design 1010 workshop
    Septem­ber 1st, Brighton, UK.
    Web­d­a­gene
    Septem­ber 29th — Octo­ber 1st, Oslo, Norway
    Web Developers Conference
    Octo­ber 27th, Bris­tol, UK.
    New Adven­tures in Web Design
    Janu­ary 20th 2011, Not­ting­ham, UK.
  • Copyright © 1999–2010 Mark Boulton. Made with an Apple Mac in Wales. Running on WordPress and VPS.net.