Quotation-Marks.com

Get a NetFlix Subscription

 

Learn How to Use Quotation Marks in English

English curved quotes, also called “book quotes” or “curly quotes”, look like small figures six and nine with the counters filled. They are preferred in formal writing and printed typography. In e-mail and on Usenet they can only be used by using a MIME type with a character set outside of the ISO-8859 series such as a Unicode encoding or one of the Windows-125x series. Whilst not a problem for most modern mail clients this does increase the size of the message and makes the raw message text harder to follow and so some believe it is bad practice (in much the same way that some think that HTML e-mail is a bad thing). A few mail clients send curved quotes using the windows-1252 codes but mark the text as ISO-8859-1 causing problems for decoders that do not make the dubious assumption that C1 control codes in ISO-8859-1 text were meant to be windows-1252 printable characters.

Curved and straight quotes are also sometimes referred to as “smart quotes” and “dumb quotes” respectively; these names are in reference to the name of a function found in word processors like Microsoft Word that automatically converts straight quotes typed by the user into curved quotes. This function is necessary because keyboards lack separate quotation marks, due to the fact the ASCII character set didn't have distinct opening and closing quotation marks. A quote followed by a letter generally becomes an opening quote, whereas a quote with a letter or period (full stop) preceding it and a space after it becomes a closing quote. This function is usually referred to as “educating quotes”.

Samples Unicode (decimal) HTML Description
‘O’ U+2018 (8216), U+2019 (8217) ‘ ’ Single quotes (left and right)
“O” U+201C (8220), U+201D (8221) “ ” Double quotes (left and right)

Variants of ' and " are:

' - U+201B (HTML: ‛) – single high-reversed-9, or single reversed comma, quotation mark (This is sometimes used to show dropped sounds at the end of words, such as goin' instead of using goin', goin', goin`, or goin')
 - U+201F (HTML: ‟) – double high-reversed-9, or double reversed comma, quotation mark

Supporting curved quotes has been a problem in information technology, primarily because the widely-used ASCII character set did not include a representation for them (as discussed above).

Word processors have traditionally offered curved quotes to users, because in printed documents curved quotes are preferred to straight ones. Before Unicode was widely accepted and supported, this meant representing the curved quotes in whatever 8-bit encoding the software and underlying operating system were using — but the character sets for Windows and Macintosh used two different pairs of values for curved quotes, and ISO 8859-1 (typically the default character set for the Unices and Linux) had no curved quotes, making cross-platform compatibility a nightmare.

Compounding the problem is the “smart quotes” feature mentioned above, which some word processors (including Microsoft Word and OpenOffice.org) use by default. With this feature turned on, users may not have realized that the ASCII-compatible straight quotes they were typing on their keyboards ended up as something entirely different.

Unicode support has since become the norm for operating systems. Thus, in at least some cases, transferring content containing curved quotes (or any other non-ASCII characters) from a word processor to another application or platform has sometimes been less troublesome, provided all steps in the process (including the clipboard if applicable) are Unicode-aware. But there are many applications which still use the older character sets, or output data using them, and thus problems still occur.

There are other considerations for including curved quotes in the widely used markup languages HTML, XML, and SGML. If the encoding of the document supports direct representation of the characters, they can be used, but doing so can result in difficulties if the document needs to be edited by someone who is using an editor that cannot support the encoding. For example, many simple text editors only handle a few encodings or assume that the encoding of any file opened is a platform default, so the quote characters may appear as “garbage”. HTML includes a set of entities for curved quotes ‘ (left single), ’ (right single), ‚ (low 9 single), “ (left double) ” (right double) &dbquo; (low 9 double). XML does not define these by default, but specifications based on it can do so, and XHTML does. In addition, while the HTML 4, XHTML and XML specifications allow specifying numeric character references in either hexadecimal or decimal, SGML and older versions of HTML (and many old implementations) only support decimal references. Thus, to represent curly quotes in XML and SGML, it is safest to use the decimal numeric character references. That is, to represent the double curly quotes use “ and ”, and to represent single curly quotes use ‘ and ’. In HTML, it is safest to use the named entity references (“, etc.), although decimal numeric character references are processable by most web browsers (Netscape 4 being a notable exception).

There has been some argument in recent years about the appropriateness of book quotes, since they are perceived by some as distracting. Editors who are against book quotes generally argue for ASCII-style straight quotes.

Copyright © 2005-2012 Quotation-Marks.com. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled “GNU Free Documentation License”.

 
 
  ©2005-2012 Quotation-Marks.com. All Rights Reserved