Borked Unicode: Tips for journalists on Unicode and writing clean copy

Tim Ahrens:

It is very tempting to put special ligatures or alternate glyphs onto rarely-used character positions in order to make them available, or to use private-use Unicode values for accessing small caps, for example. However, although it would look fine in the browser, this would make the text “unreadable” for robots, it would give you flawed plain text when copying from a Web site, and display incorrectly where Webfonts are not supported.

So keeping the underlying text strictly semantic is the key virtue, and for that it looks like we will have to use separate fonts, at least under the hood. The devil will be in the details, however. Are small caps semantically to be treated as lower case? What if someone prefers to use them without a capital letter at the beginning? Should this be treated as semantically all-caps, or should only the first letter be encoded as upper case?

I have been using capital letters with fake-small-caps sizing until such time as legitimate small caps become usable, when two lines of CSS will enable such use.

The foregoing posting appeared on Joe Clark’s personal Weblog on 2011.06.27 14:46. This presentation was designed for printing and omits components that make sense only onscreen. (If you are seeing this on a screen, then the page stylesheet was not loaded or not loaded properly.) The permanent link is:
http://blog.fawny.org/2011/06/27/ahrens-smallcaps/

Values you enter are stored and may be published.

  

Search for very early blog entries, and for anything else on fawny.org:

  

Information

Other reading

FriendFeed puts everything in one place.

Popular topics

Photographs to look at · Typography; graphic design; the death of design criticism · Leslieville · TTC · Canadian English · Accessibility

Archives by date

Just add /year/month/day/ to the end of site’s URL, blog.fawny.org. You can add just /year/month/, or just /year/, if you wish. Years are four-digit, month and day two-digit (with padding zero below 10). For example:

Very old archives are still available.

Archives by category

Copyright © Joe Clark 2004–2012. All rights reserved.

You enjoy fawny.blog