View Single Post
Old 08-24-2009, 03:56 AM   #3
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 6,150
Karma: 4792399
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Quote:
Originally Posted by ectoplasm View Post
First, it took much longer than what I first thought. This book in particular may have been an especially poor choice to attempt such a conversion. Dickens uses a lot of punctuation such as:
“So th’ are, so th’ are!” cried Ham. “Well said! So th’ are. Mas’r Davy bor—gent’lmen growed—so th’ are!”
Welcome to the amazing world of "Why regular expresions tend to fail when converting straight quotes to curly quotes"

May I suggest you use different markup for right single quote and apostrophe (both displayed as ’ )? That way, it would be fairly easy to change from US- to UK-style or vice-versa afterwards. In my files I use "’" for right single quotes and "& #8217;" (no space) for apostrophes, they encode the exact same character, however.
Jellby is offline   Reply With Quote