View Single Post
Old 02-23-2010, 06:54 PM   #1482
nickredding
onlinenewsreader.net
nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'
 
Posts: 328
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
Handling emdash

I'm having a problem with a news feed that has emdash's included literally (instead of using — ) and they are being handled as follows: the emdash is recognized as such, and translated into a unicode emdash (u2014) which then turns up in the output as the UTF-8 equivalent (0xE2 0x80 0x94) and is displayed as — which is the CP1252 interpretation of those three character codes. I can't figure out how to fix this -- preprocess_regexps doesn't work. Can anyone help?
nickredding is offline