08-20-2011, 07:16 PM | #1 |
Member
Posts: 18
Karma: 10
Join Date: Aug 2011
Device: Nook
|
Converting non-ASCII characters
I wrote a recipe for Madison.com (content from Cap Times and Wisconsin State Journal) which works well except that all apostrophes appear to have been replaced with 'â ', both in titles and in articles. I've tried using preprocess_regexps to correct this, which seems to work for about the first 75 pages of news. After that, although the titles listed under 'Contents' have apostrophes, the title at the start of the article and the article content have 'â 's. Any ideas? Is there a better way to replace all instances of certain characters or phrases in downloaded content? Here is the recipe:
Spoiler:
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Non-ASCII characters in recipe titles show as ü | bubak | Recipes | 2 | 11-30-2011 07:49 AM |
advanced text search and non-ascii characters | msz59 | General Discussions | 0 | 05-05-2011 09:47 AM |
non-ASCII characters show up as question marks on my Reader (from FAQ) | Candoumi | Sigil | 2 | 04-07-2011 08:44 PM |
Typing non-ASCII characters with the keyboard | Edmundo | Amazon Kindle | 5 | 01-20-2011 01:18 PM |
Is it possible to sent books to device with filename in non-ascii characters? | flyisland | Calibre | 8 | 10-16-2010 05:35 AM |