08-20-2014, 12:30 PM | #1 |
Member
Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
|
Recipe turning some punctuation marks as non-printable characters
Hi
I am trying to make a recipe for downloading news from govt site i.e. pib.nic.in so was trying to do some tweaking aroung and bumped into this problem.. When ebook-convert downloads the news, it turns some punctuation into junk characters.. Here is the code Spoiler:
Original HTML had Spoiler:
after running the recipe Spoiler:
How can i fix it? |
08-20-2014, 12:58 PM | #2 |
creator of calibre
Posts: 44,346
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Use the correct value of encoding in your recipe.
|
08-20-2014, 11:09 PM | #3 |
Member
Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
|
|
08-20-2014, 11:13 PM | #4 |
creator of calibre
Posts: 44,346
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
You need to fiure out what the encoding for the html pages you are scraping is. Common choices, latin1, cp1252, utf-8
|
08-20-2014, 11:47 PM | #5 | |
Member
Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
|
Quote:
PHP Code:
PHP Code:
PHP Code:
Last edited by knowledgecrawler; 08-20-2014 at 11:59 PM. |
|
Tags |
lang, punctuation |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
13 Little-Known Punctuation Marks We Should Be Using | VydorScope | Writers' Corner | 16 | 11-16-2012 02:40 PM |
Question marks instead of most special characters - HTML->mobi | vermontcathy | Conversion | 3 | 09-29-2012 11:42 AM |
Strange text characters and missing chapter marks on Kindle 3 | Grahamk | Conversion | 7 | 02-28-2011 02:14 AM |
Loss of Punctuation Marks | AllyBally | Calibre | 2 | 12-30-2010 03:03 PM |
Extra punctuation marks in epub after loading from SRL to PRS-600 | planters | Sony Reader | 8 | 03-12-2010 11:38 AM |