|
|
#1 |
|
Member
![]() Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
|
Recipe turning some punctuation marks as non-printable characters
Hi
I am trying to make a recipe for downloading news from govt site i.e. pib.nic.in so was trying to do some tweaking aroung and bumped into this problem.. When ebook-convert downloads the news, it turns some punctuation into junk characters.. Here is the code Spoiler:
Original HTML had Spoiler:
after running the recipe ![]() Spoiler:
How can i fix it?
|
|
|
|
|
|
#2 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,634
Karma: 28549046
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Use the correct value of encoding in your recipe.
|
|
|
|
|
|
#3 |
|
Member
![]() Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
|
|
|
|
|
|
|
#4 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,634
Karma: 28549046
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
You need to fiure out what the encoding for the html pages you are scraping is. Common choices, latin1, cp1252, utf-8
|
|
|
|
|
|
#5 | |
|
Member
![]() Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
|
Quote:
PHP Code:
PHP Code:
PHP Code:
Last edited by knowledgecrawler; 08-21-2014 at 12:59 AM. |
|
|
|
|
![]() |
| Tags |
| lang, punctuation |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| 13 Little-Known Punctuation Marks We Should Be Using | VydorScope | Writers' Corner | 16 | 11-16-2012 03:40 PM |
| Question marks instead of most special characters - HTML->mobi | vermontcathy | Conversion | 3 | 09-29-2012 12:42 PM |
| Strange text characters and missing chapter marks on Kindle 3 | Grahamk | Conversion | 7 | 02-28-2011 03:14 AM |
| Loss of Punctuation Marks | AllyBally | Calibre | 2 | 12-30-2010 04:03 PM |
| Extra punctuation marks in epub after loading from SRL to PRS-600 | planters | Sony Reader | 8 | 03-12-2010 12:38 PM |