08-14-2009, 07:39 AM | #1 |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
encoding problem with mobi converted to epub
Hi,
I'm seeing a problem with a .mobi file I picked up from Amazon. I converted it to epub for use on the Sony, and I'm seeing accented characters get screwed up. For example: Code:
Provençal The original mobi appears to be fine, the mobi doc looks ok in Stanza and the iphone kindle app (but not in Calibre viewer). What's really weird is that when I extracted the epub to copy/paste the screwed up text I discovered that Calibre had actually converted the text correctly, at least at the individual xhtml level. The xhtml files appears to be saved as UTF-8 with no BOM, and the <head> section specifies UTF-8 as the encoding as well. So it seems something else in the epub isn't being set correctly causing the final encoding output to go awry. Any ideas as to what it could be? Edit: The screwed up text does actually display correctly on Desktop ADE as well, but not in Calibre viewer or mobile ADE on the 505. Last edited by ldolse; 08-14-2009 at 07:42 AM. |
08-14-2009, 10:41 AM | #2 | |
Guru
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
Quote:
See this post https://www.mobileread.com/forums/sho...88&postcount=7 ADE fonts are latin1 only so you need to point epub to use LRF fonts. You will se explanation in the above post. BUT it will not work with current version of calibre due to a bug ( http://calibre.kovidgoyal.net/ticket/3150 ). You need to use 0.5.1.4 to produce correct epub. |
|
Advert | |
|
08-14-2009, 12:30 PM | #3 |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
Ok, well I can try editing the CSS to use the lrf fonts, but this doesn't look to me it's a font issue. The other poster mentioned question marks, which can happen with fonts, but in this case it really is gibberish. 'ç' looks to be in unicode, and is rendered as two bizarre characters (so I assume two bytes), which is why I was thinking it was encoding related.
Note the calibre book viewer screws it up in the same way. |
08-14-2009, 12:39 PM | #4 |
creator of calibre
Posts: 44,344
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Interesting, can you opena ticket and post the MOBI and EPUB files.
|
08-14-2009, 12:48 PM | #5 |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
Will do - note earlier today I couldn't create a ticket - did this get fixed?
|
Advert | |
|
08-14-2009, 12:55 PM | #6 |
creator of calibre
Posts: 44,344
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
There was a problem with creating tickets a day or so ago (I was upgrading server software), but it should be fine now.
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Pdf to epub Turkish character encoding problem | blueresistance | Conversion | 1 | 02-25-2011 05:31 PM |
epub to mobi conversion problem. | lutwey | Calibre | 0 | 09-18-2010 11:51 AM |
Converted an RTF file to MOBI Problem | Knipfty | Calibre | 0 | 01-11-2010 10:24 PM |
Epub to LRF no problem, Epub to Mobi indexerror | Rogier | Calibre | 3 | 06-09-2009 11:42 AM |
InDesign > ePub> Mobi problem | Kakaze | Calibre | 2 | 05-01-2009 08:24 PM |