11-27-2022, 09:41 PM | #1 |
Connoisseur
Posts: 84
Karma: 335288
Join Date: Nov 2012
Device: Kindle
|
Amazon's flawed conversion of epubs
I purchased an ebook from Kobo (epub format), imported it into Calibre (using a plugin to remove the DRM), then sideloaded it to my Kindle using my Send To Kindle email address.
The book displayed oddly on my Kindle. All of the apostrophes and quotation marks were replaced by a series of symbols: ’ (a-circumflex, Euro, superscript trademark). For example, the sentence: "I'm not sure why you don't run away." displayed as: ’I’m not sure why you don’t run away.’ I took the same file I'd emailed my Kindle and opened it with Calibre's native viewer. The punctuation displayed correctly there. That suggested the original file is fine; the problem occurs when Amazon converts this epub to awz3. As a test, I created a mobi version with Calibre and sent it to my Kindle using the desktop Send To Kindle app. (Send To Kindle email no longer allows uploads of mobi files, but the desktop app does.) This time the book displayed properly, confirming the error was an artefact of Amazon's converting the epub. Bottom line, I found a solution, so I'm not posting here to ask for help. I just want to share my experience in case anyone else runs into this issue. If Amazon screws up your epub, trying sending a mobi version with the desktop app (as long as that remains as possiblity... I believe the desktop app will also eventually drop mobi support). Out of curiosity, has anyone else come across this? I've had a book's formatting display differently on a Kindle compared to its appearance in Calibre, but this is the first time I've had actual text get screwed up. |
11-27-2022, 10:41 PM | #2 |
Bibliophagist
Posts: 35,464
Karma: 145525534
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Forma, Clara HD, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
I thought this topic had been beaten to death with multiple threads about the topic. The most likely problem is that your epubs are not marked as using UTF-8. You can use calibre's Modify Epub plugin to correct this before sending the epubs to Amazon (turn off all the options except Encode HTML in UTF-8). BTW, the epub standard requires an epub to use either UTF-8 or UTF-16 encoding though I've only seen 2 epubs that used UTF-16.
One other item is that calibre's ebook-viewer is chromium based and like most web browsers tends to hide many errors. It is not a good test bed for how an epub will display on an eInk reader or most apps. Last edited by DNSB; 11-27-2022 at 10:44 PM. |
Advert | |
|
11-27-2022, 10:46 PM | #3 |
Groupie
Posts: 169
Karma: 200002
Join Date: Feb 2009
Location: Bangkok, Thailand
Device: K3, KV, KOA3
|
In short, your source ePub file is not in a strictly well-formed format.
Amazon's Send to Kindle service supports EPUB files; however, for historical reasons, it still assumes ISO-8859-1 encoding if no encoding is specified, while most "assume" as UTF-8 and continue. This creates malformed formatting errors for special characters, especially "smart" punctuation. Last edited by tekkasit; 11-27-2022 at 10:50 PM. |
11-28-2022, 02:00 AM | #4 |
Wizard
Posts: 1,738
Karma: 26006874
Join Date: Sep 2017
Device: PW3, Fire HD8 Gen7, Moto G7, Sansa Clip v2, Ruizu X26
|
Is there a good test bed to check for errors, other than putting the book on an eReader and doing manual observation? Something that can be run on a desktop computer?
|
11-28-2022, 02:06 AM | #5 | |
Zealot
Posts: 114
Karma: 2691856
Join Date: Feb 2022
Device: PPW5, Quaderno A4, Supernote A6X, Leaf 2
|
Quote:
Note that Kindle Previewer 3 is more strict than the current Amazon processing pipeline, files that failed to convert with Kindle Previewer may be converted with Amazon's Send to Kindle. But it is a good place to start nonetheless. |
|
Advert | |
|
11-28-2022, 04:53 AM | #6 | |
Resident Curmudgeon
Posts: 74,015
Karma: 129333114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
11-28-2022, 08:46 AM | #7 | |
Zealot
Posts: 114
Karma: 2691856
Join Date: Feb 2022
Device: PPW5, Quaderno A4, Supernote A6X, Leaf 2
|
Quote:
It works very well as a testbed to check for errors, which was the question I answered. Also can you give example on where it is more forgiving, in relation to EPUB conversion error? Last I checked Kindle Previewer always error on out of order TOC, while it will now convert with Send to Kindle (with TOC stripped). |
|
11-28-2022, 09:47 AM | #8 |
Right, Except When Wrong
Posts: 353
Karma: 3968525
Join Date: Aug 2007
Location: Indianapolis
Device: Kindle Oasis 3 (sometimes iPad Mini).
|
Query: So let's assume that I've acquired an EPUB formatted book that displays properly in Calibre's viewer but not on the Kindle (because of the oddly formatted special characters). What is the correct workflow to be sure that the EPUB is encoded as UTF-8 (and converting it if necessary) before sending it to the Kindle?
|
11-28-2022, 09:52 AM | #9 |
Zealot
Posts: 114
Karma: 2691856
Join Date: Feb 2022
Device: PPW5, Quaderno A4, Supernote A6X, Leaf 2
|
If you are already using calibre then using the Modify EPUB plugin to add UTF-8.shoukd fix the problem.
If you are not using calibre then using https://kindle-epub-fix.netlify.app/ is probably easier. (Disclaimer: I built that). |
11-28-2022, 11:56 AM | #10 | |
Resident Curmudgeon
Posts: 74,015
Karma: 129333114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
11-28-2022, 12:38 PM | #11 |
Zealot
Posts: 114
Karma: 2691856
Join Date: Feb 2022
Device: PPW5, Quaderno A4, Supernote A6X, Leaf 2
|
You are entirely missing the point. The topic is about mis-encoded characters (and maybe other Send to Kindle errors), which Kindle Previewer is perfectly capable of checking.
|
11-28-2022, 01:29 PM | #12 |
Resident Curmudgeon
Posts: 74,015
Karma: 129333114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Why bother? It's a waste of time. It's just easier to use Modify ePub to make sure the ePubs are set to UTF-8 and then send them to Amazon to convert.
|
11-28-2022, 05:20 PM | #13 | ||
Connoisseur
Posts: 84
Karma: 335288
Join Date: Nov 2012
Device: Kindle
|
Quote:
Quote:
Thanks. |
||
11-28-2022, 06:00 PM | #14 |
Resident Curmudgeon
Posts: 74,015
Karma: 129333114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
11-28-2022, 06:11 PM | #15 |
Bibliophagist
Posts: 35,464
Karma: 145525534
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Forma, Clara HD, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Or if you are enthused, you can add the EpubCheck plugin to Sigil or calibre's editor. For most places, if it doesn't pass epubcheck, they won't even try to publish it. If you don't want to use Sigil or calibre, you can run epubcheck from a command line though I generally reserve that for pre-processing a mass of epubs.
For the last few years it's been better but the memory of how many "professionally" produced epubs failed epubcheck, often in a rather spectacular display of error messages still drives my lack of enthusiasm for most publishing houses ability to produce a half decently formatted ebook. Last edited by DNSB; 11-28-2022 at 06:13 PM. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Is pw4 firmware 5.12.1 flawed | Ruskie_it | Amazon Kindle | 19 | 07-30-2019 01:47 PM |
Looking for a Browser that isn't Deeply Flawed | Rizla | Android Devices | 21 | 09-17-2014 01:27 PM |
ePUBs multilevel TOC gets flattened in conversion to MOBI | damnated | Conversion | 14 | 09-02-2011 02:25 PM |
PRS-600 eink is flawed under the sun | goaspy | Sony Reader | 18 | 06-02-2010 01:33 AM |
eReader output SERIOUSLY flawed | JSWolf | Calibre | 16 | 03-21-2010 07:50 AM |