|10-23-2009, 12:11 AM||#1|
Join Date: Jul 2009
Device: Sony PRS-505
Bad character breaking conversion?
I was going nuts trying to figure out why I was only getting half a book converted from html to epub, and discovered that the textfile ahd a few odd characters. Hex 1C and 1F .
The thing that was troubling, though, was that Calibre didn't give an error, it just acted like it was done, and created an epub that had pages 1-99 of a 240 page book.
I didn't even discover the issue until I was reading the book, and hit 'next page', and .. um... huh?
Mac OSX 10.6, Calibre 6.19
Html is just vanilla html, converted from a textfile. I have a small regular expression set I use.
Any reason you can think of why Calibre wouldn't report an error, but just created a truncated epub?
|10-23-2009, 07:32 AM||#2|
Sigil & calibre developer
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
Most likely it did convert file and if you open the EPUB and look at the HTML inside of it you will find those strange characters. I'll bet that what is happening is when the viewer gets to them it chokes and can't render the rest of the book.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|Non-Breaking Hyphen||DTM||Sigil||11||08-20-2010 01:09 PM|
|PDF -> AZW conversion, weird character spacing||beacher||Amazon Kindle||7||08-17-2010 09:54 PM|
|Character conversion: "—" --> "â€”"||Ivo||Sigil||21||07-08-2010 04:38 PM|
|Non breaking spaces?||troymc||Sigil||6||05-22-2010 07:47 AM|
|Character conversion issue||Auron||Calibre||1||03-05-2009 12:27 PM|