08-03-2013, 03:45 PM | #1 |
Country0129
Posts: 55
Karma: 506306
Join Date: Apr 2012
Location: Louisiana
Device: Kindle, Kindle Fire, PC
|
HTML code Appearing in Text
I recently downloaded several Epub's, polished them, and converted them to Mobi's. One of the books included, it seems, all of the html code in the text at the beginning of the book and, it seems, at the beginning of the chapters.
I have a recipe for processing my books: 1. Download books to a dedicated folder. 2. Import the books to Calibre. 3. Update/download metadata 4. Resize covers 5. Polish books (Epub, AZW) 6. Convert books to Mobi 7. Save to external back-up disk I did notice that this particular book did not complete the polishing procedure, for I should normally have Epub,Mobi,Original Epub as file types. This one retained the Epub format, but I received no error message while polishing the book. Is there something I'm missing? How can I correct the formatting of the book. Thanks for your help, and please send email to country0129@yahoo.com with any solutions. |
08-03-2013, 03:54 PM | #2 |
Well trained by Cats
Posts: 29,812
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
If you see HTML like code:
Notice I said: Like. Something was malformed and HTML tags probably got converted to entities (ie <P> somewhere along the way. The page may still be malformed, which is why a conversion and/or polish failed to complete. |
Advert | |
|
08-03-2013, 03:56 PM | #3 |
Country0129
Posts: 55
Karma: 506306
Join Date: Apr 2012
Location: Louisiana
Device: Kindle, Kindle Fire, PC
|
The text still appears in proper order after the HTML code; so I can still read it, albeit somewhat interrupted sorting through the extraneous code. So, no way to fix it, just endure it?
|
08-03-2013, 03:58 PM | #4 | |
Well trained by Cats
Posts: 29,812
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Sigil and some crafty REGEX to remove sets of entities if the rest is fine. Why not ask the publisher for a clean copy? |
|
08-03-2013, 04:02 PM | #5 |
Country0129
Posts: 55
Karma: 506306
Join Date: Apr 2012
Location: Louisiana
Device: Kindle, Kindle Fire, PC
|
I'm not advanced enough in my registry edit procedure nor knowledgeable with the python usage to do all that. 15,662 books and counting managed wonderfully by Calibre. I'd say it's an amazing software! Thank you, Mr. Ducks. I think I will endure.
|
Advert | |
|
08-03-2013, 06:31 PM | #6 |
Well trained by Cats
Posts: 29,812
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
REGEX
is Regular Expressions, a search and replace form 1 book I would understand having been damaged All (15K) books , something else has had its evil ways (and I have no clue as to what or even if it is fixable). |
08-05-2013, 07:26 AM | #7 |
Country0129
Posts: 55
Karma: 506306
Join Date: Apr 2012
Location: Louisiana
Device: Kindle, Kindle Fire, PC
|
Dunno what happened.
Uploaded to my Kindle KB, and the corruption remained. Hit a wrong key, and from the middle of the book, it went back to the beginning, frustrating me in having to muddle through all the code again to return to my reading place; so I closed the book out and did something else.
Sometime later, next day or so, I opened Calibre, decided to read the book in the native reader, for I could use the scroll bar to return to where I was reading, and, lo and behold! It got well. All the code had disappeared. This is probably the most unusual thing I've seen Calibre do, but it was a good thing. I still maintain that Calibre is the most innovative and useful software on the planet. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Converting from HTML to Mobi, Images not appearing | Rena02 | Conversion | 1 | 07-22-2013 03:44 AM |
HTML input plugin stripping text within toc tags in child html file | nimblebooks | Conversion | 3 | 02-21-2012 03:24 PM |
Troubleshooting HTML code in text files | LeoBloom | Amazon Kindle | 1 | 12-12-2010 02:25 PM |
Strange  character appearing throughout e-book text | mag1 | ePub | 21 | 02-01-2010 07:01 AM |