View Single Post
Old 12-26-2014, 04:08 PM   #7
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 80,083
Karma: 147983159
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by burnafterreading View Post
thanks for the pointers thus far! i'll try these with some of the more nasty epubs (seems The Cutting Room was pretty clean to begin with, if i judge based on how many XHTML files show up when opened in Sigil). some of the Agatha Christie stuff i have is downright nasty - 70+ XHTML files, some nearly blank or with just a heading, etc...
That sounds like a Calibre conversion with incorrect conversion options (or default which are incorrect for those eBooks). So what you'll need to do is join together the XML that should not have been split.

Quote:
i had a thought - if i use a well-assembled epub (like The Cutting Room or something similar with enough chapters) and replace its text with a new book's text and then save-as with the appropriate name... excessive? i'd only do 10-25 books every 6 months since that's my current reading pace. or maybe just extract the raw text, and re-create an epub cleanly.
That won't work. You need to have the correct CSS for the job. And if you grab the text from another eBook, then you could end up with classes in the XML that do not exist in the CSS and it's then even more work to fix things.
JSWolf is offline   Reply With Quote