thanks for the pointers thus far! i'll try these with some of the more nasty epubs (seems The Cutting Room was pretty clean to begin with, if i judge based on how many XHTML files show up when opened in Sigil). some of the Agatha Christie stuff i have is downright nasty - 70+ XHTML files, some nearly blank or with just a heading, etc...
i'll try the PolishBooks function in Calibre. you're right, the Convert is bad form - it actually makes things worse! i sideloaded the same book with / without the Convert function being used, and the WITHOUT actually let me adjust the font/line settings albeit it had no cover.
i had a thought - if i use a well-assembled epub (like The Cutting Room or something similar with enough chapters) and replace its text with a new book's text and then save-as with the appropriate name... excessive? i'd only do 10-25 books every 6 months since that's my current reading pace. or maybe just extract the raw text, and re-create an epub cleanly.
i clearly have a lot of stuff to learn about how these things are put together - it not just a fancy .txt file like i thought!
|