Quote:
Originally Posted by j.p.s
Sounds like that uploader and maybe Project Gutenberg in general has serious toolchain/workflow issues.
|
That's not unusual for Gutenberg. Their ebook structure generally sucks and their CSS is idiosyncratic.
The epub3 version has 5 html files for the text of the ebook with chapters being split between different files. I merged all the text files into one and then used a bit of regex to insert split markers, cleaned up the CSS, etc. Sigil's saved searches come in very handy for this type of repetitive cleanup (Gutenberg's one grace is that they are pretty consistent in how they screw up the structure and CSS).