Quote:
Originally Posted by DrLimu
I just tried it and it's based on chapters. Although in this book the split between books isn't perfect, the title pages of book 2 & 3 are placed at the end of the previous chapter, but that's easy to fix using the split function in Calibre.
Calibre creates a stylesheet, it's not pretty, but better than before. I managed to remove 80 of the 114 classes by a simple regular expression (search and replace -aw-bookmark-end: _Toc[0-9]+) and the "Remove unused CSS"-function.
|
What I would do is split/join as needed. Then I would proceed to clean up the CSS. I would rename all the HTML files so you know what they are by the name. I would also losslessly compress any graphics. If you have multiple CSS, I would merge them into one. I would delete any HTML ToC and any other HTML file you don't need such as a review page, or list of other books the author has written. eBooks have an advertising page that uses embedded fonts that you can dump as well as the fonts. And there are other HTML that can go.
It sounds a lot harder then it is, but it's not all that hard to do. For example, some of the cleaning up of the HTML can be done easily with search/replace.