View Single Post
Old 10-24-2019, 08:51 AM   #3285
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 80,151
Karma: 148951761
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by DrLimu View Post
I just tried it and it's based on chapters. Although in this book the split between books isn't perfect, the title pages of book 2 & 3 are placed at the end of the previous chapter, but that's easy to fix using the split function in Calibre.

Calibre creates a stylesheet, it's not pretty, but better than before. I managed to remove 80 of the 114 classes by a simple regular expression (search and replace -aw-bookmark-end: _Toc[0-9]+) and the "Remove unused CSS"-function.
What I would do is split/join as needed. Then I would proceed to clean up the CSS. I would rename all the HTML files so you know what they are by the name. I would also losslessly compress any graphics. If you have multiple CSS, I would merge them into one. I would delete any HTML ToC and any other HTML file you don't need such as a review page, or list of other books the author has written. eBooks have an advertising page that uses embedded fonts that you can dump as well as the fonts. And there are other HTML that can go.

It sounds a lot harder then it is, but it's not all that hard to do. For example, some of the cleaning up of the HTML can be done easily with search/replace.
JSWolf is offline   Reply With Quote