MobileRead Forums - View Single Post

Shohreh · 05-13-2023, 02:46 PM

The book I'm playing with converts just fine to EPUB.

The only thing I'd need to get a near perfect EPUB is removing headers and footers… which is the perfect occasion to dig and understand why people bother with regex in the HTML at all if you can just remove the data from the source PDF before running Calibre.

There's got to be a way to either remove everything that's outside the mediabox, or mark some sections as redaction annotions and remove them all.

05-13-2023, 02:46 PM	#9
Shohreh Addict Posts: 231 Karma: 304158 Join Date: Jan 2016 Location: France Device: none	The book I'm playing with converts just fine to EPUB. The only thing I'd need to get a near perfect EPUB is removing headers and footers… which is the perfect occasion to dig and understand why people bother with regex in the HTML at all if you can just remove the data from the source PDF before running Calibre. There's got to be a way to either remove everything that's outside the mediabox, or mark some sections as redaction annotions and remove them all.