The book I'm playing with converts just fine to EPUB.
The only thing I'd need to get a near perfect EPUB is removing headers and footers… which is the perfect occasion to dig and understand why people bother with regex in the HTML at all if you can just remove the data from the source PDF before running Calibre.
There's got to be a way to either remove everything that's outside the mediabox, or mark some sections as redaction annotions and remove them all.
|