epubs are much easier to work with. I bought Acrobat full version a couple months ago so I could edit headers and footers and page numbers out of pdf formats. Little did I know that it's so difficult and confusing to use and doesn't handle most of my h, f, pn problems anyway, though it does have intriguing batch functions. It's much easier for me to convert pdf to epub, see what's wrong with it, tag appropriately, convert to rtf, mess with search/replace in Word, save as docx to get rid of a lot of extraneous MS RTF format garbage, which usually reduces size a lot, then run the docx through open office into odt format to further clean up MS garbage and reduce size again, and add back in to calibre. That sequence sounds like a lot, but it works really well for me. Eventually I'll know enough regex to handle stripping h, f, pn 's directly from calibre search/replace - but until I do, the process I described works remarkably well.
|