I'm starting work on a new PDF conversion engine for calibre that will hopefully handle header and footer extraction and multiple column extraction as well.
I'm asking for a few sample PDF files that I can use as a test corpus. I'd appreciate it if you could just extract a few pages with different typographical features and make a new PDF file with them.
Note that this new engine will not handle mathematics/tables/vector diagrams, etc. so don't provide samples for those.
Also this is a bit of a long term project, so don't expect results too quickly.