View Single Post
Old 09-14-2014, 01:13 AM   #2
eschwartz
Ex-Helpdesk Junkie
eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.
 
eschwartz's Avatar
 
Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
It will require a bit of work:

Sticky: Read this before Posting PDF Questions
Quote:
There are page numbers, headers, or footers in my output
You need to use Calibre's Search and Replace feature when converting from pdf in order to remove any text you don't want. These require the use of a search syntax called regular expressions. If you are intimidated by regular expressions, many Windows users have reported that Mobipocket creator is a good alternative to use to do the initial pdf conversion. Use Mobipocket Creator to convert the pdf to the .mobi format, and then use Calibre to convert from mobi to your final desired format.

I cropped the headers/footers from my pdf with another tool, but Calibre still converts them
Most pdf cropping utilities only change the visible page boundaries of the pdf, they don't actually eliminate the text data.

You need to find a utility which both crops AND deletes hidden text. Very few tools do this - Adobe Acrobat has an option to 'remove hidden text' while optimizing pdfs which can facilitate this. The alternative is to use Calibre's search and replace function to delete the headers, or use Sigil after conversion to epub.
eschwartz is offline   Reply With Quote