I am trying to convert a PDF file which has the title and page number information in the header and footer. These are being converted when I use pdftohtml and although I can crop the PDF in pdftohtml the text is to small.
Is there any way to find and replace a chunk of text several lines long? In Word as the header contains the books Title it goes and removes those words from within the text as well. For the footer I suppose it would have to support wildcards for the page numbers.
Its driving me nuts and I reckon there must be a nice simple solution!