Quote:
Originally Posted by pdurrant
At 6,000 pages, 45MB is less than 8KB per page, so I expect that your PDF is a proper text-only PDF, not (as I'd previously thought) images.
In which case conversion is going to be a lot easier. I suspect that you'll get best results with a converter that knows to trim headers/footers/page numbers from each page. I hope someone with more direct knowledge of PDF conversion will be along shortly to suggest programs.
But it's unlikely that even a very good program will always get paragraphs right. And if there's any complicated formatting (e.g. poetry or (worse) equations or chemical formulae) probably no tool will work very well.
PDF, alas, was always intended as a 'write only' format. Converting from PDF to anything else is error prone and time consuming. Good luck!
|
Thanks for your comments pdurrant. I will try Harrys OCR suggestion.
If anyone else has any other ideas, please feel free to chime in!
Thanks guys!