View Single Post
Old 05-26-2016, 06:50 AM   #12
pdurrant
The Grand Mouse 高貴的老鼠
pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.
 
pdurrant's Avatar
 
Posts: 74,097
Karma: 315558332
Join Date: Jul 2007
Location: Norfolk, England
Device: Kindle Oasis
Quote:
Originally Posted by jodad View Post
Ok, just to confirm, my understanding is that if the PDF is formatted as a bunch of images, when I open the file on an e reader on my computer, I wouldn’t be able to select the text itself. Am I correct here?
If this is the case then no, my PDF is not formatted as an image.

By the way, thanks for your comments so far guys!
At 6,000 pages, 45MB is less than 8KB per page, so I expect that your PDF is a proper text-only PDF, not (as I'd previously thought) images.

In which case conversion is going to be a lot easier. I suspect that you'll get best results with a converter that knows to trim headers/footers/page numbers from each page. I hope someone with more direct knowledge of PDF conversion will be along shortly to suggest programs.

But it's unlikely that even a very good program will always get paragraphs right. And if there's any complicated formatting (e.g. poetry or (worse) equations or chemical formulae) probably no tool will work very well.

PDF, alas, was always intended as a 'write only' format. Converting from PDF to anything else is error prone and time consuming. Good luck!
pdurrant is offline   Reply With Quote