Quote:
Originally Posted by joedevon
Does this still work? I gmailed myself a PDF, it let's me "view" as html, but not save it...and it just converts a text based PDF into images on the actual html page...so it isn't really converting the text into html....
Any ideas?
|
Depends on how the PDF was made. PDFs made from normal text--like a Word doc--should convert back to full-text HTML. (I checked. It works.)
PDFs made from searchable images convert differently sometimes; I've had some of them convert fine, and some have every other page come out as an image, and some come across entirely as an image. (Saving out the text as *text* should still work; the images will be dropped and the searchable text comes through. But you lose all formatting, and are stuck with the OCR errors that the images hid.)
These kinds of problems are part of why the ebook community doesn't generally like PDFs, and often doesn't think of them as a "real" ebook format. There's no quick-and-simple way to tell how well a PDF will convert to another format, or how editable it is.