Converting to HTML....hmmm.......that sounds good.
I went to the other link about .pdf and OCR. It's very thorough.
If I only want to get the OCR'ed text out of the document, is Cuneiform enough? I figure I can always get any images that I need in there somehow anyways (though maybe going to html is better).
Tessaract not having a GUI turns me off from it. I'm not a line junkie.
Google docs (and other web based OCR sites) have a size limit. I've got some pretty hefty .pdf's.
OK....will now look at converting to .HTML....
thanks!
|