MobileRead Forums - View Single Post

heychadwick · 01-03-2011, 05:20 PM

Converting to HTML....hmmm.......that sounds good.

I went to the other link about .pdf and OCR. It's very thorough.

If I only want to get the OCR'ed text out of the document, is Cuneiform enough? I figure I can always get any images that I need in there somehow anyways (though maybe going to html is better).

Tessaract not having a GUI turns me off from it. I'm not a line junkie.

Google docs (and other web based OCR sites) have a size limit. I've got some pretty hefty .pdf's.

OK....will now look at converting to .HTML....

thanks!

01-03-2011, 05:20 PM	#8
heychadwick Member Posts: 20 Karma: 386 Join Date: Dec 2010 Device: PRS-300	Converting to HTML....hmmm.......that sounds good. I went to the other link about .pdf and OCR. It's very thorough. If I only want to get the OCR'ed text out of the document, is Cuneiform enough? I figure I can always get any images that I need in there somehow anyways (though maybe going to html is better). Tessaract not having a GUI turns me off from it. I'm not a line junkie. Google docs (and other web based OCR sites) have a size limit. I've got some pretty hefty .pdf's. OK....will now look at converting to .HTML.... thanks!