View Single Post
Old 10-13-2009, 04:58 PM   #88
igorsk
Wizard
igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.
 
Posts: 3,442
Karma: 300001
Join Date: Sep 2006
Location: Belgium
Device: PRS-500/505/700, Kindle, Cybook Gen3, Words Gear
Quote:
Originally Posted by Daithi View Post
Actually, I think my biggest problem was not the OCR conversion, but was poor scanning that resulted in poor OCR conversion. I was using a flatbed scanner and anywhere that the page wasn't laying flat resulted in tons of OCR errors.

In my case, I'm scanning old books that I don't want to destroy. This means cutting off the binding and using a flatbed scanner is out of the question. I don't even want to press real hard on the book to get it to lie flat, because I'm afraid I will break the binding and have my pages falling out.
There are two kinds of flatbed scanners.
Contact Image Sensor (CIS) ones are usually cheaper since they mount sensors directly on the scan head. That gets rid of some optics but results in what you describe: anything that's more than two millimeters away from the glass is basically not registered.
Charge-Coupled Device (CCD) scanners use some optics to direct the image from the head to the fixed sensor and thus can pick up pretty much anything above the glass.
The latest versions of FineReader have some sophisticated algorithms to straighten the lines of two-page book scans, so if you can get a scanner to at least register the part close to binding, it should do a fair job.
igorsk is offline   Reply With Quote