Thread: OCR engine
View Single Post
Old 04-04-2014, 05:20 PM   #33
Hamlet53
Nameless Being
 
Quote:
Originally Posted by rkomar View Post
The problem with scanning both pages at once occurs with books where the print gets too close to the gutter. The characters get distorted on the part of the page that pulls up from the platen, and can get hidden in the shadows in the gutter. Some books have this problem, and some don't. The closer the characters get to the gutter, the more you have to squash the book down. Cutting the book is often the only way to get good scans for extreme cases.

.
Yes, that's one of the problems I encountered when I tried to scan an intact book with a flatbed scanner. That and if the book orientation was even slightly off the OCR software I was using (not Abbyy FineReader that I am using now) would do a lousy job converting to text.

That led me to purchase the scanner I have now with automatic feed of sheets and double side scanning. I have encountered one problem though that maybe people here can offer help on. When the page includes page numbers and either chapter or book title this is incorporated into the text. Is there any way to prevent this? As it is I have to edit it out when making other corrections.
  Reply With Quote