View Single Post
Old 04-10-2009, 04:46 PM   #3
Steven Lyle Jordan
Grand Sorcerer
Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.Steven Lyle Jordan ought to be getting tired of karma fortunes by now.
 
Steven Lyle Jordan's Avatar
 
Posts: 8,478
Karma: 5171130
Join Date: Jan 2006
Device: none
Guys, a method I've found to improve the quality of the OCR process involves photocopying of the book's pages first, expanding the page image up to letter/A4 size. Then scan those letter-sized pages into an OCR scanner. Many of the better OCR scanners can allow standard-sized paper to be fed into them and read at high-speed, removing the need to manually scan each page (though you'll still end up doing that at the earlier copier stage). And the expanded letters will be easier for the OCR program to read, resulting in fewer errors.

Personally, I feel the 2-step photocopy-scan process is worth the creation of scanned pages with fewer errors.

Occasionally, you luck out and discover that a particular error happens regularly, and you can fix it with a "find-and-replace all" process. But you should still go through every page manually.

Last edited by Steven Lyle Jordan; 04-10-2009 at 04:49 PM.
Steven Lyle Jordan is offline   Reply With Quote