Thread: OCR engine
View Single Post
Old 04-03-2014, 06:49 PM   #30
markom
Banned
markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.
 
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
Quote:
Originally Posted by Hamlet53 View Post
I made my first attempt at digitizing a book using a flatbed scanner. I really can't imagine anyone doing it that way. Especially if one attempts to do it while leaving the book intact. Trying to get a scan that will yield even reasonably accurate result from the OCR process while having to press down on the book during the scan and at the same time being sure that it is correctly aligned, how anyone could manage this at any reasonable production rate is beyond me.
I scan at 3-4 passes/minute at 300 dpi grayscale or color(6-8 pages/minute if book fits on the glass double sided ) on Canon 9000 F without any problems, listening to the music even watching films if book is of A5 or smaller format.

I usually scan one or two books per month and was able to scan at that rate (6-8 pages/min) after just a couple of books scanned.

For correct alignment I completely rely on scanner's raised edges pushing(sliding) the book automatically as far as it goes already knowing where approx. the center of the book (spine) should meet the raised edge, because I put some adhesive tape there to mark the place.

I never press down spine too hard, never would lower the scanner's lid (it's always up), usually scanning in the dark room (light coming from computer screen), always manually clicking the mouse to scan a current page (with mouse pointer centered on the scan button), lifting the book and flipping for a next page the moment ccd mechanism starts coming back after finished scanning, so that by the time returning ccd mechanism stops my book is usually already fixed for scanning.

After every 20-30 pages I would automatically use some soft cloth (usually some T-shirt at hand ) to quickly clean the glass from possible hairs, dust particles etc.

I don't care much about OCR precision though, because I always use pdf with OCR layer in the background (exact image in Abbyy or clearscan in Acrobat).


There are also affordable contactless scanners(document cameras), for those who would like to save their books from cutting for automatic document feeders.

https://www.mobileread.com/forums/sho...hlight=scanner

Last edited by markom; 04-03-2014 at 09:04 PM.
markom is offline   Reply With Quote