View Single Post
Old 03-08-2011, 04:08 PM   #2
pholy
Booklegger
pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.pholy ought to be getting tired of karma fortunes by now.
 
pholy's Avatar
 
Posts: 1,801
Karma: 7999816
Join Date: Jun 2009
Location: Toronto, Ontario, Canada
Device: BeBook(1 & 2010), PEZ, PRS-505, Kobo BT, PRS-T1, Playbook, Kobo Touch
My Opticbook3600 software saves page images in png format. I think that or tiff is recommended for OCR purposes, as jpg is a lossy format that doesn't do well with sharp edges, if I recall correctly. It takes me about an hour to scan a 250 page book, and less than five more minutes to do the OCR. I save both so that when I come back and proof the OCR text I can look at the page image to figure out the bad conversions. I've found that ABBYY cheap edition works best with grey-scale images, not b/w - I've never felt the need to make any manual adjustments to the scanned images.
pholy is offline   Reply With Quote