Thread: OCR engine
View Single Post
Old 03-20-2014, 10:31 PM   #9
AJ Starr
Guru
AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.AJ Starr ought to be getting tired of karma fortunes by now.
 
AJ Starr's Avatar
 
Posts: 815
Karma: 1029784
Join Date: May 2008
Location: Nebraska, USA
Device: PEZ, Color Libre, 2@Sony T1, Onyx i62HD
Quote:
Originally Posted by rkomar View Post
Maybe you're just guesstimating the accuracy, but 95% is not good. 95% for characters is terrible, and 95% for words is marginally acceptable. A typical printed page has something like 50 characters per line and 40 lines per page, so about 2000 characters per page. A 95% success rate per character would result in about 100 bad characters per page. A 95% success rate per word would bring that down to about 20 or 25 bad words per page. Even 99% accuracy produces more errors than most people like. You'd have to get to about 99.9% accuracy before you could think about not proofing the text afterwards.
Yes, I was guestimating, based on an entire novel. Scanning 1960's era paperbacks which are yellowed and abused. (Though I took very good care of my PB's)

I often got "1" instead of "I" or "l"; "m" instead of "r n" ; odd Hard Returns on the last line of a paragraph instead of Softreturns. So for an entire novel, 95% or better is more than acceptable to me.

(My epubs come out great!)
AJ Starr is offline   Reply With Quote