Thread: OCR engine
View Single Post
Old 04-08-2014, 10:53 PM   #48
cadele
Addict
cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.cadele ought to be getting tired of karma fortunes by now.
 
cadele's Avatar
 
Posts: 372
Karma: 3710372
Join Date: Feb 2010
Device: Kindles, Sony 650
Quote:
Originally Posted by Hamlet53 View Post
The scanning and OCR process to produce a text file is fast. I can get that done for a ~400 page book in less than an hour. It's the proofing that takes me time. Then I want everything to match the original, even quotation marks and apostrophes.
I am the same. I like the book to be exactly as the print version.

Now that I have Abbyy to do the OCR it has cut down enormously on the proofing, but it still takes ages. I make a special point not to calculate how many hours this takes me.

What I really need (after a good duplex scanner) is a cheat sheet of regex to cut down the proofing. Unfortunately I struggle with that - my mind is Teflon when it comes to regex
cadele is offline   Reply With Quote