MobileRead Forums - View Single Post

cadele · 04-08-2014, 11:53 PM

Quote:

Originally Posted by Hamlet53

The scanning and OCR process to produce a text file is fast. I can get that done for a ~400 page book in less than an hour. It's the proofing that takes me time. Then I want everything to match the original, even quotation marks and apostrophes.

I am the same. I like the book to be exactly as the print version.

Now that I have Abbyy to do the OCR it has cut down enormously on the proofing, but it still takes ages. I make a special point not to calculate how many hours this takes me.

What I really need (after a good duplex scanner) is a cheat sheet of regex to cut down the proofing. Unfortunately I struggle with that - my mind is Teflon when it comes to regex