View Single Post
Old 07-06-2021, 07:11 AM   #11
rcentros
eReader Wrangler
rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.rcentros ought to be getting tired of karma fortunes by now.
 
rcentros's Avatar
 
Posts: 7,894
Karma: 52566355
Join Date: Mar 2013
Location: Boise, ID
Device: PB HD3, GL3, Voyage, Clara HD
Quote:
Originally Posted by Sarmat89 View Post
An average book contains about 1500 italics fragments; adding them manually will take days. Also, without uncertain characters, orthography checking and interactive control there is no quality recognition possible.
I can't imaging what quality Tesseract produces...
Tesseract works well, especially when using it with gImageReader. As for italics, bold, etc., just mark up your text as you go. Then when you move your text into your word processor, search for the codes and make your changes. If you do it at a chapter a time it's not that big of a burden. Especially with novels, where's there's hardly any italics or bold fonts anyhow.

As for headers and footers, just exclude them when you choose your block of text. I'm guessing it's not as sophisticated as FineReader (which I've never seen) but it's still pretty good.
rcentros is offline   Reply With Quote