View Single Post
Old 10-13-2009, 12:10 PM   #80
igorsk
Wizard
igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.
 
Posts: 3,442
Karma: 300001
Join Date: Sep 2006
Location: Belgium
Device: PRS-500/505/700, Kindle, Cybook Gen3, Words Gear
Quote:
Originally Posted by ahi View Post
How is ABBYY (What the hell kind of name is that?!) for OCR-ing older books filled with long s characters and other such delights?
ABBYY is the company name, the OCR tool is called FineReader.
It has a pattern training tool ("user patterns") which can be quite effective. There is also a special version for old texts:
Quote:
On top of FineReader's basic OCR functions, FineReader XIX is capable of reading old texts that feature elaborate type prints. This includes text with ornamental curls that break the continuous line of the word and roman type characters no longer in use such as the elongated “s” used in early English or French text. FineReader XIX support for Fraktur includes:
Languages:
German, English, French, Italian, and Spanish
Fonts:
Fraktur, Schwabacher, and a majority of Textura (Gothic) fonts
igorsk is offline   Reply With Quote