Quote:
Originally Posted by ahi
How is ABBYY (What the hell kind of name is that?!) for OCR-ing older books filled with long s characters and other such delights?
|
ABBYY is the company name, the OCR tool is called FineReader.
It has a pattern training tool ("user patterns") which can be quite effective. There is also a
special version for old texts:
Quote:
On top of FineReader's basic OCR functions, FineReader XIX is capable of reading old texts that feature elaborate type prints. This includes text with ornamental curls that break the continuous line of the word and roman type characters no longer in use such as the elongated “s” used in early English or French text. FineReader XIX support for Fraktur includes:
Languages:
German, English, French, Italian, and Spanish
Fonts:
Fraktur, Schwabacher, and a majority of Textura (Gothic) fonts
|