Quote:
Originally Posted by Semwize
It's still sadder there ... Someone tried to convert these dictionaries using FineReader, which was not configured for Arabic. Of course, the result was a complete mess. And for some reason someone laid out this garbage (mobi and epub).
There need to convert and correct the text, which of course is very time-consuming and requires excellent knowledge of all these languages.
|
FineReader may fail pretty well both for the
arabic language and the
unusual page layout, even in the best lucky case, the stuff would need a massive revising.
Tesseract works better but the layout issue withstands