Quote:
Originally Posted by dgatwood
Given that these are presumably generated with a standard Arabic font [...]
|
They were generated with a standard Arabic font, however, the font contains several (optional) ligatures that mimic traditional Arabic handwriting, which makes Arabic OCR of these particular images rather difficult.
For an example, have a look at the attached screenshot. It shows the same word in two different fonts (with and without ligatures).
To give you an idea of the difficulties, here's the same word split into the 5 letters that the OCR program would have to detect:
ﺟ ﺮ ـﻳ ـﻣ ﺔ