Thanks for you input.
By running a few simple tests found that the OCR output is affected by the higher quality of details. BMP having the greater details, picks up everything. This includes marks, smudges, text which has been tipexed out. As these are typewritten the letters are not printed sharply, so rn would be shown as m or l shown as "i etc. Hence more errors.
|