View Single Post
Old 10-07-2007, 11:56 AM   #58
ereszet
Zealot
ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.
 
ereszet's Avatar
 
Posts: 118
Karma: 306
Join Date: Sep 2007
Device: Sony PRS-500 Archos 704 wifi
Quote:
Originally Posted by user View Post
this is very interesting, but I cant imagine what kind of logic it has, since manipulating image will make OCR easier

perhaps, ABBY recommends this because finereader already does the manipulation by its own? but in this case, I can't see how pre-OCR manipulation can harm

as for these photos:
http://www.mobileread.com/forums/sho...2&postcount=28
what improved the second/right image and made it perfectly OCR-able?
Yes, finereader does in batch: rotating, correcting resolution, straightening text lines (poor algorithm), despeckling, and of course recognition of blocks and text. On input it allows to detect image orientation, split dual pages and convert to black and white.

As for preprocessing they especially advice not to fatten or otherwise manipulate the pixels around the text.

The last question I already answered in the previous post.
ereszet is offline   Reply With Quote