View Single Post
Old 09-24-2007, 05:29 PM   #191
ereszet
Zealot
ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.ereszet has a complete set of Star Wars action figures.
 
ereszet's Avatar
 
Posts: 118
Karma: 306
Join Date: Sep 2007
Device: Sony PRS-500 Archos 704 wifi
whitespace and preprocessing

Quote:
Originally Posted by cacapee View Post
Fixing white space marks can be done by using the Trim% feature. Click preview and move the left, top, bottom, right markers so that you crop out what you don't need. You don't have to be very accurate since pdflrf's whitespace removal takes care of the rest.
A poor quality original jpg is attached. Pdflrf can do little to improve it unless one manually trims the black margins (different for different pages), see the attached lrf file.
After preprocessing with ClearImage (demo version) and Finereader8 (commercial) the jpg is much improved although some black lines at the margin remain (more aggresive preprocessing would remove parts of text as well). Then, pdflrf can be used more effectively.

BTW. The image is a photoscan of a 1914 Russian calendar. I can make a better photo of it by zooming to individual rather than double pages, putting white paper to avoid effects of transparency, etc., but the photo in example is just to make my point about the need for preprocessing.
Attached Thumbnails
Click image for larger version

Name:	russian original.jpg
Views:	638
Size:	204.8 KB
ID:	5805   Click image for larger version

Name:	russian preprocessed.jpg
Views:	662
Size:	103.6 KB
ID:	5806  
Attached Files
File Type: lrf russian.lrf (39.2 KB, 419 views)
File Type: lrf russian preprocessed.lrf (35.5 KB, 424 views)
ereszet is offline