View Single Post
Old 01-26-2012, 01:29 PM   #2
DSpider
Evangelist
DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.
 
DSpider's Avatar
 
Posts: 450
Karma: 343115
Join Date: Nov 2009
Location: Romania
Device: PW2 2014
You're doing it wrong! Don't scan straight to PDF.

Try to get as much raw data from the scanner as possible, meaning just the images. Processing comes later. Sure, you could get away with JPG straight from the scanner, if, say, you're only interested in OCR-ing it (extracting the text) using ABBYY FineReader for instance (and proofreading it either in FineReader or side-by-side with the actual thing). But don't use JPG if you're aiming for Scan Tailor processing because most scanners are set by default to something like 85% quality for JPG compression, which could result in unnecessary grain or fuzzy text.

If the book uses images sparingly throughout the book I usually just scan in uncompressed TIFF; I don't bother choosing JPG for the pages with just text on them to save a few megabytes - although sometimes it helps if you're not planning on immediate processing. More room for other projects.


Edit: Oh, I forgot to mention that Scan Tailor was done very quick because there was a lot less data to work with. The images were either down-sampled by the scanner's software during the PDF packaging (to save space) or Irfanview chose 96 DPI by default (look for a checkbox or an option there).

Last edited by DSpider; 01-26-2012 at 01:45 PM.
DSpider is offline   Reply With Quote