View Single Post
Old 10-04-2014, 02:24 AM   #31
rkomar
Wizard
rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.rkomar ought to be getting tired of karma fortunes by now.
 
Posts: 3,058
Karma: 18821071
Join Date: Oct 2010
Location: Sudbury, ON, Canada
Device: PRS-505, PB 902, PRS-T1, PB 623, PB 840, PB 633
Quote:
Originally Posted by adrenaline View Post
Thanks a ton, rkomar. You may seen this in the OP but just in case, here's a sample of the 600dpi scan:

https://www.dropbox.com/s/j18r16ed7t...0Page.pdf?dl=0

I feel that this book isn't too dense. Would love to hear your thoughts on this.

Thanks again.
That page should be fine at 600 dpi. You might have to clean it up with a denoising filter to remove the specks away from the text. I know that the open source OCR program "tesseract" would have a problem with those specks. Perhaps the commercial OCR programs aren't as badly affected by noise. If you do denoise the pages, be careful that the accents aren't removed, as well.
rkomar is offline   Reply With Quote