View Single Post
Old 07-17-2013, 01:28 PM   #14
tuxor
Addict
tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!
 
Posts: 320
Karma: 99999
Join Date: Oct 2011
Location: Germany
Device: Onyx Boox M92, Icarus Illumina E653
Double page scans can be separated e.g. using Scan Tailor: http://scantailor.sourceforge.net/ (you don't get a pdf in the end, but a whole bunch of tiff images - converting those to a pdf is really easy though)

You can OCR your books using google's tesseract-ocr: http://code.google.com/p/tesseract-ocr/ I use gscan2pdf whenever I want to OCR something using tesseract. But I guess, you are on Windows, so that's not an option. Unfortunately, I don't use Windows, so I have no idea how to use tesseract on a Windows machine to get decent results. (But maybe this works for you: http://www.paperfile.net/ - found via google...)

Last edited by tuxor; 07-17-2013 at 01:30 PM.
tuxor is offline   Reply With Quote