Quote:
Originally Posted by schmolch
That sounds like alot of work.
Scanning individual book-pages is a very tedious thing to do because you cant do anything between the scans and so you are forced to waste alot of time.
Then you want to ocr and proof-read every book, that sounds like even more work.
I personally dont care about the physical books, i just cut the whole thing and put it into the document feeder. I also dont bother about ocr and just leave it as picture and make a pdf out of it. The disadvantage is a bigger filesize and the troubles that come with PDF (on small readers) but it saves a ton of time.
Since you are looking for electronic versions of books you already own, it would probably be legal (at least using common sense) if you look for these books on the internet.
|
That's actually a good idea because ABBYY can scan a PDF into HTML/TXT/RTF etc. So if you have a sheetfeeder the above suggestion is sound. Whip it through the sheetfeeder, output a PDF, then run it through ABBYY at the end
Now if only I could find an OCR on 'buntu or mac that worked as well as ABBYY.
EDIT: Or find someone online whose sharing the book (a lot of what I'm scanning there's no chance of that) and download from them as above poster stated