Over the past few months I have been digitizing many of my old books. I use a setup similar to what the fellow in the attached video uses to remove the binding and yield uniform size and smoothly cut pages. For those who do not have a professional service to do it for them and who do not want to try and cut a few pages at a time this works well if you have the equipment. I also purchased an auto-feed scanner that came with very good OCR software; that only set me back about $250. I can scan and OCR about 10 pages a minute. Proofing is definitely required to catch missed text, errors in character reads (interspersed italics are a particular problem), and get paragraph breaks all correct. I can proof about 20-40 pages in an hour, depending on how much text is on a page. I find that the larger the font the more accurate the OCR process is.
|