Quote:
Originally Posted by joehunt
3. I buy the books to read, so my workflow is to convert the pdf to html in Abbyy Finereader and then to convert the html to epub using Sigil. I have a pretty good idea of what to look for now, so the whole process is not that tedious and time consuming. My accuracy rate is about 95%, which is sufficient for me since I do the conversion for my own use only (I'd rather spend the time reading instead of comparing every single character).
|
An OCR accuracy rate of 95% is an error in one character in 20, or about 1 in every 4 words, which is pretty appalling. Decent OCR should give you accuracy of about 99.9%, or 1 character in 1000, or about 1 in every 200 words (roughly 1 error per page).