Quote:
Originally Posted by charleski
You left out: stripping out page numbers/headers, spell-checking (even good OCR programs can still produce howlers), tagging chapters, removing errant paragraph marks and general clean-up.
ABBYY does a reasonable job of pouring the pages into raw text, but transforming that into a properly-formatted eBook that can be read without glaring errors every page or two still requires a lot of manual editing.
|
They
claim that FR10 has improved automatic detection of page structure: chapter headings, page numbers etc. Trial version is
available now.