View Single Post
Old 10-07-2009, 06:16 AM   #11
igorsk
Wizard
igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.igorsk ought to be getting tired of karma fortunes by now.
 
Posts: 3,442
Karma: 300001
Join Date: Sep 2006
Location: Belgium
Device: PRS-500/505/700, Kindle, Cybook Gen3, Words Gear
Quote:
Originally Posted by charleski View Post
You left out: stripping out page numbers/headers, spell-checking (even good OCR programs can still produce howlers), tagging chapters, removing errant paragraph marks and general clean-up.

ABBYY does a reasonable job of pouring the pages into raw text, but transforming that into a properly-formatted eBook that can be read without glaring errors every page or two still requires a lot of manual editing.
They claim that FR10 has improved automatic detection of page structure: chapter headings, page numbers etc. Trial version is available now.
igorsk is offline   Reply With Quote