View Single Post
Old 12-28-2009, 05:32 PM   #116
calvin-c
Guru
calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.
 
Posts: 787
Karma: 1575310
Join Date: Jul 2009
Device: Moon+ Pro
Quote:
Originally Posted by ascherjim View Post
In all my ebook scanning experimentation, using Word, WordPerfect and other editing formats and means, I never encountered "text boxes." One thing you might do is to download from the ABBYY site the free trial version of FineReader Pro 10 and see how well that works for you. It is limited in what you can do with it as a trial version (vis a vis the paid-for version) but it should at least resolve your uncertainty regarding the text boxes. Good luck.
I have. IIRC it occurred on pages with mixed text & images. Don't remember what software I was using (it was at least 4 years ago) but I seem to recall disabling 'regions' to get around that. Of course then it didn't import the images (the whole purpose of the regions was to define which areas of the document contained text, and which contained images) but in that case all that was wanted was the text anyway. IIRC we had to do quite a bit of cleanup on the text, probably because without the regions it was trying to OCR the images into the middle of the text.

I think. All I really remember for sure is that the text came out in text boxes that I was able to get rid of by fiddling with the settings, and that the result (post-fiddling) still required a lot of work.
calvin-c is offline   Reply With Quote