Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book General > News

Notices

Reply
 
Thread Tools Search this Thread
Old 12-28-2009, 08:00 AM   #106
kazbates
Wizard
kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.
 
kazbates's Avatar
 
Posts: 2,627
Karma: 406616
Join Date: Dec 2008
Location: Northern Virginia
Device: SurfacePro, SurfaceBook 2
I started a thread here in the Workshop forum before I found this thread.

I received a Canon DR-2510 scanner for Christmas that works very well but came with Omnipage's OCR software in a very limited version. The scan turns out great and very few errors turn up in the OCR process. The problem I'm encountering is that if I save the OCRed file in doc or rtf formats, the text is saved inside of textboxes which hinders additional editing within MS Word. My thought was to open the OCRed file in Word to ultimately save it as an html file (my limited version of Omnipage does not allow me to save it as html) as per HarryT's suggestion of format of choice. If I save the file as txt, all formatting is lost. I would prefer to not have to spend $200 to upgrade to Omnipage 16 if at all possible. Any suggestions?
kazbates is offline   Reply With Quote
Old 12-28-2009, 08:40 AM   #107
ardeegee
Maratus speciosus butt
ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.ardeegee ought to be getting tired of karma fortunes by now.
 
ardeegee's Avatar
 
Posts: 3,292
Karma: 1162698
Join Date: Sep 2009
Device: PRS-350
Quote:
Originally Posted by ahi View Post
How is ABBYY (What the hell kind of name is that?!) for OCR-ing older books filled with long s characters and other such delights?
Bit of an old post, but I enjoy the adolescent humor of Google Books' OCR's inability to correctly interpret the "s" in sucking pig in older documents.
ardeegee is offline   Reply With Quote
Advert
Old 12-28-2009, 10:00 AM   #108
ascherjim
Addict
ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.
 
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
Quote:
Originally Posted by kazbates View Post
I started a thread here in the Workshop forum before I found this thread.

I received a Canon DR-2510 scanner for Christmas that works very well but came with Omnipage's OCR software in a very limited version. The scan turns out great and very few errors turn up in the OCR process. The problem I'm encountering is that if I save the OCRed file in doc or rtf formats, the text is saved inside of textboxes which hinders additional editing within MS Word. My thought was to open the OCRed file in Word to ultimately save it as an html file (my limited version of Omnipage does not allow me to save it as html) as per HarryT's suggestion of format of choice. If I save the file as txt, all formatting is lost. I would prefer to not have to spend $200 to upgrade to Omnipage 16 if at all possible. Any suggestions?
I believe that some other member in this thread tested and found a preference for the ABBYY FineReader 10, both in performance and cost-wise, to the Omnipage 16 upgrade. There is (or was) a great half-price sale going on offering the FineReader UPGRADE for half-price. Didn't your accompanying disk that contained Omnipage also contain the ABBYY Sprite 6 version, from which you could upgrade to the Pro 10? The Pro 10 handles Word to HTML conversion fine (although I myself stick with txt conversion and edit it).
ascherjim is offline   Reply With Quote
Old 12-28-2009, 10:35 AM   #109
kazbates
Wizard
kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.
 
kazbates's Avatar
 
Posts: 2,627
Karma: 406616
Join Date: Dec 2008
Location: Northern Virginia
Device: SurfacePro, SurfaceBook 2
Quote:
Originally Posted by ascherjim View Post
I believe that some other member in this thread tested and found a preference for the ABBYY FineReader 10, both in performance and cost-wise, to the Omnipage 16 upgrade. There is (or was) a great half-price sale going on offering the FineReader UPGRADE for half-price. Didn't your accompanying disk that contained Omnipage also contain the ABBYY Sprite 6 version, from which you could upgrade to the Pro 10? The Pro 10 handles Word to HTML conversion fine (although I myself stick with txt conversion and edit it).
When my husband ordered the scanner, he thought it included the ABBYY software but I only saw the Ominpage. I will double check, though.

Is the reason you prefer using txt because of the issue of the textboxes? I'm wondering if this is a common practice of the OCR software and if it would even make a difference if I were to upgrade.
kazbates is offline   Reply With Quote
Old 12-28-2009, 11:11 AM   #110
ascherjim
Addict
ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.
 
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
Quote:
Originally Posted by kazbates View Post
When my husband ordered the scanner, he thought it included the ABBYY software but I only saw the Ominpage. I will double check, though.

Is the reason you prefer using txt because of the issue of the textboxes? I'm wondering if this is a common practice of the OCR software and if it would even make a difference if I were to upgrade.
In all my ebook scanning experimentation, using Word, WordPerfect and other editing formats and means, I never encountered "text boxes." One thing you might do is to download from the ABBYY site the free trial version of FineReader Pro 10 and see how well that works for you. It is limited in what you can do with it as a trial version (vis a vis the paid-for version) but it should at least resolve your uncertainty regarding the text boxes. Good luck.
ascherjim is offline   Reply With Quote
Advert
Old 12-28-2009, 11:29 AM   #111
kazbates
Wizard
kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.
 
kazbates's Avatar
 
Posts: 2,627
Karma: 406616
Join Date: Dec 2008
Location: Northern Virginia
Device: SurfacePro, SurfaceBook 2
Thanks! I'm definitely going to try the free trial. My husband had also made that same suggestion but I was afraid I would like it too much and want to spend the extra money. I definitely did not get the ABBYY limited version with my scanner.

My husband had also suggested boxing the scanner up and sending it back for another scanner that included the ABBYY software. I would hate to do that as the scanner works well, but it may come to that if I can't get the results I want.
kazbates is offline   Reply With Quote
Old 12-28-2009, 11:37 AM   #112
ascherjim
Addict
ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.
 
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
Quote:
Originally Posted by kazbates View Post
Thanks! I'm definitely going to try the free trial. My husband had also made that same suggestion but I was afraid I would like it too much and want to spend the extra money. I definitely did not get the ABBYY limited version with my scanner.

My husband had also suggested boxing the scanner up and sending it back for another scanner that included the ABBYY software. I would hate to do that as the scanner works well, but it may come to that if I can't get the results I want.
My scanner is the OpticBook 3600, with which I am very pleased and which receives other favorable comment in this thread. However, I would not jump to returning your scanner if it works well. Just sort out the software problem. When you go into the ABBYY site, see how much they charge for their Sprint 6 version, which has worked very well for me. I only upgraded to the Pro 10 version recently because of their great half-price upgrade offer.
ascherjim is offline   Reply With Quote
Old 12-28-2009, 11:38 AM   #113
kennyc
The Dank Side of the Moon
kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.
 
kennyc's Avatar
 
Posts: 35,872
Karma: 118716293
Join Date: Sep 2009
Location: Denver, CO
Device: Kindle2; Kindle Fire
Quote:
Originally Posted by kazbates View Post
Thanks! I'm definitely going to try the free trial. My husband had also made that same suggestion but I was afraid I would like it too much and want to spend the extra money. I definitely did not get the ABBYY limited version with my scanner.

My husband had also suggested boxing the scanner up and sending it back for another scanner that included the ABBYY software. I would hate to do that as the scanner works well, but it may come to that if I can't get the results I want.
Definitely an option....
kennyc is offline   Reply With Quote
Old 12-28-2009, 11:47 AM   #114
kazbates
Wizard
kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.kazbates ought to be getting tired of karma fortunes by now.
 
kazbates's Avatar
 
Posts: 2,627
Karma: 406616
Join Date: Dec 2008
Location: Northern Virginia
Device: SurfacePro, SurfaceBook 2
Quote:
Originally Posted by ascherjim View Post
My scanner is the OpticBook 3600, with which I am very pleased and which receives other favorable comment in this thread. However, I would not jump to returning your scanner if it works well. Just sort out the software problem. When you go into the ABBYY site, see how much they charge for their Sprint 6 version, which has worked very well for me. I only upgraded to the Pro 10 version recently because of their great half-price upgrade offer.
I will check into it. I know that David did a lot of research before he chose the scanner he gave me. We wanted to be able to scan in all our old photographs, too, and this one met that criteria as well.

Quote:
Originally Posted by kennyc View Post
Definitely an option....
Sometimes, you are SO not helpful!!
kazbates is offline   Reply With Quote
Old 12-28-2009, 11:55 AM   #115
kennyc
The Dank Side of the Moon
kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.kennyc ought to be getting tired of karma fortunes by now.
 
kennyc's Avatar
 
Posts: 35,872
Karma: 118716293
Join Date: Sep 2009
Location: Denver, CO
Device: Kindle2; Kindle Fire
Quote:
Originally Posted by kazbates View Post
I will check into it. I know that David did a lot of research before he chose the scanner he gave me. We wanted to be able to scan in all our old photographs, too, and this one met that criteria as well.



Sometimes, you are SO not helpful!!
I try.
kennyc is offline   Reply With Quote
Old 12-28-2009, 05:32 PM   #116
calvin-c
Guru
calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.calvin-c ought to be getting tired of karma fortunes by now.
 
Posts: 787
Karma: 1575310
Join Date: Jul 2009
Device: Moon+ Pro
Quote:
Originally Posted by ascherjim View Post
In all my ebook scanning experimentation, using Word, WordPerfect and other editing formats and means, I never encountered "text boxes." One thing you might do is to download from the ABBYY site the free trial version of FineReader Pro 10 and see how well that works for you. It is limited in what you can do with it as a trial version (vis a vis the paid-for version) but it should at least resolve your uncertainty regarding the text boxes. Good luck.
I have. IIRC it occurred on pages with mixed text & images. Don't remember what software I was using (it was at least 4 years ago) but I seem to recall disabling 'regions' to get around that. Of course then it didn't import the images (the whole purpose of the regions was to define which areas of the document contained text, and which contained images) but in that case all that was wanted was the text anyway. IIRC we had to do quite a bit of cleanup on the text, probably because without the regions it was trying to OCR the images into the middle of the text.

I think. All I really remember for sure is that the text came out in text boxes that I was able to get rid of by fiddling with the settings, and that the result (post-fiddling) still required a lot of work.
calvin-c is offline   Reply With Quote
Old 01-07-2016, 01:56 AM   #117
timofonic
Zealot
timofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic somethingtimofonic has a certain pleonastic something
 
Posts: 123
Karma: 18554
Join Date: Jan 2008
Location: Spain
Device: Onyx Boox M96+
Hello.

s necroposting legal?

How's FineReader vs Omnipage 16? I see Nuance did it a very bloated app.

The proofreading is very tiresome and still have many issues with tables, unable to recognize an index properly.

Is it me or OCR isn't progressing so much these days?

Someone should make a recaptcha but for users, so people would get into groups and help each other improve tiny amounts of text and formatting
timofonic is offline   Reply With Quote
Old 01-10-2016, 03:07 PM   #118
eschwartz
Ex-Helpdesk Junkie
eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.
 
eschwartz's Avatar
 
Posts: 19,422
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
On the plus side, 6 years has got to be one of the longest resurrection times.
You *might* have broken a record.


Most people seem to swear by ABBYY Finereader -- still!
I suspect that OCR would progress more if we weren't moving in the direction of less things that need to be OCRed.
eschwartz is offline   Reply With Quote
Old 01-11-2016, 04:52 PM   #119
Katsunami
Grand Sorcerer
Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.
 
Katsunami's Avatar
 
Posts: 6,111
Karma: 34000001
Join Date: Mar 2008
Device: KPW1, KA1
Woah... old thread. I'm still of the same opinion as I was a few years ago, even more so with lower prices, Kobo codes, and much more availability.

Scanning, OCR-ing and proofreading is way too much work. If a book costs €7. If it even takes only an hour (and it will take MUCH longer), it's not worth it. Would you want to work for €7, gross? I wouldn't, if I can help it.
Katsunami is offline   Reply With Quote
Old 01-11-2016, 11:21 PM   #120
harriska2
Addict
harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.
 
Posts: 272
Karma: 8000000
Join Date: Oct 2010
Location: Corvallis, OR
Device: Kindle PW2, iPad Pro
Some books are not available in e format. And if you want to use it to highlight and write on it and annotate, e format is the easiest. Some of use refuse to have paper books....
harriska2 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
calibre crashes when scanning and adding books oncdoc Calibre 8 04-21-2010 03:03 PM
Scanning books - New need help Sporadic Workshop 9 04-19-2009 01:11 PM
Scanning paper (out of copyright) books. Charles Gray Workshop 18 03-25-2009 02:06 PM
Scanning books Nate the great Lounge 10 11-04-2007 01:20 AM
Scanning books from your own library Alexander Turcic Deals and Resources (No Self-Promotion or Affiliate Links) 13 06-16-2006 12:28 AM


All times are GMT -4. The time now is 08:39 AM.


MobileRead.com is a privately owned, operated and funded community.