![]() |
#1 |
Chasing Butterflies
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,132
Karma: 5074169
Join Date: Mar 2011
Location: American Southwest
Device: Uses batteries.
|
ABBYY FineReader Sale
The ABBYY FineReader OCR software came up in a thread I posted about converting paper-books to e-books. I happened to noticed when I was checking out their site that there's a dale going on until July 19th -- the FineReader Express 9 is $50 (50% off) and the FineReader Professional 10 is $170 (regularly $400).
Has anyone used this program before? I'm wondering what level of accuracy the program provides for a "regular" book (i.e., text from top to bottom, with no fancy layouts). I think $170 would be a steal if we were talking a 99% conversion accuracy from image to text, but anything less than that would entail a LOT of proofing. Since e-books from publishers come with OCR mistakes, I'm assuming that probably this software doesn't boast 99% accuracy or comparable........? Just wondering if anyone has used this. |
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,888
Karma: 5875940
Join Date: Dec 2007
Device: PRS505, 600, 350, 650, Nexus 7, Note III, iPad 4 etc
|
Finereader generally recognised as one of the best OCR programs around... still needs some proofing but should get 99.9%... trouble is the 0.1%
![]() |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,187
Karma: 25133758
Join Date: Nov 2008
Location: SF Bay Area, California, USA
Device: Pocketbook Touch HD3 (Past: Kobo Mini, PEZ, PRS-505, Clié)
|
For regular, clean-font books, it's over 99% accurate; often 2-4 pages between errors if the scans are good, and those tend to be things like names that aren't in its internal dictionaries. And that's FR 7; 10 should be more accurate because OCR tech has improved in the last six years.
Note: 99% accuracy is one character wrong every couple of sentences. It's a lot better than that. 99.9 is still 1 character wrong per page. I'm attaching a sample from a public-domain book that I'm in the process of converting. (I'm done with the main text; am trying to decide how/whether to deal with the index.) This was scanned at 600dpi, so the scans are good, but the font is older than current books & has a tighter line-spacing than a lot of modern books. 10-pg PDF extract, and Word output from FineReader 7 with no corrections & keeping all the auto-detected formatting. Normally, I wouldn't keep the line breaks. I'd also try to remove the headers & page numbers before OCR; I've got various ways to do that but FR 10 might have better ones. And I'd run through FineReader's internal correction process, which is easier to deal with than comparing the Word export to the PDF and making changes that way. Last edited by Elfwreck; 06-30-2011 at 05:31 PM. |
![]() |
![]() |
![]() |
#4 |
Feral Underclass
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,622
Karma: 26821535
Join Date: Jan 2010
Location: Yorkshire, tha noz
Device: 2nd hand paperback
|
I use version 8, it came free with a scanner that died years ago. Works very well but sometimes struggles with italics. But it does keep them as italics in the end document, which is a bonus. You can have a split view with the page image on one side and the resulting text on the other, and it highlights words it's not too sure about.
|
![]() |
![]() |
![]() |
#5 |
Chasing Butterflies
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,132
Karma: 5074169
Join Date: Mar 2011
Location: American Southwest
Device: Uses batteries.
|
I went ahead and tried their trial version last night and was hooked. Purchased the full version and started ripping up an old Asimov backlist book. Now I just need a better paper cutter....
![]() |
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,187
Karma: 25133758
Join Date: Nov 2008
Location: SF Bay Area, California, USA
Device: Pocketbook Touch HD3 (Past: Kobo Mini, PEZ, PRS-505, Clié)
|
Quote:
If you get access to one of those, pick up a few throwaway books from thrift stores to practice cutting the spines off without cropping into the words. |
|
![]() |
![]() |
![]() |
#7 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,492
Karma: 37057604
Join Date: Jan 2008
Device: Pocketbook
|
I use Finereader 9 full version, with an Optiscan 3600. Error rate for me is around 1 in 5 pages for straight text. Since I am one of those "zero-defect tolerance" people, it doesn't save me any proofing time, but fewer changes.
Finereader 10 has "cloud" features, so I am not interested. I use Finereader on my "sterile" machine... |
![]() |
![]() |
![]() |
#8 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,187
Karma: 25133758
Join Date: Nov 2008
Location: SF Bay Area, California, USA
Device: Pocketbook Touch HD3 (Past: Kobo Mini, PEZ, PRS-505, Clié)
|
Quote:
Also: Do you know what resolutions 9 allow syou to save PDF images at? 7 lets me save out as a range of sizes from 72dpi to 600 dpi; 8 just has low-medium-high, and I think high is 300 dpi. (Maybe if I'm lucky, "high" is "match original." That's what happens when I save 400dpi scans at 600 on FR7.) I prefer to work with 400 dpi scans, and save out as 300 dpi when I'm keeping images, but sometimes I want to keep 600dpi scans as original. |
|
![]() |
![]() |
![]() |
#9 | |
Feral Underclass
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,622
Karma: 26821535
Join Date: Jan 2010
Location: Yorkshire, tha noz
Device: 2nd hand paperback
|
Quote:
|
|
![]() |
![]() |
![]() |
#10 | |
Chasing Butterflies
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,132
Karma: 5074169
Join Date: Mar 2011
Location: American Southwest
Device: Uses batteries.
|
Quote:
![]() |
|
![]() |
![]() |
![]() |
#11 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,492
Karma: 37057604
Join Date: Jan 2008
Device: Pocketbook
|
Quote:
I scan straight into the Finereader OCR program. i don't save intermediate files. |
|
![]() |
![]() |
![]() |
#12 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,187
Karma: 25133758
Join Date: Nov 2008
Location: SF Bay Area, California, USA
Device: Pocketbook Touch HD3 (Past: Kobo Mini, PEZ, PRS-505, Clié)
|
Work has high-speed scanners that I can sometimes use, so I can scan at work & then import the pages into FR. (You'd think that someone would make a nice simple portable TWAIN scanning program that scans to tiff or, sigh, PDF. Haven't found one yet.)
|
![]() |
![]() |
![]() |
#13 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,581
Karma: 11380098
Join Date: Aug 2010
Location: NE Oregon
Device: Kobo Sage, Pocketbook Era, Kobo Forma, Kindle Oasis 2
|
$130.99 at Amazon
Thanks for posting the sale! I was thinking about buying at the sale price above, but then today had the notion to check Amazon.com before I pulled the trigger "just in case". Glad I did!
Amazon.com has ABBYY Finereader 10 at $130.99 w/free shipping. Only 7 left in stock now tho. ![]() I caved. |
![]() |
![]() |
![]() |
#14 |
Serpent Rider
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,123
Karma: 10219804
Join Date: Jun 2009
Device: Sony 350; Nook STR; Oasis
|
I've gotten an old version kicking around. Used it to convert the Corum books. A new version sure would be nice though...
|
![]() |
![]() |
![]() |
#15 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 272
Karma: 8000000
Join Date: Oct 2010
Location: Corvallis, OR
Device: Kindle PW2, iPad Pro
|
I got a slightly used come 9770 19" guillotine stack paper cutter. $399 on ebay. I used the new blade it came with, had it sharpened 2 times, and purchased another blade for $140. We have just figured out that you can self sharpen the blade with a whetstone. Takes a couple minutes to take off the blade and you sharpen as best as you can (about 4 minutes). We sharpen one side then the other side we break the edge slightly. This method saves a ton of money and time (shipping or driving to a specialty place and $18 per sharpening). Plus when they sharpen they take a *bunch* of blade off. I've cut about 400 books and am pretty much done with my library. This cut so well I was able to reglue several of the books back together.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Need help with Abbyy Finereader 10 (linebreaks) | NASCARaddicted | Workshop | 11 | 01-19-2017 04:10 PM |
ABBYY Finereader and text formating | Student1 | Workshop | 6 | 12-15-2011 06:37 PM |
Abbyy FineReader Dictionaries | Mebyon | Workshop | 2 | 02-10-2010 02:57 PM |
ABBYY FineReader cannot see images | chinesealbumart | Workshop | 8 | 05-15-2009 11:03 PM |
Ended wanted: coupon code for Abbyy finereader | moz | Flea Market | 1 | 03-12-2008 02:10 AM |