01-26-2012, 02:50 PM | #1 |
eWanderer
Posts: 523
Karma: 1441998
Join Date: Jul 2010
Location: NC, USA
Device: iMac,iPad3,iPhone5-Kindle Fire,Touch,PaperWhite
|
Report on Abbyy FineReader OCR Software w/ Canon Lide 60
SubTitle: In dealing with Books, it's not the scanner that's important, it's the OCR SOFTWARE!
I have experience creating ebooks (pdf, mobi, epub) but my source has, to date, been nice text files supplied to me by clients. I got the urge the other day to try creating suitable text by scanning a pBook with my old Canon Lide 60 scanner.. so.... I obtained a copy of ABBYY FineReader Express for Mac and installed it on my iMac ( Lion - OSX 7.2) I fired up ABBYY FineReader and within minutes I was scanning. In less than 20 minutes "learning time" I was scanning multiple pages (Cut from book). The software is intuitive enough that I did not have to consult instructions or help. With the Lide 60 it takes about 11-12 seconds to scan a 5.5x8 page. After the initial setup adjusting the scan you can just hit "scan" for every page and the scanned image for each page appears in a pane (see attached pic). Scan as many pages as you like and then hit the "convert" button in FineReader Express and the scan is OCR'd to text. The output from the OCR conversion is a single RTF file. My six page scan turned out to be 2066 words. I then opened the RTF file in Pages (Apples Word compatible text processor) I have yet to find one error and I've looked at the entire text. The text even had proper paragraph indents and also proper BlockQuote indents!. (left and right margins/padding) Perfect multipage scans loaded to my favorite text editor with almost no "learning curve" at all! Lesson: A Good OCR program is worth the cost! Attached is a screen shot of FineReader (Mac version) with six pages scanned. (Pic is reduced 40%) . . Last edited by 1611mac; 01-26-2012 at 02:52 PM. |
01-27-2012, 04:00 AM | #2 |
Enthusiast
Posts: 49
Karma: 14
Join Date: Jul 2010
Location: Harrogate, England
Device: iPad
|
Fine Reader
Fine Reader is indeed a good piece of software.
However, the results depend greatly on the source material. My scans (around 500 books) have been of normal format paperbacks and the quality ranges from near perfect to near unreadable. There are a number of errors which happen frequently, especially with small fonts 'I' being replaced by '1', 'tl' being replaced by 'd' and exclamation marks being replaced by various glyphs. Additionally, if you have accented characters (and select additional languages to English) you find rather more accented characters in the OCR output than in the original! This is not a criticism - most books are more than readable, simply an acknowledgement that the product is not perfect! |
Advert | |
|
01-27-2012, 11:13 AM | #3 | |
eWanderer
Posts: 523
Karma: 1441998
Join Date: Jul 2010
Location: NC, USA
Device: iMac,iPad3,iPhone5-Kindle Fire,Touch,PaperWhite
|
Quote:
What I intended to convey was that it is that the product is CAPABLE of reproducing original copy and that with good clean original copy good results are possible (i.e.: fast workflow, intuitive interface, etc, etc.) and that paid software can be worth it sometimes (as opposed to freeware/shareware). As info, my original text was printed a bit heavy (ink) and was blurry, it was far from being a "perfect" page to scan. Last edited by 1611mac; 01-27-2012 at 11:16 AM. |
|
01-27-2012, 11:23 AM | #4 |
eWanderer
Posts: 523
Karma: 1441998
Join Date: Jul 2010
Location: NC, USA
Device: iMac,iPad3,iPhone5-Kindle Fire,Touch,PaperWhite
|
Yes, I read a lot of 18th century material and I think my scans of them won't turn out too well. (v's for u's, f's for s's) etc.
|
01-27-2012, 03:46 PM | #5 |
Zealot
Posts: 103
Karma: 57138
Join Date: May 2010
Device: Sony 505, iPad 1 & 3, Galaxy Note 8.1
|
11-12 seconds to scan a page? Ouch! I'd never get a book done.
|
Advert | |
|
01-27-2012, 05:30 PM | #6 |
eWanderer
Posts: 523
Karma: 1441998
Join Date: Jul 2010
Location: NC, USA
Device: iMac,iPad3,iPhone5-Kindle Fire,Touch,PaperWhite
|
|
01-27-2012, 06:05 PM | #7 |
Guru
Posts: 860
Karma: 4380
Join Date: Feb 2008
Location: Almada, Portugal
Device: Cybook Gen3, Sony PRS 505, Kindle DXG and Samsung Galaxy Note
|
Hello
Agree, one plays with the cards one has… |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
ABBYY FineReader Sale | anamardoll | General Discussions | 15 | 02-20-2013 11:25 AM |
Abbyy Finereader 11 Pro $99 | chainring | Deals and Resources (No Self-Promotion or Affiliate Links) | 6 | 02-13-2012 07:12 AM |
PRS-650 OCR software/Abbyy Finereader-Highlighting –Export pdf w.notes, highlighted passages | wonderose | Sony Reader | 4 | 04-27-2011 10:41 PM |
Abbyy FineReader Dictionaries | Mebyon | Workshop | 2 | 02-10-2010 02:57 PM |
ABBYY FineReader cannot see images | chinesealbumart | Workshop | 8 | 05-15-2009 11:03 PM |