MobileRead Forums
Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > Workshop

Welcome to the MobileRead Forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community today, you will have fewer ads, access to post topics, communicate privately with other members, respond to polls, upload content and access many other special features.

If you have any problems with the registration process or your account login, please contact us.

Hint: Don't have time to visit us daily? Subscribe to our main RSS feed to receive our frontpage posts at your convenience.

Notices

Workshop Scanning your first book? Need general conversion tips? Or just confused over the plethora of formats? Let us know and we'll do our best to help

Reply
 
Thread Tools Search this Thread Display Modes
Old 11-04-2009, 01:21 PM   #1
Elfwreck
Wizard
Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.Elfwreck got an A in P-Chem.
 
Elfwreck's Avatar
 
Posts: 1,182
Karma: 6337
Join Date: Nov 2008
Location: California, USA
Device: Clié; PRS-505
Quote:
Originally Posted by Rootman View Post
I hesitate to add that converting a physical book to etext may also violate the publishers and authors copyrights as well. I do not beleive that the "fair use" doctrine applies to text scanned from a copyright source.

Not knowing what country you are in and what texts you want of course YMMV
In the US, format-shifting for personal use is almost certainly fair use. And I only say "almost" because there's no court cases to base that on, because nobody's been stupid enough to try to sue their customers for it. Nobody's been sued for making cassette tapes of their own albums for their own uses, either. Nor for reading aloud a book and recording it for their child.

Format-shifting for research purposes is somewhat *more* acceptable; that moves the purpose of copying into the "transformative" realm. Note that Google's cache function was ruled to be non-infringing.

Trying to keep from hijacking this into yet another copyright debate... hmm.

Jim should keep in mind that no OCR software is perfect, especially with books with nonstandard language. (Legal, medical, scientific, religious, sf/fantasy, etc.) With an autofeed scanner, the OCR checking becomes the most time-consuming part of conversion. Even with a flatbed scanner, the OCR checking is the most *annoying* part of conversion, because you can scan while talking or listening to music.
__________________
When I'm not here, I'm somewhere else. Mind the rainbows.
Elfwreck is offline   Reply With Quote
Old 11-04-2009, 09:59 PM   #2
Jim Thompson
Member
Jim Thompson began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Nov 2009
Device: iPhone
Quote:
Originally Posted by Elfwreck View Post
the OCR checking becomes the most time-consuming part of conversion
Any other thoughts on best OCR software/scanner combination. I had been thinking a reliable ADF might be the most challenging part, but I now suspect reliable OCR is key. Which has few errors and makes correcting errors easiest? For example, if the OCR doesn't know if a word is "fire" or "flre", it might be nice if it could guess that it's "fire" by use of a dictionary. Any suggestions on ways to reduce the "annoyance" factor?
Jim Thompson is offline   Reply With Quote
Old 11-05-2009, 02:18 PM   #3
Jim Thompson
Member
Jim Thompson began at the beginning.
 
Posts: 16
Karma: 10
Join Date: Nov 2009
Device: iPhone
I have learned (I think):

1. PCMag review in (I think it was '08) suggested ABBYY or OmniPage depending upon ones needs.

2. People tend to prefer interface and support for ABBYY.

3. OmniPage may be better for people who want more customized automation or larger batches. For example it will permit creating a zone template that will remove standard headers/footers (page #s) from every document in a batch fed through the ADF -- whereas that would have to be done a page at a time in ABBYY.

4. ABBYY seems to have a relationship wit Fujitsu, so if buying a scanner and software, there might be less frustration with getting them to work well together.

I'm looking into OmniPage for my purposes. Trying to determine if I need the professional version and which ADF scanners seem most used with it. I'll report again when I'm smarter, or if you have insight, please enlighten me.
Jim Thompson is offline   Reply With Quote
Reply

Tags
adf, crop, dehypenization, ocr, scanner

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
"Online Novels" - FREE, legal novels available on the Internet Dr. Drib Deals, Freebies, and Resources 8 05-22-2009 10:32 PM
Suggest a Story (Round 1) Moejoe Writers' Corner 110 05-17-2009 11:18 PM
Suggest your own eBook Reader dj_modus_ponens News and Commentary 27 12-03-2007 04:58 AM
Google Suggest - search with autocomplete Alexander Turcic Lounge 4 12-18-2004 07:00 PM
Suggest CNN replacement? starbuckk Mobile Sites 1 12-11-2003 09:51 AM


All times are GMT -4. The time now is 05:13 PM.


MobileRead.com is a privately owned, operated and funded community.