Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book General > News

Notices

Reply
 
Thread Tools Search this Thread
Old 01-12-2016, 06:37 AM   #121
Ravensknight
Serpent Rider
Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.Ravensknight ought to be getting tired of karma fortunes by now.
 
Ravensknight's Avatar
 
Posts: 1,123
Karma: 10219804
Join Date: Jun 2009
Device: Sony 350; Nook STR; Oasis
Quote:
Originally Posted by harriska2 View Post
Some books are not available in e format. .
This is why I scanned about 10 books a couple of years ago and proofed them. Of course, now they're out in ebook form.
What I have learned from that is that all the books will eventually get the e-treatment. Even if not while I'm alive ;-)
Ravensknight is offline   Reply With Quote
Old 01-12-2016, 07:12 AM   #122
Katsunami
Grand Sorcerer
Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.Katsunami ought to be getting tired of karma fortunes by now.
 
Katsunami's Avatar
 
Posts: 6,111
Karma: 34000001
Join Date: Mar 2008
Device: KPW1, KA1
Quote:
Originally Posted by harriska2 View Post
Some books are not available in e format. And if you want to use it to highlight and write on it and annotate, e format is the easiest. Some of use refuse to have paper books....
In that case, I would just buy a cheap all-in-one, and copy the pages I would like to put annotations on, and bundle them with the book or put them into a binder. In case of a non-study book, I would never mark the book itself.
Katsunami is offline   Reply With Quote
Advert
Old 01-12-2016, 08:38 AM   #123
markom
Banned
markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.
 
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
Quote:
Originally Posted by Katsunami View Post
Woah... old thread. I'm still of the same opinion as I was a few years ago, even more so with lower prices, Kobo codes, and much more availability.

Scanning, OCR-ing and proofreading is way too much work. If a book costs €7. If it even takes only an hour (and it will take MUCH longer), it's not worth it. Would you want to work for €7, gross? I wouldn't, if I can help it.
There is really no need for proofreading, though, because every A5 format can be easily read on 6" reader in landscape if not in portraite.

Abbyy Finereader' ocr (pdf or epub) is pretty good for fiction (no need for 100% accuracy) and Abbyy' exact pdf image for scientific pdfs (100% accuracy).

So, in my case ocr-ed pdf is all it takes for reading, highlighting, scribbling, searching etc. thereon.

For an average 500 pages A5 book it takes me 1-1.5 hour for scanning (average flatbed scanner) and additional 1-1.5 hour for computer to create under 10 MB pdf using Acrobat' Clearscan ocr for exact pdf image.

Last edited by markom; 01-12-2016 at 09:57 AM.
markom is offline   Reply With Quote
Old 01-13-2016, 04:46 AM   #124
Wolfrott
Member
Wolfrott began at the beginning.
 
Wolfrott's Avatar
 
Posts: 23
Karma: 10
Join Date: Dec 2013
Device: iPad Mini / Voyage
I do this! I am bed ridden and have mobility issues that stop me from holding hardcopies.

For years now, what I do is:

1. Either scan an intact book 2-pages at a time via a flatbed scanner, OR rip it up into sheets to feed through a duplex document scanner.

Time difference between the two are days; ease, speed, & quality of the duplex surpasses the flatbed. If the book is OOP, rare, sentimental, etc I recommend you don't rip it, scan it flattened on the flatbed.

2. OCR via Adobe DC Pro or through Microsoft Onenote. It depends on the scans; Adobe hates darker pages with fancier fonts and images like borders, chapter art breaks, but Adobe's speed at batch OCR can impress you to stick to it despite more time needed proofreading. OneNote takes longer as its manual OCR grabbing, but has nearly flawless rate of spelling errors in comparison to Adobe.

3. Use an online line break tool to fix paragraphs and indents for you. Lifesaver, I swear.

4. Proof read by comparing the text output file (pdf, word, txt, etc) to the scans. Most common mistakes are ff, tt, mm, rr, mistaken for each other.

6. Put together in Word, polish it off, Calibre, and you're done!

I can finish a 300 page book in a week following the above steps. Initially it took 2+ months.
Wolfrott is offline   Reply With Quote
Old 01-15-2016, 01:43 PM   #125
harriska2
Addict
harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.harriska2 ought to be getting tired of karma fortunes by now.
 
Posts: 272
Karma: 8000000
Join Date: Oct 2010
Location: Corvallis, OR
Device: Kindle PW2, iPad Pro
I use MS Word to fix line break issues. I don't know why but it just seems to work. Sometimes I have to be smart about it (first search and replace .^p with |, then replace all ^p with space, then finally replace all | with ^p) but I have lots of flexibility with a gui interface.
harriska2 is offline   Reply With Quote
Advert
Old 01-24-2016, 04:42 PM   #126
alecE
Evangelist
alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.alecE ought to be getting tired of karma fortunes by now.
 
alecE's Avatar
 
Posts: 412
Karma: 546196
Join Date: Mar 2009
Location: UK canal boat
Device: sony prs505, prs650, kobo Glo HD liseuses
I scan perhaps 24 old paperbacks per year, partly to ease space restrictions, partly to provide durable copies of paperbacks that are falling apart. Standard procedure is:
- split book into 32-page signatures
- scan signatures using Canon P150 duplex sheet scanner
- ocr the output using Abbyy Fine Reader
- correct the obvious typos in Abbyy, mark where sentences run over from one page to the next, identify required and redundant hyphens at page breaks and then output to .txt file
- further editing in Notepad++ (systematic treatment of speech marks, ellipses, endashes, ligatures, apostrophes... specify the language as HTML and use entities...add markers to divide the text into chapters...use regex to tidy line breaks etc
- construct an empty e-book in Sigil using standard components (pre-canned title-page, front-matter, chapter styles, pre-defined css sheets)
- move text from Notepad++ to Sigil, add italics, additional breaks etc plus further styling
- create a "nice" front cover (usually scanned from the original cover) and add that
- add to e-reader via Calibre

Generally reckon from 4 to 10 hours work per e-book, depending on the complexity and the quality of the ocr process. Feel the work is justified as I gain digital copies of titles which are unlikely to digitised by the publisher, at least in my lifetime!
alecE is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
calibre crashes when scanning and adding books oncdoc Calibre 8 04-21-2010 03:03 PM
Scanning books - New need help Sporadic Workshop 9 04-19-2009 01:11 PM
Scanning paper (out of copyright) books. Charles Gray Workshop 18 03-25-2009 02:06 PM
Scanning books Nate the great Lounge 10 11-04-2007 01:20 AM
Scanning books from your own library Alexander Turcic Deals and Resources (No Self-Promotion or Affiliate Links) 13 06-16-2006 12:28 AM


All times are GMT -4. The time now is 01:34 AM.


MobileRead.com is a privately owned, operated and funded community.