Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Reply
 
Thread Tools Search this Thread
Old 02-21-2007, 11:01 PM   #31
RWood
Technogeezer
RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.
 
RWood's Avatar
 
Posts: 7,233
Karma: 1601464
Join Date: Nov 2006
Location: Virginia, USA
Device: Sony PRS-500
Cheapo-Cheapo Productions

Bob, I'll take "Making an eBook from a Binder of 8.5" x 11" for $100"
(Be sure to phrase your response ....)

Cheapo-cheapo production methods would be to use the scanning part of the multifunction printer such as the HP 6210 All in One that I picked up at CompUSA for just under $100 that includes bundled software for both the Mac and PC. Among that software is a fairly good OCR program.

The scanner part has an automatic feed (no duplex) that can be set to as small as a mass market ("it fits in your hip pocket") paperback. I have scanned, rearranged pages, and OCR on the software with very good results. Sure some ls were 1s; but, nobody's perfect all of the time.
RWood is offline   Reply With Quote
Old 02-22-2007, 06:44 AM   #32
Bob Russell
Recovering Gadget Addict
Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.
 
Bob Russell's Avatar
 
Posts: 5,381
Karma: 676161
Join Date: May 2004
Location: Pittsburgh, PA
Device: iPad
Quote:
Originally Posted by RWood
Bob, I'll take "Making an eBook from a Binder of 8.5" x 11" for $100"
(Be sure to phrase your response ....)

Cheapo-cheapo production methods would be to use the scanning part of the multifunction printer such as the HP 6210 All in One that I picked up at CompUSA for just under $100 that includes bundled software for both the Mac and PC. Among that software is a fairly good OCR program.

The scanner part has an automatic feed (no duplex) that can be set to as small as a mass market ("it fits in your hip pocket") paperback. I have scanned, rearranged pages, and OCR on the software with very good results. Sure some ls were 1s; but, nobody's perfect all of the time.
That's very interesting...I've been perusing those all-in-ones for such a purpose myself. I notice you can even get duplex scanning for about $400. But if you have auto sheet feeders without duplex scanning, isn't it tough to get the book pages in the right order? And don't you get a lot of misfeeds with inexpensive all-in-ones?

I'm hoping that those are not issues -- a decent $100 book scanning solution would be very impressive. Tell us more!
Bob Russell is offline   Reply With Quote
Old 02-22-2007, 10:01 AM   #33
RWood
Technogeezer
RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.
 
RWood's Avatar
 
Posts: 7,233
Karma: 1601464
Join Date: Nov 2006
Location: Virginia, USA
Device: Sony PRS-500
The first book I converted was an old late 1950s paperback with yellowing pages and glue falling into dust for the binding. All bound edges were clean cut to remove traces of the glue and the rough edges. The pages were not smooth surfaced and quite rough as many paperbacks were in those days. There were only about 2 misfeeds for the whole book. As you noted I then had to rearrange the pages manually. This only took about 20 minutes.

Since then I have scanned other books (I wait until I need a rubber band around the book to hold it together) and the higher quality paper does scan and OCR better. Magizines (and any thin clay coated stock) will not feed and must be done on a page-by-page basis.

From examination in the stores it seems to me that there is little difference between the feeders other than the duplexing as you increase in price from one model from a company to another. I suspect that it is cheaper for the companies to offer a standard unit rather than a unique unit for each model.
RWood is offline   Reply With Quote
Old 02-22-2007, 10:39 AM   #34
Bob Russell
Recovering Gadget Addict
Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.
 
Bob Russell's Avatar
 
Posts: 5,381
Karma: 676161
Join Date: May 2004
Location: Pittsburgh, PA
Device: iPad
That's very encouraging.

Now that I think about it, I suppose a clever person could write a DOS batch file or other kind of script to take the default order you get and rename the images in order. Then a simple review of the pages with windows slideshow in file explorer (or equivalent) would pretty quickly catch and fix any anomalies.

Even one misfeed per 100 pages is probably tolerable, although maybe a little bit of a headache. But we're talking a low-budget solution that's accessible to almost anyone.

One more question... what software were you using for scanning and OCR? Was it what comes with the printer, or did you use your own software also?
Bob Russell is offline   Reply With Quote
Old 02-22-2007, 05:22 PM   #35
RWood
Technogeezer
RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.
 
RWood's Avatar
 
Posts: 7,233
Karma: 1601464
Join Date: Nov 2006
Location: Virginia, USA
Device: Sony PRS-500
As noted in my first post this is a cheapo-cheapo method. Therefore all of the software used was provided by HP on the install CD.

Although I have other (they say more advanced) OCR programs like TextBridge, the results do not change that much when using good originals. Higher resolutions and better OCR programs would be critical if the base language was for example Russian. Since English has a fairly simple character set it is easier than many to OCR. If exotic designer fonts are used then the better quality programs are a must. For common everyday typeset books, the supplied software is quite fine.
RWood is offline   Reply With Quote
Old 02-22-2007, 05:45 PM   #36
Bob Russell
Recovering Gadget Addict
Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.
 
Bob Russell's Avatar
 
Posts: 5,381
Karma: 676161
Join Date: May 2004
Location: Pittsburgh, PA
Device: iPad
Okay, I was wrong... I have yet another question!

If you wanted to create a pdf from the images without doing OCR (e.g. a document with lots of formulas and diagrams), can you also do that with the included software?

I think you may have sold a lot of all-in-one HP printers with this information, by the way!
Bob Russell is offline   Reply With Quote
Old 02-23-2007, 08:31 PM   #37
RWood
Technogeezer
RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.
 
RWood's Avatar
 
Posts: 7,233
Karma: 1601464
Join Date: Nov 2006
Location: Virginia, USA
Device: Sony PRS-500
I don't own stock in HP so if sales increase I will not enjoy any additional advantage.

As for output going directly to PDF I am not sure. I had already installed a full copy of Adobe Acrobat 6 Professional on the machine before I hooked up the printer so I am not sure if the capability is native to the HP provided software or available simply because I had already installed Acrobat.

The software uses TIF as the native graphic format for the saved scans so it should be easy for most othe programs to pick up these formats and create a PDF from them.
RWood is offline   Reply With Quote
Old 02-23-2007, 11:42 PM   #38
RWood
Technogeezer
RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.
 
RWood's Avatar
 
Posts: 7,233
Karma: 1601464
Join Date: Nov 2006
Location: Virginia, USA
Device: Sony PRS-500
I went back and reviewed the install CD that came with the HP, there were no PDF creation programs on it that I could find. All of the documentation on the disk was in HTML.

When I set up the scan from the All-In-One using the "Scan To" button, one of the options is "Adobe Acrobat" Since I had a full copy installed before I installed the HP software I feel that this is where it picked up this option from. When used it transfers the scans directly to PDF files and leaves the Acrobat application open on the desktop with the scanned file ready for me. Likewise, if I pick "Scan to Word" it opens Word and leaves an image in Word -- not OCR text, an image. You must go to an HP application and select OCR with a destination of Word to get text. Any word processor will work, I just happen to use Word.

BTW: The All-In-One also comes with a set of Mac drivers and application programs that are said to do the same as the PC drivers and applications.
RWood is offline   Reply With Quote
Old 02-27-2007, 11:48 AM   #39
Studio717
Addict
Studio717 will become famous soon enoughStudio717 will become famous soon enoughStudio717 will become famous soon enoughStudio717 will become famous soon enoughStudio717 will become famous soon enoughStudio717 will become famous soon enough
 
Posts: 208
Karma: 575
Join Date: Oct 2006
Location: California
Device: Various Kindles, iPhone, iPad, Galaxy 10.1
Nag, one of the benefits of the Opticbook scanner is that #1 and #3 of your list aren't necessary. After the first Preview scan and boundary setup, the scanner continually scans only the page area you've set up. And the included software (PC only, much to my dismay) has auto odd or even page rotation.

Bob and RWood, the Mac has built in PDF generation under the Print dialog box, so the HP solution could generate PDFs with no additional cost. (I don't know if there's an upper page limit to the PDFs. I haven't run across one, but I use Acrobat for large projects.)
Studio717 is offline   Reply With Quote
Old 02-27-2007, 12:39 PM   #40
RWood
Technogeezer
RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.RWood ought to be getting tired of karma fortunes by now.
 
RWood's Avatar
 
Posts: 7,233
Karma: 1601464
Join Date: Nov 2006
Location: Virginia, USA
Device: Sony PRS-500
On the PC side I have genreated 3,000+ page documents with no problem (just a lot of time) so I would doubt that there would be a problem on the Mac.
RWood is offline   Reply With Quote
Old 03-27-2007, 10:26 AM   #41
Nag
Junior Member
Nag began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Feb 2007
Studio717:

I was suggesting alternatives, which ought to speed-up the scanning process. Not claiming that opticbook cannot do it.

For example, if the scanner-head doesn't illuminate beyond the page area, then there is no need to use white pages. OTOH, if it scans the entire area and then trims the margins, then using white paper is less stressful on the eye and also no need to close the cover of the scanner for every scan.

One can scan odd and even pages continuously and the opticbook will merge them. However, rotating the book for every scan takes time and effort. Scanning all odd pages first and then all even pages makes the scanning faster.

Try this for a book or two.

Best
Nag

Quote:
Originally Posted by Studio717
Nag, one of the benefits of the Opticbook scanner is that #1 and #3 of your list aren't necessary. After the first Preview scan and boundary setup, the scanner continually scans only the page area you've set up. And the included software (PC only, much to my dismay) has auto odd or even page rotation.
Nag is offline   Reply With Quote
Old 05-27-2007, 08:36 PM   #42
jackbrown
Enthusiast
jackbrown began at the beginning.
 
Posts: 36
Karma: 10
Join Date: Apr 2006
Location: San Francisco USA/North Africa
RE: Nag's question--

actually when I scan books, I don't use 8 bit grayscale, I do black and white: the files are smaller, and there's no need for anti-aliasing the text; OCR seems to work better without it anyway.
jackbrown is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Firmware Update Simplest method to register your Kindle w/2.5.x firmware, if you live outside US Nifty Amazon Kindle 153 12-08-2023 12:08 AM
Anyone Using An eReader on Gym Equipment? chazcop General Discussions 21 08-03-2010 08:53 PM
reading methods? ahammer Lounge 1 07-19-2009 02:51 PM


All times are GMT -4. The time now is 07:24 PM.


MobileRead.com is a privately owned, operated and funded community.