Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Reply
 
Thread Tools Search this Thread
Old 01-23-2010, 08:31 PM   #16
Greg Anos
Grand Sorcerer
Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.
 
Posts: 11,247
Karma: 35000000
Join Date: Jan 2008
Device: Pocketbook
As an experienced scanner, you don't want direct to HTML. Using a word processor format is much better to start with because you can then use the spell checker for your scan, which is awkward with an HTML file. Then use the word processor to do the HTML conversion.
Greg Anos is offline   Reply With Quote
Old 01-23-2010, 08:48 PM   #17
ascherjim
Addict
ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.
 
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
Quote:
Originally Posted by Ralph Sir Edward View Post
As an experienced scanner, you don't want direct to HTML. Using a word processor format is much better to start with because you can then use the spell checker for your scan, which is awkward with an HTML file. Then use the word processor to do the HTML conversion.
I couldn't agree more. I fear that beginners to ebook creation through scanning underestimate the amount of editing (proof-reading) required, even when having scanned the most legible printed pages. And the use of word processors in the process, with all the editing tools (especially global) at their disposal, tend to simplify and shorten (somewhat) and relieve the tedium (somewhat) of the process.
ascherjim is offline   Reply With Quote
Advert
Old 01-23-2010, 10:59 PM   #18
Skydog
Wizard
Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.
 
Skydog's Avatar
 
Posts: 2,243
Karma: 7400001
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Forma, Libra 2, Clara 2E, Kindle Oasis3, Voyage
Quote:
Originally Posted by Ralph Sir Edward View Post
As an experienced scanner, you don't want direct to HTML. Using a word processor format is much better to start with because you can then use the spell checker for your scan, which is awkward with an HTML file. Then use the word processor to do the HTML conversion.
Indeed, but aren't most word processors, particularly Word, known for terrible HTML output?
Skydog is offline   Reply With Quote
Old 01-23-2010, 11:03 PM   #19
texasnightowl
Guru
texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.texasnightowl ought to be getting tired of karma fortunes by now.
 
Posts: 699
Karma: 1001556
Join Date: Jul 2008
Location: Texas
Device: Oasis 3, K4B(NT), K3/KK
Quote:
Originally Posted by Skydog View Post
Indeed, but aren't most word processors, particularly Word, known for terrible HTML output?
Word is really really bad unless you choose the "filtered" html output...then it is better...at least comparatively speaking.
texasnightowl is offline   Reply With Quote
Old 01-23-2010, 11:28 PM   #20
ascherjim
Addict
ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.
 
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
Quote:
Originally Posted by texasnightowl View Post
Word is really really bad unless you choose the "filtered" html output...then it is better...at least comparatively speaking.
My personal methodology -- which gives me ultimately my ebook in my preferred Mobipocket format -- is so convoluted that I am loathe to really recommend it to anyone else. I scan to txt, convert that to Wordperfect for editing, then convert it back to txt for conversion to html using txt2html, then convert from html to Mobipocket using Calibre. Crazy as it sounds (and less than perfect as it occasionally is) it works fine overall for me!!!
ascherjim is offline   Reply With Quote
Advert
Old 01-24-2010, 01:59 AM   #21
Skydog
Wizard
Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.Skydog ought to be getting tired of karma fortunes by now.
 
Skydog's Avatar
 
Posts: 2,243
Karma: 7400001
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Forma, Libra 2, Clara 2E, Kindle Oasis3, Voyage
Quote:
Originally Posted by ascherjim View Post
My personal methodology -- which gives me ultimately my ebook in my preferred Mobipocket format -- is so convoluted that I am loathe to really recommend it to anyone else. I scan to txt, convert that to Wordperfect for editing, then convert it back to txt for conversion to html using txt2html, then convert from html to Mobipocket using Calibre. Crazy as it sounds (and less than perfect as it occasionally is) it works fine overall for me!!!
Actually, it does not sound crazy at all - I follow your logic! I'm willing to try anything that will result in success.
Skydog is offline   Reply With Quote
Old 01-24-2010, 10:45 AM   #22
ascherjim
Addict
ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.ascherjim has a complete set of Star Wars action figures.
 
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
Quote:
Originally Posted by Skydog View Post
Actually, it does not sound crazy at all - I follow your logic! I'm willing to try anything that will result in success.
Of course, before I settled on my system I tried (experimented with) many other options, which I encourage everyone to do. What gives me what I want in the end is not necessarily what would give anyone else what they might want. It depends on so many factors. Patience and perseverance are the requisites.
ascherjim is offline   Reply With Quote
Old 01-24-2010, 05:24 PM   #23
Greg Anos
Grand Sorcerer
Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.Greg Anos ought to be getting tired of karma fortunes by now.
 
Posts: 11,247
Karma: 35000000
Join Date: Jan 2008
Device: Pocketbook
Quote:
Originally Posted by Skydog View Post
Indeed, but aren't most word processors, particularly Word, known for terrible HTML output?

First, let us define terrible. Seriously. I define bad two ways, One, it doesn't function. Two, other programs I need to read the HTML won't read it.

I don't insist on pretty, tight HTML, just does it work, and can it be converted to Epub and Mobi, and be read in FBReader from a zipped file. (I use OpenInkPot on my Hanlin readers....

Now this is a very loose standard, inasmuch as I will go in and add my own bits of HTML to the output. (Usually images and internal hyperlinks.)

Now by this standard, Word 97 works somewhat (won't convert into anything in Calibre, but will convert to epub in Sigil and the Epub can be back converted into mobi by Calibre), Word 2000 works well, but produces XML (XHTML) which is needlessly wordy for e-book use compared to HTML, Atlantis doesn't work (can't handle RTF conversion properly and sets some lines of text to zero height.), and Open Office messes up the RTF formatting while editing it, which leads to bad HTML formatting output.

So currently I use scanner to RTF, spell-check in Open Office despite it's problems, because it highlights hypenated words nicely, clean up the format mangles in Wordpad, load into Word 2000 and save as web page, and load into Calibre, which zips up the XML (and converts to Epub and Mobi, if required.)

Oh yes, I also use Hexedit 3.0 to clean up RTF control language at byte level, as needed.

Anyway you look at it, lots of work...
Greg Anos is offline   Reply With Quote
Old 05-10-2010, 09:56 PM   #24
pomo81
Junior Member
pomo81 began at the beginning.
 
Posts: 1
Karma: 10
Join Date: May 2010
Device: Sony PRS 600
You can imagine my joy in finding this forum. I've been scanning texts for archival purposes for nearly 20 years. I've spent the past two days scouring this forum and getting introduced to fantastic software I had not known about. Abby, Omnipage, and Paperport I'm familiar with; Scan Tailor and ScanKromsator are new to me and had been desperately needed.

I don't have much need for OCR, but fast flatbed scanning is a must. In nearly 20 years, I have not come across a flatbed scanner that is faster than the old -- and big and heavy -- HP 4C scanners, which can do 4 300 dpi 1-bit (black and white) legal-sized scans in one minute. Opening up a book on the glass and doing two facing pages at a time will yield 240 scans (480 pages) an hour.

All of the "fast document" scanners seem to have a speed rating only on the ADF, not the flatbed, which seem to be terribly slow. I think some of the Epson flatbed scanners might be able to come close to 4 legal-sized 300 dpi, 1-bitscans a minute, but I'm not certain. The old AGFA SCSI scanner did 3 scans a minute.

I have no idea whether I can get my HP 4C, which can still be bought new or in very good condition on Ebay for nearly nothing (it's the shipping that costs), and required SCSI card to work on a Vista or Windows 7 machine, as I have mine connected to an XP machine.

I'm always on the lookout for a faster flatbed USB 2.0 scanner, but I've seen nothing. I'm thinking about a digital copier that can be connected to a PC, as I'd like the flatbed scanning time to be just as fast as a copier. I'm also thinking of going the route of a photo box with two SLR cameras.

Anyway, I just wanted to throw in a word on these old 4C scanners and finally take part in this forum, which I had sorely needed the past 20 years and found only two days ago.

Last edited by pomo81; 05-10-2010 at 10:56 PM.
pomo81 is offline   Reply With Quote
Old 05-11-2010, 02:22 PM   #25
Kirtai
Addict
Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.Kirtai ought to be getting tired of karma fortunes by now.
 
Posts: 304
Karma: 2454436
Join Date: Sep 2008
Device: PRS-505, PRS-650, iPad, Samsung Galaxy SII (JB), Google Nexus 7 (2013)
There's a forum about building book scanners based on digital cameras at http://www.diybookscanner.org/ you might find interesting. I believe the ScanTailor developer posts there too.
Kirtai is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
DIY Scanner Eratosthenes News 14 04-16-2010 04:21 PM
Welcher Scanner? Jacques_N Lounge 15 04-01-2010 05:34 PM
What scanner do you use, recommendations Moejoe Lounge 8 07-11-2009 02:38 AM
Using handheld scanner Idoine Workshop 13 06-14-2009 06:42 PM
scanner problem scanner Workshop 6 09-29-2008 08:37 AM


All times are GMT -4. The time now is 03:25 AM.


MobileRead.com is a privately owned, operated and funded community.