![]() |
#1 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Sep 2008
Device: prs-505
|
Collaborating on Proofing of Scanned Books
Where do I post to see if someone or more folk would like to collaborate on proofing a large book converted to text via OCR (but w/ all the typical mangling such process carries)? The goal to have a clean ebook.
Thx |
![]() |
![]() |
![]() |
#2 |
Cultural Artist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,128
Karma: 12829
Join Date: May 2008
Location: Georgia
Device: Sony 505, Kindle 2
|
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Reader
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,504
Karma: 8720163
Join Date: May 2007
Location: South Wales, UK
Device: Sony PRS-500, PRS-505, Asus EEEpc 4G
|
If it's in the public domain then why not offer it to Distributed Proofreaders, if you don't want to correct it yourself?
|
![]() |
![]() |
![]() |
#4 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
... and if it's not in the public domain you're probably breaking the law in giving your scans to anyone else.
|
![]() |
![]() |
![]() |
#5 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,395
Karma: 1358132
Join Date: Nov 2007
Location: UK
Device: Palm TX, CyBook Gen3
|
DP is good, but it'll take ages.
What's the book? I wouldn't mind lending a hand if it wasn't too onerous. |
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Ars longa
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,179
Karma: 17404
Join Date: Sep 2008
Location: north carolina, usa
Device: Kindle K1, K3 wifi
|
Quote:
Regards, R.L. Last edited by rlparker; 10-07-2008 at 11:20 AM. |
|
![]() |
![]() |
![]() |
#7 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,489
Karma: 37057604
Join Date: Jan 2008
Device: Pocketbook
|
The thruput of PG US, with TIFF scan, Distributed Proofreading, and file formatting is now pushing 2 years on average. Quite frankly, I have given up on them from a contribution sense. I have 10 volumes that I am going to convert to e-book myself, and post on PG Austrialia, which is less fussy.
Here are excepts from the lastest communique from Greg Weeks at PG US. "> Who do I need to communicate with for scanning US western historical > books? I have several (with origial illustrations) that need a better > P.G. Version than what is currently available. I don't typicly handle anything outside of children's literature and SF. If you want to set up an account at www.pgdp.net and post in the forums, you might find someone interested. > In addition, I looked at the recent (March 2008) HTML version of > Abbott's Flatland, and it still didn't have the origial illustration, > mererly the .txt simulations. Is there any way I can provide the scans > of the original illustrations for incorporation with the text? I thought someone was working on this. Yes, "Flatland. A Romance of Many Dimensions" is going through PGDP with the original cover even. It's in the P3 waiting queue. It's waiting for the final proofing pass and has two formatting passes left. Around a year probably before it posts." No complaint about Greg, he doesn't make the system. But it's gotten extremely slow. |
![]() |
![]() |
![]() |
#8 | |
curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,487
Karma: 5748190
Join Date: Jun 2006
Location: Redwood City, CA USA
Device: Kobo Aura HD, (ex)nook, (ex)PRS-700, (ex)PRS-500
|
Quote:
Their throughput is quite astonishing. It's the latency that's made you give up. Throughput, in this context, is the rate at which they provide proofed, formatted pages to PG. And that rate is quite high. Latency, by comparison, is the time it takes any particular book to make it through the entire process. If you want to improve DP's throughput, just get in there and proof pages. But if you want to improve latency for a book, you need to (1) get in there and volunteer; (2) work your way up to a level that lets you contribute to the stage at which the book you care about is stalled; and (3) work on moving that particular book forward. It's quite straight-forward, really, but takes a bit more effort. Xenophon P.S. In the interest of full disclosure, I should note that my sister is one of the folks who "runs" DP (inasmuch as such a loosely knit organization can be said to be "run," that is). |
|
![]() |
![]() |
![]() |
#9 |
Reader
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,504
Karma: 8720163
Join Date: May 2007
Location: South Wales, UK
Device: Sony PRS-500, PRS-505, Asus EEEpc 4G
|
The Distributed Proofreaders at PG Canada seem to be relatively quick. And their books are of a very high standard indeed.
But I am intrigued. What is the book that you are interested in, Lizardcry? |
![]() |
![]() |
![]() |
Tags |
collaborate, ocr, proofing |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
reading scanned books on ereader | theta0824 | Introduce Yourself | 5 | 05-21-2010 01:42 PM |
Small scanned books | Paul Moews | iRex | 22 | 02-05-2009 05:58 PM |
Ok I have scanned pdf books....but | DeathtoToasters | Sony Reader | 38 | 11-04-2008 07:51 PM |
Scanned books - a rant | FuzzyGamer | Sony Reader | 31 | 04-01-2008 03:39 PM |
Huge PDFs and scanned books | janosch | iRex | 3 | 09-19-2006 10:40 AM |