View Single Post
Old 04-11-2008, 04:46 PM   #21
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by RWood View Post
I used the Internet Archive PDF versions as the basis for several volumes of the Harvard Classics series. I did run them through my own OCR as the provided OCR texts were a bit buggy (and the redone ones far better.) IA is a great resource.
RWood:

Did you see the DJVU text copy ( harvardclassics50eliouoft_djvu.txt ) of Volume 50 there.

I grabbed it as it was quite good. I started to make corrections to fix incorrect line-breaks, starting from the end and I am working backwards.

Is your OCR text better than that copy? BTW, the other (first) text format was a disaster.

Looking forward to converting any Volume 50 .prc you may create.

Regards,

Last edited by nrapallo; 04-11-2008 at 05:44 PM.
nrapallo is offline   Reply With Quote