Quote:
Originally Posted by RWood
I used the Internet Archive PDF versions as the basis for several volumes of the Harvard Classics series. I did run them through my own OCR as the provided OCR texts were a bit buggy (and the redone ones far better.) IA is a great resource.
|
RWood:
Did you see the DJVU text copy ( harvardclassics50eliouoft_djvu.txt ) of Volume 50 there.
I grabbed it as it was quite good. I started to make corrections to fix incorrect line-breaks, starting from the end and I am working backwards.
Is your OCR text better than that copy? BTW, the other (first) text format was a disaster.
Looking forward to converting any Volume 50 .prc you may create.
Regards,