View Single Post
Old 04-15-2013, 08:17 PM   #391
CheriePie
Connoisseur
CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.CheriePie got an A in P-Chem.
 
CheriePie's Avatar
 
Posts: 91
Karma: 6020
Join Date: Feb 2009
Location: Silicon Valley, CA
Device: Kindle Voyage, Samsung Galaxy S23+, Galaxy Tab S6
I get an error when trying to use Tesseract OCR engine on the 64-bit windows platform (v1.65). After selecting Tesseract for the OCR choice, I've left all other choices in that selection at their default. The only other change I'm making is the Device settings (d) for Kindle Paperwhite.

So this is the command line I've built:

Selected options:
"C:\Users\Cherie\Documents\My eBooks\Calibre Library\Jesse
Petersen\Club Monstrosity (124)\Club Monstrosity - Jesse Petersen.pdf"
-dev kpw -ocr t -ocrhmax 1.5 -ocrvis s



After hitting enter to begin the conversion, I get the following errors:

Failed loading language 'eng'
Tesseract couldn't load any languages!
Could not find Tesseract data (env var TESSDATA_PREFIX = (not assigned)).
Using GOCR v0.49.

Reading 233 pages from C:\Users\Cherie\Documents\My eBooks\Calibre Library\Jesse
Petersen\Club Monstrosity (124)\Club Monstrosity - Jesse Petersen.pdf ...

Detecting document orientation ... No rotation necessary.

SOURCE PAGE 1 of 233 (7.5 x 9.4 in) ... 0 new pages saved.


And then it stops working completely at page 2, throwing up the standard k2pdfopt.exe has stopped working error dialog from Windows.

I don't get these errors using the Gocr engine, but I guess Tesseract is more accurate so I'd like to try to use that one if possible.
CheriePie is offline   Reply With Quote