View Single Post
Old 05-11-2013, 04:41 PM   #421
kundor
Junior Member
kundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toyskundor shares his or her toys
 
Posts: 5
Karma: 5998
Join Date: Oct 2011
Device: Kindle 3
Tesseract math

I'm using k2pdfopt to convert a large mathematical text. On the Tesseract download page, I noticed a file "tesseract-ocr-3.02.equ.tar.gz" which says it's a "Math / equation detection module for Tesseract 3.02." This sounds like it would help to OCR the math part correctly. The majority of the text is English. Is there some way to get the OCR engine to use this, in combination with the English training data?
kundor is offline   Reply With Quote