View Single Post
Old 01-05-2016, 10:35 PM   #1224
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by timofonic View Post
Is it possible to user other OCR engines? Even commercial ones.
I have no idea how I'd integrate commercial source into a GPL project like k2pdfopt. Tesseract is the best open source OCR library I know of.

Quote:
Originally Posted by timofonic View Post
I tried to find a repository where k2pdfopt is stored, but it was impossible to find.
My official releases are always on my web site.

Quote:
Originally Posted by timofonic View Post
Do you use some source code versioning control system? It would be ideal to have it in GitHub.
I have very limited time to spend on k2pdfopt. I prefer to use it adding features / fixing bugs rather than learning how to use github.

Quote:
Originally Posted by timofonic View Post
Are you planning to update your software to latest Tesseract?
Tesseract v3.04.00 now depends on the ICU library, which has hundreds of source files. I'll have to wade through that at some point and see if it is worth the effort.

Quote:
Originally Posted by timofonic View Post
Latest MuPDF is 1.8.1-ios from 15 days ago...
I'll update to the latest MuPDF on my next k2pdfopt bug-fix release, but I'm not sure when that will be.

Quote:
Originally Posted by timofonic View Post
I wrote this because KOreader project got unable to use newer Tesseract and Leptonica, you need to update it in your project...
It used to be that k2pdfopt was compiled for KOreader with the MuPDF and Tesseract dependencies turned off (i.e. with macros set so that MuPDF and Tesseract library functions were not used). I'm not sure when that all changed, but apparently it has. Either way, it looks like the only reason KOreader uses Tesseract or Leptonica is so that it can compile them into the k2pdfopt library, so I'm not sure why you find it so urgent to update to bleeding edge versions of MuPDF and Tesseract. Is some other part of the KOreader source making use of the newer versions of MuPDF / Tesseract independently of the k2pdfopt source code?

Quote:
Originally Posted by timofonic View Post
Here's an historical reference why using GitHub would make other projects to deploy k2pdfopt a lot easier, plus potential contributions from other developers.
It probably wouldn't be that hard for some clever person, perhaps yourself, to write a script which would pull down my latest official release zip file and dump it to a github release.

Last edited by willus; 01-05-2016 at 10:39 PM.
willus is offline   Reply With Quote