View Single Post
Old 06-06-2020, 11:56 PM   #5
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
Thanks for this interesting tip.

I am an Archlinux user.

I have been using Tesseract extensively for over one year. Usually, when I have to deal with a PDF, I make a batch convert to png using Imagemagick, then scantailor, before performing the OCR.

I installed k2pdfopt from AUR by compiling it. However something was missing because when I tried, I've got this message:

Code:
[...]
k2pdfopt v2.51 (w/DjVuLibre) (c) 2020, GPLv3, http://willus.com
    Compiled Jun  7 2020 with Gnu C v10.1.0 for Linux on x64.

** No OCR capability in this compile of k2pdfopt! **
I have seen here in the comments, that this package has some trouble on this regard (ocr). Using a Windows version would be an overkill - for me. So, I regrettably give up this try for now.

Last edited by roger64; 06-07-2020 at 02:56 AM. Reason: regrets
roger64 is offline   Reply With Quote