MobileRead Forums - View Single Post

roger64 · 06-07-2020, 12:56 AM

Thanks for this interesting tip.

I am an Archlinux user.

I have been using Tesseract extensively for over one year. Usually, when I have to deal with a PDF, I make a batch convert to png using Imagemagick, then scantailor, before performing the OCR.

I installed k2pdfopt from AUR by compiling it. However something was missing because when I tried, I've got this message:

Code:

[...]
k2pdfopt v2.51 (w/DjVuLibre) (c) 2020, GPLv3, http://willus.com
    Compiled Jun  7 2020 with Gnu C v10.1.0 for Linux on x64.

** No OCR capability in this compile of k2pdfopt! **

I have seen here in the comments, that this package has some trouble on this regard (ocr). Using a Windows version would be an overkill - for me. So, I regrettably give up this try for now.

06-07-2020, 12:56 AM	#5
roger64 Wizard Posts: 2,625 Karma: 3120635 Join Date: Jan 2009 Device: Kindle PW3 (wifi)	Thanks for this interesting tip. I am an Archlinux user. I have been using Tesseract extensively for over one year. Usually, when I have to deal with a PDF, I make a batch convert to png using Imagemagick, then scantailor, before performing the OCR. I installed k2pdfopt from AUR by compiling it. However something was missing because when I tried, I've got this message: Code: [...] k2pdfopt v2.51 (w/DjVuLibre) (c) 2020, GPLv3, http://willus.com Compiled Jun 7 2020 with Gnu C v10.1.0 for Linux on x64. No OCR capability in this compile of k2pdfopt! I have seen here in the comments, that this package has some trouble on this regard (ocr). Using a Windows version would be an overkill - for me. So, I regrettably give up this try for now. Last edited by roger64; 06-07-2020 at 03:56 AM. Reason: regrets