[GUI Plugin] OCRthisPDF – Version 1.0.0 – May 31, 2026
A Calibre GUI plugin that adds a text layer to scanned PDFs (direct OCR processing / "in-place OCR" based on OCRmyPDF),
including an optional proofreading function (not yet implemented).
Version History:
----------------
Version 1.0.0 – May 31, 2026
- Initial release.
Installation:
-------------
1. Install OCRmyPDF and its dependencies (Tesseract, Ghostscript, etc.) according to the OCRmyPDF documentation at:
https://ocrmypdf.readthedocs.io/en/l...roduction.html. Additional training files for German and German Fraktur script are provided by the University of Mannheim:
https://github.com/UB-Mannheim/tesseract/wiki
2. Install the plugin as usual. No configuration is currently required.
3. Select a suitable menu for the plugin in the Calibre settings.
Usage:
------
- In most cases, the default settings are sufficient.
- Detailed information can be found in the OCRmyPDF documentation.
- Language information for Tesseract is derived from the book metadata; the default is English.
- If the text in the PDF image is not correctly aligned (e.g., rotated by 90 degrees), check the "rotate" box.
- For books in German Fraktur script, check the "fraktur" box.
- The proofreading step has not yet been implemented (coming in version 2).
Bug Reports and Suggestions:
----------------------------
If you encounter any issues or have suggestions, please report them in the MobileRead forum.