Quote:
Originally Posted by j.p.s
|
Many thanks for your info. I was doing some experiments that confirm what you wrote:
1. I downloaded and installed this GUI:
https://github.com/Parathantl/tesseract_gui/releases
(It installs Tesseract 4 but is easy to replace V4 with V5).
2. That GUI is to OCR pdf files.
3. I OCRed a pdf with 25 pages and I noted the time to finish the task.
4. I repeated the job but in console mode. Results were practically the same.
5. After my tests, I can say that ABBy is -at least- twice faster than Tesseract while the accuracy is almost the same.
Finally, I think I discover the cause of the difference of speed; Tesseract is using ONLY ONE CPU. I don't know how was compiled the .exe (for 64bits) but is not multithreading or the user doesn't have the option to enable it (maybe under Linux things are different). A real pity because is a nice program with a very good OCR precision and free.