| 
			
			 | 
		#1 | 
| 
			
			
			
			 Custom User Title 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,359 
				Karma: 79528341 
				Join Date: Oct 2018 
				Location: Canada 
				
				
				Device: Kobo Libra H2O, formerly Aura HD 
				
				
				 | 
	
	
	
		
		
			
			 
				
				Is there a GUI for OCRmyPDF?
			 
			
			
			https://wiki.mobileread.com/wiki/OCRmyPDF 
		
	
		
		
		
		
		
		
		
		
		
		
	
	I am... fairly bad with using the command-line. Does anybody know if there was any sort of GUI made for this?  
		 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#2 | 
| 
			
			
			
			 Fuzzball, the purple cat 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,312 
				Karma: 11087488 
				Join Date: Jun 2011 
				Location: California 
				
				
				Device: iPad 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			For any particular platform / operating system?  Looks like OCRmyPDF is largely targeted at linux but can install on OS/X and Windows with some third party support (e.g. Python, Homebrew/Cygwin...).  There are a lot of other OCR apps if you find OCRmyPDF difficult to use.  If you have an iPhone, for example, there is Elucidate which also uses the Tesseract OCR engine.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#3 | 
| 
			
			
			
			 Custom User Title 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,359 
				Karma: 79528341 
				Join Date: Oct 2018 
				Location: Canada 
				
				
				Device: Kobo Libra H2O, formerly Aura HD 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			I somehow managed to completely skip over the fact that there's no real Windows implementation  
		
	
		
		
		
		
		
		
		
		
		
		
	
	 
		 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#4 | 
| 
			
			
			
			 Still reading 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 15,004 
				Karma: 111111255 
				Join Date: Jun 2017 
				Location: Ireland 
				
				
				Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			It's really for tesseract anyway. Even on Linux you might use something else with Tesseract! 
		
	
		
		
		
		
		
		
		
		
		
		
	
	You can run Linux for free, either on a VM (Openbox is free on Windows 10 and recommended MS solution for XP and Win7 on Win10), or USB stick or dual boot, or ditch windows (me entirely in Jan 2017, but I have a clone of my 2002 XP laptop on OpenBox VM on Linux and Office 2003 on WINE).  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#5 | 
| 
			
			
			
			 Resident Curmudgeon 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80,782 
				Karma: 150249619 
				Join Date: Nov 2006 
				Location: Roslindale, Massachusetts 
				
				
				Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			There's no good program for OCRing PDF. It's not possible.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#6 | 
| 
			
			
			
			 Grand Sorcerer 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,842 
				Karma: 105494725 
				Join Date: Apr 2011 
				
				
				
				Device: pb360 
				
				
				 | 
	
	|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#7 | 
| 
			
			
			
			 Resident Curmudgeon 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80,782 
				Karma: 150249619 
				Join Date: Nov 2006 
				Location: Roslindale, Massachusetts 
				
				
				Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 
				
				
				 | 
	
	|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#8 | |
| 
			
			
			
			 Grand Sorcerer 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,842 
				Karma: 105494725 
				Join Date: Apr 2011 
				
				
				
				Device: pb360 
				
				
				 | 
	
	
	
		
		
		
		
		 Quote: 
	
 You have proved nothing. Prove your assertion, retract, or rephrase to be true. (You have been told that makking an assertion does not establish a fact and it certainly does not constitute proof.)  | 
|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#9 | 
| 
			
			
			
			 Still reading 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 15,004 
				Karma: 111111255 
				Join Date: Jun 2017 
				Location: Ireland 
				
				
				Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper 
				
				
				 | 
	
	|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#10 | 
| 
			
			
			
			 Guru 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 968 
				Karma: 13558066 
				Join Date: Jul 2017 
				
				
				
				Device: Boox Nova 2 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			It really depends what you're using the OCR'd text to do. To make an ePub? Yeah it'll need some work. To search the text of a bunch of PDFs? It can do a pretty good job.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#11 | 
| 
			
			
			
			 Custom User Title 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,359 
				Karma: 79528341 
				Join Date: Oct 2018 
				Location: Canada 
				
				
				Device: Kobo Libra H2O, formerly Aura HD 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			I am now very confused. I just wanted to add a text layer to some scanned booklets I had for search purposes.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#12 | 
| 
			
			
			
			 Guru 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 968 
				Karma: 13558066 
				Join Date: Jul 2017 
				
				
				
				Device: Boox Nova 2 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Tesseract is an open source OCR engine that a lot of programs use, OCRmyPDF included. There's a number of programs to do OCR for free using it like gImageReader that works on Windows with a GUI. I don't think gImageReader will embed the text in a layer like OCRmyPDF but it will let you get a text document out at least.  
		
	
		
		
		
		
		
		
		
		
		
		
	
	JSWolf seems to think OCR on PDF is useless but he's wrong. If you have Windows 10/11 it's also not that hard to get OCRmyPDF working under the Windows Subsystem for Linux (WSL). There's also some websites that will do it like https://www.sandwichpdf.com/  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#13 | |
| 
			
			
			
			 Fuzzball, the purple cat 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,312 
				Karma: 11087488 
				Join Date: Jun 2011 
				Location: California 
				
				
				Device: iPad 
				
				
				 | 
	
	
	
		
		
		
		
		 Quote: 
	
 If you dig deeper there are options to adjust contrast, gamma correction, and output resolution. Last edited by willus; 03-08-2022 at 11:52 PM.  | 
|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
    
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| [GUI Plugin] Save Virtual Libraries To Column (GUI) | chaley | Plugins | 14 | 04-04-2021 06:25 AM | 
| OCRmyPDF adds OCR text layer to scanned PDF files | orebmur | 0 | 01-20-2018 07:16 PM | |
| GUI Icons | Rellwood | Development | 1 | 07-09-2017 12:19 PM | 
| GUI Changes | luketheobscure | Development | 40 | 07-14-2011 05:23 PM | 
| Frustrated with GUI | yocalif | Library Management | 23 | 04-11-2011 04:09 PM |