|  07-11-2020, 08:14 PM | #16 | 
| Guru            Posts: 942 Karma: 53902736 Join Date: Jun 2015 Device: multiple | 
			
			Yes. In my experience the biggest problems are: 1. Either omitting important graphic tables, or retaining unimportant page frames, page backgrounds, and other cruft. 2. Mis-ocring numbers. Since it's especially hard to spot that these are off, and they throw resulting figures off. 3. Screwing up text tables. 4. Columns, column, columns. Last edited by MarjaE; 07-11-2020 at 08:17 PM. | 
|   |   | 
|  07-11-2020, 08:46 PM | #17 | |
| null operator (he/him)            Posts: 22,010 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 To convert a PDF to something editable the first tool I try is Word itself (2016 or later), it can't handle very large PDFs, but the results can be as good or better than the FineReader scanning etc route. I've wondered if MS and Adobe use a common code base for the PDF related improvements they've made to Acrobat and Word. There was speculation a few months ago that MS would acquire Adobe. BR | |
|   |   | 
| Advert | |
|  | 
|  07-11-2020, 09:25 PM | #18 | |
| Bookmaker & Cat Slave            Posts: 11,503 Karma: 158448243 Join Date: Apr 2010 Location: Phoenix, AZ Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2 | Quote: 
 Hitch | |
|   |   | 
|  07-12-2020, 04:31 AM | #19 | 
| Evangelist            Posts: 441 Karma: 77256 Join Date: Sep 2011 Device: none | 
			
			If I need OCR, I'll generally use FineReader. Acrobat seems to be in many cases, of what I've tried so far best overall at exporting commercial vector PDFs. Unfortunately there aren't many options. Sometimes HTML export is better than Word, easier to correct, though the HTML isn't great. It can style paragraphs as italic, with a span for normal, a bit of work to correct. Or it can create lists, such as currently, a line break that might start with "1." or some initial on the next line, same paragraph, it'll break and create a list; a lot of work to correct. Maybe I will again try Nitro, Phantom and whatever other PDF reader I can find. Word might be an option but it's a large PDF I'd have to split. Afraid of multiple CSS style definitions with the same name but different definition per split that would cause other pains to fix. | 
|   |   | 
|  07-12-2020, 05:58 AM | #20 | 
| Resident Curmudgeon            Posts: 80,742 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			Once you've converted the PDF to anything else, you'll need to A/B compare the PDF to the output format. That means every letter, every space, every punctuation, etc. Because if you don't, you WILL have errors.
		 | 
|   |   | 
| Advert | |
|  | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Convert an epub to a pdf from another pdf sample file | SvenSND | Conversion | 3 | 09-02-2016 04:29 PM | 
| Convert epub to pdf, with notes with main text in the pdf? | 8140david | ePub | 1 | 06-18-2015 01:13 PM | 
| Convert epub to pdf, with notes with main text in the pdf? | 8140david | Conversion | 1 | 06-18-2015 11:02 AM |