Thank you SavageNarce. Your post was very helpful.
I had several PDFs that would lose the double-L upon conversion to any format when using Calibre. These PDFs were created with a Python PDF Converter tool of some knd. I did some experimenting based on SavageNarce's post and here is what I found worked best for me:
1. Used Foxit Reader to open the PDF. Clicked Edit and Select All to have Foxit highlight all the text for me.
2. Copied that to Word 2010 and did a Save As, selecting Rich Text Format as the type so I had an original to fall back to in case I goofed up something. Also turned on the Show/Hide Paragraph Marks feature so I could see the spaces and paragraph marks.
3. Used the Replace tool to find and replace the following:
First Replace: find .^p (period and paragraph mark) replace with .| (period and vertical bar)
Second Replace: find "^p (double-quote and paragraph mark) replace with "| (double-quote and vertical bar)
Third Replace: find ^p (paragraph mark) replace with one blank space
Fourth Replace: find | (vertical bar) and replace with ^p (one paragraph mark)
I found that using one paragraph mark gave me a more readable output once I loaded it into Calibre and converted to ePub. The whole process took about 10 minutes once I had the process down.
Last edited by WitchyWolf; 06-24-2011 at 06:40 PM.
|