View Single Post
Old 06-24-2011, 06:38 PM   #11
WitchyWolf
Junior Member
WitchyWolf began at the beginning.
 
WitchyWolf's Avatar
 
Posts: 1
Karma: 10
Join Date: Jun 2011
Device: Nook Color
Thank you SavageNarce. Your post was very helpful.

I had several PDFs that would lose the double-L upon conversion to any format when using Calibre. These PDFs were created with a Python PDF Converter tool of some knd. I did some experimenting based on SavageNarce's post and here is what I found worked best for me:

1. Used Foxit Reader to open the PDF. Clicked Edit and Select All to have Foxit highlight all the text for me.
2. Copied that to Word 2010 and did a Save As, selecting Rich Text Format as the type so I had an original to fall back to in case I goofed up something. Also turned on the Show/Hide Paragraph Marks feature so I could see the spaces and paragraph marks.
3. Used the Replace tool to find and replace the following:
First Replace: find .^p (period and paragraph mark) replace with .| (period and vertical bar)
Second Replace: find "^p (double-quote and paragraph mark) replace with "| (double-quote and vertical bar)
Third Replace: find ^p (paragraph mark) replace with one blank space
Fourth Replace: find | (vertical bar) and replace with ^p (one paragraph mark)

I found that using one paragraph mark gave me a more readable output once I loaded it into Calibre and converted to ePub. The whole process took about 10 minutes once I had the process down.

Last edited by WitchyWolf; 06-24-2011 at 06:40 PM.
WitchyWolf is offline   Reply With Quote