01-11-2012, 09:03 PM | #1 |
Grand Sorcerer
Posts: 8,501
Karma: 64095689
Join Date: Jan 2008
Location: Harrisburg outskirts
Device: Palms, K1-4s, iPads, iPhones, KV, KO1
|
PDF italics yeilds line breaks
I have a book only in PDF format, and not available to buy a better version. I'm using Calibre 0.8.34 to convert it to mobi for a kindle. Is there any setting or fix I can apply to prevent italicized words in the pdf from generating spurious line breaks and incorrect word breaks too? I tried outputting the DEBUG info, and the problem begins in the input section and has worsened by the parse section. See pics for examples of extra line breaks being inserted, and also for some italicized sections grabbing part of the neighboring word too.
|
01-11-2012, 11:19 PM | #2 |
Member
Posts: 10
Karma: 60
Join Date: Jan 2012
Device: Kindle 4
|
I have the same issue, it has been driving me nuts for over a week. I tried the Amazon converter, MobiPocket creator, and everything I could find in Calibre and nothing will solve it. I would ideally love to have some regex code that would allow me to control the spacing before and after every italicized word. No luck so far.
|
Advert | |
|
01-12-2012, 08:28 AM | #3 |
Member
Posts: 10
Karma: 60
Join Date: Jan 2012
Device: Kindle 4
|
Well, nevermind, I solved my problem. In my case, all italicized word were missing a space after the word, so it would run into the neighboring word and effectively make one word only every time, e.g., "it all makessense after all".
So what I ended up doing is I imported the PDF in MobiPocket Creator to get the html, and used their html editor function there (which opened my notepad with the book in it) and I simply went in Edit -> Replace, and replaced all the occurences of </i> with the same but with an added space after it. It ended up being a very simple solution. |
01-12-2012, 05:51 PM | #4 |
Grand Sorcerer
Posts: 8,501
Karma: 64095689
Join Date: Jan 2008
Location: Harrisburg outskirts
Device: Palms, K1-4s, iPads, iPhones, KV, KO1
|
I was going to do that, but only found HTMLZ output in Calibre. Which didn't show up as text, in my editor. Where do I get MobiPocket Creator?
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Kindle 3 PDF Conversion Line Breaks | mvnjpy | Calibre | 3 | 09-26-2010 09:36 PM |
Opening ePub in Sigil breaks TOC and loses italics | PatNY | Sigil | 15 | 08-25-2010 07:05 PM |
Ignoring line breaks in pdf file | mike_bike_kite | Calibre | 0 | 06-14-2010 09:37 AM |
No line breaks | ecpepper | Amazon Kindle | 3 | 08-09-2009 06:42 PM |
Calibre PDF to LRF losing line breaks | kad032000 | Calibre | 11 | 06-23-2008 10:22 AM |