Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 01-11-2012, 09:03 PM   #1
badgoodDeb
Wizard
badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.
 
badgoodDeb's Avatar
 
Posts: 4,385
Karma: 12122505
Join Date: Jan 2008
Location: Chicago outskirts
Device: Palms, K1, K3, K4s, iPad, iPhone 5
PDF italics yeilds line breaks

I have a book only in PDF format, and not available to buy a better version. I'm using Calibre 0.8.34 to convert it to mobi for a kindle. Is there any setting or fix I can apply to prevent italicized words in the pdf from generating spurious line breaks and incorrect word breaks too? I tried outputting the DEBUG info, and the problem begins in the input section and has worsened by the parse section. See pics for examples of extra line breaks being inserted, and also for some italicized sections grabbing part of the neighboring word too.
Attached Thumbnails
Click image for larger version

Name:	1-pdf.jpg
Views:	55
Size:	54.3 KB
ID:	81103   Click image for larger version

Name:	2-Debug-input.jpg
Views:	60
Size:	48.5 KB
ID:	81104   Click image for larger version

Name:	3-debug-parsed.jpg
Views:	52
Size:	51.5 KB
ID:	81105  
badgoodDeb is offline   Reply With Quote
Old 01-11-2012, 11:19 PM   #2
voidwards
Junior Member
voidwards began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Jan 2012
Device: Kindle 4
I have the same issue, it has been driving me nuts for over a week. I tried the Amazon converter, MobiPocket creator, and everything I could find in Calibre and nothing will solve it. I would ideally love to have some regex code that would allow me to control the spacing before and after every italicized word. No luck so far.
voidwards is offline   Reply With Quote
 
Enthusiast
Old 01-12-2012, 08:28 AM   #3
voidwards
Junior Member
voidwards began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Jan 2012
Device: Kindle 4
Well, nevermind, I solved my problem. In my case, all italicized word were missing a space after the word, so it would run into the neighboring word and effectively make one word only every time, e.g., "it all makessense after all".

So what I ended up doing is I imported the PDF in MobiPocket Creator to get the html, and used their html editor function there (which opened my notepad with the book in it) and I simply went in Edit -> Replace, and replaced all the occurences of </i> with the same but with an added space after it. It ended up being a very simple solution.
voidwards is offline   Reply With Quote
Old 01-12-2012, 05:51 PM   #4
badgoodDeb
Wizard
badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.badgoodDeb ought to be getting tired of karma fortunes by now.
 
badgoodDeb's Avatar
 
Posts: 4,385
Karma: 12122505
Join Date: Jan 2008
Location: Chicago outskirts
Device: Palms, K1, K3, K4s, iPad, iPhone 5
I was going to do that, but only found HTMLZ output in Calibre. Which didn't show up as text, in my editor. Where do I get MobiPocket Creator?
badgoodDeb is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Kindle 3 PDF Conversion Line Breaks mvnjpy Calibre 3 09-26-2010 09:36 PM
Opening ePub in Sigil breaks TOC and loses italics PatNY Sigil 15 08-25-2010 07:05 PM
Ignoring line breaks in pdf file mike_bike_kite Calibre 0 06-14-2010 09:37 AM
No line breaks ecpepper Amazon Kindle 3 08-09-2009 06:42 PM
Calibre PDF to LRF losing line breaks kad032000 Calibre 11 06-23-2008 10:22 AM


All times are GMT -4. The time now is 01:29 AM.


MobileRead.com is a privately owned, operated and funded community.