Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 11-30-2010, 06:39 PM   #1
gnychis
Junior Member
gnychis began at the beginning.
 
Posts: 9
Karma: 10
Join Date: Nov 2010
Device: ipad
additional breakline after every line (need to wrap)

It seems that after I convert from PDF to ePub, the conversion has trouble wrapping lines. That is, between every line of text I have an additional blank line (whitespace). It turns out to look like double spaced text. I think this is the opposite problem as unwrapping the text, where I am looking to re-wrap some of it.

I found the un-wrap helper in the structure part of the conversion, is there something for wrapping?
gnychis is offline   Reply With Quote
Old 11-30-2010, 07:01 PM   #2
gnychis
Junior Member
gnychis began at the beginning.
 
Posts: 9
Karma: 10
Join Date: Nov 2010
Device: ipad
to provide a little more information, I think this is created by the fact that the PDF uses hard line breaks after each line.

So, if I look at the HTML I see something like this for a single paragraph and transition to the next paragraph (pardon the foreign language):
Code:
Οκύριος και η κυρία Ντάρσλι, που έμεναν στο νούμε-<br>
ρο 4 της οδού Πριβέτ, έλεγαν συχνά, και πάντα με<br>
υπερηφάνεια, πως ήταν απόλυτα φυσιολογικοί άν-<br>
θρωποι, τίποτα περισσότερο ή λιγότερο. Ήταν οι τελευταίοι<br>
άνθρωποι που θα περίμενε κανείς να δει ανακατεμένους σε<br>
κάτι παράξενο ή μυστήριο, απλώς και μόνο γιατί και οι ίδιοι<br>
πίστευαν πως δεν υπήρχαν αληθινά τέτοιες ανοησίες στη<br>
ζωή.<br>   ###### TRANSITION TO NEXT PARAGRAPH.....
Ο κύριος Ντάρσλι ήταν ο διευθυντής ενός εργοστασίου<br>
με το όνομα «Γκράνινγκς», το οποίο έφτιαχνε γεωτρύπανα.<br>
gnychis is offline   Reply With Quote
Advert
Old 11-30-2010, 07:18 PM   #3
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
Make sure the line un-wrap factor under pdf input is set to the default of 0.45. If that still doesn't work file a bug at bugs.calibre-ebook.com with an example file.
ldolse is offline   Reply With Quote
Old 12-01-2010, 10:29 AM   #4
gnychis
Junior Member
gnychis began at the beginning.
 
Posts: 9
Karma: 10
Join Date: Nov 2010
Device: ipad
It was the default, thanks I can build an example file
gnychis is offline   Reply With Quote
Old 12-01-2010, 02:18 PM   #5
gnychis
Junior Member
gnychis began at the beginning.
 
Posts: 9
Karma: 10
Join Date: Nov 2010
Device: ipad
I have created an example and filed a ticket, thanks!

http://bugs.calibre-ebook.com/ticket/7761
gnychis is offline   Reply With Quote
Advert
Old 12-02-2010, 12:30 AM   #6
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
The problem is because the current line unwrapping function requires a Roman alphabet. Handling Greek will require the new pdf engine. There isn't much you can do from the GUI, but you could use the command line with debug output to get an html conversion with the new pdf engine. It won't finish a complete conversion, it will error out, but there will be usable html in the debug directory.

The new engine should un-wrap the lines, but in my experience it un-wraps too much - YMMV. The command line should be something like this:[code]ebook-convert book.pdf book.epub --new-pdf-engine -v --debug-pipeline=/somedirectory/[code] That will vary slightly by platform, filenames, etc.
ldolse is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PDF Line Un-Wrap Factor bug? jotekman Calibre 2 03-15-2010 11:43 AM
LRF and wrap-around text Seabound Calibre 13 12-28-2008 03:30 PM
Word wrap in the forum [closed] JSWolf Lounge 51 11-11-2007 10:22 PM


All times are GMT -4. The time now is 02:01 AM.


MobileRead.com is a privately owned, operated and funded community.