Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 12-17-2010, 09:58 AM   #1
Snakey
What would Snake do?
Snakey began at the beginning.
 
Posts: 49
Karma: 16
Join Date: Dec 2010
Location: Canada
Device: Kindle 3
Unhappy Removing paragraph breaks present after every line in EPUB?

Hello, I'm new to e-books but have been doing hours of research so I think I have a somewhat decent understanding of things. My problem is that I have several EPUBs that I'm trying to convert to MOBI for my Kindle and it appears that there is a paragraph break after every singe line in these problems EPUBs where there shouldn't be. I've been trying to figure out how to get Calibre to remove these, with no luck. I thought the "line unwrap factor" feature sounded like what I was looking for, but it hasn't helped. Can someone give me any help, is this even possible to fix? I converted a problem EPUB to TXT and loaded it into Word and you can see the paragraph symbols after every line, while a non-problem EPUB does not have any of these symbols except where they're supposed to be. I'm going to include an image of a problem EBUB converted to TXT in Word and a non-problem one to better illustrate my problem. Thanks a lot.

Snakey is offline   Reply With Quote
Old 12-17-2010, 10:30 AM   #2
EowynCarter
Wizard
EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.
 
Posts: 4,337
Karma: 4000000
Join Date: Oct 2008
Location: Paris
Device: Cybooks; Sony PRS-T1
Mmm, sigil and a regex maybe ?
EowynCarter is offline   Reply With Quote
Old 12-17-2010, 10:30 AM   #3
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Snakey View Post
I have several EPUBs that I'm trying to convert to MOBI for my Kindle and it appears that there is a paragraph break after every singe line in these problems EPUBs where there shouldn't be.
Your problem EPUBs were badly converted from something else (probably a text file or perhaps a PDF). I convert back to txt format, then convert back to EPUB using the input format = text paragraph control options. You haven't shown a paragraph break, but if it's indented, there's an option for that, and otherwise, it should spot double paragraph breaks as the true paragraph breaks.

Alternatively, you can search and replace in Word, then save as filtered HTML and import into Calibre, then convert.
Starson17 is offline   Reply With Quote
Old 12-17-2010, 10:32 AM   #4
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
The preprocess/line unwrap doesn't work on epub files.

Rename the epub to .zip instead of .epub. Then import that file back to the book, Calibre will treat it as zipped html. Then convert the html version of the book, then enable the 'preprocess input file' option under structure detection. Mess around with line_unwrap if you need better results.
ldolse is offline   Reply With Quote
Old 12-17-2010, 10:36 AM   #5
Snakey
What would Snake do?
Snakey began at the beginning.
 
Posts: 49
Karma: 16
Join Date: Dec 2010
Location: Canada
Device: Kindle 3
I'll try to make sense of your suggestions and work on this some more. Thanks a lot, will post again with my results. Would it help if I provided an image of a real paragraph break like Starson17 mentioned?
Snakey is offline   Reply With Quote
Old 12-17-2010, 11:05 AM   #6
Snakey
What would Snake do?
Snakey began at the beginning.
 
Posts: 49
Karma: 16
Join Date: Dec 2010
Location: Canada
Device: Kindle 3
Thumbs up

Quote:
Originally Posted by ldolse View Post
The preprocess/line unwrap doesn't work on epub files.

Rename the epub to .zip instead of .epub. Then import that file back to the book, Calibre will treat it as zipped html. Then convert the html version of the book, then enable the 'preprocess input file' option under structure detection. Mess around with line_unwrap if you need better results.
This seems to have worked virtually flawlessly, I didn't even have to use line_unwrap. This solves my problem, hopefully I can figure out any further issues on my own. Thanks so much, everyone!
Snakey is offline   Reply With Quote
Old 12-17-2010, 11:08 AM   #7
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Snakey View Post
I'll try to make sense of your suggestions and work on this some more. Thanks a lot, will post again with my results. Would it help if I provided an image of a real paragraph break like Starson17 mentioned?
I see the type of problem you have from badly converted text files, so i was addressing it as that. That's where knowing the paragraph break structure may help. If your EPUB has formatting you want to preserve, then Idolse's suggestions make more sense, and the paragraph break structure is less critical. You actually have lots of options from manually searching and replacing to various automatic conversions. Try each and you'll find one that works for you.
Starson17 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Removing unnecessary paragraph breaks in .txt citac Other formats 2 10-26-2010 05:16 PM
Converting from LRF: Paragraph & Line Breaks wudaben LRF 0 07-14-2010 11:32 PM
Odd line/paragraph breaks in epub and FB2? PKFFW Calibre 4 10-01-2009 07:49 AM
convert to lrf : paragraph indents, line breaks karo02 Calibre 4 01-27-2009 09:19 AM
Removing extra line breaks plemming Calibre 0 07-31-2008 07:50 PM


All times are GMT -4. The time now is 01:02 AM.


MobileRead.com is a privately owned, operated and funded community.