View Single Post
Old 01-14-2012, 09:33 AM   #6
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,252
Karma: 16544692
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
Quote:
Originally Posted by Cptow View Post
Can someone tell me if its the original input problem or i miss some conversion steps?
Noticed that few of the books all have this problem.
This is not a great epub. The reason for the excess of blank lines is that the source contains 2 surplus linebreaks between each real paragraph.

If you are using Calibre you can produce a much-improved (not perfect) epub by doing an epub-to-epub conversion with the following conversion settings:

Look&Feel page:
check box 'Remove spacing between paragraphs'

Heuristic Processing page:
check box 'Enable heuristic processing'
check box 'Detect and markup unformatted chapter hdgs...'
uncheck all other boxes

Structure Detection page:
set box 'Detect chapters at' to //h:h2

Table of Contents page:
check box 'Force use of auto-generated TOC'

Search&Replace page:
First expression (to remove surplus linebreaks)
set box 'Search Regular Expression' to the following exact text
Code:
<br />\s*<br />
box 'Replacement text' should be empty
Second expression (to remove unwanted page numbers and improve Chapter detection)
set box 'Search Regular Expression' to the following exact text
Code:
Page\s*\d+\s*
box 'Replacement text' should be empty

EPUB Output page:
make sure box 'Split files larger than' is set to 260 KB
(It needs to be done to fix a problem with the original)
There are other ways to correct this epub, but I've detailed a method which can be achieved using Calibre conversion settings only, rather than manually editing the html & css inside the epub.
jackie_w is offline   Reply With Quote