View Single Post
Old 06-01-2010, 05:22 PM   #1
Daddy Warpig
Enthusiast
Daddy Warpig began at the beginning.
 
Posts: 49
Karma: 14
Join Date: Apr 2010
Device: iPad & iPhone
ePub Output Bug, Caused by MSWord

There is an annoying bug in the Calibre ePub conversion module, linked to a "feature" of MSWord.

This original text:

Code:
to Unseelie Court on King Street and tease
is converted to the following text:

Code:
to
Unseelie Court
on

King Street
and tease
Cause:

MS Word Generated HTML/XHTML includes "smart tags." When such an HTML file is converted to ePub, these tags are translated, but errant <p> tags are inserted into the new html.

Original HTML code:

Code:
to <st1:Street w:st="on"><st1:address
 w:st="on">Unseelie Court</st1:address></st1:Street> on <st1:Street w:st="on"><st1:address
 w:st="on">King Street</st1:address></st1:Street> and tease
Translated HTML code:

Code:
to</p>
<address class="calibre8"><span>Unseelie</span> Court</address>
<p>on</p>
<address class="calibre8">King Street</address>
<p>and tease
Some solutions for end users:

Either erase the MSWord smart tags before converting, or fix the <p> tags by hand after converting (unzip ePub, edit .html or .xhtml files, rezip).

This has been reported as ticket #5671 in the Calibre Bug Tracking system.
Daddy Warpig is offline   Reply With Quote