|
|
#1 |
|
Enthusiast
![]() Posts: 49
Karma: 14
Join Date: Apr 2010
Device: iPad & iPhone
|
ePub Output Bug, Caused by MSWord
There is an annoying bug in the Calibre ePub conversion module, linked to a "feature" of MSWord.
This original text: Code:
to Unseelie Court on King Street and tease Code:
to Unseelie Court on King Street and tease MS Word Generated HTML/XHTML includes "smart tags." When such an HTML file is converted to ePub, these tags are translated, but errant <p> tags are inserted into the new html. Original HTML code: Code:
to <st1:Street w:st="on"><st1:address w:st="on">Unseelie Court</st1:address></st1:Street> on <st1:Street w:st="on"><st1:address w:st="on">King Street</st1:address></st1:Street> and tease Code:
to</p> <address class="calibre8"><span>Unseelie</span> Court</address> <p>on</p> <address class="calibre8">King Street</address> <p>and tease Either erase the MSWord smart tags before converting, or fix the <p> tags by hand after converting (unzip ePub, edit .html or .xhtml files, rezip). This has been reported as ticket #5671 in the Calibre Bug Tracking system. |
|
|
|
|
|
#2 | |
|
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,273
Karma: 61916422
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
I noticed that street names seem to get broken up instead of just Italicized. Figured Kovid liked it that way
|
|
|
|
|
| Advert | |
|
|
|
|
#3 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,270
Karma: 16544702
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
Hi Daddy Warpigs,
If you generate your HTML using MSWord, you should use the SaveAs Webpage-Filtered option rather than SaveAs Webpage. The "smart tags" should then not be created in your generated HTML and there is no need for manual editing. |
|
|
|
|
|
#4 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,798
Karma: 30548723
Join Date: Dec 2006
Location: Singapore
Device: Boyue
|
I would also recommend passing the html file through html tidy.
That cleans up many of the crap word add to the file. I have seen files go down from 1 mb to about 500kb sometimes |
|
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| [Old Thread] Epub Output: Line Height | greenapple | Conversion | 20 | 01-27-2013 10:27 AM |
| EPUB output | kovidgoyal | Calibre | 920 | 02-05-2011 12:59 PM |
| EPUB output justification | toki08 | Calibre | 10 | 01-08-2011 05:14 PM |
| Seems Amazon have caused an epub price war in the UK | ceebee_uk | General Discussions | 11 | 09-27-2010 05:20 AM |
| epub output metadata | troymc | Calibre | 5 | 05-22-2010 01:23 AM |