Cleaning up Word-generated html is fine if you're a masochist. If you regularly generate epubs from Word files then Atlantis Word processor is a far better option.
Valloric is right, this behaviour sounds very much like what happens when HTMLTidy tries to resolve an unclosed inline tag.
|