Here's a
link to Toxaris's Word macro, Unfortunately it's in Dutch, so it might not be that helpful (I'm trying to translate it, but I'm only about a quarter of the way through).
Depending on your input source, Word's html isn't that hard to clean up in Sigil. Some of the scans conversions I get from Finereader can be a pain though.