Has anyone seen this
article? It describes using xml stylesheets to make Word export much cleaner xhtml code. It tried it on a few files and it does seems to work rather nice.
Basically what you do, is save the document as an xml file, but apply a xml stylesheet while saving. The result has the extension xml, but is in fact a much cleaner html file.