I've just found out why.
The HTML file for some reason had the following xml declaration at the top:
<?xml version="1.0" encoding="UTF-8" ?>
Removing this causes the html file to be converted fine (without the funny front page).
Whether you want to cater for this oddity I'm not sure (its not really common although all browsers handle it OK)