You are correct that Calibre stores HTML books as a zip, but it should have no trouble converting from it. RTF should also work, although DOC will not. When you try what sort of error messages do you get?
The issues with characters displaying as question marks is normally because the character encoding type is incorrectl specified.
|