Quote:
Originally Posted by Dizzley
It might not be the sup tag.
I notice two changes in the input HTML (in Bold) -
1) there's a new <p> tag near versnum 11, and
2) some <p> tag changes near versenum 13 (which still renders correctly as a new paragraph).
|
I'm not sure how calibre would put <p> tags into content other than in the conversion process. To avoid the conversion code entirely, use the method of command line ebook-convert I linked you to. It won't produce the multiple directories from the debugged conversion process, just the raw html.
I also see you're using embedded content from the RSS feed. (You've doubled some lines in the posted recipe, but that shouldn't be a problem.) You might try not using the embedded content.
Still another possibility is that the problem is in the RSS feed, but you're not seeing it if you are looking with a browser (browsers sometimes change the raw source before showing the page). You can print the raw XML soup Calibre sees with :
Code:
def preprocess_html (self, soup):
print 'the Soup is:', soup
return soup
Sometimes I do that to make sure I know what Calibre's recipe is actually getting to work with.