MobileRead Forums - View Single Post

Starson17 · 10-05-2011, 03:40 PM

Quote:

Originally Posted by Dizzley

It might not be the sup tag.
I notice two changes in the input HTML (in Bold) -
1) there's a new <p> tag near versnum 11, and
2) some <p> tag changes near versenum 13 (which still renders correctly as a new paragraph).

I'm not sure how calibre would put <p> tags into content other than in the conversion process. To avoid the conversion code entirely, use the method of command line ebook-convert I linked you to. It won't produce the multiple directories from the debugged conversion process, just the raw html.

I also see you're using embedded content from the RSS feed. (You've doubled some lines in the posted recipe, but that shouldn't be a problem.) You might try not using the embedded content.

Still another possibility is that the problem is in the RSS feed, but you're not seeing it if you are looking with a browser (browsers sometimes change the raw source before showing the page). You can print the raw XML soup Calibre sees with :

Code:

    def preprocess_html (self, soup):
        print 'the Soup is:', soup
        return soup

Sometimes I do that to make sure I know what Calibre's recipe is actually getting to work with.