View Single Post
Old 10-05-2011, 02:40 PM   #4
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by Dizzley View Post
It might not be the sup tag.
I notice two changes in the input HTML (in Bold) -
1) there's a new <p> tag near versnum 11, and
2) some <p> tag changes near versenum 13 (which still renders correctly as a new paragraph).
I'm not sure how calibre would put <p> tags into content other than in the conversion process. To avoid the conversion code entirely, use the method of command line ebook-convert I linked you to. It won't produce the multiple directories from the debugged conversion process, just the raw html.

I also see you're using embedded content from the RSS feed. (You've doubled some lines in the posted recipe, but that shouldn't be a problem.) You might try not using the embedded content.

Still another possibility is that the problem is in the RSS feed, but you're not seeing it if you are looking with a browser (browsers sometimes change the raw source before showing the page). You can print the raw XML soup Calibre sees with :
Code:
    def preprocess_html (self, soup):
        print 'the Soup is:', soup
        return soup
Sometimes I do that to make sure I know what Calibre's recipe is actually getting to work with.

Last edited by Starson17; 10-06-2011 at 09:18 AM.
Starson17 is offline   Reply With Quote