MobileRead Forums - View Single Post

nickredding · 01-26-2010, 03:56 PM

The Washington Post RSS feeds have some issues which involve empty articles in the index, causing the parser to crash. I've opened a ticket, and in the mean time the following code added to the recipe will work around the issue.

Code:

    def preprocess_html(self, soup):
        for tag in soup.findAll('font'):
            if tag['size']:
                if tag['size'] == '+2':
                    if tag.b:
                        return soup
        return None

01-26-2010, 03:56 PM	#1
nickredding onlinenewsreader.net Posts: 324 Karma: 10143 Join Date: Dec 2009 Location: Phoenix, AZ & Victoria, BC Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire	Washington Post news feed crash The Washington Post RSS feeds have some issues which involve empty articles in the index, causing the parser to crash. I've opened a ticket, and in the mean time the following code added to the recipe will work around the issue. Code: def preprocess_html(self, soup): for tag in soup.findAll('font'): if tag['size']: if tag['size'] == '+2': if tag.b: return soup return None