View Single Post
Old 01-26-2010, 04:56 PM   #1
nickredding
onlinenewsreader.net
nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'
 
Posts: 320
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
Washington Post news feed crash

The Washington Post RSS feeds have some issues which involve empty articles in the index, causing the parser to crash. I've opened a ticket, and in the mean time the following code added to the recipe will work around the issue.
Code:
    def preprocess_html(self, soup):
        for tag in soup.findAll('font'):
            if tag['size']:
                if tag['size'] == '+2':
                    if tag.b:
                        return soup
        return None
nickredding is offline   Reply With Quote