Here's my latest attempt...still can't exclude the junk above and below the articles. Tried all the pages of web pages a few pages early on this, but don't quite have it.
Code:
class AdvancedUserRecipe1284145178(BasicNewsRecipe):
title = u'Blackhawks Headlines'
oldest_article = 7
max_articles_per_feed = 100
feeds = [(u'Blackhawks Recent Headlines', u'http://blackhawks.nhl.com/rss/news.xml')]
def print_version(self, url):
main1, replace1, end1 = url.partition('news.htm?')
url = main1 + 'newsprint.htm?' + end1
main2, middle2, end2 = url.partition('&')
return main2
keep_only_tags [dict(name='div', attrs={'class':'newsBody'})]
After about three hours on this total, I'd just love the answer if someone is willing to throw me a bone. I know I'm close...