I'm having trouble parsing my local newspaper feed, it's able to retrieve the articles but when I open the epub each article page is blank.
In the log it shows that once it gets to "parsing all content..." all the referenced files aren't found.
ex "Referenced file 'feed_0/article_23/PrintArticle.aspx%3fe%3d1715420' not found" and so on for each article. I've attached the log if that's more helpful.
Also this is what I have for the recipe:
class AdvancedUserRecipe1251250978(BasicNewsRecipe):
title = u'Intelligencer'
oldest_article = 7
no_stylesheets = True
max_articles_per_feed = 100
feeds = [(u'Recent Local News', u'http://www.intelligencer.ca/rss')]
def print_version(self, url):
turl = url.replace('ArticleDisplay', 'PrintArticle')
return turl
Any help would be appreciated, thanks!
edit: I was able to fix the problem by adding in the following:
keep_only_tags = [dict(id=['ctl00_ContentPlaceHolder1_FormView1'])]
Last edited by indole; 08-26-2009 at 09:51 PM.
|