View Single Post
Old 08-26-2009, 03:35 PM   #682
indole
Junior Member
indole began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Aug 2009
Device: PRS-700
I'm having trouble parsing my local newspaper feed, it's able to retrieve the articles but when I open the epub each article page is blank.

In the log it shows that once it gets to "parsing all content..." all the referenced files aren't found.
ex "Referenced file 'feed_0/article_23/PrintArticle.aspx%3fe%3d1715420' not found" and so on for each article. I've attached the log if that's more helpful.

Also this is what I have for the recipe:

class AdvancedUserRecipe1251250978(BasicNewsRecipe):
title = u'Intelligencer'
oldest_article = 7
no_stylesheets = True
max_articles_per_feed = 100

feeds = [(u'Recent Local News', u'http://www.intelligencer.ca/rss')]

def print_version(self, url):
turl = url.replace('ArticleDisplay', 'PrintArticle')
return turl

Any help would be appreciated, thanks!

edit: I was able to fix the problem by adding in the following:
keep_only_tags = [dict(id=['ctl00_ContentPlaceHolder1_FormView1'])]
Attached Files
File Type: txt intelligencer.log.txt (19.0 KB, 220 views)

Last edited by indole; 08-26-2009 at 09:51 PM.
indole is offline