View Single Post
Old 03-13-2011, 12:04 PM   #1
Dereks
Connoisseur
Dereks began at the beginning.
 
Posts: 57
Karma: 10
Join Date: Feb 2010
Device: Kindle Paperwhite 1
Ukrainian Legal News site - plishing the feed

Hi, can somebody help me with this one and tell me what's wrong with this code:

Code:
class AdvancedUserRecipe1300026627(BasicNewsRecipe):
    title          = u'Liga Zakon'
    oldest_article = 7
    max_articles_per_feed = 100
    remove_tags_before = dict(name='div ', attrs={'class':'news_title_7 content'})
    remove_tags_after = dict(id='main_content')
    no_stylesheets = True

    feeds          = [(u'\u041b\u0456\u0433\u0430 \u0417\u0430\u043a\u043e\u043d', u'http://news.ligazakon.ua/news_rss/tape_articles.xml')]
somehow remove_tags_before doesn't do it's job: feed remains polluted with the stuff.

am I doing something wrong? Or is there a better way to polish the feed?

defining this parameter with id='main_content' works perfectly, but main_content doesn't include article's title, which is a bit inconvenient.

Thanks for help!

Question on a side note: since this recipe has only one feed, how can it be modified, so it jumps to the contents directly, without creating a list of feeds, with only one link?

Last edited by Dereks; 03-13-2011 at 12:06 PM.
Dereks is offline   Reply With Quote