Hi, can somebody help me with this one and tell me what's wrong with this code:
Code:
class AdvancedUserRecipe1300026627(BasicNewsRecipe):
title = u'Liga Zakon'
oldest_article = 7
max_articles_per_feed = 100
remove_tags_before = dict(name='div ', attrs={'class':'news_title_7 content'})
remove_tags_after = dict(id='main_content')
no_stylesheets = True
feeds = [(u'\u041b\u0456\u0433\u0430 \u0417\u0430\u043a\u043e\u043d', u'http://news.ligazakon.ua/news_rss/tape_articles.xml')]
somehow
remove_tags_before doesn't do it's job: feed remains polluted with the stuff.
am I doing something wrong? Or is there a better way to polish the feed?
defining this parameter with id='main_content' works perfectly, but main_content doesn't include article's title, which is a bit inconvenient.
Thanks for help!
Question on a side note: since this recipe has only one feed, how can it be modified, so it jumps to the contents directly, without creating a list of feeds, with only one link?