![]() |
#1 |
Zealot
![]() ![]() Posts: 119
Karma: 100
Join Date: Jan 2011
Location: Germany / NRW /Köln
Device: prs-650 / prs-350 /kindle 3
|
recipe for Neuss-Grevenbroicher-Zeitung (NGZ) - german
Code:
import string, re from calibre import strftime from calibre.web.feeds.recipes import BasicNewsRecipe from calibre.ebooks.BeautifulSoup import BeautifulSoup class AdvancedUserRecipe1303841067(BasicNewsRecipe): title = u'NGZ-online' __author__ = 'schuster' remove_tags_before = dict(id='bu') remove_tags_after = dict(id='noblock') remove_tags = [dict(attrs={'class':['articleTools', 'post-tools', 'side_tool', 'nextArticleLink clearfix', 'liketext']}), dict(id=['footer', 'toolsRight', 'articleInline', 'navigation', 'archive', 'side_search', 'blog_sidebar', 'side_tool', 'side_index', 'Verlinken', 'vorheriger', 'LESERKOMMENTARE', 'bei facebook', 'bei twitter', 'Schreiben Sie jetzt Ihre Meinung:', 'Thema', 'Ihr Beitrag', 'Ihr Name', 'Ich möchte über weitere Lesermeinungen zu diesem Artikel per E-Mail informiert werden.', 'banneroben', 'bannerrechts', 'inserieren', 'stellen', 'auto', 'immobilien', 'kleinanzeige', 'tiere', 'ferienwohnung', 'NGZ Card', 'Mediengruppe RP', 'Werben', 'Newsletter', 'Wetter', 'RSS', 'Abo', 'Anzeigen', 'Redaktion', 'Schulprojekte', 'Gast', 'Mein NGZ', 'Nachrichten', 'Sport', 'Wirtschaft', 'Stadt-Infos', 'Bilderserien', 'Bookmarken', 'del.icio.us', 'Mister Wong', 'YiGG', 'Webnews', 'Shortnews', 'Twitter', 'Newsider', 'Facebook', 'StudiVZ/MeinVZ', 'Versenden', 'Drucken']), dict(name=['script', 'noscript', 'style'])] oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True use_embedded_content = False language = 'de' remove_javascript = True cover_url = 'http://www.rhein-kreis-neuss-macht-sport.de/sport/includes/bilder/ngz_logo.jpg' def print_version(self, url): return url + '?ot=de.circit.rpo.PopupPageLayout.ot' feeds = [ (u'Grevenbroich', u'http://www.ngz-online.de/app/feed/rss/grevenbroich'), (u'Kreis Neuss', u'http://www.ngz-online.de/app/feed/rss/rheinkreisneuss'), (u'Dormagen', u'http://www.ngz-online.de/app/feed/rss/dormagen'), (u'J\xfcchen', u'http://www.ngz-online.de/app/feed/rss/juechen'), (u'Rommerskirchen', u'http://www.ngz-online.de/app/feed/rss/rommerskirchen') ] |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
recipe for Bild.de - German | schuster | Recipes | 2 | 05-22-2016 05:00 AM |
recipe for FAZ.net - german | schuster | Recipes | 10 | 05-28-2011 12:13 AM |
recipe for Astronomie heute - german | schuster | Recipes | 0 | 05-14-2011 12:42 PM |
Problem with recipe for Sueddeutsche Zeitung | amontiel69 | Recipes | 0 | 02-25-2011 11:05 AM |
German: Sueddeutsche Zeitung is broken | kbaerwald | Recipes | 3 | 11-18-2010 05:57 AM |