Thread: FAZ-Net Update
View Single Post
Old 12-19-2013, 08:09 AM   #2
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
I made an update for this recipe due to some unless footer add-on's and add one more feed for lifestyle.

Spoiler:
Code:
__license__   = 'GPL v3'
__copyright__ = '2008-2011, Kovid Goyal <kovid at kovidgoyal.net>, Darko Miletic <darko at gmail.com>'
'''
Profile to download FAZ.NET
'''

from calibre.web.feeds.news import BasicNewsRecipe

class FazNet(BasicNewsRecipe):
    title                 = 'FAZ.NET'
    __author__            = 'Kovid Goyal, Darko Miletic'
    description           = 'Frankfurter Allgemeine Zeitung'
    publisher             = 'Frankfurter Allgemeine Zeitung GmbH'
    category              = 'news, politics, Germany'
    use_embedded_content  = False
    language = 'de'

    max_articles_per_feed = 30
    no_stylesheets        = True
    encoding              = 'utf-8'
    remove_javascript     = True

    keep_only_tags = [{'class':'FAZArtikelEinleitung'},
            {'id':'ArtikelTabContent_0'}]

    remove_tags_after = dict(name='div', attrs={'class':['ArtikelFooter']}) # AGe add 2013-12-19
    remove_tags = [dict(name='div', attrs={'class':['ArtikelFooter']})] # AGe add 2013-12-19

                  
    feeds = [
              ('FAZ.NET Aktuell', 'http://www.faz.net/aktuell/?rssview=1'),
              ('Politik', 'http://www.faz.net/aktuell/politik/?rssview=1'),
              ('Wirtschaft', 'http://www.faz.net/aktuell/wirtschaft/?rssview=1'),
              ('Feuilleton', 'http://www.faz.net/aktuell/feuilleton/?rssview=1'),
              ('Sport', 'http://www.faz.net/aktuell/sport/?rssview=1'),
              ('Lebensstil', 'http://www.faz.net/aktuell/lebensstil/?rssview=1'), # AGe add 2013-12-19
              ('Gesellschaft', 'http://www.faz.net/aktuell/gesellschaft/?rssview=1'),
              ('Finanzen', 'http://www.faz.net/aktuell/finanzen/?rssview=1'),
              ('Technik & Motor', 'http://www.faz.net/aktuell/technik-motor/?rssview=1'),
              ('Wissen', 'http://www.faz.net/aktuell/wissen/?rssview=1'),
              ('Reise', 'http://www.faz.net/aktuell/reise/?rssview=1'),
              ('Beruf & Chance', 'http://www.faz.net/aktuell/beruf-chance/?rssview=1'),
              ('Rhein-Main', 'http://www.faz.net/aktuell/rhein-main/?rssview=1')
            ]
Attached Files
File Type: zip faznet_AGe V3.zip (873 Bytes, 312 views)
Divingduck is offline   Reply With Quote