View Single Post
Old 02-01-2013, 05:49 AM   #1
zitronenrollchen
Junior Member
zitronenrollchen began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Feb 2013
Device: Kindle NT
VICE Magazin Deutschland

I shamelessly took the recipe for the Spanish website of vice.com and modified it for the German website. All credits go to the original author:

Code:
# vim:fileencoding=UTF-8:ts=4:sw=4:sta:et:sts=4:ai
import re
from calibre.web.feeds.news import BasicNewsRecipe

class ViceDERecipe(BasicNewsRecipe):
    title = u'Vice Magazin Deutschland'
    __author__ = 'atordo;alex'
    description = u'Die offizielle Website des Vice Magazins Deutschland'
    category = u'Nachrichten, Fotografie, Blogs, Mode, Kunst, Film, Musik, Literatur, Technik'
    cover_url = 'http://www.seeklogo.com/images/V/Vice-logo-668578AC94-seeklogo.com.gif'
    oldest_article = 14
    max_articles_per_feed = 100
    auto_cleanup = False
    no_stylesheets = True
    language = 'de'
    use_embedded_content  = False
    remove_javascript = True
    publication_type = 'magazine'

    recursions=10
    match_regexps = [r'/read/.*\?Contentpage=[2-9]$']

    keep_only_tags = [
        dict(attrs={'class':['article_title','article_content','next']})
    ]
    remove_tags = [
        dict(attrs={'class':['social_buttons','search','tweet','like','inline_socials'
           ,'stumblebadge','plusone']})
    ]

    extra_css = '''
        .author{font-size:small}
        img{margin-bottom: 0.4em; display:block; margin-left:auto; margin-right: auto}
    '''

    preprocess_regexps = [
        (re.compile(r'<img src="http://.*\.scorecardresearch\.com/'), lambda m: '')
    ]

    feeds = [('Vice', 'http://www.vice.com/de/rss')]
Hope this is of any use. (First post, sorry if anything's not correct).
zitronenrollchen is offline   Reply With Quote