MobileRead Forums - View Single Post - 20 Minutos (boletín) + La tribuna de Talavera

tolyluis · 01-24-2011, 07:42 PM

Hi again:

New versions of this recipes, a little changes (just gpl'ed). Here are the news recipes from me:

20 Minutos (boletín) - Simple recipe with highlights

name: 20minbol_(es).recipe

Code:

__license__   = 'GPL v3'

class AdvancedUserRecipe1295310874(BasicNewsRecipe):

    title          = u'20 Minutos (Boletin)'
    publisher      = u'Grupo 20 Minutos'

    __author__            = 'Luis Hernández'
    description           = 'Boletin del periódico gratuito en español - v1.0 - 25 Jan 2011'
    cover_url     = 'http://estaticos.20minutos.es/mmedia/especiales/corporativo/css/img/logotipos_grupo20minutos.gif'

    oldest_article = 2
    max_articles_per_feed = 50

    feeds          = [(u'VESPERTINO', u'http://20minutos.feedsportal.com/c/32489/f/478284/index.rss')
                        , (u'DEPORTES', u'http://20minutos.feedsportal.com/c/32489/f/478286/index.rss')
                        , (u'CULTURA', u'http://www.20minutos.es/rss/ocio/')
                        , (u'TV', u'http://20minutos.feedsportal.com/c/32489/f/490877/index.rss')
]

La tribuna de Talavera - Local Newspaper from Talavera de la Reina

name: Latribunatal_(es).recipe

Code:

__license__   = 'GPL v3'

class AdvancedUserRecipe1294946868(BasicNewsRecipe):

    title             = u'La Tribuna de Talavera'
    publisher      = u'Grupo PROMECAL'

    __author__  = 'Luis Hernández'
    description   = 'Diario de Talavera de la Reina - v1.0 - 25 Jan 2011'
    cover_url     = 'http://www.latribunadetalavera.es/entorno/mancheta.gif'

    oldest_article = 5
    max_articles_per_feed = 50

    remove_javascript = True
    no_stylesheets        = True
    use_embedded_content  = False

    encoding              = 'utf-8'
    language              = 'es'
    timefmt        = '[%a, %d %b, %Y]'

    keep_only_tags     = [dict(name='div', attrs={'id':['articulo']})
                                  ,dict(name='div', attrs={'class':['foto']})
                                  ,dict(name='p', attrs={'id':['texto']})                                
                                ]

    remove_tags_before = dict(name='div' , attrs={'class':['comparte']})
    remove_tags_after  = dict(name='div' , attrs={'id':['relacionadas']})


    feeds          = [(u'Portada', u'http://www.latribunadetalavera.es/rss.html')]

Sorry for my english, guys, of course my work can be taken by calibre developers, hope you find useful this!