![]() |
#1 |
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Apr 2011
Device: Amazon Kindle 3
|
Recipes for Version2.dk and Alt om Herning
I made recipes for two danish sites:
Alt om Herning: Code:
__license__ = 'GPL v3' __copyright__ = '2011, Rasmus Lauritsen <rasmus at lauritsen.info>' ''' aoh.dk ''' from calibre.web.feeds.news import BasicNewsRecipe class aoh_dk(BasicNewsRecipe): title = 'Alt om Herning' __author__ = 'Rasmus Lauritsen' description = 'Nyheder fra Herning om omegn' publisher = 'Mediehuset Herning Folkeblad' category = 'news, local, Denmark' oldest_article = 14 max_articles_per_feed = 50 no_stylesheets = True delay = 1 encoding = 'utf8' use_embedded_content = False language = 'da' extra_css = """ body{font-family: Verdana,Arial,sans-serif } img{margin-bottom: 0.4em} .txtContent,.stamp{font-size: small} """ conversion_options = { 'comment' : description , 'tags' : category , 'publisher' : publisher , 'language' : language } feeds = [(u'All news', u'http://aoh.dk/rss.xml')] keep_only_tags = [ dict(name='h1') ,dict(name='span', attrs={'class':['frontpage_body']}) ] remove_tags = [ dict(name=['object','link']) ] Code:
import re __license__ = 'GPL v3' __copyright__ = '2011, Rasmus Lauritsen <rasmus at lauritsen.info>' ''' version2.dk ''' from calibre.web.feeds.news import BasicNewsRecipe class version2(BasicNewsRecipe): title = 'Version2.dk' __author__ = 'Rasmus Lauritsen' description = 'IT News' publisher = 'version2.dk' category = 'news, IT, hardware, software, Denmark' oldest_article = 14 max_articles_per_feed = 50 no_stylesheets = True remove_empty_feeds = True use_embedded_content = False encoding = 'iso-8859-1' language = 'da' extra_css = """ body {font-family: "Verdana",Times,serif} .articleauthor{color: #9F9F9F; font-family: Arial, sans-serif; font-size: small; text-transform: uppercase} .rubric,.dd,h6#credit{color: #CD0021; font-family: Arial, sans-serif; font-size: small; text-transform: uppercase} .descender:first-letter{display: inline; font-size: xx-large; font-weight: bold} .dd,h6#credit{color: gray} .c{display: block} .caption,h2#articleintro{font-style: italic} .caption{font-size: small} """ preprocess_regexps = [ (re.compile(r'</?a[^>]*>'),lambda match: ''), (re.compile(r'<span[^>]*article-link-id.*?<br\s*\/?><br\s*\/?>'), lambda match: '')] keep_only_tags = [dict(name='div', attrs={'class':'article'})] remove_tags = [ dict(name='p',attrs={'class':'meta links'}), dict(name='div',attrs={'class':'float-right'}), dict(name='span',attrs={'class':'article-link-id'}) ] feeds = [ (u'Seneste nyheder' , u'http://www.version2.dk/feeds/nyheder') ,(u'Forretningssoftware' , u'http://www.version2.dk/feeds/forretningssoftware') ,(u'Internet & styresystemer' , u'http://www.version2.dk/feeds/styresystemer') ,(u'It-arkitektur' , u'http://www.version2.dk/feeds/it-arkitektur') ,(u'It-styring & outsourcing' , u'http://www.version2.dk/feeds/it-styring') ,(u'Job & karriere' , u'http://www.version2.dk/feeds/karriere') ,(u'Mobil it & tele' , u'http://www.version2.dk/feeds/tele') ,(u'Server/storage & netværk' , u'http://www.version2.dk/feeds/server-storage') ,(u'Sikkerhed' , u'http://www.version2.dk/feeds/sikkerhed') ,(u'Softwareudvikling' , u'http://www.version2.dk/feeds/softwareudvikling') ] |
![]() |
![]() |
![]() |
#2 |
Enthusiast
![]() Posts: 30
Karma: 12
Join Date: Jun 2011
Location: India
Device: Kindle 3g
|
Thanks
|
![]() |
![]() |
Advert | |
|
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
alt z does not work | mmm2m8r | Amazon Kindle | 1 | 04-10-2011 02:14 AM |
Could somebody help with src and alt, please? | GillianMary | Sigil | 2 | 10-08-2010 08:12 AM |
Alt-Characters 174 and 175 | herby2908 | Calibre | 2 | 05-25-2010 03:42 PM |
Default for alt-shift-# is...? | Neil | Amazon Kindle | 2 | 07-04-2009 04:51 PM |
No Alt-t for time! | schulzmc | Amazon Kindle | 2 | 02-24-2009 08:48 PM |