View Single Post
Old 01-13-2011, 05:18 PM   #5
Tegan
Connoisseur
Tegan began at the beginning.
 
Posts: 59
Karma: 10
Join Date: Jan 2011
Device: Kindle 1st Gen, Kindle 3 SO
Ok, fixed it. Amazing what the placement of a couple of lines can do...

Code:
class TriCityHeraldRecipe(BasicNewsRecipe):
    title          = u'Tri-City Herald'
    description    = 'The Tri-City Herald Mid-Columbia.'
    __author__     = 'Laura Gjovaag'
    oldest_article        = 1.5
    max_articles_per_feed = 100
    no_stylesheets        = True
    remove_javascript     = True
    keep_only_tags = [
                      dict(name='div', attrs={'id':'story_header'}),
                      dict(name='img', attrs={'class':'imageCycle'}),
                      dict(name='div', attrs={'id':['cycleImageCaption', 'story_body']})
                      ]
    remove_tags    = [
                      dict(name='div', attrs={'id':'story_mlt'}),
                      dict(name='a', attrs={'id':'commentCount'}),
                      dict(name=['script', 'noscript', 'style'])]
    extra_css      = 'h1{font: bold 140%;} #cycleImageCaption{font: monospace 60%}'

    feeds          = [
                      (u'Tri-City Herald Mid-Columbia', u'http://www.tri-cityherald.com/901/index.rss')
                     ]

And here's the one for the Yakima Herald-Republic...

Code:
class YakimaHeraldRepublicRecipe(BasicNewsRecipe):
    title          = u'Yakima Herald-Republic'
    description    = 'The Yakima Herald-Republic.'
    __author__     = 'Laura Gjovaag'
    oldest_article        = 1.5
    max_articles_per_feed = 100
    no_stylesheets        = True
    remove_javascript     = True
    keep_only_tags = [
                      dict(name='div', attrs={'id':['searchleft', 'headline_credit']}),
                      dict(name='div', attrs={'class':['photo', 'cauthor', 'photocredit']}),
                      dict(name='div', attrs={'id':['content_body', 'footerleft']})
                      ]
    extra_css = '.cauthor {font: monospace 60%;} .photocredit {font: monospace 60%}'

    feeds          = [
                      (u'Yakima Herald Online', u'http://feeds.feedburner.com/yhronlinenews'), 
                     ]
Tegan is offline   Reply With Quote