MobileRead Forums - View Single Post - Help with Daytona Beach News Journal Recipe

BRGriff · 05-12-2011, 11:14 PM

I have just started to learn about Calibre and tried my first recipe for a newspaper not listed in its Fetch News service. The publication is the Daytona Beach News Journal ( http://www.news-journalonline.com ). I have been able to remove all extraneous tags and limited the download to just the stories I want to read. Being totally new to this, I would welcome any input on streamlining the code or formatting the output which any member is willing to give me. Here is the recipe I have:

class AdvancedUserRecipe1305144260(BasicNewsRecipe):
title = u'Daytona Beach News Journal'
oldest_article = 1
max_articles_per_feed = 100

timefmt = ' [%a, %d %b, %Y]'

remove_tags_before = dict (id='page-title')
remove_tags = [dict(attrs={'class':['legal', 'subscribe-blurb', 'asset-footer', 'google-entry', 'relevent-stories', 'column-header', 'widget-content', 'submit', 'google-footer', 'footer-column', 'socilalinks-footer']}),
dict(id=['tabs', 'tabs1', 'tabs2', 'footer', 'footer-inner', 'footer-columnists', 'footer-content', 'footer-links', 'footer-column', 'footer-legal']),
dict(name=['script', 'noscript', 'style'])]

feeds = [(u'News', u'http://www.news-journalonline.com/rss.xml'),
(u'Business', u'http://www.news-journalonline.com/business/rss.xml'),
(u'Lifestyle', u'http://www.news-journalonline.com/lifestyle/rss.xml'),
(u'Entertainment', u'http://www.go386.com/rss.xml'),
(u'Opinion', u'http://www.news-journalonline.com/opinion/rss.xml')]

If this is of any benefit to anyone, I would like to see this source added to the next update of Calibre.

Thank you,
Randy

05-12-2011, 11:14 PM	#1
BRGriff Connoisseur Posts: 58 Karma: 12 Join Date: May 2011 Location: Deland, Florida Device: Kindle 3	Help with Daytona Beach News Journal Recipe I have just started to learn about Calibre and tried my first recipe for a newspaper not listed in its Fetch News service. The publication is the Daytona Beach News Journal ( http://www.news-journalonline.com ). I have been able to remove all extraneous tags and limited the download to just the stories I want to read. Being totally new to this, I would welcome any input on streamlining the code or formatting the output which any member is willing to give me. Here is the recipe I have: class AdvancedUserRecipe1305144260(BasicNewsRecipe): title = u'Daytona Beach News Journal' oldest_article = 1 max_articles_per_feed = 100 timefmt = ' [%a, %d %b, %Y]' remove_tags_before = dict (id='page-title') remove_tags = [dict(attrs={'class':['legal', 'subscribe-blurb', 'asset-footer', 'google-entry', 'relevent-stories', 'column-header', 'widget-content', 'submit', 'google-footer', 'footer-column', 'socilalinks-footer']}), dict(id=['tabs', 'tabs1', 'tabs2', 'footer', 'footer-inner', 'footer-columnists', 'footer-content', 'footer-links', 'footer-column', 'footer-legal']), dict(name=['script', 'noscript', 'style'])] feeds = [(u'News', u'http://www.news-journalonline.com/rss.xml'), (u'Business', u'http://www.news-journalonline.com/business/rss.xml'), (u'Lifestyle', u'http://www.news-journalonline.com/lifestyle/rss.xml'), (u'Entertainment', u'http://www.go386.com/rss.xml'), (u'Opinion', u'http://www.news-journalonline.com/opinion/rss.xml')] If this is of any benefit to anyone, I would like to see this source added to the next update of Calibre. Thank you, Randy Last edited by BRGriff; 05-12-2011 at 11:17 PM.