I have just started to learn about Calibre and tried my first recipe for a newspaper not listed in its Fetch News service. The publication is the Daytona Beach News Journal (
http://www.news-journalonline.com ). I have been able to remove all extraneous tags and limited the download to just the stories I want to read. Being totally new to this, I would welcome any input on streamlining the code or formatting the output which any member is willing to give me. Here is the recipe I have:
class AdvancedUserRecipe1305144260(BasicNewsRecipe):
title = u'Daytona Beach News Journal'
oldest_article = 1
max_articles_per_feed = 100
timefmt = ' [%a, %d %b, %Y]'
remove_tags_before = dict (id='page-title')
remove_tags = [dict(attrs={'class':['legal', 'subscribe-blurb', 'asset-footer', 'google-entry', 'relevent-stories', 'column-header', 'widget-content', 'submit', 'google-footer', 'footer-column', 'socilalinks-footer']}),
dict(id=['tabs', 'tabs1', 'tabs2', 'footer', 'footer-inner', 'footer-columnists', 'footer-content', 'footer-links', 'footer-column', 'footer-legal']),
dict(name=['script', 'noscript', 'style'])]
feeds = [(u'News', u'http://www.news-journalonline.com/rss.xml'),
(u'Business', u'http://www.news-journalonline.com/business/rss.xml'),
(u'Lifestyle', u'http://www.news-journalonline.com/lifestyle/rss.xml'),
(u'Entertainment', u'http://www.go386.com/rss.xml'),
(u'Opinion', u'http://www.news-journalonline.com/opinion/rss.xml')]
If this is of any benefit to anyone, I would like to see this source added to the next update of Calibre.
Thank you,
Randy