![]() |
#1 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Dec 2010
Location: Fayetteville, NC USA
Device: kindle
|
Fayetteville Observer - Fayetteville, NC USA (dups)
How does one improve a recipe to avoid duplicate articles across a site?
Here is there recipe and it works, but an article can be in more than one sub section. I would like to download it only once. class AdvancedUserRecipe1293623816(BasicNewsRecipe): title = u'Fayetteville Observer' oldest_article = 7 max_articles_per_feed = 30 feeds = [(u'Local News', u'http://fayobserver.com/CMSPages/rssNews.aspx'), (u'Life', u'http://fayobserver.com/CMSPages/rssLife.aspx'), (u'Business', u'http://fayobserver.com/CMSPages/rssBusiness.aspx'), (u'Military', u'http://fayobserver.com/CMSPages/rssMilitary.aspx'), (u'Sports', u'http://fayobserver.com/CMSPages/rssSports.aspx'),(u'Livewire',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/livewire'),(u'Crime',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/brooks'),(u'FaytoZ',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/faytoz'),(u'PeoplesBusiness',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/peoplesbusiness'),(u'910pets',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/910pets'),(u'DadFactor',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/dadfactor'),(u'FreeStuff',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/free'),(u'Cheers & Jeers',u'http://fayobserver.com/cheers'),(u'TechSassy',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/techsassy'),(u'Deviere',u'http://blogs.fayobserver.com/CMSPages/BlogRss.aspx?aliaspath=/deviere')] |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,190
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
i would advise against doing that. Sometimes it is nice to access an article from multiple feeds. And note that only the article html is downloaded multiple times. Not images/css/etc. so the bandwidth/size overhead is minimal.
|
![]() |
![]() |
Advert | |
|
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Hello from Fayetteville, NC. | Blades | Introduce Yourself | 12 | 01-15-2011 04:04 PM |
Review of the Kindle 3 from the Observer in the Guardian UK | DMcCunney | News | 18 | 08-29-2010 07:03 PM |
Hi from OK, USA | oksahmof2 | Introduce Yourself | 9 | 06-14-2010 04:37 AM |
Possible to use outside of USA? | Majorix | iRex | 6 | 11-08-2009 07:54 AM |
The Observer feeds and articles | Roger Wilmut | Calibre | 3 | 12-15-2008 12:02 PM |