![]() |
#1 |
Member
![]() Posts: 11
Karma: 10
Join Date: Feb 2011
Device: Kindle 3
|
Article Date Not Recognized - Is There a Work-Around?
My local paper is published weekly and the RSS is updated once a week. On the feed page the paper includes articles from the past two issues. No matter what I set oldest_article to calibre will always download all the articles. I looked at the source for the feed page (http://www.mahopacnews.com/rssheadlines.xml) and the only indication of the article date is in the article URL. Examples:
Code:
'http://www.mahopacnews.com/Articles-c-2011-03-01-207511.112113-Residents-question-timing-of-agencys-hearing.html' 'http://www.mahopacnews.com/Articles-c-2011-02-22-207434.112113-You-can-choose-to-be-happy.html' I can extract the date using url.split. Is there a way to tell calibre to use this date to determine article age? Must the date be in a particular format? Thanks |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,318
Karma: 27111242
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
RSS feeds are supposed to have their publication dates specified. This is what calibre uses. If you want to filter based on URL, simply implement print_version() in you recipe and return None for the articles you dont want to be downloaded
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Member
![]() Posts: 11
Karma: 10
Join Date: Feb 2011
Device: Kindle 3
|
Thanks. Any chance you could point me to a recipe where print_version() is used to filter by date?
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Problem with Article Date in parse_index | spedinfargo | Recipes | 5 | 02-19-2011 07:12 PM |
Tip: Article Date needs to be Unicode String | spedinfargo | Recipes | 0 | 02-19-2011 07:08 PM |
Trying to strip the date from an article URL | Finbar127 | Recipes | 1 | 02-17-2011 03:02 PM |
Up-to-date candy teacher (date being 1921) | kacir | Deals and Resources (No Self-Promotion or Affiliate Links) | 0 | 06-16-2010 04:18 PM |
new official shipping date / US invitation date | R2D2 | iRex | 18 | 07-06-2006 02:32 PM |