|
|
#1 |
|
Member
![]() Posts: 11
Karma: 10
Join Date: Jan 2011
Device: Kindle
|
removing articles from feeds with regexp
The RSS feeds from RSS Technica have so-called "Etc" items which do not point to actual stories, but rather are just links to the web etc.
I would like to remove these from the feeds so that they do not get turned into articles. One way I'd go about it is with a regular expression - the titles for these items all start with "Etc:". Here is an example feed: http://feeds.arstechnica.com/arstechnica/apple/ Is there a BasicNewsRecipe method I could use to do this? I might have missed one when going through the API. Thanks! |
|
|
|
|
|
#2 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,626
Karma: 28549046
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
get_article_url, have it return None for articles you want skipped
|
|
|
|
| Advert | |
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Regexp and Alternate Page Header/Footer | adad | Calibre | 5 | 01-15-2011 10:03 PM |
| Multiple line regexp | janvanmaar | Calibre | 19 | 11-02-2010 02:02 PM |
| Regexp and header/footer problems | concern | Calibre | 0 | 02-07-2010 04:35 AM |
| Any way to import feeds/articles from Google reader into Calibre? | techie_007 | Calibre | 1 | 12-26-2009 12:15 PM |
| The Observer feeds and articles | Roger Wilmut | Calibre | 3 | 12-15-2008 01:02 PM |