Quote:
Originally Posted by spedinfargo
Curious if anyone has worked with a RSS feed that is essentially a link to an assortment of articles on other sites. For instance, Arts & Letters Daily hand-picks articles from a number of sources: http://www.aldaily.com/rss/rss.xml
Would it be hopeless to try and create a recipe to pull those down into one document? Most of them seem to work well with Readability so they should parse out fairly well.
|
It's not hopeless, but if they vary from site to site, and if you want to remove advertising and make them look like they have a standardized format, you are talking about writing something similar to Readability. OTOH, if you just want to get the data into a single ebook, and don't insist on making them look consistent with useless/objectionable links removed, then it's straight forward. I think the general opinion is that it's better not to try to deal with multiple different sites. You may be able to use an aggregator, like Google Reader to help get them consistent.