I do something similar by using the tools at Feedbooks for pulling RSS feeds.
Helpfully the Guardian provides full text feeds for pretty much everything they do so I can have an entire newspaper each day by grabbing the feeds and chucking them on the BeBook.