Quote:
Originally Posted by Starson17
You have what looks like a tough site to clean properly. Have you looked for print links? Sometimes they are the easiest way to get a clean feed.
|
Indeed, that's the first thing I looked for. The website manages printing in two ways, depending on the section:
1. Open a print dialog box that will print the current page as it shows, with all the icons, comments, menus and other garbage.
2. Open a pop-up window saying there's been a bad server request.
So, not very useful.
Also, the RSS is awful. Sometimes it gives links as
www.sociedade.publico.pt, sometimes as
www.publico.pt/sociedade, sometimes as
www.publico.pt, etc, etc I cannot make head or tails out of it, really.
I know, this newspaper website is a mess structurally and otherwise. But it's my favourite Portuguese newspaper (very popular there, too) and I gotta keep learning that beautiful language.