Sorry did not explain it very well - I tried the rss feeds and they work kind of ok. I tried doing some programming but to no avail in cleaning it up properly.
For example to convert
http://www.entrepreneur.com/marketin...cle203248.html
to
http://www.entrepreneur.com/article/...is/203248.html
also on another one
http://goal.com/en/feeds/news?id=1659&fmt=rss
The print option creates a pdf so not sure how to handle for example
http://www.goal.com/en/news/1863/wor...-ireland-south
prints to
http://www.goal.com/en/news/1863/wor...reland-south#p
which is actually a pdf file.