Ok, here 2 new rewrite rules for other newspapers.
Washington Post:
Pattern: http://www\.washingtonpost\.com(.*)\.html(.*)
Rewrite rule: http://www.washingtonpost.com$1_pf.html
Corriere della Sera (Italian newspaper)
Pattern: http://www\.corriere\.it(.*)shtml
Rewrite rule: http://www.corriere.it$1html
Now I'm analyzing International Herald Tribune and Reuters
With IHT I tried this rule:
Pattern: http://www\.iht\.com(.*)
Rewrite rule: http://www.iht.com/bin/print_ipub.php?file=$1
But it doesn't work: articles downloaded have only title, author and date.
See later
Gaetano
|