Quote:
Originally Posted by kovidgoyal
@JTravers
match_regexp works on the contents of the href attribute, i.e. the URL itself, not on the <a> tag.
|
Here's the code I'm using for the link regexp:
Code:
match_regexps = ['http://online.barrons.com/.*?html\?mod=.*?']
But I can see webpages being fetched from entirely different domains than barrons.com. I've attached my profile for Barrons. You should be able to test it (at your convenience, of course) without supplying a username and password, as there are some articles that are available to non-subscribers.