View Single Post
Old 12-07-2011, 05:54 PM   #2
Serpentine
Evangelist
Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.Serpentine ought to be getting tired of karma fortunes by now.
 
Posts: 416
Karma: 1045911
Join Date: Sep 2011
Location: Cape Town, South Africa
Device: Kindle 3
Not that I use any terrible crapple specific stuff but:
Sounds like you've got some invalid characters (commas, colons etc) in the hrefs or using direct addressing; ie href="something.html#stuff" often causes trouble while href="../Text/something.html#stuff" will be fine.

Anyway, if you you need help, it's a good idea to provide us with an example(s), this regex :
Code:
<a\b(?:\s*\w+="[^"]*")*\s*(href="[^"]+")(?:\s*\w+="[^"]*")*\s*>
will grab all href's from <a> tags, group 1 contains just the href itself, if you could run this over your html and save the matches, that would be quite helpful.
Serpentine is offline   Reply With Quote