Earlier I mentioned that I was worried that:
might find other tags that started with 'a'--and indeed it does. The addr, abbr, and area tags are probably able to be dismissed, but the <aside> tag is one that I'm sure we're only going to see more and more of. And my regex will include it.
So for the paranoid/pedantic type (like myself), it's probably best to use:
instead (should work in pretty-much all regex flavors).
The \b just matches a "word" boundary so that no other tags that start with 'a' will be caught up in the match.