FWIW, there is no one official standard for regular expressions, especially for more advanced features. PCRE differs from even Perl now, Java is different, etc. Yes we use the PCRE version 1 library inside Sigil.
Have you tried removing the square brackets around [\w]+ part of your regular expression. You are not selecting characters from a set here \w is already a set. Also what controls the greediness of the remaining letters of Anzil? And what exactly are you trying to do with capture groups 4 and 5 and with the * there couldn't it easily match the nzil part?
|