And by a binary search of hrefs in that file, it seems the culprit is this line:
href="https://en.wikipedia.org/wiki/Chamb%C3%A9ry"
So the boost library to unquote that url seems to be barfing on Chambéry
and utf-8 for é is the bytesequence 0xc3 0xa9 so I think this is a valid url
The problem must be someplace in boosts library but I am not sure where or a missing url unquote someplace.
|