Thanks for the suggestion, but the page really uses UTF-8. Setting the encoding to other values just adds garbage chars in the text.
I'm afraid this may require more complex solutions. I'll have a look at builtin recipes for more inspiration and report back when/if I make any progress.
|