Quote:
Originally Posted by tompe
I had hoped that Encode was included in the core files. Maybe I can try to use another function. I will check with the Sherlock Holmes Omnibus and see what the problem is.
|
The problem is that this is an old file so it does not have the information about codepage and I had maybe wrongly assumed that if it was not specified it was UTF-8...
I have now changed the code so that if not "codepage" is specified it does nothing. If 1252 is specified it add the meta header for that. If 65001 (UTF-8) is specified I add meta header for that and decode the byte stream with before sending it to HTML::TreeBuilder.
It worked for the Sherlock Holmes omnibus. I will check the UTF-8 example I have and maybe do a new release later today...