View Single Post
Old 02-27-2008, 03:41 PM   #359
tompe
Grand Sorcerer
tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.tompe ought to be getting tired of karma fortunes by now.
 
Posts: 7,452
Karma: 7185064
Join Date: Oct 2007
Location: Linköpng, Sweden
Device: Kindle Voyage, Nexus 5, Kindle PW
Quote:
Originally Posted by tompe View Post
I had hoped that Encode was included in the core files. Maybe I can try to use another function. I will check with the Sherlock Holmes Omnibus and see what the problem is.
The problem is that this is an old file so it does not have the information about codepage and I had maybe wrongly assumed that if it was not specified it was UTF-8...

I have now changed the code so that if not "codepage" is specified it does nothing. If 1252 is specified it add the meta header for that. If 65001 (UTF-8) is specified I add meta header for that and decode the byte stream with before sending it to HTML::TreeBuilder.

It worked for the Sherlock Holmes omnibus. I will check the UTF-8 example I have and maybe do a new release later today...
tompe is offline   Reply With Quote