Quote:
Originally Posted by ldolse
Bug then? I did a bit of googling and it seems like Python has some quirks in it's handling of BOMs across different unicode encodings. I Didn't look very hard though.
|
Then it's in
chardet which TXT input uses for detecting the encoding. I haven't had time but I need to look at the input file to see if it actually has a BOM. chardet will only detect UTF-16 if it has a valid BE or LE BOM.