View Single Post
Old 03-08-2011, 06:56 AM   #18
user_none
Sigil & calibre developer
user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.
 
user_none's Avatar
 
Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
Quote:
Originally Posted by ldolse View Post
Bug then? I did a bit of googling and it seems like Python has some quirks in it's handling of BOMs across different unicode encodings. I Didn't look very hard though.
Then it's in chardet which TXT input uses for detecting the encoding. I haven't had time but I need to look at the input file to see if it actually has a BOM. chardet will only detect UTF-16 if it has a valid BE or LE BOM.
user_none is offline   Reply With Quote