Just discovered the one thing that chardet is very good at -- chardet is faultless in its detection of the whole family of UTF encodings including utf-7, utf-8, utf-8-sig, utf-16, utf-16BE utf-32 etc. I've therefore changed my encoding detection function to only allow chardet results for the UTF family of encodings. This is as far as I can go I think.
Last edited by slowsmile; 01-13-2017 at 07:49 PM.
|