View Single Post
Old 12-03-2012, 06:55 AM   #9
andrusz
Member
andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.andrusz ought to be getting tired of karma fortunes by now.
 
Posts: 24
Karma: 476046
Join Date: Nov 2012
Location: Poland
Device: Kobo Touch, Kindle K3
Quote:
Originally Posted by tshering View Post
I have to confess that this is nonsense. The filenames are stored in the zip file in UTF-8 encoding. You can verify this by opening the zip file in a HEX editor and inspecting the filenames. When the files are unzipped the encoding of the filenames depends on the settings of the unzipping program and the OS. 7-zip under windows encodes them by default as UTC/unicode.
I do not have hex editor right now to verify it but look at my previous post. This is a list of zip file contents run on RedHat Linux with UTF-8 language settings using the following command: unzip -l dicthtml-ja.zip

I have observed the same on Windows 7 (english version) opening the files using Winzip:

Old dictionary:


New dictionary:


Have you observed the same?
andrusz is offline   Reply With Quote