View Single Post
Old 08-11-2015, 02:32 AM   #142
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,737
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by trloha View Post
Hey all, I tried doing this for a Vietnamese dictionary and it's having trouble with some characters like đ Đ ụ ư ă and i'm sure a bunch of others. Any thoughts? I tried adding -utf to the script but it didn't work.
IIRC, the tab-delimited source file needs to be saved as a UTF-8 file. To ensure the correct encoding open it with Windows Notepad and save it as a UTF-8 file.
If you don't have a Window machine, save it as a UTF-8 file with a BOM.

Last edited by Doitsu; 08-11-2015 at 02:57 AM.
Doitsu is offline   Reply With Quote