View Single Post
Old 09-16-2011, 05:26 PM   #18
Pr.BarnArt
Addict
Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.Pr.BarnArt has never been to obedience school.
 
Pr.BarnArt's Avatar
 
Posts: 245
Karma: 44966
Join Date: Jun 2011
Device: Pocketbook 902 / Ipad2
Quote:
Originally Posted by dave9000 View Post
There's a small problem with character encoding (windows-1252).
E.g. this sentence:
"... uno dei piß¹ famosi scrittori ..."
should be
"... uno dei pi¨ famosi scrittori ..."

It looks like the original text is encoded TWICE with UTF-8.
Thanks for the useful plugin anyway!

P.S. the following patch in worker.py seems to fix the issue:
OLD: raw = raw.decode('windows-1252', errors='replace')
NEW: raw = raw.decode('utf-8', errors='replace')
Thx. I will do a check .
Can you give me title and writer please.
Do you use anobii or bol-it as source?
Pr.BarnArt is offline   Reply With Quote