View Single Post
Old 11-29-2013, 05:09 AM   #9
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 7,558
Karma: 19620479
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Quote:
Originally Posted by pgfiore View Post
- HEX
65CC8020616371756973746162696C65 (continuos string, no special html inside)

"CC80"???? They are two, definitely; tea for two bloody chars?
OK, that is correct. CC80 is the combining grave accent encoded in UTF-8. And yes, UTF-8 uses two bytes for many characters, those outside the basic ASCII... it even uses four bytes for CJK characters, I believe... There's just no way you can use only one byte for every character, when you you have many more than 256 of them

http://www.fileformat.info/info/unic...0300/index.htm

So the string is "e", "combining grave accent", "space", "a", "c", etc. Nothing wrong with that, and there's definitely a bug in code view if it's not showing the space. But it's not necessarily a bug in Sigil, it could be in some of the libraries it's using.

As you have been told already, the easiest solution is to use the precomposed character "è", instead of "e"+"combining grave accent". That's:

C3 A8 20 61 63 71 75 69 73 74 61 62 69 6C 65

http://www.fileformat.info/info/unic...00e8/index.htm
Jellby is offline   Reply With Quote