About charsets
Another little hitch, with another file. I am learning slowly.
Starting from a file in another format, I made an odt out of it some time ago, then exported it yesterday to xhtml from OpenOffice, them made an epub out of it with Sigil.
Validator gives me many "errrors", mostly about "lang" attribute unneeded but I can open the epub on my Sony. I have one hitch though.
What I see is not exactly what I get. While in Sigil, it looks perfect, on my PRS-505, I have a lot of characters replaced by question marks.
I think there is probably a wrong charset somewhere.
Looking on the epub meta data, I read:
<meta content="text/html; charset=utf-8" http-equiv="Content-Type" /> I am not sure though that the file uses really a utf-8 charset.
In Linux, I know it's possible to ascertain the charset of a text file, using the command line with
file
I do not know how to proceed with html, or epub files and how to set them on the right track.
Any hint?