View Single Post
Old 03-31-2011, 10:44 PM   #2
oneillpt
Connoisseur
oneillpt began at the beginning.
 
Posts: 63
Karma: 46
Join Date: Feb 2011
Device: Kindle 3 (cracked screen!); PW1; Oasis
Quote:
Originally Posted by phiznlil View Post
added charset ISO-8859-15, is the correct? It seems to work best with all the fadas (acute accents) in Irish and the euro symbol.
The encoding does not appear to be critical for the Irish Times recipe, as the potential encoding problem which might arise with the Euro symbol (vowels with fadas would be encoded identically with ISO-8859-15, ISO-8859-1 or Windows-1252 in any case) are avoided by using HTML entities, "& #8364;" for the Euro symbol, "& #237;" for i fada, etc. (without the space between & and #, inserted here to display the entity rather than the resulting character)

The Irish Times pages are declared as us-ascii, which implies 7-bit ASCII codes, and which, if followed consistently, would require all these "special" characters to be encoded as HTML entities. I'm not sure whether the entities defined extend to traditional script, but the séimhiú can be handled with roman letters, for example "& #7682;" for Ḃ (B séimhiú), "& #7683" for ḃ (b séimhiú), etc.

If 7-bit ASCII encoding with HTML entities for all characters not included among the 7-bit set is followed consistently it should not matter whether the encoding is specified as ISO-8859-15, ISO-8859-1, Windows-1252 or UTF-8, as all these encodings are identical for the 7-bit codes. Have you noticed any instances where the Euro symbol or any letters with fadas (or any other special character for that matter) appeared incorrectly?
oneillpt is offline   Reply With Quote