The safest is just to use ASCII, and convert all other characters to numerical or named entities:
á -> á
ü -> ü
— -> —
’ -> &rsquo (or & #8217;)
If you do this, you won't have to worry about encodings, because the HTML text is actually just plain ASCII.
|