Quote:
Originally Posted by Snuffi
html literals are characters expressed by html tags. I'm not familiar with the codes for cyrillic letters but here are a few examples I have stumbled across:
ö = ö
ß = ß
& = &
I assume that for cyrillic characters there are similar html-codes and for literals expressed via these codes you don't need special fonts, codepages, etc.
|
Those are called "entities". I don't think there are named entities for Cyrillic, but there are numbered entites like & #1044; (with no space between & and #). See
here for a list
Quote:
|
I am not aware of any program to "convert" cyrillic text into html literals, though. I guess you'd have to look for a program to convert regular text to html which comes from a region using cyrillic letters...
|
In linux I use "recode", which is as simple as: "recode utf8..html myfile.htm" (that converts from UTF-8 encoding to HTML entities, it converts < to <, & to &, etc. too). It seems there's a Windows port
here.