You could create a simple sed script with one line for each character that you need to fix. E.g.
Then simply save the lines as a utf8 text file (without BOM), e.g. fix.sed
, and execute it with sed:
sed -f fix.sed -i *.html
(Note that this will overwrite the original files.)