I find it useful to convert the problem file to html and setting the encoding to utf8 use the w3c verification page to find the problem characters. It takes me less time than going through to find each instance and every time the file is unable to verify because of a problem character I do a search and replace for that character.
http://www.ascii.cl/htmlcodes.htm is a good reference for the most common problem characters.