Quote:
Originally Posted by Lightsource
When I'm viewing it in a text editor before conversion, it's ok to me (whether it's a quote or an emdash, whatever) because I am using a smart editor (editpad, mostly) the displays the character.
|
I see the same thing dealing with older DOS text files. My "smart text editor" (TextPad) has a "dumb down" text mode which it calls "Convert to DOS". It usually helps with quotes and hyphens, but may not do so well with UTF-8 "characters". In that case try the "conversion to .html" method you outlined below.
Quote:
I'm going to take a shot in a few with converting them all to html and doing some f/r, then converting to mobi - I'll update with the outcome.
|
When the text is converted to html (I've use the freeware utility called
text2html), I would then also pass it through Tidy to change the literal characters to HTML codes. Those you could then convert using regex's to their equivalent more popular character encodings, if you like.