View Single Post
Old 04-10-2011, 04:45 PM   #15
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by Lightsource View Post
When I'm viewing it in a text editor before conversion, it's ok to me (whether it's a quote or an emdash, whatever) because I am using a smart editor (editpad, mostly) the displays the character.
I see the same thing dealing with older DOS text files. My "smart text editor" (TextPad) has a "dumb down" text mode which it calls "Convert to DOS". It usually helps with quotes and hyphens, but may not do so well with UTF-8 "characters". In that case try the "conversion to .html" method you outlined below.

Quote:
I'm going to take a shot in a few with converting them all to html and doing some f/r, then converting to mobi - I'll update with the outcome.
When the text is converted to html (I've use the freeware utility called text2html), I would then also pass it through Tidy to change the literal characters to HTML codes. Those you could then convert using regex's to their equivalent more popular character encodings, if you like.
nrapallo is offline   Reply With Quote