@tropoy, it seems that your dictionary program used whilst reading a book properly handles html automatically and knows the apparently typical symbol references defined in the doctype. This is somewhat expected, because converter.exe seems to strip out all html-tags that are more complex than bold and italics: All the interpretation needs to be on the interpreter side.
However, your dictionary manager doesn't know html at all, it seems. What dictionary manager are you referring to?
@EastEriq The tags <f> and </f> are new and were demolished by my script with the conversion of ">" to "<". This is corrected now in the script. Also now all the ampersands are left alone if they are closely followed by a ";".
Last edited by Markismus; 02-05-2020 at 05:53 AM.
|