View Single Post
Old 11-18-2019, 01:36 AM   #1
odamizu
just an egg
odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.odamizu ought to be getting tired of karma fortunes by now.
 
odamizu's Avatar
 
Posts: 1,833
Karma: 8006102
Join Date: Mar 2015
Device: Kindle, iOS
Entities oddities / 0.9.991 bug?

I may have found another bug in 0.9.991, but I'm struggling in how to describe it.

I have Preferences set to Mend on Open and Preserve Entities only for #160.

I have one epub (so far) that, when loaded into 0.9.991, all the character entities (quotes, apostrophes, etc.) persist, even though Prefs are set to preserve only #160. Running "Mend All HTML Files" fixes this: all the character entities are properly converted, and all is well.

I have another epub where the quotes and apostrophes are converted, but the non-breaking spaces show up as #x00A0. Again, running Mend fixes this: #x00A0 gets converted to #160 and all is well.

Now, when I load the exact same epubs into 0.9.18, all the character entities (except #160) are automatically and properly converted without my having to do anything extra.

So why is 0.9.991 struggling with these character entities, requiring me to run Mend manually, when 0.9.18 is handling it all seamlessly and automatically right off the bat?

Note1: the first epub where none of the character entities converted had <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd"> which was corrected on Mend along with the character entities.

Note2: Both of these epubs originated as AZW3, brought into Sigil via KindleImport plugin. At first I thought it was a KindleImport plugin problem, but when I saved the epub then re-opened them, the character entities continued to persist. But manually running Mend fixed things. So it seems like Mend wasn't being run on Open, despite the Preference settings?

I will play with this more tomorrow to see if I can find more clues, but I wanted to throw this out there. Also, if anyone has suggestions for what I should look for, let me know.
odamizu is offline   Reply With Quote