Thread: Losing content
View Single Post
Old 07-17-2015, 12:43 PM   #12
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,841
Karma: 6120478
Join Date: Nov 2009
Device: many
Hi,

One further piece of information - what are your preference settings for auto-fixing the file on import or save with Tidy? Have you enabled or disabled that?

In general, it is never a good idea to run an automated update utility on a malformed file (ie. on a file an xml parser can not properly parse - and so barfs on it). The only thing I can do without allowing auto-fixing is to prevent use of any set of automated changes (renaming files, deleting files, creating toc, editing metadata, etc) if any html file is not well-formed.

This will require a very fast well-formed check that can be quickly run over all html files before allowing the command to proceed.

Not sure what else I can do. The upcoming Sigil 0.9.X series that replace Tidy with googles gumbo html5 parser which is self-reparing by the very definition of html5 parsing. I will check your test case against current master to see what impact it may have.

Thanks!

KevinH
KevinH is offline   Reply With Quote