If I were them, I would be a little hesitant. It is hard enough to eliminate bugs now, and if they had 20-30 variations of Tidy added, it might make it even harder. But it would be nice to have more options for cleaning if it were reliable and that I think is the crux of the matter.
It is easy for the regex guys to do this stuff, but I spend all my energy on pdf origin documents that substitute a for o and scramble parts of sentences in paragraphs around. Not so much energy left for the misnamed "regular" expressions.
|