ath, one could quite easily make an xml schema which would support tagging for parts of speech --- the problem is the effort of applying all those tags to a document of any length compleatly and accurately. It's quite enough work just looking through a document finding the small percentage of instances where things have been gotten wrong and applying a fix.
Probably a better choice here is to forego hyphenation entirely and set everything FLRR (flush left, ragged right) --- certainly Sony's ebook viewing program wouldn't have some of the spacing issues which really irk me if it set things fully justified (FLFR).
William
|