View Single Post
Old 07-26-2016, 08:20 AM   #61
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,841
Karma: 6120478
Join Date: Nov 2009
Device: many
When cleaning the text and parsing it before passing it to the spell checker it should be easy to filter out entities like & s h y ; and its numerical equivalents. But truly, soft-hyphenating words is probably best left to the very last step, after all other changes including spellchecking are completed.

So I recommend removing all soft-hyphens from the document using search and replace, until the text and epub are in an "as desired" state and then using a hyphenation library to add back in soft-hyphens if and only if you are producing an epub for readers that support them.
KevinH is online now   Reply With Quote