View Single Post
Old 02-28-2016, 11:47 AM   #6
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,911
Karma: 6120478
Join Date: Nov 2009
Device: many
I looked at what calibre does. For every single word they spellcheck they use a regular expression replacement of the regular and short hyphens with nothing. If that shortens the word, they then spellcheck the shortened version first as a new word, if it passes they add it first to the suggestions, and then go and spellcheck the original word, and then test each new suggestion to prevent duplication with the no hyphen suggestion.

Sorry, Sigil is not going to go through all of that for a special case that only comes up for OCR text. Either a plugin or just normal find and replace can be done before the spellcheck to easily detect real hyphenation from OCR induced hyphenation and even better this would present the word in context.

Sorry.

KevinH
KevinH is offline   Reply With Quote