There are a number of algorithms that could be used - Levenshtein (Edit) distance and Jaccard similarity are two that come to mind - GitHub has Python implementations.
DaltonSt might be interested in creating a new plugin, or adding something it to one of his existing plugins. Would be good if it could have a choice of algorithms.
BR
|