View Single Post
Old 02-16-2022, 09:21 AM   #265
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
WordDumb uses spaCy's Named Entity Recognition pipeline to find person names and other X-Ray terms. If a character is not included in X-Ray, it's because spaCy doesn't recognize the name. Try the large spaCy model, it has higher NER precision.

I already use the RapidFuzz library to merge similar X-Ray entities, it works quite well for names.

Last edited by xxyzz; 02-16-2022 at 09:27 AM.
xxyzz is offline   Reply With Quote