View Single Post
Old 06-30-2024, 08:08 PM   #39
democrite
Evangelist
democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.democrite will give the Devil his due.
 
Posts: 441
Karma: 77256
Join Date: Sep 2011
Device: none
Quote:
Originally Posted by KevinH View Post
The days of removing accents just to search for pseudo text are long gone.
Maybe for some future date or consideration. As some may not want to normalize - not sure if ligatures or other features may also apply -, maybe it is someday worth also normalizing internally for search in such cases. It could be a mess but I'm not sure how some readers might handle such. Maybe some keep copies in RAM that are normalized and also stripped of diacritics to speed up search.

Being able to match canonical equivalence and possibly diacritical insensitive search might be useful for some though perhaps usage is more rare. For the latter, there could be typos in diacritics and/or OCR errors in diacritics or accents - some languages have terms spelled the same but with a difference in accent or diacritic -, such that that kind of search might be useful for some.
democrite is offline   Reply With Quote