|  12-06-2024, 03:03 AM | #1 | 
| Connoisseur  Posts: 76 Karma: 10 Join Date: Feb 2022 Device: None | 
				
				Problem finding words with diacriticals in them
			 
			
			One issue with regex is that it does not appear to be able to ignore diacriticals. IME, this is often necessary because diacritical marks may have been incorrectly applied and need to be replaced. For example. suppose you want to find the word "protege" (i.e. protégé). It will not return any results if there are diacriticals in it, unless you enter the letters with the exact same diacriticals.. Calibre needs to have a setting allowing it to read letters with diacriticals as ordinary letters. This will allow the user to find all instances of the word even if the diacriticals have been incorrectly applied. | 
|   |   | 
|  12-06-2024, 03:15 AM | #2 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Not something I am interested in implementing, too much work to try to get regex engines to act like that. It's a reasonable amount of work to have plain text searches do that, but not something I care enough about to implement.
		 | 
|   |   | 
|  12-06-2024, 03:20 AM | #3 | 
| Wizard            Posts: 1,687 Karma: 9500498 Join Date: Sep 2021 Location: Australia Device: Kobo Libra 2 | 
			
			There are a few ways around that... 1. Search for é to find all instances where it is used, 2. From Tools>Reports>Characters, you can see a listing of all diacritics used and click on them to jump to them 3. Use the Spellcheck which does exactly what you ask... | 
|   |   | 
|  12-16-2024, 09:53 AM | #4 | 
| Junior Member  Posts: 2 Karma: 10 Join Date: Apr 2024 Device: Paperwhite | 
			
			It's not exactly an answer to your question, but a palliative: why not search : prot?g? More precise: prot[eéèê]g[eéèê] Last edited by Biblos; 12-16-2024 at 09:57 AM. | 
|   |   | 
|  12-17-2024, 06:27 PM | #5 | |
| Grand Sorcerer            Posts: 5,763 Karma: 24088559 Join Date: Dec 2010 Device: Kindle PW2 | Quote: 
 For example: (these){e<=1} will find these, those, there (and many other strings). If you search for (protege){e<=2} in Regex mode, it'll find protege , protegé and protégé. It might of course also find other similar strings... | |
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Problem finding font adjustments | Strether | PocketBook | 24 | 05-16-2024 01:10 AM | 
| Finding Series of Capitalized Words? | enuddleyarbl | ePub | 5 | 02-12-2023 12:18 PM | 
| First post: Problem between Words and Sigil | Ti-Ron | Sigil | 12 | 03-28-2012 04:11 PM | 
| Highlighting, Defining Words on K3 - Problem with | detacht69 | Amazon Kindle | 2 | 09-29-2010 10:36 PM | 
| Having problem finding the eBook I want to buy. | Jary316 | Deals and Resources (No Self-Promotion or Affiliate Links) | 5 | 08-25-2008 01:12 PM |