![]() |
#1 |
Connoisseur
![]() Posts: 58
Karma: 10
Join Date: Oct 2023
Device: glo hd
|
dictionaries not recognizing ligatures and diacritics
I have a problem with some of my texts which contain diacritics (principally apices and diaeresis) and ligatures (principally æ, œ) being encoded as stand alone letters whereas my dictionaries don't include them at all and can't identify the word.
Does anyone know how to format my texts so that: Æ is always recognized as AE Œ is OE Ë is E ÁÉÍÓÚ is AEIOU I want to do this without replacing the diacritics in the text. Or conversely make the dictionaries themselves recognized those letter as such Please help me if you have any information Thank you |
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,688
Karma: 16307824
Join Date: Sep 2022
Device: Kobo Libra 2
|
I don't think you can do what you want. You would either have to replace all the letters in the book with the letters that the dictionary recognises, or you would have to add new entries to the dictionary for every word that might be spelled with special characters. Since you don't want to do the former, typing in the words manually to look them up is easier and faster than doing the latter.
If it's possible to use NickelMenu to perform dictionary lookups, you could potentially inject a shell script that would find and replace special characters before passing the word to the dictionary, but I don't know of a way to perform dictionary lookups with a shell script on Kobo. |
![]() |
![]() |
![]() |
#3 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,745
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
delete post
|
![]() |
![]() |
![]() |
#4 | |
Connoisseur
![]() Posts: 58
Karma: 10
Join Date: Oct 2023
Device: glo hd
|
Quote:
|
|
![]() |
![]() |
![]() |
#5 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,804
Karma: 7025947
Join Date: May 2016
Location: Ontario, Canada
Device: Kobo Mini, Aura Edition 2 v1, Clara HD
|
New Kobo dictionaries can contain a `prefix_exceptions` file, which is a marisa-trie of original words to the new one to look up in its place.
Unfortunately, I have not had a chance to implement this in dictutil, and there's no publicly available software for working with the newer dictionary format. It can still be done manually if you care enough. |
![]() |
![]() |
![]() |
#6 | |
Connoisseur
![]() Posts: 58
Karma: 10
Join Date: Oct 2023
Device: glo hd
|
Quote:
|
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Diacritics | crutledge | ePub | 22 | 06-21-2016 06:24 AM |
GIMP and diacritics | AlexBell | Workshop | 9 | 10-22-2015 08:53 AM |
Gen3 Diacritics for Epub | coolfishx | Bookeen | 0 | 07-11-2011 07:49 AM |
Troubleshooting Diacritics support? | amobile | Amazon Kindle | 13 | 01-16-2011 07:42 PM |
Diacritics, Czech | kucera | Kobo Reader | 9 | 12-24-2010 12:13 PM |