Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Readers > Kobo Reader

Notices

Reply
 
Thread Tools Search this Thread
Old 02-19-2025, 03:01 PM   #1
h3ct0r
Connoisseur
h3ct0r began at the beginning.
 
Posts: 58
Karma: 10
Join Date: Oct 2023
Device: glo hd
dictionaries not recognizing ligatures and diacritics

I have a problem with some of my texts which contain diacritics (principally apices and diaeresis) and ligatures (principally æ, œ) being encoded as stand alone letters whereas my dictionaries don't include them at all and can't identify the word.

Does anyone know how to format my texts so that:
Æ is always recognized as AE
Œ is OE
Ë is E
ÁÉÍÓÚ is AEIOU

I want to do this without replacing the diacritics in the text. Or conversely make the dictionaries themselves recognized those letter as such

Please help me if you have any information
Thank you
h3ct0r is offline   Reply With Quote
Old 02-19-2025, 05:11 PM   #2
Aleron Ives
Wizard
Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.Aleron Ives ought to be getting tired of karma fortunes by now.
 
Posts: 1,688
Karma: 16307824
Join Date: Sep 2022
Device: Kobo Libra 2
I don't think you can do what you want. You would either have to replace all the letters in the book with the letters that the dictionary recognises, or you would have to add new entries to the dictionary for every word that might be spelled with special characters. Since you don't want to do the former, typing in the words manually to look them up is easier and faster than doing the latter.

If it's possible to use NickelMenu to perform dictionary lookups, you could potentially inject a shell script that would find and replace special characters before passing the word to the dictionary, but I don't know of a way to perform dictionary lookups with a shell script on Kobo.
Aleron Ives is offline   Reply With Quote
Advert
Old 02-19-2025, 06:15 PM   #3
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 79,745
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
delete post
JSWolf is offline   Reply With Quote
Old 02-20-2025, 12:17 AM   #4
h3ct0r
Connoisseur
h3ct0r began at the beginning.
 
Posts: 58
Karma: 10
Join Date: Oct 2023
Device: glo hd
Quote:
Originally Posted by Aleron Ives View Post
I don't think you can do what you want. You would either have to replace all the letters in the book with the letters that the dictionary recognises, or you would have to add new entries to the dictionary for every word that might be spelled with special characters. Since you don't want to do the former, typing in the words manually to look them up is easier and faster than doing the latter.

If it's possible to use NickelMenu to perform dictionary lookups, you could potentially inject a shell script that would find and replace special characters before passing the word to the dictionary, but I don't know of a way to perform dictionary lookups with a shell script on Kobo.
In that case do you know if there's a way to automate the process of changing every letter in the book?
h3ct0r is offline   Reply With Quote
Old 02-20-2025, 12:22 AM   #5
geek1011
Wizard
geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.geek1011 ought to be getting tired of karma fortunes by now.
 
Posts: 2,804
Karma: 7025947
Join Date: May 2016
Location: Ontario, Canada
Device: Kobo Mini, Aura Edition 2 v1, Clara HD
New Kobo dictionaries can contain a `prefix_exceptions` file, which is a marisa-trie of original words to the new one to look up in its place.

Unfortunately, I have not had a chance to implement this in dictutil, and there's no publicly available software for working with the newer dictionary format. It can still be done manually if you care enough.
geek1011 is offline   Reply With Quote
Advert
Old 02-20-2025, 07:06 AM   #6
h3ct0r
Connoisseur
h3ct0r began at the beginning.
 
Posts: 58
Karma: 10
Join Date: Oct 2023
Device: glo hd
Quote:
Originally Posted by geek1011 View Post
New Kobo dictionaries can contain a `prefix_exceptions` file, which is a marisa-trie of original words to the new one to look up in its place.

Unfortunately, I have not had a chance to implement this in dictutil, and there's no publicly available software for working with the newer dictionary format. It can still be done manually if you care enough.
They actually aren't official kobo dictionaries, I converted them myself with pyglossary. Maybe someone knows of a way to fix or circumvent this issue. Thanks for your response
h3ct0r is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Diacritics crutledge ePub 22 06-21-2016 06:24 AM
GIMP and diacritics AlexBell Workshop 9 10-22-2015 08:53 AM
Gen3 Diacritics for Epub coolfishx Bookeen 0 07-11-2011 07:49 AM
Troubleshooting Diacritics support? amobile Amazon Kindle 13 01-16-2011 07:42 PM
Diacritics, Czech kucera Kobo Reader 9 12-24-2010 12:13 PM


All times are GMT -4. The time now is 09:06 PM.


MobileRead.com is a privately owned, operated and funded community.