View Single Post
Old Yesterday, 11:30 AM   #12
Musrar
Member
Musrar began at the beginning.
 
Posts: 10
Karma: 10
Join Date: Sep 2024
Device: Kindle Oasis
Quote:
I found something informative in the Amazon docs.
Yeah, makes sense that it uses a fuzzy algorithm. That was solved by removing the ë in the orth value

After more testing I'm encountering more issues.

Apparently, some inflections don't get picked up in some words, whereas an inflection with the same form of another word gets picked up.

For example, the word капли can be both an inflection of ка́пнуть and ка́пля, but in my kindle only ка́пнуть gets picked up, and it makes no sense because in both cases it's just an iform, not even the orth. I tried deleting orth values but it change nothing.

The order in the xhtml files can't explain it, as ка́пнуть (picked up) is 06 6696 (xhtml, line) and ка́пля (not picked up) is 01 7778.

The same happens for example with есть. It's a form of быть and есть, and even removing orth values doesn't change only есть being picked up, and есть (picked up) comes later in the files, 00 955, whereas быть (not picked up) is 00 113.


Guess I'm yet again stuck.
Attached Thumbnails
Click image for larger version

Name:	капнуть.png
Views:	14
Size:	33.5 KB
ID:	218163   Click image for larger version

Name:	капля.png
Views:	13
Size:	23.8 KB
ID:	218164   Click image for larger version

Name:	быть.png
Views:	10
Size:	27.5 KB
ID:	218165   Click image for larger version

Name:	есть.png
Views:	13
Size:	29.5 KB
ID:	218166  
Musrar is offline   Reply With Quote