![]() |
#1 |
Member
![]() Posts: 13
Karma: 10
Join Date: Nov 2014
Device: android
|
koreader and stardic
Dear all, I am re-posting here a question I have asked in a different section of the forum, but perhaps here it is more appropriate since koreader is much used for koboreader devices -
would you be interested in a stardic version of Latin-English inflected? I have been trying to convert it from a bgl file for several weeks with no success, it's because I have recently discovered the wonderful Koreader which does not flash on my e-ink device (icarus illumina) when looking for words and the Latin-English inflected would solve an enormous problem. So, I have tried to convert it in various ways, being no expert at all in such things, somehow I understood how to use the Python converter, I got the files, with a very big syn. file (around 21M) but Koreader does not recognize the dictionary, even if it does recognize all the other stardictionaries I could find around for other languages. I am posting here the bgl file and the stardic files I converted it into - they work nicely on Golden Dictionary and Color Dictionary, but not on Koreader. If anyone could help with this, I would be extremely grateful. It might be because I have not used Stardic Tools, they wouldn't install on my computer due to some dll missing; or perhaps there's some feature of Koreader that I don't know of. Or, the syn. file could somehow be integrated into the idx. file? |
![]() |
![]() |
![]() |
#3 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 955
Karma: 149907
Join Date: Jul 2013
Location: Rotterdam
Device: HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
@didina
I've checked your dictionary with StarDict-Editor. It checks out o.k. When I de- and recompile it via xml (Textual StarDict dictionary) it gives only some small errors. I can decompile to a tabbed text-file, but it gives the error "Error, duplicate word a." when I try to recompile it. Koreader uses sdcv, the commandline version of stardict. When I give the command Code:
sdcv -l --data-dir . So I googled "sdcv syn-files" and got the result that syn support for sdcv is still a feature request. If that doesn't stop you, you could try to edit the tabbed text file generated by StarDict-Editor. It crashes on the recompilation of it, because of the multiple entries for a: interj./prep./prep.: Quote:
If you remove the "end of line | a | tab" between the 1st and second line and the second and third line, you would end up with one entry for 'a'. Now the new error would become "Error, duplicate word abactus." So you could write a script to eliminate the multiple entries and recompile it with StarDict-Editor. However, you would still lose the function provide by the syn-files. Better look at Whitaker's Words here on mobileread before you decide that your dictionary is worth that extra effort. |
|
![]() |
![]() |
![]() |
#4 |
Member
![]() Posts: 13
Karma: 10
Join Date: Nov 2014
Device: android
|
thank you very much. yes, I had kept googling and one of the things that crossed my mind was that the syn. file is the problem, but I wasn't sure since no one else seemed to have worried about it in connection with koreader. another member of the forum has kindly reconverted the bgl file with stardic tools and his version works on my device for the basic forms (just like Whitaker's Words, thanks for the suggestion). https://www.mobileread.com/forums/sho...88#post3121788
the inflected forms are not yet seen, I am still hoping it might be something with my icarus illumina, but the chances are low. I was wondering if it would not be possible to somehow modify the tabulated text version so as to make a separate entry for each inflected form. that would increase greatly the dimensions of the dictionary, but I think it might work. I have the stardic version of rosen's greek forms, which has around 900.000 entries, and that works without any problem. the inflected latin would have about 1.000.000 entries, by analogy it might work as well. Do you think I could do something to the tab txt file, or to the csv file, to make these separate entries? I am sorry, I am a complete ignorant in such stuff, this might a silly question, but I thought I should ask. |
![]() |
![]() |
![]() |
#5 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 955
Karma: 149907
Join Date: Jul 2013
Location: Rotterdam
Device: HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
@didina Just looking at these two entries:
Code:
abactus <b>abactus, abacta, abactum</b><br/><font color="#457820">adj.</font> driven away/off/back; forced to resign (office); restrained by; passed (night); abactus <b>abactus, abactus</b><br/><font color="#457820">n. m.</font> cattle thieving, stealing of cattle, rustling; Code:
abactus<TAB><b>abactus, abacta, abactum</b><br/>adj. driven away/off/back; forced to resign (office); restrained by; passed (night);<EOL> abactus<TAB><b>abactus, abactus</b><br/>n. m. cattle thieving, stealing of cattle, rustling; <EOL> Now two things need doing. First off: remove double entry: Code:
abactus<TAB><b>abactus, abacta, abactum</b><br/>adj. driven away/off/back; forced to resign (office); restrained by; passed (night); b>abactus, abactus</b><br/>n. m. cattle thieving, stealing of cattle, rustling; <EOL> Code:
abactus<TAB><b>abactus, abacta, abactum</b><br/>adj. driven away/off/back; forced to resign (office); restrained by; passed (night); b>abactus, abactus</b><br/>n. m. cattle thieving, stealing of cattle, rustling; <EOL> abactum<TAB><b>abactus, abacta, abactum</b><br/>adj. driven away/off/back; forced to resign (office); restrained by; passed (night); b>abactus, abactus</b><br/>n. m. cattle thieving, stealing of cattle, rustling; <EOL> abacta<TAB><b>abactus, abacta, abactum</b><br/>adj. driven away/off/back; forced to resign (office); restrained by; passed (night); b>abactus, abactus</b><br/>n. m. cattle thieving, stealing of cattle, rustling; <EOL> I seem to remember something like: Code:
abactus<TAB><b>abactus, abacta, abactum</b><br/>adj. driven away/off/back; forced to resign (office); restrained by; passed (night); b>abactus, abactus</b><br/>n. m. cattle thieving, stealing of cattle, rustling; <EOL> abactum<TAB><GOTO>abactus<EOL> abacta<TAB><GOTO>abactus<EOL> However, as koreader is ever developping, I do know that a feature to allow to search in the dictionary popup has been requested. Don't know whether it is implemented yet. That would get rid of the need of a coded <GOTO> and you could just code: Code:
abactus<TAB><b>abactus, abacta, abactum</b><br/>adj. driven away/off/back; forced to resign (office); restrained by; passed (night); b>abactus, abactus</b><br/>n. m. cattle thieving, stealing of cattle, rustling; <EOL> abactum<TAB>see: abactus<EOL> abacta<TAB>see: abactus<EOL> Last edited by Markismus; 06-24-2015 at 04:34 PM. |
![]() |
![]() |
![]() |
#6 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 955
Karma: 149907
Join Date: Jul 2013
Location: Rotterdam
Device: HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
For later reference the link to the stardict file-formats might be handy.
|
![]() |
![]() |
![]() |
#7 |
Member
![]() Posts: 13
Karma: 10
Join Date: Nov 2014
Device: android
|
thank you very much for your time and advice. I finally gave up and I bought a tablet, to use the golden dictionary with my texts. it does not feel alright, the e-ink was so much closer to real paper and less tiring. it would be great if koreader will at some point support syn files, I will continue to check the releases.
|
![]() |
![]() |
![]() |
#8 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,326
Karma: 1077205
Join Date: Jun 2011
Device: Kobo Touch, Sony T1, Kobo Mini
|
Some Onyx ereaders use Android with poor to good sucess. Read Onyx forum to stay current on their progress.
https://www.mobileread.com/forums/forumdisplay.php?f=220 |
![]() |
![]() |
![]() |
#9 |
Member
![]() Posts: 13
Karma: 10
Join Date: Nov 2014
Device: android
|
managed to convert the syn file using penelope. I've attached the result here. https://www.mobileread.com/forums/sho...36#post3128536
|
![]() |
![]() |
![]() |
#10 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 955
Karma: 149907
Join Date: Jul 2013
Location: Rotterdam
Device: HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
So that new option in Penelope is useful! Very nice.
I've checked your dictionary with stardict editor and sdcv (v0.42). Code:
$ sdcv -l --data-dir ./ Dictionary's name Word count Dictionary latininfl -> english 1223585 Code:
$ sdcv --data-dir ./ quaerit Found 1 items, similar to quaerit. -->Dictionary latininfl -> english -->quaerit <b>quaero, quaerere, quaesivi, quaesitus</b><br/><font color="#457820">v.</font> search for, seek, strive for; obtain; ask, inquire, demand; Code:
Verifing dictionary files... Loading index file: /home/markismus/Downloads/stardict-dic/fulllatininflected[1]/latin infl+lewis.idx Wrong order, first word = HS, second word = Habraea Wrong order, first word = IIvirum, second word = Iacchus Wrong order, first word = PRO, second word = Pactolus Wrong order, first word = SS., second word = Sabaeus Wrong order, first word = Zephyrus, second word = a Incorrect number of words: in .ifo file, wordcount=1223585, while the real word count is 1427152 Failed! Code:
$ sdcv --data-dir ./ servus Found 2 items, similar to servus. 0)Dictionary latininfl -> english-->cervus 1)Dictionary latininfl -> english-->nervus Your choice[-1 to abort]: -1 Code:
sdcv --data-dir ./ canis Found 1 items, similar to canis. -->Dictionary latininfl -> english -->canis <b>canus, cana, canum</b><br/><font color="#457820">adj.</font> white, gray; aged, old, wise; hoary; foamy, white-capped; white w/snow/frost; So my conclusions are that you're onto something here, but it's not done yet. I would decompile your dictionary with stardict-editor to a tabbed text-file and compare the results from stardict-editor verify to it, run a lot of words through it to test it and see whether you can find a logic to its imperfect functioning. If we know what's actually wrong, instead of just not-correct translation, we could create an issue on penelope's github page: https://github.com/pettarin/penelope/issues |
![]() |
![]() |
![]() |
#11 |
Member
![]() Posts: 13
Karma: 10
Join Date: Nov 2014
Device: android
|
Apologies for replying late. Unfortunately I am very busy this period, I don't think I would be able to do this checking anyway. I noticed definitions go astray for longer forms, e.g. plus que parfait forms. I guess the problem is rather that one cannot move within the dictionary window from one inflected form (found in latin-english inflected or rosen's greek forms) to the basic dictionary entry in the larger dictionary (lewis for latin and liddle scott for greek). I recall that there was at some point the intention of enabling this feature within the dictionary window. But perhaps it would be too difficult to implement, again, in this technical matters I have very very little knowledge, I am anyway grateful to koreader developers for all they did.
|
![]() |
![]() |
![]() |
#12 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 955
Karma: 149907
Join Date: Jul 2013
Location: Rotterdam
Device: HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
Koreader's chrox just implemented the feature hold to lookup word in dictionary window.
|
![]() |
![]() |
![]() |
#13 |
Member
![]() Posts: 13
Karma: 10
Join Date: Nov 2014
Device: android
|
that is great, I've checked it and it works, and it is really useful for latin. it does not work for ancient greek unfortunately, when tapping on a word inside the dictionary window it just selects a letter instead of the whole word.
|
![]() |
![]() |
![]() |
#14 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,466
Karma: 6900052
Join Date: Dec 2009
Location: The Heart of Texas
Device: Boox Note2, AuraHD, PDA,
|
Quote:
Luck; Ken |
|
![]() |
![]() |
![]() |
#15 |
Member
![]() Posts: 13
Karma: 10
Join Date: Nov 2014
Device: android
|
thank you for your reply, I've checked and it still does select just a letter instead of the whole ancient greek word, even when holding on the word and not just tapping. the selection works when taking directly a word from the text, but not inside the dictionary window.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
KOreader on Android | Ken Maltby | KOReader | 518 | 01-22-2025 12:36 PM |
KoReader! | Readirect | Kindle Developer's Corner | 2 | 10-31-2014 09:37 AM |
KoReader | Readirect | Kindle Developer's Corner | 1 | 10-29-2014 07:27 AM |
Aura HD Koreader - what's it good for | HarpingOn | Kobo Reader | 4 | 06-21-2014 09:28 PM |
Koreader v. Coolreader | 93terp | Kobo Developer's Corner | 8 | 11-05-2013 04:37 AM |