Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Readers > PocketBook > PocketBook Developer's Corner

Notices

Reply
 
Thread Tools Search this Thread
Old 08-12-2020, 02:09 PM   #46
Sakke
Junior Member
Sakke began at the beginning.
 
Posts: 4
Karma: 10
Join Date: May 2019
Device: PocketBook Inkpad 3
Hi,

I noticed that there has been some kind offers to convert dictionary files to .dic. I'm using Mac and it seems like a big thing to achieve this conversion, so please, if anyone could convert the attached .dict to .dic so I could read more comfortably in Swedish and learn the language.

Thanks.
Attached Files
File Type: zip swe-fin.dict.zip (323.2 KB, 315 views)
Sakke is offline   Reply With Quote
Old 08-27-2020, 10:22 AM   #47
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 895
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
Quote:
Originally Posted by lplazas View Post
Hi guys, first of all thanks for sharing all your knowledge around this topic. I've used the dictionary below in the past and I'm trying to get it into my new pocketbook. I'm not really tech savvy so I must say I'm lost at reading through must of the things discussed in this thread. I would appreciate your help guiding me on how to convert this dictionary to .dic (I'm a mac user if it helps). TIA

http://download.huzheng.org/bigdict/...-2.4.2.tar.bz2
The result is here.

Last edited by Markismus; 11-14-2020 at 06:02 AM.
Markismus is offline   Reply With Quote
Old 08-27-2020, 10:35 AM   #48
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 895
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
Quote:
Originally Posted by firewalker View Post
I managed to convert it but it doesn't seem to work. It always says "Translation not found".
I've ran it, too. The result is here. Hope it helps. I used pyglossary for babylon->stardict and then the script with the toggle
Code:
my $Just4PocketBook = 1;
Markismus is offline   Reply With Quote
Old 08-27-2020, 10:54 AM   #49
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 895
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
@Sakke The dictfmt format is new to me and Lingae, Penelope nor PyGlossary are able to do much with it. Try getting the dictionary in another format and I'll give it another go.
Code:
00-database-dictfmt-1.12.1
Svenska-suomi FreeDict+WikDict dictionary

Maintainer: Karl Bartel

Edition: 2020.02.08
Size: 12080 headwords

Publisher: Karl Bartel.


Availability:

    Licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported
  license

Publication date: 2020-02-08.

Notes:

 * Database Status: big enough to be useful


Source(s):

  Automatic creation of this bilingual dictionary by WikDict
  [http://www.wikdict.com/]. Base data from Wiktionary.org
  [https://www.wiktionary.org/] via , DBnary
  [http://kaiko.getalp.org/about-dbnary/].

The Project:

  This dictionary comes to you through nice people making it available for
  free and for good. It is part of the FreeDict project,
  http://www.freedict.org/. This project aims to make available many
  translating dictionaries for free. Your contributions are welcome!

Changelog:

 * 2018-09-12 Karl Bartel:
   Change numbering scheme from YYYY-MM-DD to YYYY.MM.DD
 * 2018-05-15 Karl Bartel:
   Add links to sourceDesc
 * 2017-12-15 Karl Bartel:
   Add changelog
 * 2017-11-22 Karl Bartel:
   Use ref tag to encode license URL
Svenska-suomi FreeDict+WikDict dictionary ver. 2020.02.08
http://www.wikdict.com/
-are /ˈarə/ <affix>
1. -mpi, -mmin
adjektiv- och adverbkomparator
2. -lainen, -läinen
invånare
3. -ja, -jä
person som utför något
-bar /bˈɑːr/ <affix>
-tava, -kelpoinen
möjlig att utsätta för verbhandlingen
-gon /ɡˈɔn/ <affix>
Markismus is offline   Reply With Quote
Old 09-12-2020, 06:37 AM   #50
ichnilatis
Groupie
ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.
 
Posts: 169
Karma: 1686
Join Date: Jul 2020
Location: Greece
Device: Pocketbook Touch Lux 5
Babylon dictionary for KOReader

I would like to ask if there is anywhere the Babylon English-Greek dictionary in .dict format for KOReader. I have installed the one that Markismus converted for the Pocketbook and it satisfies me a lot.

Unfortunately, I do not succeed in scripts and convertions.

@Markismus: Thank you very much for your work!
ichnilatis is offline   Reply With Quote
Old 09-14-2020, 10:37 AM   #51
Photonoxx
Junior Member
Photonoxx began at the beginning.
 
Posts: 3
Karma: 10
Join Date: May 2020
Device: Vivlio Touch HD plus=PocketBook touch HD 3=P632
Hi,

As I am on Ubuntu 20.04, Linguae or Stardict tools are no more available. So I guess your script can't works in this situation.

I test Koreader on my PB632 with vivlio (french vendor of Pocketbook device), but it was terribly slow for some things (pinch to zoom for example takes many second to refresh display) and was not usable (may be linked to the vivlio FW ?).

I tried to convert a Stardict dictionary I have (Larousse Chambers eng-fra) with penelope, but I get a failure when it try to output xml file (it works with csv output), may be I missed an argument ?

Code:
penelope -d -i Larousse_Chambers_francais_anglais.zip -j stardict -f en -t fr -p xml -o Larousse_Chambers_francais_anglais.xml
[DEBU] Running with the command line arguments:
Namespace(apply_css=None, author='Penelope', bookeen_collation_function=None, bookeen_install_file=False, copyright='GNU GPL v3', cover_path=None, csv_fs=',', csv_ignore_first_line=False, csv_ls='\n', debug=True, description='Dictionary en to fr', dictzip_path=None, email='penelopedictionaryconverter@gmail.com', epub_no_compress=False, escape_strings=False, flatten_synonyms=False, group_by_prefix_function=None, group_by_prefix_length=2, group_by_prefix_merge_across_first=False, group_by_prefix_merge_min_size=0, identifier='c3925750f315433d81e1240cc50955e3', ignore_case=False, ignore_synonyms=False, include_index_page=False, input_file='Larousse_Chambers_francais_anglais.zip', input_file_encoding='utf-8', input_format='stardict', input_parser=None, keep=False, kindlegen_path=None, language_from='en', language_to='fr', license='GNU GPL v3', marisa_bin_path=None, marisa_index_size=1000000, merge_definitions=False, merge_separator=' | ', mobi_no_kindlegen=False, no_definitions=False, output_file='Larousse_Chambers_francais_anglais.xml', output_format='xml', sd_ignore_sametypesequence=False, sd_no_dictzip=False, sort_after=False, sort_before=False, sort_by_definition=False, sort_by_headword=False, sort_ignore_case=False, sort_reverse=False, title='Dictionary en to fr', version=False, website='https://goo.gl/EB5XSR', year='2020')
[INFO] Reading input file(s)...
[DEBU] Reading from file 'Larousse_Chambers_francais_anglais.zip'...
[DEBU] Working in temp dir '/tmp/tmpy26_xpeo'
[DEBU] Extracted /tmp/tmpy26_xpeo/d.ifo
[DEBU] Extracted /tmp/tmpy26_xpeo/d.idx
[DEBU] Extracted /tmp/tmpy26_xpeo/d.dict.dz
[DEBU] Uncompressed /tmp/tmpy26_xpeo/d.dict
[DEBU] Extracted /tmp/tmpy26_xpeo/d.syn
[DEBU] Read .ifo file with values:
{'version': '2.4.2', 'wordcount': 48978, 'synwordcount': 275022, 'idxfilesize': 999773, 'bookname': 'Larousse Chambers français-anglais', 'author': 'Informatique Editoriale Larousse', 'email': 'linguistic-support@babylon.com', 'description': 'Le plus pédagogique et le plus accessible des grands dictionnaires, dans une nouvelle édition moderne et complète : 250 000 mots et expressions, 450 000 traductions, le vocabulaire spécialisé le plus récent.', 'sametypesequence': 'h'}
[DEBU] The input StarDict file contains a .syn file, parsing it...
[DEBU] The input StarDict file contains a .syn file, parsing it... done
[DEBU] Deleted temp dir '/tmp/tmpy26_xpeo'
[DEBU] Reading from file 'Larousse_Chambers_francais_anglais.zip'... success
[INFO] Reading input file(s)... done
[INFO] Writing output file(s)...
[DEBU] Creating XML tree...
[ERRO] Writing to file 'Larousse_Chambers_francais_anglais.xml'... failure
[ERRO] Unable to write the output file(s)
Additionally, PyGlossary don't manage to write xdxf file, so I don't know what to do now.

So if anybody had an idea or could help me, it would be very appreciated.

Thanks in advance and the hell of proprietary and undocumented file's formats….

Nicolas
Photonoxx is offline   Reply With Quote
Old 09-15-2020, 06:33 AM   #52
Photonoxx
Junior Member
Photonoxx began at the beginning.
 
Posts: 3
Karma: 10
Join Date: May 2020
Device: Vivlio Touch HD plus=PocketBook touch HD 3=P632
Finally I see your script accept CSV dictionary too, and I can convert Stardict to CSV with penelope and CSV to Pocketbook dic with your script.

Thanks for your work.
Photonoxx is offline   Reply With Quote
Old 09-15-2020, 05:22 PM   #53
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 895
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
@ichnilatis I've uploaded the dictionary generated with the settings:
Code:
# Shortcuts to Collection of settings.
my $Just4Koreader = 1;
my $Just4PocketBook = 0;
@Photonoxx Nice that you like it.
Just because Ubuntu doesn't maintain those packages anymore, doesn't mean that they won't run on Ubuntu. The code and install files of huzheng001 are hosted on his github page. You can decompress the tools from the tar-file, install them following the directions from INSTALL and use them to either generate the xml directly or use my script to start from the triplet files (dict-idx-ifo).

Last edited by Markismus; 09-15-2020 at 05:26 PM.
Markismus is offline   Reply With Quote
Old 09-16-2020, 03:45 AM   #54
eMole
Zealot
eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.eMole ought to be getting tired of karma fortunes by now.
 
Posts: 128
Karma: 546310
Join Date: Jun 2011
Device: PocketBook Touch HD, PocketBook Era
And Linguae should run on Ubuntu, download the source and wrestle with python 2. Good luck to it.
eMole is offline   Reply With Quote
Old 09-24-2020, 06:49 AM   #55
ichnilatis
Groupie
ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.
 
Posts: 169
Karma: 1686
Join Date: Jul 2020
Location: Greece
Device: Pocketbook Touch Lux 5
Quote:
Originally Posted by Markismus View Post
@ichnilatis I've uploaded the dictionary generated with the settings:
Code:
# Shortcuts to Collection of settings.
my $Just4Koreader = 1;
my $Just4PocketBook = 0;
Markismus, thank you a lot!

Sorry for my ignorance of the scripts, but what should I do with the code?

Thanks again!
ichnilatis is offline   Reply With Quote
Old 09-24-2020, 04:07 PM   #56
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 895
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
I’ve just wanted to illustrate how easy the script has become—well, after installing it properly, of course.

Apparently, windows 10 now has a fully fledged Linux subsystem. So best way to use the script is going that route.
Markismus is offline   Reply With Quote
Old 09-28-2020, 02:15 PM   #57
Sakke
Junior Member
Sakke began at the beginning.
 
Posts: 4
Karma: 10
Join Date: May 2019
Device: PocketBook Inkpad 3
Quote:
Originally Posted by Markismus View Post
@Sakke The dictfmt format is new to me and Lingae, Penelope nor PyGlossary are able to do much with it. Try getting the dictionary in another format and I'll give it another go.
Thank you so much.

I have found another form, .tei. Would this help? The folder it came in is attached, and the dictionary itself seems to be the .tei file.
Attached Files
File Type: zip swe-fin 3.zip (419.3 KB, 288 views)
Sakke is offline   Reply With Quote
Old 10-04-2020, 05:52 AM   #58
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 895
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
From their webpage:
Quote:
The databases are available in TEI XML. From there we convert to different dictionary lookup application formats, eg. we support

* dictd, a server implementation of the DICT protocol * Evolutionary Dictioanry, an application for Windows CE * zbedic, an application for the Sharp Zaurus PDA

More platforms are planned to be supported
So it's an initiative that doesn't support much formats, nor does it have a structurally easily converted format:
Code:
 <entry>
      <form>
        <orth>nervsystem</orth>
      </form>
      <gramGrp>
        <pos>n</pos>
      </gramGrp>
      <sense>
        <cit type="trans" xml:lang="fi">
          <quote>hermosto</quote>
        </cit>
        <sense>
          <def>system av nervceller i djur</def>
        </sense>
      </sense>
    </entry>
You would have to write a converter function. I recommend using a dictionary of which you know both languages, so that you can decided what's what. E.g. I have no idea what this stands for:
Code:
<quote>hermosto</quote>
As you can't get the script to run on your own, I guess you reached the end of that initiative.

Last edited by Markismus; 10-04-2020 at 05:57 AM.
Markismus is offline   Reply With Quote
Old 11-09-2020, 04:56 AM   #59
ichnilatis
Groupie
ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.ichnilatis once ate a cherry pie in a record 7 seconds.
 
Posts: 169
Karma: 1686
Join Date: Jul 2020
Location: Greece
Device: Pocketbook Touch Lux 5
Hi,

I have the greek-greek dictionary "Papyros" for the GoldenDict and I was wondering if it could be converted to dic format (for pocketbook) or even to dict format (for KoReader).

You can download the "Papyros" files from here

Thank you!
ichnilatis is offline   Reply With Quote
Old 11-09-2020, 08:59 AM   #60
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 895
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
No idea what format it is. Looking at the homepage of GoldenDict, it should be Abbyy. It tried Abbyy Lingvo in pyglossary which resulted in:
Code:
[DEBUG] readOptions: {}
[DEBUG] writeOptions: {}
[INFO] Encoding of DSL file is not utf-8
[INFO] Encoding of DSL file detected: utf-16
[ERROR] unknown language '\t"Greek"'

[WARNING] Full sort enabled, falling back to indirect mode
[CRITICAL] Traceback (most recent call last):
  File "/usr/lib/python3.8/site-packages/pyglossary/plugins/babylon_bgl/bgl_reader.py", line 726, in __iter__
    raise StopIteration
StopIteration

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/lib/python3.8/site-packages/pyglossary/ui/ui_gtk.py", line 995, in convertClicked
    finalOutputFile = self.glos.convert(
  File "/usr/lib/python3.8/site-packages/pyglossary/glossary.py", line 1264, in convert
    finalOutputFile = self.write(
  File "/usr/lib/python3.8/site-packages/pyglossary/glossary.py", line 1093, in write
    self._inactivateDirectMode()
  File "/usr/lib/python3.8/site-packages/pyglossary/glossary.py", line 865, in _inactivateDirectMode
    self.loadReader(reader)
  File "/usr/lib/python3.8/site-packages/pyglossary/glossary.py", line 842, in loadReader
    for index, entry in enumerate(self._applyEntryFiltersGen(reader)):
  File "/usr/lib/python3.8/site-packages/pyglossary/glossary.py", line 439, in _applyEntryFiltersGen
    for index, entry in enumerate(gen):
RuntimeError: generator raised StopIteration
Traceback locals:
    self = <pyglossary.glossary.Glossary object at 0x7f2879c895b0>
    gen = <babylon_bgl.bgl_reader.BglReader object at 0x7f287a2a5370>

Last edited by Markismus; 11-09-2020 at 09:03 AM.
Markismus is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Pocketbook dictionary logan PocketBook 322 03-05-2024 09:48 AM
Dictionary coversion from .mobi to pocketbook format? doctorat PocketBook 16 07-01-2020 05:34 PM
Webster's 1913 Dictionary in Pocketbook Format luqmaninbmore PocketBook 8 05-27-2020 10:41 AM
SW>EN Dictionary for Pocketbook tttrine PocketBook 3 06-09-2015 06:01 AM


All times are GMT -4. The time now is 02:13 AM.


MobileRead.com is a privately owned, operated and funded community.