MobileRead Forums - View Single Post - dictutil: Tools, documentation, and libraries related to Kobo dictionaries

geek1011 · 03-05-2020, 06:38 PM

dictutil
Tools, documentation, and libraries related to Kobo dictionaries (and a few converted ones).
___

This project contains a collection of tools and libraries to work with Kobo dictionaries, plus comprehensive documentation of Kobo's dictionary format.

Unlike previous attempts at working with Kobo dictionaries, dictutil has full support for all features supported by nickel (word prefixes, unicode, variants, images, etc), with a focus on simplicity, correctness (prefix generation and other features are directly tested against libnickel's code and regexps, v1/v2 dictionaries are differentiated), and completeness (most of the research was done by reverse-engineering libnickel).

In addition, it has a custom format for creating Kobo dictionaries which has a simple syntax and full support for all features.

Dictutil consists of multiple tools and libraries:

dictutil provides commands for installing, removing, unpacking, packing, and performing low-level modifications and tests on Kobo dictionaries. All operations are intended to be correct, lossless, and deterministic.
dictgen simplifies creating full-featured dictionaries for Kobo eReaders, with support for images, unicode prefixes, raw html, markdown, and more.
dicthtml documents Kobo's dictionary format and how it works.
examples/gotdict-convert is a working example of using dictutil to convert GOTDict into a Kobo dictionary.
examples/webster1913-convert is a working example of using dictutil to convert Project Gutenberg's Webster's Unabridged Dictionary into a Kobo dictionary.
examples/dictzip-decompile is an experimental tool to convert a dictzip into a dictfile.
examples/bgl-convert is a simple tool to convert Babylon BGL dictionaries to a dictfile.
Library: kobodict provides support for reading, writing, encrypting, and decrypting Kobo dictionaries.
Library: dictgen provides the functionality of dictgen as a library.
Library: marisa provides a simplified self-contained CGO wrapper for marisa-trie.

Dictutil implements version 2 of the Kobo dictionary format, which supports firmware versions 4.7.10364+.

See the website for more details and examples.

Quick reference:

dictgen:

Spoiler:

dictgen dictfile format:

Spoiler:

dictutil:

Spoiler:

dictutil install:

Spoiler:

dictutil uninstall:

Spoiler:

dictutil pack:

Spoiler:

dictutil unpack:

Spoiler:

dictutil prefix:

Spoiler:

gotdict-convert:

Spoiler:

webster1913-convert:

Spoiler:

dictzip-decompile:

Spoiler:

Download | Website