View Single Post
Old 02-05-2021, 10:00 AM   #1
InMyPocket
Member
InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.InMyPocket can teach chickens to fly.
 
Posts: 21
Karma: 3620
Join Date: Feb 2021
Device: Pocketbook
Wiktionary for Pocketbook

Hi,

I converted the french wiktionary from this GitHub project: https://github.com/BoboTiG/ebook-reader-dict

This project extract every day dictionaries from Wiktionary in several languages. So, the work I made could easiliy be done for other language too.


The french version I made is base on the 28th of January 2021. It contains more than 1.5 millions word !

It is downloadable here:

https://gofile.io/d/a27MMk

The .dic file must be copied into the "system\dictionaries" folder of your Pocketbook reader. (Note: the "System" folder is hidden)

Any comment and suggestion is welcomed !


------------------

If you want to make the convertion yourself, here are the steps to proceed:


# 1 ## Download the woktionary in the Stardict format:

https://github.com/BoboTiG/ebook-reader-dict

I used the french version:
https://github.com/BoboTiG/ebook-rea...eleases/tag/fr

# 2 ## Convert the Stardict files to a XDXF file using Linguae 0.15.

Download Linguae here:
https://linguae.stalikez.info/

# 3 ## Proceed to some cleanup of the XDXF with the "sed" tools:

1) Change the dictionary name : "Wiktionnaire"
2) Remove colors codes not needed on reader : #([0-9a-fA-F]){6}
3) Add "\n" after each </li> : this truncates long lines to avoid crash of the Pocketbook converter (step 4)
4) Add "#" at the beginning of definitions : a matter of look and lisibility. Change to whatever you like...
5) For french wiktionary, remove <i>(Date ŕ préciser)</i> : this text is useless for readers. Change the text to your language.


I used this light and standalone sed for windows:
https://raw.githubusercontent.com/mb...ed-4.8-x64.exe

and used this command:
sed -e "s/<full_name>.*<\/full_name>/<full_name>Wiktionnaire<\/full_name>/" -e "s/\(<\/li>\)/\1\n/g" -e "s/#\([0-9a-fA-F]\)\{6\}//g" -e "s/<li>/<li># /g" -e "s/<i>(Date .* pr.*ciser)<\/i>//g" dict.xdxf >Wiktionnaire.xdxf

# 4 ## Convert to Pocketbook format:

The official converter can be downloader here:

https://support.pocketbook-int.com/d...yConverter.zip

This version includes only few "locales" files. The "fr" local was not included.

These locales are available in an older version in this post:
https://www.mobileread.com/forums/sh...06&postcount=4

The file is here:
https://www.mobileread.com/forums/at...4&d=1435862948

I used this patched version of the converter which avoid crash with long XDXF content:
https://www.mobileread.com/forums/sh...82&postcount=4



To convert the file from XDXF to Pocketbook format:
converter Wiktionnaire.xdxf fr
=> Create: Wiktionnaire.dic
InMyPocket is offline   Reply With Quote