View Single Post
Old 07-12-2025, 04:49 AM   #1
Moonbase59
Addict
Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.Moonbase59 ought to be getting tired of karma fortunes by now.
 
Moonbase59's Avatar
 
Posts: 223
Karma: 1000244
Join Date: Oct 2021
Location: Germany
Device: Tolino Vision 5, Tolino Tab 8", Pocketbook Era (16GB)
New single-source German hyphenation — please test!

German readers: Parallel thread at E-Reader Forum

I’m working on new single-source German hyphenation patterns for Linux and many types of e-readers, including Kobo. These are based on a ~600,000 word text corpus and generate patterns for the German reformed spelling (1996/2006).

Since I don’t own a Kobo, I cannot test this, and ask for your help in testing, please.

I’d be interested in:
  • Does it work at all?
  • Can the Kobo software use UTF-8 hyphenation dictionaries?
  • Are special word boundary cases recognized and handled correctly, like shown in the screenshots?

I tried to prepare a Kobo-compatible file. As far as I have read, you’ll have to:
  • Unpack the ZIP archive.
  • Connect the Kobo via USB.
  • Copy the unpacked KoboRoot.tgz into the .kobo folder on the device.
  • Safely eject the device.
  • The Kobo reader should then install the new German hyphenation dictionary.

Here is one of the books I used for testing: Hans Dominik - Atlantis. It has real crazy, manually constructed "ellpises" at word endings, constructed like (NNBSP)(.)(NNBSP)(.)(NNBSP)(.).

Screenshots: Left—bad hyphenation; Right—new version hyph_de.dic
Attached Thumbnails
Click image for larger version

Name:	scr0020.png
Views:	43
Size:	161.4 KB
ID:	216857   Click image for larger version

Name:	scr0019.png
Views:	32
Size:	159.3 KB
ID:	216858  
Attached Files
File Type: zip Moonbase59-hyph_de_DE.zip (175.1 KB, 15 views)

Last edited by Moonbase59; 07-12-2025 at 05:09 AM.
Moonbase59 is offline   Reply With Quote