View Single Post
Old 10-02-2022, 12:03 AM   #125
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
textx pyglossary conversion

Good evening M.Sarmat89,

I have, per your request, attached a section of the file newtsv.txt which is the final output text for pyglossary and the djvu1txt.txt which is a section of the original file djvu.txt.

Both files are different from what you have as I wanted to give you fairly matched files. Please look for the word "affadissant, e" near the beginning of both files.

The unfolded file, newtsv.txt makes it a wee bit harder to find that headword but it is near the beginning.

Unless I am mistaken, it appears that the headwords do not begin the unfolded line but, I don't know how pyglossary reads the tab for building the index.

I certainly hope this helps. We at least have unfolded lines.

Cordially,
pz
Attached Files
File Type: txt newTSV.txt (352.1 KB, 127 views)
File Type: txt djvu1txt.txt (365.8 KB, 116 views)
pzack is offline