![]() |
#91 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Hello, DNSB-David,
Thank you much for responding and for the notepad information. I did the show all characters that you recommended and I see only CR symbols for return. Nothing other than that. Any ideas? Cordially, pz |
![]() |
![]() |
#92 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Good evening M. Sarmat89,
Using the information the responder DNSB kindly gave me I did a show all characters in notepad on the tsv file. No tab indications at all just CR for line return. Cordially, pz |
![]() |
Advert | |
|
![]() |
#93 |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 24,369
Karma: 111805467
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Forma, Clara HD, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
This was on your original input file where the head word should have been separated from the pronunciation/definition by a tab character?
|
![]() |
![]() |
#94 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Goodevening M. Sarmat89 and DNSB-David,
This message updates what I just sent to both of you this evening. There are,in fact, some tabs but only scattered throughout. I searched the tabbed word in the dictionary in koreader. The word is found, however, only the line containing the headword and whatever part of the definition on that line is shown. The lines of definition after the tabbed headword line are not shown. Nevertheless, this is progress and I think that we are on the right track. The question is why certain head words are tabbed and the majority not tabbed and how to get the entire definition read with its headword. Your help so far has begun to pay-off, thankyou. Very cordially, pz |
![]() |
![]() |
#95 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 431
Karma: 2146264
Join Date: Nov 2015
Device: none
|
Please post somr lines that don't have tabs in them.
|
![]() |
Advert | |
|
![]() |
#96 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 372
Karma: 2636542
Join Date: Sep 2020
Device: Calibre E-book viewer/ PW3
|
There's a world of dictionaries in e-book format waiting to be converted for electronic use, that's so lamentable
![]() |
![]() |
![]() |
#97 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Pyglossary conversion to stardict; tabbed and untabbed lines
Hello, M. Sarmat89,
Thank you for your message. I have to further revise what I wrote last night; some of the tabbed headwords are not found in the stardict word search. I assumed that all tabbed words would be found but this is not the case. I have inclosed some lines which include two words, colorisation and coloristique which do show tabs and are found. Of two other included headwords tabbed, "lustre" is not found while "bagagiste" is found. As stated before, only the line containing the headword is shown in stardict-koreader. Cordially, pz |
![]() |
![]() |
#98 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 431
Karma: 2146264
Join Date: Nov 2015
Device: none
|
Process the file which you got now with
Code:
perl -pe "s:^ +::" |
![]() |
![]() |
#99 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Good afternoon, M. Sarmat89,
Thank you for your response and the code. I don't know how to apply this line of code to the tsv file. I imagine that the file name would appear somewhere in the perl line of code. Also, just to confirm, we are using the tsv file generated from the previous codes? cordially, pz |
![]() |
![]() |
#100 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 431
Karma: 2146264
Join Date: Nov 2015
Device: none
|
Yes. To pass data to a perl script, use <filename; to save the output, use >filename.
|
![]() |
![]() |
#101 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Hello, Sarmat89,
I am sorry,but, that is still a wee bit confusing to me. May I trouble you to give me the line of code that incorpôrates my file(yourfile). I do not know where to place it so it would simply make things easier for me, a novice with Perl, to see the full syntaxe with the file included. I am not sure if I place it at the end of the line and with what sign. Am I passing data or receiving data or both at the same time? Cordially, pz |
![]() |
![]() |
#102 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Good evening Sarmat89,
I looked at your other perl line code so I think it is a safe bet to follow what you did there; the file goes at the end of the line. Please, then, ignore the other message just sent and thank you again for your help. I'll see if I can do this tonight and will let you know the results. Very cordially, pz |
![]() |
![]() |
#103 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Hello M. Sarmat89,
I used your line of code from post 98 on the tsv file created from last time then took the resulting file and ran it with the four lines of perl code. The file built from that was used with your line of code post 63. The file created from that code was used with pyglossary. This time, pyglossary took another 200 seconds longer to build the stardict files. There were still "[error] Warning: line staring with'....' has no tab" warnings and the index file had that question mark. I didn't see any difference when I loaded the dictIonary into koreader from the previous files built. There are still a great amount of headwords without tabs and if a headword is found, only the line with the tabbed headword is shown. It seems, with just a cursory check on the file, that the same headwords are being found from last time. Cordially, pz Last edited by pzack; 09-28-2022 at 12:19 PM. |
![]() |
![]() |
#104 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 431
Karma: 2146264
Join Date: Nov 2015
Device: none
|
Please post some lines that have no tabulation.
|
![]() |
![]() |
#105 |
Connoisseur
![]() Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
|
Pyglossary conversion to stardict; tabbed and untabbed lines
Good evening, M Sarmat89
Thank you for your message. I have attached some untabbed headwords. Cordially, pz |
![]() |
![]() |
Tags |
pyglossary |
Thread Tools | Search this Thread |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
PDF to PDF conversion causes all the text to be aligned to the left | Swifty4635 | Conversion | 1 | 01-16-2022 10:17 PM |
Desktop App How do I run PyGlossary on Windows ? | Bilingual | Kobo Reader | 2 | 07-12-2020 01:54 PM |
epub 2 PDF conversion with OCR in PDF possible? | hobi2000 | Conversion | 2 | 03-25-2019 03:20 AM |
PDF conversion keeping pdf page | highstream | Conversion | 3 | 05-31-2016 11:46 AM |
PDF to PDF conversion creates much larger file? | rocketcat | Conversion | 11 | 09-30-2011 07:37 PM |