Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Closed Thread
 
Thread Tools Search this Thread
Old 09-26-2022, 06:28 PM   #91
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Hello, DNSB-David,

Thank you much for responding and for the notepad information.

I did the show all characters that you recommended and I see only CR symbols for return. Nothing other than that.

Any ideas?

Cordially,
pz
pzack is offline  
Old 09-26-2022, 06:30 PM   #92
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Good evening M. Sarmat89,

Using the information the responder DNSB kindly gave me I did a show all characters in notepad on the tsv file. No tab indications at all just CR for line return.

Cordially,
pz
pzack is offline  
Old 09-26-2022, 06:46 PM   #93
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 44,779
Karma: 168765399
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Quote:
Originally Posted by pzack View Post
Hello, DNSB-David,

Thank you much for responding and for the notepad information.

I did the show all characters that you recommended and I see only CR symbols for return. Nothing other than that.

Any ideas?

Cordially,
pz
This was on your original input file where the head word should have been separated from the pronunciation/definition by a tab character?
DNSB is online now  
Old 09-26-2022, 07:11 PM   #94
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Goodevening M. Sarmat89 and DNSB-David,

This message updates what I just sent to both of you this evening.

There are,in fact, some tabs but only scattered throughout. I searched the tabbed word in the dictionary in koreader. The word is found, however, only the line containing the headword and whatever part of the definition on that line is shown. The lines of definition after the tabbed headword line are not shown.

Nevertheless, this is progress and I think that we are on the right track.

The question is why certain head words are tabbed and the majority not tabbed and how to get the entire definition read with its headword.

Your help so far has begun to pay-off, thankyou.

Very cordially,
pz
pzack is offline  
Old 09-26-2022, 08:27 PM   #95
Sarmat89
Fanatic
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 515
Karma: 2268308
Join Date: Nov 2015
Device: none
Please post somr lines that don't have tabs in them.
Sarmat89 is offline  
Old 09-27-2022, 12:41 AM   #96
Comfy.n
want to learn what I want
Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.Comfy.n ought to be getting tired of karma fortunes by now.
 
Posts: 1,595
Karma: 7463599
Join Date: Sep 2020
Device: none
There's a world of dictionaries in e-book format waiting to be converted for electronic use, that's so lamentable
Comfy.n is offline  
Old 09-27-2022, 11:32 AM   #97
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Pyglossary conversion to stardict; tabbed and untabbed lines

Hello, M. Sarmat89,

Thank you for your message. I have to further revise what I wrote last night; some of the tabbed headwords are not found in the stardict word search. I assumed that all tabbed words would be found but this is not the case.

I have inclosed some lines which include two words, colorisation and coloristique which do show tabs and are found. Of two other included headwords tabbed, "lustre" is not found while "bagagiste" is found.

As stated before, only the line containing the headword is shown in stardict-koreader.

Cordially,
pz
Attached Files
File Type: txt somelines.txt (4.7 KB, 112 views)
pzack is offline  
Old 09-27-2022, 12:17 PM   #98
Sarmat89
Fanatic
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 515
Karma: 2268308
Join Date: Nov 2015
Device: none
Process the file which you got now with
Code:
perl -pe "s:^ +::"
, and redo all the steps with the result.
Sarmat89 is offline  
Old 09-27-2022, 04:35 PM   #99
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Good afternoon, M. Sarmat89,

Thank you for your response and the code.

I don't know how to apply this line of code to the tsv file. I imagine that the file name would appear somewhere in the perl line of code.

Also, just to confirm, we are using the tsv file generated from the previous codes?

cordially,
pz
pzack is offline  
Old 09-27-2022, 04:41 PM   #100
Sarmat89
Fanatic
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 515
Karma: 2268308
Join Date: Nov 2015
Device: none
Yes. To pass data to a perl script, use <filename; to save the output, use >filename.
Sarmat89 is offline  
Old 09-27-2022, 06:54 PM   #101
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Hello, Sarmat89,

I am sorry,but, that is still a wee bit confusing to me. May I trouble you to give me the line of code that incorpôrates my file(yourfile). I do not know where to place it so it would simply make things easier for me, a novice with Perl, to see the full syntaxe with the file included.

I am not sure if I place it at the end of the line and with what sign.

Am I passing data or receiving data or both at the same time?

Cordially,
pz
pzack is offline  
Old 09-27-2022, 07:39 PM   #102
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Good evening Sarmat89,

I looked at your other perl line code so I think it is a safe bet to follow what you did there; the file goes at the end of the line.

Please, then, ignore the other message just sent and thank you again for your help.

I'll see if I can do this tonight and will let you know the results.

Very cordially,
pz
pzack is offline  
Old 09-28-2022, 11:50 AM   #103
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Hello M. Sarmat89,

I used your line of code from post 98 on the tsv file created from last time then took the resulting file and ran it with the four lines of perl code. The file built from that was used with your line of code post 63. The file created from that code was used with pyglossary.

This time, pyglossary took another 200 seconds longer to build the stardict files.

There were still "[error] Warning: line staring with'....' has no tab" warnings and the index file had that question mark.

I didn't see any difference when I loaded the dictIonary into koreader from the previous files built. There are still a great amount of headwords without tabs and if a headword is found, only the line with the tabbed headword is shown.

It seems, with just a cursory check on the file, that the same headwords are being found from last time.

Cordially,
pz

Last edited by pzack; 09-28-2022 at 12:19 PM.
pzack is offline  
Old 09-28-2022, 03:51 PM   #104
Sarmat89
Fanatic
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 515
Karma: 2268308
Join Date: Nov 2015
Device: none
Please post some lines that have no tabulation.
Sarmat89 is offline  
Old 09-28-2022, 08:20 PM   #105
pzack
Connoisseur
pzack began at the beginning.
 
Posts: 79
Karma: 10
Join Date: Aug 2022
Device: kobo sage,elipsa
Pyglossary conversion to stardict; tabbed and untabbed lines

Good evening, M Sarmat89

Thank you for your message. I have attached some untabbed headwords.

Cordially,

pz
Attached Files
File Type: txt somemore-lines.txt (1,008 Bytes, 104 views)
pzack is offline  
Closed Thread

Tags
pyglossary


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PDF to PDF conversion causes all the text to be aligned to the left Swifty4635 Conversion 1 01-16-2022 10:17 PM
Desktop App How do I run PyGlossary on Windows ? Bilingual Kobo Reader 2 07-12-2020 01:54 PM
epub 2 PDF conversion with OCR in PDF possible? hobi2000 Conversion 2 03-25-2019 03:20 AM
PDF conversion keeping pdf page highstream Conversion 3 05-31-2016 11:46 AM
PDF to PDF conversion creates much larger file? rocketcat Conversion 11 09-30-2011 07:37 PM


All times are GMT -4. The time now is 11:28 AM.


MobileRead.com is a privately owned, operated and funded community.