02-05-2020, 02:46 PM | #286 |
Guru
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
Yes, those are indeed the wrong ones.
Sorry, my kid and me were having a sick day. I'll look into it later. Last edited by Markismus; 02-05-2020 at 02:50 PM. |
02-05-2020, 03:17 PM | #287 |
Enthusiast
Posts: 34
Karma: 510
Join Date: Feb 2016
Device: Kobo
|
|
Advert | |
|
02-05-2020, 03:19 PM | #288 |
Enthusiast
Posts: 34
Karma: 510
Join Date: Feb 2016
Device: Kobo
|
|
02-05-2020, 04:47 PM | #289 |
Guru
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
Allright, I just wrote the conversion from xdxf-format to Stardict text-format. Now I can generate the Stardict files without having to use Linguae. (I returned my pocketbook Inkpad 3 Pro, so I am using koreader on my Kobo Aura H2O again.)
Here is the link to the dictionaries. All other links now point to them, too. Tested in koreader: Last edited by Markismus; 02-06-2020 at 04:24 PM. |
02-05-2020, 05:43 PM | #290 |
Groupie
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
|
Uhm, appreciate, but still no joy. The stock Dictionary.app displays the codes literally, inside pbreader they are simply dropped. In CR3 I can't open the dictionary anymore (not necessarily because of this file).
|
Advert | |
|
02-05-2020, 05:52 PM | #291 |
Guru
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
Seems that your dictionary app can't handle html entities at all.
The last thing you could do is convert the numerical symbols to characters and expect modern day apps to be able to handle unicode directly. When I posted the DOCTYPE within a code block, I found that some codes were automatically interpreted. So you'll want a low level parser, that only interprets the &#xXXXX; parts. I had expected the lookup in reader to be working, though? Should expect to be better even because the <f>-tags are retained now. But apparently it can handle entity names, but not numbers? Last edited by Markismus; 02-05-2020 at 05:54 PM. |
02-05-2020, 06:14 PM | #292 |
Groupie
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
|
Seems so. That's why I wrote "replace with unicode" Anyway, not tonight... Now what buggers me is why CR3 can't see dictionaries anymore (which most likely is unrelated to your file, now removed from PB)
|
02-06-2020, 05:52 AM | #293 |
Enthusiast
Posts: 34
Karma: 510
Join Date: Feb 2016
Device: Kobo
|
|
02-06-2020, 04:23 PM | #294 |
Guru
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
So the unicode conversion is a tough one in Perl. Learned a lot...
For your testing purposes: Here are your dictionary files. I tested it in koreader: I also redirected _all_ previous links to files. |
02-06-2020, 04:25 PM | #295 |
Groupie
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
|
Try this.
How done:
Last edited by EastEriq; 02-06-2020 at 05:07 PM. |
02-06-2020, 04:26 PM | #296 |
Groupie
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
|
ETA: crossed mid-air....
|
02-06-2020, 04:26 PM | #297 |
Guru
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
You just missed me! Sorry, didn't realize you were also tinkering.
How do the results compare? Why did you remove the <f> tags? BTW thanks for the link to more locales! Last edited by Markismus; 02-06-2020 at 04:46 PM. |
02-06-2020, 04:40 PM | #298 |
Groupie
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
|
Uh, I don't know, byte-checking? At first sight on a few entries (on my one) the content seems about ok, I wouldn't go for an extensive check beyond that.
The appearance in the three (I manage to resurrect CR3 in the meantime) dictionary viewers I have varies a bit, each one has its formatting quirks, like adding spacing, using a font which misses some of the graphisms. None of the three dict viewers has really an impressing usability by itself, I once was aware of some configuration files to fiddle a bit with fonts in the one or the other, but that's all it is. This one .dic seems as good as others now. |
02-06-2020, 04:54 PM | #299 |
Groupie
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
|
because they were showing up in the dictionary app. But actually that maybe was because they were always tagging nonrecognised entities? Will try yours, if you kept them...
|
02-06-2020, 04:55 PM | #300 |
Guru
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
Comparing the two, I think I like the Stardict with sametypesetting=m and unicoded text (post #294) better than the one with sametypesetting=h and html-handling of the text (post #289).
Though that could just mean that we should do some work on Koreader's html-handling. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Webster's 1913 Dictionary in Pocketbook Format | luqmaninbmore | PocketBook | 8 | 05-27-2020 10:41 AM |
Russian dictionary for Pocketbook 301+ | irbit | PocketBook | 9 | 03-29-2010 03:05 AM |
Pocketbook 301 und Pocketbook 360° im Test, Teil 1 | Forkosigan | PocketBook | 11 | 02-11-2010 03:54 AM |
Oxford built-in dictionary disappears after changing default dictionary | YYZscientist | Amazon Kindle | 4 | 01-24-2010 08:42 PM |
Pocketbook und Netronix Inc. fusionieren zu PocketBook Global | Forkosigan | Deutsches Forum | 0 | 01-08-2010 01:13 PM |