Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Readers > PocketBook

Notices

Reply
 
Thread Tools Search this Thread
Old 02-05-2020, 02:46 PM   #286
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
Yes, those are indeed the wrong ones.

Sorry, my kid and me were having a sick day. I'll look into it later.

Last edited by Markismus; 02-05-2020 at 02:50 PM.
Markismus is offline   Reply With Quote
Old 02-05-2020, 03:17 PM   #287
tropoy
Enthusiast
tropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enough
 
tropoy's Avatar
 
Posts: 34
Karma: 510
Join Date: Feb 2016
Device: Kobo
Quote:
Originally Posted by EastEriq View Post
... but it seems they are all still there. Are you sure you uploaded your last conversion?
I checked and the first conversion and the second one seems to be the same, indeed. Same date and size...
tropoy is offline   Reply With Quote
Advert
Old 02-05-2020, 03:19 PM   #288
tropoy
Enthusiast
tropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enough
 
tropoy's Avatar
 
Posts: 34
Karma: 510
Join Date: Feb 2016
Device: Kobo
Quote:
Originally Posted by Markismus View Post
Yes, those are indeed the wrong ones.

Sorry, my kid and me were having a sick day. I'll look into it later.
Ok, thanks, take care of yourselves.
tropoy is offline   Reply With Quote
Old 02-05-2020, 04:47 PM   #289
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
Allright, I just wrote the conversion from xdxf-format to Stardict text-format. Now I can generate the Stardict files without having to use Linguae. (I returned my pocketbook Inkpad 3 Pro, so I am using koreader on my Kobo Aura H2O again.)

Here is the link to the dictionaries. All other links now point to them, too.


Tested in koreader:

Last edited by Markismus; 02-06-2020 at 04:24 PM.
Markismus is offline   Reply With Quote
Old 02-05-2020, 05:43 PM   #290
EastEriq
Groupie
EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!
 
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
Uhm, appreciate, but still no joy. The stock Dictionary.app displays the codes literally, inside pbreader they are simply dropped. In CR3 I can't open the dictionary anymore (not necessarily because of this file).
Attached Thumbnails
Click image for larger version

Name:	pbreaderdict.png
Views:	250
Size:	83.2 KB
ID:	176942   Click image for larger version

Name:	pbdict.png
Views:	236
Size:	85.6 KB
ID:	176943  
EastEriq is offline   Reply With Quote
Advert
Old 02-05-2020, 05:52 PM   #291
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
Seems that your dictionary app can't handle html entities at all.

The last thing you could do is convert the numerical symbols to characters and expect modern day apps to be able to handle unicode directly.
When I posted the DOCTYPE within a code block, I found that some codes were automatically interpreted. So you'll want a low level parser, that only interprets the &#xXXXX; parts.

I had expected the lookup in reader to be working, though? Should expect to be better even because the <f>-tags are retained now. But apparently it can handle entity names, but not numbers?

Last edited by Markismus; 02-05-2020 at 05:54 PM.
Markismus is offline   Reply With Quote
Old 02-05-2020, 06:14 PM   #292
EastEriq
Groupie
EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!
 
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
Quote:
Originally Posted by Markismus View Post
Seems that your dictionary app can't handle html entities at all.
Seems so. That's why I wrote "replace with unicode" Anyway, not tonight... Now what buggers me is why CR3 can't see dictionaries anymore (which most likely is unrelated to your file, now removed from PB)
EastEriq is offline   Reply With Quote
Old 02-06-2020, 05:52 AM   #293
tropoy
Enthusiast
tropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enoughtropoy will become famous soon enough
 
tropoy's Avatar
 
Posts: 34
Karma: 510
Join Date: Feb 2016
Device: Kobo
Quote:
Originally Posted by EastEriq View Post
Uhm, appreciate, but still no joy. The stock Dictionary.app displays the codes literally, inside pbreader they are simply dropped.
On my Inkpad 3 too, of course...

Thanks you both a lot anyway.
tropoy is offline   Reply With Quote
Old 02-06-2020, 04:23 PM   #294
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
So the unicode conversion is a tough one in Perl. Learned a lot...

For your testing purposes: Here are your dictionary files.

I tested it in koreader:


I also redirected _all_ previous links to files.
Markismus is offline   Reply With Quote
Old 02-06-2020, 04:25 PM   #295
EastEriq
Groupie
EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!
 
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
Try this.

How done:
  1. Took Markismus last xdxf
  2. substituted xml entities with unicode characters with this script:
    Code:
    cat Nouveau\ Littre\ 2011\ \(from\ Bookeen\ by\ Penelope\)_reconstructed.xdxf |\
      sed -e "s/&nbsp;/ /g" \
          -e "s/&apos;/'/g" \
          -e "s/<[/]*f>//g" \
          | perl -CS -pe 's/&#x([\dA-Fa-f]{3,4});/chr(hex($1))/eg' \
          | perl -CS -pe 's/&#(\d{3,4});/chr($1)/eg' \
    > N1.xdxf
  3. converted using DictionaryConverter-neu 171109, augmented with the locales found in this post (rather than using the converter Markismus used - for no reason, just ran into the others first).

Last edited by EastEriq; 02-06-2020 at 05:07 PM.
EastEriq is offline   Reply With Quote
Old 02-06-2020, 04:26 PM   #296
EastEriq
Groupie
EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!
 
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
ETA: crossed mid-air....
EastEriq is offline   Reply With Quote
Old 02-06-2020, 04:26 PM   #297
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
You just missed me! Sorry, didn't realize you were also tinkering.
How do the results compare?

Why did you remove the <f> tags?

BTW thanks for the link to more locales!

Last edited by Markismus; 02-06-2020 at 04:46 PM.
Markismus is offline   Reply With Quote
Old 02-06-2020, 04:40 PM   #298
EastEriq
Groupie
EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!
 
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
Quote:
Originally Posted by Markismus View Post
How do the results compare?
Uh, I don't know, byte-checking? At first sight on a few entries (on my one) the content seems about ok, I wouldn't go for an extensive check beyond that.

The appearance in the three (I manage to resurrect CR3 in the meantime) dictionary viewers I have varies a bit, each one has its formatting quirks, like adding spacing, using a font which misses some of the graphisms. None of the three dict viewers has really an impressing usability by itself, I once was aware of some configuration files to fiddle a bit with fonts in the one or the other, but that's all it is. This one .dic seems as good as others now.
EastEriq is offline   Reply With Quote
Old 02-06-2020, 04:54 PM   #299
EastEriq
Groupie
EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!EastEriq rocks like Gibraltar!
 
Posts: 169
Karma: 100516
Join Date: Jan 2018
Device: Cybook Orizon, PocketBook Touch HD
Quote:
Originally Posted by Markismus View Post
Why did you remove the <f> tags?
because they were showing up in the dictionary app. But actually that maybe was because they were always tagging nonrecognised entities? Will try yours, if you kept them...
EastEriq is offline   Reply With Quote
Old 02-06-2020, 04:55 PM   #300
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 897
Karma: 149877
Join Date: Jul 2013
Location: Netherlands
Device: Cracked HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
Comparing the two, I think I like the Stardict with sametypesetting=m and unicoded text (post #294) better than the one with sametypesetting=h and html-handling of the text (post #289).

Though that could just mean that we should do some work on Koreader's html-handling.
Markismus is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Webster's 1913 Dictionary in Pocketbook Format luqmaninbmore PocketBook 8 05-27-2020 10:41 AM
Russian dictionary for Pocketbook 301+ irbit PocketBook 9 03-29-2010 03:05 AM
Pocketbook 301 und Pocketbook 360° im Test, Teil 1 Forkosigan PocketBook 11 02-11-2010 03:54 AM
Oxford built-in dictionary disappears after changing default dictionary YYZscientist Amazon Kindle 4 01-24-2010 08:42 PM
Pocketbook und Netronix Inc. fusionieren zu PocketBook Global Forkosigan Deutsches Forum 0 01-08-2010 01:13 PM


All times are GMT -4. The time now is 02:39 AM.


MobileRead.com is a privately owned, operated and funded community.