@ezdiy Great! Thank you!
Quote:
Turns out the "100kb limit" is actually 64k (after removal of tags).
|
What tags are retained in the conversion? Are color-tags removed? Blockquote, ex, abr?
Quote:
you'll have to abbreviate the entry or split it via perl. Out of the whole dict there's only one such entry though.
|
The maximum article- and line-lengths are already implemented, so I’ll tune them in the script. That's why the reconstructed xdxf-file still only had one left, that was too long. The original is teeming with them.
What is the limiting entity, precisely? I saw with Greek letters, that it isn't bytes: Some accepted entries stayed below 3500 chars, while being 7500 Bytes. But the chars are not exactly 4k either, somewhat less.
Is there a way to encode for resources? Audio tags for pronunciation? I know Stardict-tools can convert Lingvo audio resources to Stardict format, however, I have no idea how to implement them in xdxf-format, yet. Would be great to use the audio feature of the pocketbook!
Image resources would be nice, too. Maybe with bbencode? I encoded fonts that way into xml when further processing needed it.
That looks a bit like the de-assembler I used as a kid. (I had to hack CGA games to work on my dad's monochrome Hercules graphics card.) What could I look into for that, nowadays?