Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Plugins

Notices

Reply
 
Thread Tools Search this Thread
Old 12-13-2021, 10:31 PM   #226
ArthurQ
Junior Member
ArthurQ began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Dec 2021
Device: kindle paper white 4
Quote:
Originally Posted by xxyzz View Post
BTW, calibre can create MOBI news directly. You need to set 'preferred output format' to MOBI in preferences->behavior.
Thank you, now I can create MOBI news .
And after I tried to dis-checked 'columns have three values(needs restart)' option, WordDumb works for me
ArthurQ is offline   Reply With Quote
Old 12-18-2021, 08:41 AM   #227
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
I added images to X-Ray for MOBI and AZW3 books in commit https://github.com/xxyzz/WordDumb/co...1888f4ef1b5f95, the latest build can be downloaded from GitHub Actions(https://github.com/xxyzz/WordDumb/ac...uns/1596064865). Please try it out and report bugs.

For KFX books, I'll need jhowell's help again. The X-Ray data for KFX looks like this:
Code:
sqlite> SELECT * FROM excerpt WHERE image NOT NULL;
+-----+---------+--------+-------+------------------+---------+
| id  |  start  | length | image | related_entities |  goto   |
+-----+---------+--------+-------+------------------+---------+
| 576 | 1       | 0      | e6    |                  | 1       |
| 577 | 2179585 | 71     | e2GX  |                  | 2179583 |
| 578 | 2179667 | 57     | e2H8  |                  | 2179665 |
The "goto" column is the image tag offset in the file, I guess the "image" will be similar to the "src" of the img tag in MOBI file.

The "start" and "length" are for the sentence below the image. For convenience(laziness), I set "start" to the same as "goto" and "length" to 0 in this commit.

Last edited by xxyzz; 12-18-2021 at 08:55 AM.
xxyzz is offline   Reply With Quote
Old 12-18-2021, 11:16 AM   #228
jhowell
Grand Sorcerer
jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.
 
jhowell's Avatar
 
Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
Quote:
Originally Posted by xxyzz View Post
For KFX books, I'll need jhowell's help again. The X-Ray data for KFX looks like this:
Code:
sqlite> SELECT * FROM excerpt WHERE image NOT NULL;
+-----+---------+--------+-------+------------------+---------+
| id  |  start  | length | image | related_entities |  goto   |
+-----+---------+--------+-------+------------------+---------+
| 576 | 1       | 0      | e6    |                  | 1       |
| 577 | 2179585 | 71     | e2GX  |                  | 2179583 |
| 578 | 2179667 | 57     | e2H8  |                  | 2179665 |
It never occurred to me that images would be relevant to the X-Ray feature. I would be happy to help.

In that table the image field has values starting with "e". Those are identifiers for KFX resources.

The code I wrote to extract text and positions from KFX files currently handles only text. I can enhance that to also include the resource names and positions for images. Each image in KFX occupies one "position" within the book, the same as a single character.

If you need to access the actual image content then I can also include the mapping from resource name to image location. That is the filename of the image within the unpacked book. (You can unpack KFX using a different option of the KFX Input plugin.)

Is there anything else you might need?
jhowell is online now   Reply With Quote
Old 12-18-2021, 11:22 AM   #229
yiming
Zealot
yiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animals
 
Posts: 127
Karma: 6744
Join Date: Dec 2011
Device: Kindle Touch, PW2, PW5
Thank you for creating this great plugin. I've have used it on a few books from the Standard Ebooks and found that soft hyphens are still causing problem. For eg, in Pride and Prejudice, the word "property" is treated as "prop", and in The Great Gatsby, the word "wagon" is treated as "wag".

Could you look into the problem. I'm using the latest 3.14.4 of the plugin. Thanks.

links to the books:
https://standardebooks.org/ebooks/ja...prejudice.azw3
https://standardebooks.org/ebooks/f-...at-gatsby.azw3

Last edited by yiming; 12-18-2021 at 11:41 AM.
yiming is offline   Reply With Quote
Old 12-18-2021, 07:00 PM   #230
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Quote:
Originally Posted by jhowell View Post
It never occurred to me that images would be relevant to the X-Ray feature.
There is an image tab in X-Ray shows all the images in the book, it's quite convenient to find images.

Quote:
Originally Posted by jhowell View Post
Is there anything else you might need?
The image resource name and position are sufficient for the task. Thanks for your help.
xxyzz is offline   Reply With Quote
Old 12-18-2021, 07:04 PM   #231
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Quote:
Originally Posted by yiming View Post
that soft hyphens are still causing problem
Both libraries(flashtext and spaCy) I used for creating Word Wise and X-Ray can't deal with soft hyphen. You have to remove soft hyphens or convert the book to KFX with KFX Output plugin.
xxyzz is offline   Reply With Quote
Old 12-18-2021, 07:35 PM   #232
jhowell
Grand Sorcerer
jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.
 
jhowell's Avatar
 
Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
Quote:
Originally Posted by xxyzz View Post
The image resource name and position are sufficient for the task. Thanks for your help.
No problem. I will let you know when it is ready.
jhowell is online now   Reply With Quote
Old 12-18-2021, 11:41 PM   #233
yiming
Zealot
yiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animals
 
Posts: 127
Karma: 6744
Join Date: Dec 2011
Device: Kindle Touch, PW2, PW5
Quote:
Originally Posted by xxyzz View Post
Both libraries(flashtext and spaCy) I used for creating Word Wise and X-Ray can't deal with soft hyphen. You have to remove soft hyphens or convert the book to KFX with KFX Output plugin.
Thanks for your reply. I misunderstood this: "Update to v3.7.5, this version ignores soft hyphens in the book." Thought that it applies to AZW3 as well.
yiming is offline   Reply With Quote
Old 12-19-2021, 12:00 AM   #234
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Quote:
Originally Posted by yiming View Post
I misunderstood this: "Update to v3.7.5, this version ignores soft hyphens in the book." Thought that it applies to AZW3 as well.
That version does ignore soft hyphen when creating Word Wise but not X-Ray. And since v3.13.0 I'm using flashtext to improve speed, so soft hyphen affects Word Wise again.
xxyzz is offline   Reply With Quote
Old 12-20-2021, 07:58 AM   #235
jhowell
Grand Sorcerer
jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.
 
jhowell's Avatar
 
Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
Quote:
Originally Posted by jhowell View Post
No problem. I will let you know when it is ready.
Version 1.46 of the KFX Input plugin has the change to include position information for images. See the section on "Generating content/position information" in the KFX Input documentation for a description of how the output format has changed.

Let me know if there are any problems.
jhowell is online now   Reply With Quote
Old 12-20-2021, 12:29 PM   #236
yiming
Zealot
yiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animalsyiming is kind to children and small, furry animals
 
Posts: 127
Karma: 6744
Join Date: Dec 2011
Device: Kindle Touch, PW2, PW5
Quote:
Originally Posted by xxyzz View Post
That version does ignore soft hyphen when creating Word Wise but not X-Ray. And since v3.13.0 I'm using flashtext to improve speed, so soft hyphen affects Word Wise again.
Thanks for the clarification.
yiming is offline   Reply With Quote
Old 12-20-2021, 10:28 PM   #237
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
v3.15.0

- update dependencies
- close MOBI file
- support X-Ray images, thanks for jhowell's help
xxyzz is offline   Reply With Quote
Old 12-30-2021, 09:33 PM   #238
mmobes
Enthusiast
mmobes began at the beginning.
 
Posts: 39
Karma: 10
Join Date: Dec 2021
Device: Kindle Oasis
So when the instructions say the plugin won't support macOS, does that mean I can't use macOS to add X-Ray to my books through Calibre or does that mean X-Ray won't work when reading on macOS?
mmobes is offline   Reply With Quote
Old 12-30-2021, 10:35 PM   #239
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Quote:
Originally Posted by mmobes View Post
So when the instructions say the plugin won't support macOS, does that mean I can't use macOS to add X-Ray to my books through Calibre or does that mean X-Ray won't work when reading on macOS?
It means you can't use this plugin to create X-Ray file on the official calibre macOS app. Use WordDumb on Linux if you want to try that feature or build calibre from source and disable library validation.
xxyzz is offline   Reply With Quote
Old 01-07-2022, 07:07 AM   #240
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
v3.16.0

- X-Ray can be created on macOS
- add 5038 and update 608 lemmas
- update dependencies
xxyzz is offline   Reply With Quote
Reply

Tags
worddumb, x-ray


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
[GUI Plugin] KindleUnpack - The Plugin DiapDealer Plugins 527 Today 01:36 PM
[GUI Plugin] CalibreSpy DaltonST Plugins 245 08-18-2024 09:33 PM
[GUI Plugin] Manga plugin mastertea Plugins 6 01-06-2022 02:43 AM
[GUI Plugin] Save Virtual Libraries To Column (GUI) chaley Plugins 14 04-04-2021 05:25 AM
[GUI Plugin] Plugin Updater **Deprecated** kiwidude Plugins 159 06-19-2011 12:27 PM


All times are GMT -4. The time now is 09:33 PM.


MobileRead.com is a privately owned, operated and funded community.