Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Plugins

Notices

Reply
 
Thread Tools Search this Thread
Old 04-07-2022, 06:58 AM   #301
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
New test version
  • Send files to Android via adb, please read the document at GitHub
  • Improve X-Ray and Word Wise quality:
    • Merge person partial name X-Ray entities
    • Unescape HTML text
    • Change fuzz algorithm
    • Add 149 and update 53 lemmas
    • Remove WORK_OF_ART NER label
    • Remove cardinal directions and ordinal directions X-Ray entities
    • Remove whitespaces from book quote sentence
    • Add option to enable or disable EPUB locator map
    • Insert X-Ray image caption data

Commits: https://github.com/xxyzz/WordDumb/co....19.0...master

The latest plugin will be uploaded to GitHub Actions Artifacts at each git push automatically.
Attached Files
File Type: zip worddumb-f63030d7836ad8dc706a4bd3ad8c64a5f24a418f.zip (408.0 KB, 90 views)

Last edited by xxyzz; 04-19-2022 at 10:38 PM. Reason: Update test plugin
xxyzz is offline   Reply With Quote
Old 04-27-2022, 10:34 PM   #302
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
v3.20.0

New features:
  • Send files to Android device via adb
  • Customize Word Wise lemmas
  • Insert X-Ray image caption data
  • Add an option to enable or disable EPUB locator map

Bug fixes:
  • Fix the code that should check ASIN of books on Kindle but check books in library
  • Disable pip cache to fix memory error on some devices
  • Fix a typo in dump_lemmas.py

Other changes:
  • Merge person X-Ray entities when partial name appears first
  • Merge X-Ray entities that get redirected or normalized by MedaiWiki API
  • Unescape MOBI and EPUB HTML text
  • Only use texts inside the MOBI and EPUB body tag
  • Select orthographic locator map in SPARQL query
  • Add lemmas that contain parentheses
  • Change X-Ray fuzz algorithm to token_set_ratio()
  • Remove WORK_OF_ART NER label
  • Remove cardinal directions and ordinal directions X-Ray entities
  • Add 639 and update 173 lemmas

Looks like spaCy will release v3.3.0 soon.
Attached Thumbnails
Click image for larger version

Name:	Screen Shot 2022-04-28 at 10.41.57.png
Views:	144
Size:	437.6 KB
ID:	193485  

Last edited by xxyzz; 04-27-2022 at 10:43 PM.
xxyzz is offline   Reply With Quote
Old 04-29-2022, 06:04 AM   #303
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
v3.20.1
  • Add support of Finnish, Korean and Swedish language
  • Update spaCy to 3.3.0
  • Disable jobs_pointer
xxyzz is offline   Reply With Quote
Old 05-04-2022, 09:44 PM   #304
Magic815
Enthusiast
Magic815 has learned how to buy an e-book online
 
Posts: 38
Karma: 98
Join Date: May 2022
Device: Kindle Paperwhite 11th Gen (2021)
Kind of a dumb question, but since I see that WordDumb handles the creation of the files and sends the ebook to my Kindle device, I wanted to confirm something.

My typical workflow is to ingest EPUB files in Calibre, and then I have them auto-convert to KFX when I use the 'Send to device' button. So does that mean that I need to previously have a KFX file already made, when I click the WordDumb button? I wasn't sure if sending via WordDumb would also handle the same auto-conversion that occurs with the Send to device method in Calibre.
Magic815 is offline   Reply With Quote
Old 05-05-2022, 12:02 AM   #305
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Quote:
Originally Posted by Magic815 View Post
So does that mean that I need to previously have a KFX file already made, when I click the WordDumb button?
That's right. WordDumb doesn't convert book formats. It'll create a new EPUB file with X-Ray footnotes if a book only has EPUB format.

Last edited by xxyzz; 05-05-2022 at 12:05 AM.
xxyzz is offline   Reply With Quote
Old 05-05-2022, 09:17 AM   #306
muggleMode
Enthusiast
muggleMode began at the beginning.
 
muggleMode's Avatar
 
Posts: 28
Karma: 10
Join Date: May 2022
Device: PW5
hi, thanks for this amazing plugin.

just a couple newbie questions
Does "Wordwise" only work in English?
Do you have a book where I can test all the functions working fine on a kindle pw5?
muggleMode is offline   Reply With Quote
Old 05-05-2022, 07:20 PM   #307
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Quote:
Originally Posted by muggleMode View Post
Does "Wordwise" only work in English?
Yes. It might be possible to create a new dictionary file and set the book language to English to enable Word Wise for other languages.

Quote:
Originally Posted by muggleMode View Post
Do you have a book where I can test all the functions working fine on a kindle pw5?
https://standardebooks.org/ebooks
xxyzz is offline   Reply With Quote
Old 05-07-2022, 08:03 PM   #308
Magic815
Enthusiast
Magic815 has learned how to buy an e-book online
 
Posts: 38
Karma: 98
Join Date: May 2022
Device: Kindle Paperwhite 11th Gen (2021)
Hi there - I had a follow-up question about this plugin. Let's say I have already converted an EPUB file to a KFX file - and it's the KFX file that I want to send to my eReader device.

When I hit the WordDumb plugin on that eBook, is WordDumb actually modifying the KFX file that is stored in my Calibre library permanently - and then it sends it over to my eReader? Or is WordDumb modifying a copy of the KFX file and sending that copy to the eReader device, and the KFX file in my library stays untouched? Or is WordDumb sending over extra files that sit alongside the KFX file on my eReader as it's way of embedding that information?
Magic815 is offline   Reply With Quote
Old 05-07-2022, 10:28 PM   #309
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Both Word Wise and X-Ray data are stored in a separate sqlite file. WordDumb only adds a random ASIN metedata to the book if the book doesn't have this data, otherwise it won't alter the book file.

Last edited by xxyzz; 05-07-2022 at 10:32 PM.
xxyzz is offline   Reply With Quote
Old 05-07-2022, 10:55 PM   #310
Magic815
Enthusiast
Magic815 has learned how to buy an e-book online
 
Posts: 38
Karma: 98
Join Date: May 2022
Device: Kindle Paperwhite 11th Gen (2021)
Quote:
Originally Posted by xxyzz View Post
Both Word Wise and X-Ray data are stored in a separate sqlite file. WordDumb only adds a random ASIN metedata to the book if the book doesn't have this data, otherwise it won't alter the book file.
Interesting. Where does that separate sqlite file get stored/reside? Does it move to the eReader and sit next to the KFX file? As for the IDs, so you're saying if my workflow always ensures that I have ISBN, ASIN, and GoodReads ID information in my IDs metadata field, WordDumb will not touch the KFX file itself in any way?
Magic815 is offline   Reply With Quote
Old 05-07-2022, 11:46 PM   #311
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
Quote:
Originally Posted by Magic815 View Post
Where does that separate sqlite file get stored/reside?
It's located at the same folder of the book file before being moved to your device.

Quote:
Originally Posted by Magic815 View Post
Does it move to the eReader and sit next to the KFX file?
It's sent to a "book_name.sdr" folder next to the book file.

Quote:
Originally Posted by Magic815 View Post
As for the IDs, so you're saying if my workflow always ensures that I have ISBN, ASIN, and GoodReads ID information in my IDs metadata field, WordDumb will not touch the KFX file itself in any way?
As long as the book file has a valid ASIN metadata, WordDumb won't alter the file.
xxyzz is offline   Reply With Quote
Old 05-31-2022, 06:03 AM   #312
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
v3.21.0

New features
  • Add option to adjust preferred book format order
  • Add option to create files for all available book format

Improve X-Ray quality
  • Thanks to jhowell, KFX footnote reference numbers are ignored(requires KFX input 1.49.0)
  • Don't merge different person with the same given name or surname
  • Increase KFX image caption length limit to 450
  • Ignore URL links and page number references
  • Delete " of" from the end of x-ray label

Other changes
  • Convert NER label constant variables to frozenset
  • Only save MediaWiki cache if new data is added
  • Update dependencies
xxyzz is offline   Reply With Quote
Old 06-02-2022, 08:26 AM   #313
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
v3.21.1
  • Disable pip version check
  • Fix PY_PATH undefined error on macOS
xxyzz is offline   Reply With Quote
Old 06-11-2022, 10:58 PM   #314
xxyzz
Evangelist
xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.xxyzz ought to be getting tired of karma fortunes by now.
 
Posts: 442
Karma: 2666666
Join Date: Nov 2020
Device: none
v3.21.2

Bug fixes
  • Fix media_type undefined error when adding locator map image to EPUB footnote
  • Fix Fandom cache folder not exists error
  • Fix substring not found error for some MOBI , AZW3 and EPUB books

Other changes
  • Replace full name in Chinese and Japanese books that have interpunct
  • Only extent X-Ray length when the next character is punctuation
  • Update dependencies
xxyzz is offline   Reply With Quote
Old 06-18-2022, 12:49 PM   #315
j.p.s
Grand Sorcerer
j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.
 
Posts: 5,804
Karma: 103362673
Join Date: Apr 2011
Device: pb360
Thank you for making this plugin.

Does WordDumb have an option for the user to supply descriptions for x-ray entities?
j.p.s is offline   Reply With Quote
Reply

Tags
worddumb, x-ray


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
[GUI Plugin] KindleUnpack - The Plugin DiapDealer Plugins 525 Today 06:16 PM
[GUI Plugin] CalibreSpy DaltonST Plugins 245 08-18-2024 09:33 PM
[GUI Plugin] Manga plugin mastertea Plugins 6 01-06-2022 02:43 AM
[GUI Plugin] Save Virtual Libraries To Column (GUI) chaley Plugins 14 04-04-2021 05:25 AM
[GUI Plugin] Plugin Updater **Deprecated** kiwidude Plugins 159 06-19-2011 12:27 PM


All times are GMT -4. The time now is 09:20 PM.


MobileRead.com is a privately owned, operated and funded community.