07-31-2020, 01:03 PM | #31 |
Bibliophagist
Posts: 35,406
Karma: 145435140
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Forma, Clara HD, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
One oddity. When I installed 1.2.0 and did a search in my main library, it added a couple of epubs which were there when the original install was done.
|
07-31-2020, 01:12 PM | #32 |
Connoisseur
Posts: 77
Karma: 90088
Join Date: Jul 2020
Device: android
|
This can happen when plugin detects that book modification time was changed. Not sure though why and when it might change, need to experiment a bit.
|
Advert | |
|
07-31-2020, 02:30 PM | #33 |
Bibliophagist
Posts: 35,406
Karma: 145435140
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Forma, Clara HD, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
I wouldn't have been surprised if it happened in my Intake library but my Main library doesn't get many changes. One of the book was an epub version of a dictionary which has never been changed.
|
07-31-2020, 05:54 PM | #34 |
Guru
Posts: 927
Karma: 1177583
Join Date: Dec 2016
Location: Goiânia - Brazil
Device: iPad, Kindle Paperwhite
|
Nice work. Now we can easily spot when a file chokes on conversion (it will stay there on the list "forever").
|
08-08-2020, 01:05 PM | #35 |
Connoisseur
Posts: 77
Karma: 90088
Join Date: Jul 2020
Device: android
|
Version 1.3.0 released.
Contains following usability improvements:
|
Advert | |
|
08-10-2020, 07:27 PM | #36 |
Custom User Title
Posts: 8,622
Karma: 61176603
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
I ran into an issue: I have some large image-based CBZ files that don't have any text (empty output file if I convert manually). It takes a few hours for Power Search to index them (note that even with one thread and calibre's job priority set to low, it still manages to eat up my system resources and make my computer unusable for the duration), and it tries to re-index them every time.
Any suggestions? EDIT: If there was an option to exclude certain books (or even exclude certain formats entirely, I don't mind forgoing the ability to search my other CBZ files) that might help. Last edited by ownedbycats; 08-11-2020 at 03:50 AM. |
08-11-2020, 05:44 AM | #37 |
Connoisseur
Posts: 77
Karma: 90088
Join Date: Jul 2020
Device: android
|
Hi ownedbycats,
Sure, it will be a great feature. I will try to add it in near future. And thanks for your feedback, helpful as usual! |
08-17-2020, 04:49 AM | #38 |
Junior Member
Posts: 8
Karma: 10
Join Date: Jul 2019
Device: boox note pro, auro h2o
|
I recently started de-duplicating my library with a newly discovered tool that compares 2 textfiles A, Bagainst each other and gives a percentage of text A occuring in text B.
The Debian package is called similarity-tester, the software was written in 1989 and lives here: https://dickgrune.com/Programs/similarity_tester/ Since I don't regularly use Calibre and only nibble at this forum very occasionally, I write this here, because your plugin seems to me to have nearly all aspects available to use this: - convert books to text - run external program - do something with the result and I've found no other mention of sim_text in combination with Calibre. A couple of points I found when using it: - it takes time to run. 3000 files on an Intel J1900 use about 60 seconds and there is no provision for a progress indicator. Can be added relatively simple, of course. There's three main loops: reading files, hashing files & comparing hashes. - it runs on a single core. If you split the filelist and run permutations of the split sections on multiple cores, it runs faster - if you have enough memory. - it takes memory to run also. 3000 files use about 2 GiB of memory. - some patches to make it compile cleanly exists in Debian's bug tracker. - the best way to run it is to feed it a list of files (-i parameter), then parse the output and if something is found, run the comparison for those single files in reverse (since if A occurs for 80% in B, maybe B is the 'extended edition' with a short story added, or something like that). So, maybe someone can use this, I notice that detecting similar books is a regularly occuring question in Calibre, and this is a foolproof method. |
08-22-2020, 11:07 PM | #39 |
Diligent dilettante
Posts: 3,417
Karma: 48736498
Join Date: Sep 2019
Location: in my mind
Device: Kobo Sage; Kobo Libra H2O
|
"pdftotext" is listed as "optional", but attempting to configure Power Search without it brings up path errors? I can't be bothered with pdftotext as I have no PDFs in my Calibre library.
EDIT Never mind, pebkac as usual. I started it without opening the Options dialog, all seems VERY well. Thank you! Last edited by Uncle Robin; 08-22-2020 at 11:43 PM. Reason: Add correction |
08-24-2020, 11:46 AM | #40 |
Connoisseur
Posts: 77
Karma: 90088
Join Date: Jul 2020
Device: android
|
Version 1.4.0 released.
Now user can select supported file formats in options dialog. Thanks to ownedbycats for feature suggestion. |
08-24-2020, 11:59 AM | #41 |
Custom User Title
Posts: 8,622
Karma: 61176603
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
I installed it. Excluding CBZ files fixed my issue quite nicely. Thank you.
I did run into another issue though: for some reason it stopped searching the older books, only results are the ones added/updated recently. I reinstalled ElasticSearch while trying to figure out the CBZ issue and I think it broke the index. Is there a way to reset it? EDIT: I removed the plugin and reinstalled it and that didn't work. Last edited by ownedbycats; 08-24-2020 at 12:54 PM. |
08-24-2020, 01:10 PM | #42 |
Connoisseur
Posts: 77
Karma: 90088
Join Date: Jul 2020
Device: android
|
I'm not sure, it might happen if you are switching your library path. Otherwise it should be working well...
|
08-24-2020, 02:01 PM | #43 |
Custom User Title
Posts: 8,622
Karma: 61176603
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
Moving caps.json from the plugins folder seems to reset the Power Search settings. It's re-indexing all the files.
I accidentally deleted the ES data folder when I reinstalled ElasticSearch (I use Geek Uninstaller) but since the caps.json file listed the files as already-indexed, it didn't re-index them. Another possible bug I noticed: When canceling partway through indexing (e.g. to adjust the number of threads), although the files are indexed (as can be confirmed by monitoring the elasticsearch programdata folder), the datetimes don't get added to caps.json so Power Search ends up re-indexing them unnecessarily. Last edited by ownedbycats; 08-24-2020 at 02:17 PM. |
08-24-2020, 04:17 PM | #44 | ||
Connoisseur
Posts: 77
Karma: 90088
Join Date: Jul 2020
Device: android
|
Quote:
Quote:
|
||
08-24-2020, 04:23 PM | #45 |
Diligent dilettante
Posts: 3,417
Karma: 48736498
Join Date: Sep 2019
Location: in my mind
Device: Kobo Sage; Kobo Libra H2O
|
Do we need to uninstall 1.3.0 before installing 1.4.0? If so, how is that done, since Power Search doesn't show up in my list of installed plugins?
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
[GUI Plugin] Clipboard Search | kiwidude | Plugins | 29 | 04-02-2024 10:05 PM |
[GUI Plugin] Search the Internet | kiwidude | Plugins | 433 | 04-01-2024 05:48 PM |
[GUI Plugin] Recoll Full Text Search | Satas | Plugins | 16 | 08-05-2016 03:54 AM |
[GUI Plugin] Full Text Search (SOLR) | peterpisljar | Plugins | 2 | 08-09-2015 08:16 AM |
Make a simple Plugin for Full Text Search using Recoll | Satas | Development | 9 | 07-20-2013 04:15 PM |