![]() |
#961 |
Junior Member
![]() Posts: 2
Karma: 10
Join Date: Mar 2022
Device: Various
|
Doh, not sure how I missed that LOL
Thanks for the help. |
![]() |
![]() |
![]() |
#962 |
Library Breeder (She/Her)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,268
Karma: 1937891
Join Date: Apr 2015
Location: Fullerton, California
Device: Paperwhite 2015 (2), PW 2024 (12 GEN), PW 2023 (11 GEN), Scribe (1st)
|
Might be a dumb or obvious question/answer, but in the case of tags where there are multiple matches for duplicates and only a couple of those matches in the group are actual matches, how do I specify that I only want to use those specific ones out of the group? It seems like it's an all or none.
Example 2stars, 2-stars,2.stars,3-stars,3.stars,5-stars I match 2-stars for all of them and I want to use 2.stars for the 2 star ones but none of the other ones, which should get their own matches. I ignore them and end up missing out all together. Am I missing something? |
![]() |
![]() |
![]() |
#963 | |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,792
Karma: 146391129
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
![]() |
![]() |
![]() |
#964 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: May 2022
Device: none
|
Stored Binary Hashes?
Hey, I'm a big fan of your Find Duplicates Plugin. I was wondering if it would be feasible to have it store the binary hashes in the database, to reduce the amount of work needed to do a comparison. Specifically, the binary compare has gotten slow as my library had increased beyond the 1TB mark. Is this a feature that can be added, or can I fork the code and attempt to add it myself somewhere?
|
![]() |
![]() |
![]() |
#965 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,076
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
MR user should be cautioned about using experimental code. Now the critique. A hash may only be good at the time. ANY change to the book (polish, embed, edit) will change that value. How do you NOW know the value is stale??? Value Storage: A custom column will stay with the Library |
|
![]() |
![]() |
![]() |
#966 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,447
Karma: 8012886
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
|
Quote:
Plugins are provided as source code. You are free to change them however you want. Getting your changes released requires consent of the plugin's current maintainer. See Writing your own plugins to extend calibre’s functionality for some guidance on writing & changing plugins. You can store the hashes in calibre's database using db.cache.add_custom_book_data. Personally I would store both the hash and the format's last-modified date so you can have a clue that the hash is still valid. |
|
![]() |
![]() |
![]() |
#967 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,196
Karma: 1995558
Join Date: Aug 2015
Device: Kindle
|
Version 1.9.7
Version 1.9.7
|
![]() |
![]() |
![]() |
#968 |
Junior Member
![]() Posts: 9
Karma: 10
Join Date: Sep 2022
Device: Kobo Touch, Like Book Mars, Android
|
Optimise Binary Compare using hashes/digests
I'd like to suggest that Find Duplicates's Binary Compare be enhanced to support storage and comparison by hash/digest to significant speed it up, by avoiding most/all redundant full file compares. A file compare on hash/digest could be an option if hash/digest collision false-matches are suspected, as were discovered occurring sometimes for MD5.
If suggest storage of binary comparison metadata in a custom field, and that this field contains a JSON map with a hash/digest type, and a key for each file format, with the value map containing, a hexadecimal formatted file hash/digest (I'd suggest SHA256), the last file size, and the file last modified timestamp; the later two for validation. If this custom field was not configured, like the "Last Modified" plugin does, a warning should be displayed, then the current _slow_ full file compare functionality used instead. During a binary search: * If any of the field maps are missing, they should be created. * If the field value is junk or if the hash/digest type is obsolete, the whole field map should be recreated. * If a format file is missing, it should be removed from the field map. * If a format file was added, it should be added to the map. * If the hash/digest value is out-of-date (file size or last modified changed), the type map should be re-built. It would be a nice if the above rules were applied after a book entry was added, after any formats were added/removed, and after any in-calibre format file changes; obviously, I would not expect this to be able to spot any updates outside of calibre. I'd suggest that this field, and its value creation, validation, and updating should really be provided by calibre itself. Last edited by jluaioyj; 09-08-2022 at 02:22 PM. |
![]() |
![]() |
![]() |
#969 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,196
Karma: 1995558
Join Date: Aug 2015
Device: Kindle
|
Quote:
Given the last point, automatic hash calculation on book additions does not make much sense. It can be done but will not be of much use, because only a small subset of these hashes will be needed based on size comparisons. In addition to this, calculating book hashes will slow down adding books, especially if the user is adding a large number of books. |
|
![]() |
![]() |
![]() |
#970 |
Library Breeder (She/Her)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,268
Karma: 1937891
Join Date: Apr 2015
Location: Fullerton, California
Device: Paperwhite 2015 (2), PW 2024 (12 GEN), PW 2023 (11 GEN), Scribe (1st)
|
I was wondering if you could add a feature that put tags with a specific character as the default rename. I hate having to keep right clicking on the tags with periods.
|
![]() |
![]() |
![]() |
#971 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 34
Karma: 130226
Join Date: Apr 2020
Device: Kindle Voyage
|
Not sure if I asked this before or if this is some other place discussed.
I'm puzzled how when I do "similar" search in Metadata duplicates it can't find what I guess should be obvous hits like: Doe, John Doe, John L. I have so many authors in my database with and without their middle names, but can't find most of them in duplicate. Similar gives nothing, Fuzzy gives 20x more false positives...not even close to being useful. Am I missing something? TNx |
![]() |
![]() |
![]() |
#972 | |
Calibre Plugins Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,730
Karma: 2197770
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Quote:
|
|
![]() |
![]() |
![]() |
#974 |
Calibre Plugins Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,730
Karma: 2197770
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
@capink - nice one - my bad for not properly reading the post while I was focused on other things...
|
![]() |
![]() |
![]() |
#975 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,076
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Then I would suggest the old Eyeball Mk II
In the Tag Browser: Right click (authors): Manage Be sure to check (just do a Search and validate what you see) before you rename. Some Authors use their MI to differentiate from another Author with the same Fn Ln And some Publishers leave the Initial off ![]() ![]() BTW I have to do this with Series. Sometimes I add a 'tie breaker': Series (Authors Initials), to the name. The Paren does not mess the sort. |
![]() |
![]() |
![]() |
Tags |
cross library duplicates, in library duplicates |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
[GUI Plugin] Generate Cover | kiwidude | Plugins | 862 | 07-24-2025 08:49 PM |
[GUI Plugin] View Manager | kiwidude | Plugins | 416 | 07-16-2025 05:35 PM |
[GUI Plugin] Quality Check | kiwidude | Plugins | 1251 | 07-07-2025 09:13 PM |
[GUI Plugin] Open With | kiwidude | Plugins | 404 | 02-21-2025 05:42 AM |
[GUI Plugin] Plugin Updater **Deprecated** | kiwidude | Plugins | 159 | 06-19-2011 12:27 PM |