![]() |
#121 | |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 156
Karma: 10001
Join Date: Feb 2011
Device: sony
|
Quote:
As to the rest, they really sound like jobs for the Mark I eyeball ![]() |
|
![]() |
![]() |
![]() |
#122 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 352
Karma: 103850
Join Date: Apr 2011
Device: Kindle NT
|
But it only counts epub and mobi books. And I mostly have pdf as before calibre integrated reader I found it best for me. So now my library has lots of old pdfs.
I need to find dupes among pdfs (like is it a dupe or a converted version of an original) |
![]() |
![]() |
Advert | |
|
![]() |
#123 | |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 156
Karma: 10001
Join Date: Feb 2011
Device: sony
|
Quote:
1) Find Duplicates isn't going to be looking inside any files. It is a tool to analyze the metadata stored in the Calibre database to identify possible duplicates -- leaving content analysis up to you. 2) For the most part, Calibre developers are only minimally interested in supporting pdfs since pdf is an extraordinarily unfriendly format to work with. Sorry, I know these are not very helpful comments ![]() |
|
![]() |
![]() |
![]() |
#124 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 352
Karma: 103850
Join Date: Apr 2011
Device: Kindle NT
|
As a solution I made a new library for Dupes. I leave one book in my main library and transfer another to Dupes library. This way there is no danger of deleting an original or a different copy and it doesn't mess up my library. Books don't take up much place so deleting dupes ain't so important as them not messing with your library.
And now I do hate pdf too. Impossible to count pages, hard to convert (author,title remains in every page for epubs etc.) Thanks for the help |
![]() |
![]() |
![]() |
#125 | |
US Navy, Retired
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,897
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
|
Quote:
My library currently consists of 8600 ePubs and 5 PDFs. |
|
![]() |
![]() |
Advert | |
|
![]() |
#126 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,079
Karma: 14079267
Join Date: Oct 2007
Location: Almere, The Netherlands
Device: Kobo Sage
|
Plugin starts eating huge amounts of memory
I found out that using 'Soundex' for the Title and 'Ignore' for the Author in an Author/Title duplicate search doesn't work well. With my library (~40K books) the plugin starts eating memory like mad, in the end crashing Calibre when it runs out, which happens in half a minute or less (this on a 2.4GHz Corei5 with 2GB RAM + the same VM)
It doesn't do this when using Soundex for both, or indeed any other combo I have tried (mostly Fuzzy/Fuzzy or Fuzzy/Ignore) or when using ISBN matching. Those all work just fine, and have weeded out literally thousands of dups (probably close to 4000) from the mess that was my ebook collection ![]() |
![]() |
![]() |
![]() |
#127 |
eBook Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,526
Karma: 1464018
Join Date: May 2010
Location: USA
Device: Kindle Fire 2020, Kindle PW2
|
Hi Kiwidude:
I ran into some files that seem to have the author name reversed in my db, such as Brockmann Suzanne and Suzanne Brockmann, is there any way to use the plugin to find these files?? Nyn |
![]() |
![]() |
![]() |
#128 |
Calibre Plugins Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,730
Karma: 2197770
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
@Nyn - if you do an "Ignore Title, Similar Author" search that should help you find those.
|
![]() |
![]() |
![]() |
#129 | |
eBook Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,526
Karma: 1464018
Join Date: May 2010
Location: USA
Device: Kindle Fire 2020, Kindle PW2
|
Quote:
![]() It worked great, it found 4 more authors like that. Thanks again. Nyn Last edited by nynaevelan; 08-16-2011 at 06:18 PM. Reason: more info |
|
![]() |
![]() |
![]() |
#130 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,079
Karma: 14079267
Join Date: Oct 2007
Location: Almere, The Netherlands
Device: Kobo Sage
|
Quote:
Thanks for a great plugin! |
|
![]() |
![]() |
![]() |
#131 |
eBook Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,526
Karma: 1464018
Join Date: May 2010
Location: USA
Device: Kindle Fire 2020, Kindle PW2
|
Hi Kiwidude:
Me again, ![]() Nyn |
![]() |
![]() |
![]() |
#132 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 293
Karma: 21022
Join Date: Mar 2011
Location: NL
Device: Sony PRS-650
|
That would be a great option indeed, for the quality check.
|
![]() |
![]() |
![]() |
#133 |
Connoisseur
![]() Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
This is certainly one of the most useful and significant plugins - and I am glad it is set to become a part of the main program. Especially the ability to find duplicates by the file itself - and not just duplicate names. That is very powerful and useful.
I do have one suggestion - although I am not sure if it is possible (it seems like it should be) - about how to make it even better. I would really like to be able to limit my checking for duplicates, at times, to a selected set of books - rather than the entire library. This would especially be useful in order to focus on cleaning up one area of my library. One thing I should note - is that I do keep (intentionally) multiple copies of some books. Those copies, however, different in the file (not the name/identity). So I often wind up with multiple copies of the same file by accident - and like to routinely clean that up. (This is especially the case as I build the library from my files - often times having old drives contents dumped in - to find out what is not there and what is duplicated). |
![]() |
![]() |
![]() |
#134 |
Calibre Plugins Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,730
Karma: 2197770
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
@Philosopher - you can do this already. Find Duplicates will respect any search restriction you have put in place. So do a search to bring back just the subset of books you are interested in. Then in the Restriction dropdown on the top left, select "*Current Search". Now if you use Find Duplicates (or indeed the Quality Check plugin as well) all operations are limited to just those books. When you are finished, clear the search restriction in the restriction dropdown to go back to your full library.
|
![]() |
![]() |
![]() |
#135 |
Connoisseur
![]() Posts: 77
Karma: 12
Join Date: Jun 2010
Device: Kindle
|
OK - didn't realize that. But that is because I tried it using the User Category to select the group - and it didn't seem to restrict it. I'll have to go back and see the difference. Thanks. (I thought that the User Category effectively does a search itself - but perhaps there is something different).
|
![]() |
![]() |
![]() |
Tags |
cross library duplicates, in library duplicates |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
[GUI Plugin] Generate Cover | kiwidude | Plugins | 862 | Yesterday 08:49 PM |
[GUI Plugin] View Manager | kiwidude | Plugins | 416 | 07-16-2025 05:35 PM |
[GUI Plugin] Quality Check | kiwidude | Plugins | 1251 | 07-07-2025 09:13 PM |
[GUI Plugin] Open With | kiwidude | Plugins | 404 | 02-21-2025 05:42 AM |
[GUI Plugin] Plugin Updater **Deprecated** | kiwidude | Plugins | 159 | 06-19-2011 12:27 PM |