Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 04-18-2022, 09:05 AM   #1
realzi
Junior Member
realzi began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Apr 2022
Device: Amazon kindle
Finding duplicate ISBNs & compare size

Hey there,
trying to find an answer to my question i did read a lot on this forum, but to no avail. I understand that automatic deletion of duplicates, which are not binary duplicates, seems to be frowned upon. I want to do it anyway.
I would love to find duplicate ISBN numbers (making sure it is the same edition of the book) and then to compare the size of the books to only keep the biggest. My assumption being that in this case the quality of the cover and the book per se is better.
The finding duplicates helps a lot in this case. But as my library is quite big i would also like to have an automated process, where the file sizes of books with the same isbn get compared.
Is there any easy way to achieve this?
Regards realzi
realzi is offline   Reply With Quote
Old 04-18-2022, 12:12 PM   #2
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,047
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Your Q prompted me to run an Find Duplicates:ISBN (the plugin) search.
1) DON'T Automate
2) See #1, don't trust a simple compare.

Each and every one was uniquely wrong (and tedious to find a resolution).
theducks is offline   Reply With Quote
Advert
Old 04-18-2022, 02:08 PM   #3
realzi
Junior Member
realzi began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Apr 2022
Device: Amazon kindle
Well obviously a prerequesite to this procedure would be to verify the ISBNs are correct!

There is a plugin that extracts the ISBNs from ebooks. The scenario you are describing sounds like you might have a couple of wrong numbers there. As the plugin was able to identify/verify a bit more than 70% of the ebooks in my library i am quite confident that an automated approach would be useful.

Also i tested the find duplicates plugin yesterday and in the first 50 Books the results were on point. But i don't feel like comparing all of the books by hand. That is why i made this post.

So is there a way to compare the results of the plugin?

As far as i understand the plugin iterates through the whole library and makes a list of ISBNs. If the most recent one was already on the list, it gets marked as a duplicate, if it was not, it is put on the list. So far so simple, i just struggle to see a way to compare filesizes in this process.
realzi is offline   Reply With Quote
Old 04-18-2022, 03:28 PM   #4
chaley
Grand Sorcerer
chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.chaley ought to be getting tired of karma fortunes by now.
 
Posts: 12,444
Karma: 8012886
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
Quote:
Originally Posted by realzi View Post
As far as i understand the plugin iterates through the whole library and makes a list of ISBNs. If the most recent one was already on the list, it gets marked as a duplicate, if it was not, it is put on the list. So far so simple, i just struggle to see a way to compare filesizes in this process.
You can't with the find duplicates plugin. It is possible with Action Chains using a template that for each book searches for others with the same ISBN then for each of the results check for identical book sizes, but the template will be very complicated and the process horribly slow. I am not interested in helping write such a template.

If you really want to do this ( ) then I suggest you make a custom column containing the book format size (you are comparing the same formats?). Then use find duplicates to find ISBN duplicate sets. For each set, sort the booklist by the format file size and manually delete any that your Mark 1 eyeball say are the same size.

Or write an Action Chains template that iterates through the selection sets finding books with the same size and deleting one of them. Personally I think this is madness, but I am not you.

The easiest solution is probably to write a python program that uses the calibre API to do the nested scans, but that is "easiest" only if you are a programmer.
chaley is offline   Reply With Quote
Old 04-19-2022, 04:32 AM   #5
realzi
Junior Member
realzi began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Apr 2022
Device: Amazon kindle
Thx @chaley, you have given me an idea. I can Just save the duplicates into folders, delete them from the library and let a python script or another program do the size comparison. Obviously i will need to have all the ISBNs in Order and all the Metadata downloaded first, which will take quite some time.
realzi is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Finding duplicate using book cover yamusing Library Management 0 08-21-2020 06:54 PM
What applications do you guys use for finding and removing duplicate books? bajillioneer Kobo Reader 5 05-25-2020 04:15 PM
Finding Duplicates: Ignoring certain formats in 'binary compare' ownedbycats Calibre 0 11-16-2018 08:59 PM
Finding Duplicate files without launching Calibre? Dullahir Calibre 8 04-13-2013 12:37 AM
Finding and Deleting Duplicate Files of different formats dpayment General Discussions 19 10-19-2011 03:02 PM


All times are GMT -4. The time now is 01:55 AM.


MobileRead.com is a privately owned, operated and funded community.