OK!
I have tested it on around 100 books and I am quite impressed.
I let it work on some books from my baen monthly bundles and some non-fiction pdfs and some random files. And the script organize-ebooks.sh did a great job. I will definitely start using this to preprocess all my ebook files before importing to calibre!
I had it the script place books it was uncertain about in a specific folder. Only one book, a comic book I have no idea where I got, was placed there. But it was correctly renamed...
However I never got -oft (output filename template) working. I tried several ways with, quotes in many different ways, including copied from the docs, but it never worked. So all the books are in a single folder. If you, sealbeater, could post a snippet showing how to get organize-ebooks.sh to use -oft and/or output books to subfolders it would be great!
A few caveats: The scripts is intended for actual published books. With ISBN. Fan fiction or self-published stuff I suspect would be very hit-and-miss depending on how metadata is stored in the file.
Specify -owi to attempt to organize books without ISBN.
The WorldCat ISBN calibre plugin had to be rezipped to a more shallow structure to be installed in calibre. And it has a limit of 1000 lookups per day. Fine for my needs...
Tesseract (OCR-software) in Ubuntu 18.04 is the latest 4.0, so it can be installed from normal repositories. Seems to OCR fine!
If you have the source books in subfolders the subfolders are not removed as the renamed books are removed.
Some familiarity with linux and bash scripting is needed.
But, as said, I am very pleased. Thanks to sealbeater for making me look at these scripts a little closer. I've seen them before, but not tried them.
Just point the script at a folder with books and have it shug away. A few seconds per book unless it has to be OCR:ed.
Then import to calibre. I still have to manually organize by genre... Perhaps also download cover.
Last edited by Adoby; 08-09-2018 at 11:23 AM.
|