![]() |
#1 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,327
Karma: 5306
Join Date: Jan 2014
Device: none
|
Adding Books to Empty Library
Hi.
Is there any way to speed up the process of adding new books to an empty library? Unfortunately I only have epubs and opf-files and no database. Would it be quicker if I create zip or rar files? They are in folders by authors with multiple epubs & opf-files in each folder. Thanks |
![]() |
![]() |
![]() |
#2 |
hopeless n00b
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
If the files were in Calibre folder structure, you can do a restore database. Alas, that doesn't seem to be the case.
Unfortunately, I think this task is pretty disk bound and I don't believe zip/rar would help. Perhaps don't use the computer and disable antivirus during import? |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,048
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
I see no real reason to have 'real time scans' on a Library folder.
I agree with ilovejedd, your pretty much IO bound adding. 1)get rid of the A/V scan 2)Have the source on a different drive spindle, to reduce head position latency Is the OPF from a Calibre Library (one opf + title per folder)? That might help. Normal Add, is ~5+ per minute, on basic desktop hardware (no SSD or I7 ) |
![]() |
![]() |
![]() |
#4 |
hopeless n00b
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
|
![]() |
![]() |
![]() |
#5 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,327
Karma: 5306
Join Date: Jan 2014
Device: none
|
Quote:
The OPF are from Calibre already, but it's one folder per author and in that folder the epubs and opf's are all together, to avoid the limit on characters in windows 10. If I were to split everything into separate folders, I guess it would have to be manual and considering how many there are a very large task. |
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,079
Karma: 14079267
Join Date: Oct 2007
Location: Almere, The Netherlands
Device: Kobo Sage
|
No, it's not *that* bad. With Calibre itself and the source on an SSD and the library on a dedicated 5400 RPM drive, I get about 25 adds/min (older mobile Core i7).
Library size also plays a role, as Calibre does basic dup checking when adding. |
![]() |
![]() |
![]() |
#7 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,327
Karma: 5306
Join Date: Jan 2014
Device: none
|
Could I avoid the duplicates check? Would that make it go faster?
|
![]() |
![]() |
![]() |
#8 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,048
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
I think some of my speed issue is this 'refurbished Core Duo PC' came with 8G of slow end RAM (according to UserBenchmark) , DDR3-1100 instead of 1600 (listed by Crucial). I put in a WD Black drive thinking the drive was the slow part, but saw no obvious improvement. |
|
![]() |
![]() |
![]() |
#9 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,327
Karma: 5306
Join Date: Jan 2014
Device: none
|
It's an empty library I add the books to.
|
![]() |
![]() |
![]() |
#10 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,305
Karma: 10259306
Join Date: May 2016
Device: kobo forma, Kobo Libra, Huawei media Tab, fire HD10, PW3 HDX8.9,
|
just let it run overnight, there is no need to sit and watch it .
that is a old, slow CPU by modern standards. are we talking hundreds of books, or thousands, or tens of thousands ??? |
![]() |
![]() |
![]() |
#11 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,327
Karma: 5306
Join Date: Jan 2014
Device: none
|
I think the biggest library is 60.000 ...
|
![]() |
![]() |
![]() |
#12 | ||
hopeless n00b
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
Quote:
Mind, on a single "slow" SSD (Samsung 840 Basic) and i5-3450S, I get 1 book/second and that's probably only because I call calibredb add --library-path per book which I expect is far less efficient than a normal batch import. No auto-merge/dupe check during import. I just use the Find Duplicates plugin after. Quote:
Yeah, just leave it running overnight. Maybe run them in batches of 5-10K? Last edited by ilovejedd; 10-11-2018 at 10:33 AM. |
||
![]() |
![]() |
![]() |
#13 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,327
Karma: 5306
Join Date: Jan 2014
Device: none
|
Is it possible to avoid the duplicate check when adding books to empty library?
I don't have a checkmark in the duplicates check but when I add new books to an empty library it asks what to do about a bunch of duplicates that aren't really duplicates. It only checks the title and not both title and author or any other columns. Could it be possible to either override this duplicate check or make sure that it compares more than one column to figure out if it's a duplicate. |
![]() |
![]() |
![]() |
#14 | |
hopeless n00b
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
Quote:
I use the Find Duplicates plugin to check for duplicates after import. I've got plenty of fanfics that have changed author pseudonym or title so the url identifier (e.g. https://archiveofourown.org/works/1234567) is the most reliable method of catching duplicates. I was actually quite curious about performance so I ran some import tests using part of my AO3 fanfic library. SSD used is a 500GB Samsung 840 with planar TLC NAND. It's 5-6 years old and 90% full so probably quite slow by modern SSD standards as well as due to normal performance degradation. HDD used is a brand new, empty 1TB 7200RPM Seagate Barracuda (found it in my box of spare parts). Flash drive used is a 128GB Samsung Bar USB 3.1 (connected to USB 3.0 port). Code:
calibredb add --duplicates --recurse --library-path "X:\Calibre Portable\TestLibrary" "X:\ebooks\import" Import Stats mm:ss.00 MB/min books/min SSD to SSD 13:54.10 178 355 SSD to HDD 16:55.52 147 291 HDD to HDD 20:19.20 122 243 Flash to HDD 17:37.79 141 280 Import Structure: \Fandom\Authors\Authors - Title (id).ext 2.422 GB, 979 folders, 24,650 files 4,930 "unique" books (based on checksum) * each book has epub, mobi, txt, opf & cover 2,411 unique titles (based on url identifier) Options: -d, --duplicates Add books to database even if they already exist. Comparison is done based on book titles. -r, --recurse Process directories recursively |
|
![]() |
![]() |
![]() |
#15 |
hopeless n00b
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
P.S.
System runs a Core i5-3450S 2.80 GHz (Ivy Bridge) and 16GB RAM. Operating system (Windows 7 64-bit), pagefile, etc. are all on a separate SSD. Import process was definitely I/O bound. The only time the import managed to max out one core (25% CPU utilization) was during the SSD to SSD import. On all imports to HDD, CPU usage was typically 5-15%. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Empty file window when adding books...? | patrik | Calibre | 2 | 06-19-2018 12:46 PM |
Empty folders when adding books | LucyOne | Calibre | 0 | 07-15-2017 06:27 AM |
Adding Books to Library | MikeHackney | Calibre | 11 | 10-16-2015 10:47 AM |
Adding books already in library directory but not library database | sfguy5874 | Library Management | 1 | 07-07-2012 04:33 AM |
Adding empty books | vbussell | Calibre | 4 | 12-10-2009 05:42 PM |