|  08-21-2014, 09:06 AM | #1 | 
| Member  Posts: 14 Karma: 10 Join Date: Aug 2014 Device: Kobo | 
				
				bulk operations - a couple of questions
			 
			
			Hi there!  I'm brand new here, so hopefully I'm posting this in the right place. I'm using Calibre (huge love!) to manage a very large collection (100,000 books, about 100 GB) and I've run into a couple of problem areas. First, with bulk cover and metadata downloads, I run into a python error when I try to update 4000 books at a time. If I break it up and do batches of say 1000 at a time, it seems ok. Is there any way to overcome this limitation? I have been processing in batches like this, but it would be way less labor intensive to do them all at once. Second issue is copying from one library to another seems to slow down dramatically as the job progresses. The first few books copy really quickly, but as time goes on, I notice it taking longer and longer to process each book. Copying 4000 books takes many hours. Is that normal, and is there any way to speed it up? | 
|   |   | 
|  08-21-2014, 09:57 AM | #2 | 
| Well trained by Cats            Posts: 31,240 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | 
			
			Hitting cover or metadata servers with huge number of requests is A NOT NICE practice to do to someone who provides us with free access to their metadata.   We have been blocked before for overwhelming servers that were simply intended for human users. IMHO limit requests (20-30 at a time), there are millions of other Calibre users that need access to that metadata | 
|   |   | 
|  08-21-2014, 01:53 PM | #3 | 
| Member  Posts: 14 Karma: 10 Join Date: Aug 2014 Device: Kobo | 
			
			I appreciate your input.  I had no idea that this kind of thing was considered bad practice.  I'd appreciate any links where I can read about it further. In any case, back to the point at hand, my issues are more with the Calibre software itself. I was hoping to understand better how Calibre is behaving during these bulk operations, particularly the bulk library-to-library copy where the performance is degrading over time. Do you have any insights that might help me speed things up? | 
|   |   | 
|  08-21-2014, 07:42 PM | #4 | 
| null operator (he/him)            Posts: 21,997 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | 
			
			What OS are you using?  What are your h/w specs?   You might want to consider putting the databases onto a PcIE SSD drive, and the library/author/book folders onto SATA 3.0 BR | 
|   |   | 
|  08-22-2014, 08:49 AM | #5 | 
| Member  Posts: 14 Karma: 10 Join Date: Aug 2014 Device: Kobo | 
			
			I'm pursuing some hardware optimizations to try to improve performance, but I guess I was more hoping to catch a developer's attention to try to understand why the speed of the copy operation would degrade with each successive operation.  The first copies take probably less than half a second per book.  As time goes on, each book can take upwards to 5-10 seconds to process.
		 | 
|   |   | 
|  08-22-2014, 09:00 AM | #6 | 
| creator of calibre            Posts: 45,594 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Because every copy has to check for duplicates against all existing books. As you copy more books the list of books to check against becomes longer.
		 | 
|   |   | 
|  08-22-2014, 02:09 PM | #7 | |
| Member  Posts: 14 Karma: 10 Join Date: Aug 2014 Device: Kobo | Quote: 
 From my observation, each copy job starts fast and slows down as the job progresses. When I start a second job, the next job starts out fast and slows down. If it was the dupe checking, I'd think each job would get progressively slower than the one before it, which doesn't seem to be the case. For every job, the last 5% takes 10x longer than the first 5%. This is when copying a library of about 4000 titles into one that has about 100,000. Hopefully that makes sense. Any insight on the python error when trying to download metadata for 4000+ titles? I'm guessing I'm hitting an maximum within the software for a list of books to update. | |
|   |   | 
|  08-22-2014, 02:20 PM | #8 | 
| creator of calibre            Posts: 45,594 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			It would help if you posted the actual error message.And the suplicate check has to check not just the books in the library but also any previous dupes from the current job.
		 | 
|   |   | 
|  08-23-2014, 10:06 AM | #9 | 
| Junior Member  Posts: 2 Karma: 10 Join Date: Aug 2014 Device: Nook HD+ | 
				
				adding to library
			 
			
			i have a similar issue. calibre will max out my cpu (I7-3930K) and memory (32GB). the last effort i killed it when memory was at 99%. the transfer gets to 78% then stops. i have to use task manager to kill the process because trying to use the abort button does nothing. i am thinking that perhaps the size i am trying to add (254GB) may be overwhelming the program.    | 
|   |   | 
|  08-23-2014, 05:23 PM | #10 | |
| Member  Posts: 14 Karma: 10 Join Date: Aug 2014 Device: Kobo | Quote: 
 If I manage to generate the python error again, I'll start a new thread and post it there. Thanks again! | |
|   |   | 
|  08-23-2014, 07:20 PM | #11 | 
| Well trained by Cats            Posts: 31,240 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | 
			
			Preferences: Adding books: (the various rules too)
		 | 
|   |   | 
|  08-23-2014, 11:46 PM | #12 | 
| creator of calibre            Posts: 45,594 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Not for a currently running job, no.
		 | 
|   |   | 
|  08-25-2014, 08:30 PM | #13 | 
| Member  Posts: 14 Karma: 10 Join Date: Aug 2014 Device: Kobo | |
|   |   | 
|  08-30-2014, 02:55 AM | #14 | |
| Zealot            Posts: 116 Karma: 48334 Join Date: Sep 2013 Location: Australia Device: Fire HD 10 | 
				
				Huge Requests to Metadata Servers
			 Quote: 
 | |
|   |   | 
|  | 
| Thread Tools | Search this Thread | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Bulk search and replace operations - question | SFD1968 | Calibre | 1 | 03-01-2013 09:23 AM | 
| PRS-T1 questions on collections and slow operations | Jade Aislin | Sony Reader | 9 | 08-14-2012 01:49 PM | 
| A couple of questions | sadievan | Apple Devices | 7 | 06-05-2012 10:28 AM | 
| Couple of questions... | Diegan | HanLin eBook | 8 | 01-23-2009 04:57 PM | 
| Couple Questions | d10sfan | Bookeen | 9 | 12-03-2007 05:17 AM |