Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 06-05-2020, 04:32 AM   #16
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,598
Karma: 28548962
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
I dont follow th espeed of the content server is the same as the spped of the gui they use the same code for actual library operations.
kovidgoyal is offline   Reply With Quote
Old 06-05-2020, 11:13 AM   #17
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Quote:
Originally Posted by kovidgoyal View Post
I dont follow th espeed of the content server is the same as the spped of the gui they use the same code for actual library operations.
So I would expect. I suspect it had to do it being a network connection (to localhost, but still) rather than direct file copy. I admit I didn't run a long sample, just a handful of files before deciding to go back to what worked faster.

I'll give it another shot after I finish this set.
kjdavies is offline   Reply With Quote
Old 06-05-2020, 11:32 AM   #18
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,598
Karma: 28548962
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Unless your ebooks are really large I dont see that making a significant difference.
kovidgoyal is offline   Reply With Quote
Old 06-05-2020, 02:29 PM   #19
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Quote:
Originally Posted by kovidgoyal View Post
Unless your ebooks are really large I dont see that making a significant difference.
I was a little surprised myself... but that's what I observed.

I have noticed that the first addition after a large edit (i.e. many entries edited in bulk) tends to be quite a bit slower than usual too.

... for that matter, even the individual additions locally (i.e. via calibredb directly to the library, not the server) are going slow. About the same rate (four or five per minute) as I saw with calibre server. Maybe the other ones were just faster?

Last edited by kjdavies; 06-05-2020 at 02:33 PM. Reason: accidental double post
kjdavies is offline   Reply With Quote
Old 06-05-2020, 05:36 PM   #20
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Quote:
Originally Posted by kjdavies View Post
I was a little surprised myself... but that's what I observed.

I have noticed that the first addition after a large edit (i.e. many entries edited in bulk) tends to be quite a bit slower than usual too.

... for that matter, even the individual additions locally (i.e. via calibredb directly to the library, not the server) are going slow. About the same rate (four or five per minute) as I saw with calibre server. Maybe the other ones were just faster?
Slow than that, I modified my script so it shows a timestamp as it starts processing each file. Each file sees
  • One file loaded (optionally setting cover to the file itself if loading an image file).
  • Set_metadata publisher (apparently no flag in 'add').
  • Five or six set_custom (depending on if I've got page count)
  • Fetch metadata back via list $id so I can see what got loaded.

Timestamps indicate the script has loaded 32 files in 11:14... slightly more than 20 seconds each. It used to be closer to 4-5 seconds each. Does library size make a difference? It's approaching 10,000 files.
kjdavies is offline   Reply With Quote
Old 06-05-2020, 10:34 PM   #21
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,598
Karma: 28548962
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Additions via calibredb without a server are going to be slow, because the library has to be opened and read into RAM for each invocation.
kovidgoyal is offline   Reply With Quote
Old 06-06-2020, 01:18 PM   #22
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Quote:
Originally Posted by kovidgoyal View Post
Additions via calibredb without a server are going to be slow, because the library has to be opened and read into RAM for each invocation.
That definitely aligns with what I see.

It never occurred to me that it would do that (I mean, I think I saw that described in documentation, but forgot until you mentioned it).

And changing to query as needed would be a significant architecture change, wouldn't it?

Right now the process via calibredb takes about 21 seconds per title, with having half a dozen calls per addition.

I wonder how difficult it would be to add a couple new parameters to 'calibredb add':
  • -p PUBLISHER, --publisher=PUBLISHER (seems to be missing)
  • -m METADATA, --metadata=METADATA, which can take multiple metadata fields at once, including custom.

METADATA takes the form field:'value',field:'value',*field:'value',*field: 'value' (identifiers:'isbn:#####,asin:#####').

We can mix built-in and custom columns in other contexts, so I hope we can do the same here.

I'd be inclined to add the same -m/--metadata to set_metadata (doesn't affect those who use it today because they're using -f).

$ calibredb set_metadata -m *own:1,*source=OneBookShelf,*filename='somefilenam e.pdf',*filepath='o:/DriveThruRPG/Some Publisher',*filesize=3183845 11345

[hmm, weird. when I preview I see an extra space in 'somefilename.pdf' that I didn't put in]

would let me do all the metadata in one pass... and if the change is made to calibredb add, I can do both steps at once and reduce the entire thing to a single call.

It looks like using an OPF file can reduce the metadata setting to a single call (custom columns look kind of complex in the OPF file). It looks like adding a file won't accept an OPF, though.

Last edited by kjdavies; 06-06-2020 at 01:20 PM. Reason: added 'OPF' comment
kjdavies is offline   Reply With Quote
Old 06-06-2020, 01:46 PM   #23
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,598
Karma: 28548962
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Simply use the server, then the db is not re-opened on each call, it stays opened.
kovidgoyal is offline   Reply With Quote
Old 06-06-2020, 02:08 PM   #24
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Quote:
Originally Posted by kovidgoyal View Post
Simply use the server, then the db is not re-opened on each call, it stays opened.
I tried that. By the time I load the file and set the metadata, it takes 50 seconds per title.

What I'm doing now takes 20 seconds.

On paper the server should be faster. It takes 2.5 times as long.

It does work. But it's not efficient for my purpose.

(Pretending) I sleep 8 hours a night, I can run my load scripts and load about (8*60*3=) 1440 files per night, and use calibre normally while I'm awake (and since I'm not using calibre while I'm working, I can sneak in another 8+ hours load time per day, so 2,880 files/day).

Or run via content server all day and load (24*60*6/5) 1728 files per day, and be limited to the content server web interface. This might see higher throughput overall if I can run multiple load scripts concurrently, but I haven't proven this does not slow them down.
kjdavies is offline   Reply With Quote
Old 06-06-2020, 02:13 PM   #25
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,598
Karma: 28548962
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
No, I mean use calibredb but connect it to the server. See the first couple of paras at https://manual.calibre-ebook.com/gen...calibredb.html
kovidgoyal is offline   Reply With Quote
Old 06-06-2020, 02:13 PM   #26
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Thank you, Kovid

Before I say anything else, I'd like to thank you for your time, Kovid, and what you've created in calibre. Considering the size of my libraries -- this thread is mostly talking about only one of them -- I'd be lost trying to manage them.

I admit to some consternation regarding how calibredb performs, and that might be coming through in my writing. For that I apologize.

In the meantime, I'm suggesting things that would make it work better for my purpose (and, I think, for others who use it as I do... which might be a small number of people).

If you point me at the correct place in the code to make these changes, I can see if I can work up a patch. I think the command line changes shouldn't be that difficult (find 'metadata argument', parse metadata elements apart and apply update metadata function multiple times instead of once per calibredb call).
kjdavies is offline   Reply With Quote
Old 06-06-2020, 02:21 PM   #27
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Quote:
Originally Posted by kovidgoyal View Post
No, I mean use calibredb but connect it to the server. See the first couple of paras at https://manual.calibre-ebook.com/gen...calibredb.html
I did. In both cases I was making the same calibredb calls:
  • Start content server
  • calibredb add <entry parameters here>
  • calibredb set_metadata -f publisher 'publisher name' id
  • calibredb set_custom <field> <id> <value>
  • calibredb set_custom <field> <id> <value>
  • calibredb set_custom <field> <id> <value>
  • calibredb set_custom <field> <id> <value>
  • calibredb set_custom <field> <id> <value>

When "--with-library=n:/libraries/rpg-auto", run time was ~20 seconds per entry.

When "--with-library='http://localhost:8080/#rpg-auto'", run time was ~50 seconds per entry.

It does work. It just works very slowly.

If I can instead do the entire add in one call, or do add and set metadata (built-in and custom) in two calls, I expect I can greatly reduce the time needed.

I'm not an efficiency freak, but I think in a use case like mine it could reduce the run time by a factor of four or five, and could reduce a job that I started last weekend to one day (67 hours runtime down to perhaps 15... not quite 'overnight', but two nights or night-and-while-working).

Last edited by kjdavies; 06-06-2020 at 02:24 PM. Reason: added 'efficiency' comment
kjdavies is offline   Reply With Quote
Old 06-06-2020, 06:12 PM   #28
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Hang on. I have another way.

This almost certainly invalidates my warranty.

Adding a new title has lots of baggage (copying files, creating/copying cover images, and so on) that I don't want to duplicate. Fair enough, calibredb can take care of that for me.

Setting metadata, on the other hand, is just updating a record (if it's built-in, like publisher) or adding a record (custom column). I don't need calibredb to do that. I can handle that externally.

I know that each script works with only one database at a time. I can do
  • calibredb add
  • set_metadata via script
  • set_custom (multiple) via script
safely enough. Even if the next calibredb loads the entire database and saves the entire database after making changes, the scripted updates/inserts happen before that starts. No chance of having both active concurrently, so no chance of one overwriting the other.

If calibre-server means that each calibredb call is handled (more or less) atomically I might still be able to have two load processes running against the same library (brokered by the server) safely... but I don't count on that. Which is okay, I can have processes loading against different libraries and that's enough for now.
kjdavies is offline   Reply With Quote
Old 06-06-2020, 11:54 PM   #29
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,598
Karma: 28548962
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
First off, I find it extremely hard to believe that calibredb performs worse through the server than without it, I certainly cannot replicate that. Do you perchance have an antivirus/firewall getting in the way?

Secondly you dont need multiple set_custom calls, you can set all metadata with set_metadata, including custom columns in a single call. Simply specify --field multiple times.
kovidgoyal is offline   Reply With Quote
Old 06-07-2020, 02:26 AM   #30
kjdavies
Zealot
kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.kjdavies is no e-book dilettante.
 
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
Quote:
Originally Posted by kovidgoyal View Post
First off, I find it extremely hard to believe that calibredb performs worse through the server than without it, I certainly cannot replicate that. Do you perchance have an antivirus/firewall getting in the way?

Secondly you dont need multiple set_custom calls, you can set all metadata with set_metadata, including custom columns in a single call. Simply specify --field multiple times.
It could be that other elements are impeding performance, but... yep, about 2-2.5 times as long per title.

And strange, I did try setting multiple metadata (standard and custom) at once, using that syntax... didn't seem to work.

However, I will try again. Perhaps I had it formatted incorrectly.

set_custom doesn't have '--field', though.

calibredb set_custom [options] column id value

I take it I should try

calibredb set_custom [options] [column id value]*
kjdavies is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Support multiple instances of same format in same book entry masp Library Management 3 09-23-2014 10:44 PM
Multiple identical server instances detected didierm Calibre Companion 2 08-17-2014 10:19 AM
Two or multiple instances of Calibre on one computer clockmaker Library Management 2 06-30-2012 01:55 PM
Replace multiple matching instances within paragraph? murphycc Conversion 2 02-23-2012 09:53 AM
Trouble with multiple content server instances perx Calibre 3 02-17-2012 01:24 AM


All times are GMT -4. The time now is 08:56 PM.


MobileRead.com is a privately owned, operated and funded community.