If you want speed in a process that is started up for every file, you dont want to implement things in python, the startup cost of python processes is too high. And note that reading metadata from ebook files actually does require things like Qt. For example, for some epub files, the only way to get the cover is to render the first html file in the epub, for which you need a browser (Qt WebKit). Similarly, for PDF getting the cover requires rendering the first page of the PDF.
If you do want to persist with using python, then you might as well just use ebook-meta.exe you wont gain very much in the way of performance by extracting the metadata reading code from calibre into a separate python project. But if you want to do that, the metadata reading code for all the various formats is in metadata/*.py (although that code depends on facilities provided by various parts of calibre so extracting it wont be trivial).
|