Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Plugins

Notices

Reply
 
Thread Tools Search this Thread
Old Yesterday, 07:47 PM   #1
RelUnrelated
Junior Member
RelUnrelated is on a distinguished road
 
RelUnrelated's Avatar
 
Posts: 9
Karma: 64
Join Date: Apr 2014
Location: Homer, Illinois, US
Device: Kindle Paperwhite 2
[GUI Plugin] AI Vision Metadata

[Plugin] AI Vision Metadata

Overview
AI Vision Metadata is a custom calibre plugin designed to automate the extraction of metadata from publication covers. By leveraging state-of-the-art AI vision models, it analyzes cover art to identify specific issue numbers, publication dates, publishers, and creators, making it an invaluable tool for cataloging comics, magazines, and vintage periodicals.

Originally built around Google's Gemini API, the plugin has evolved into a robust, multi-agent engine tailored for power users.

Key Features
  • Multi-Provider Routing: Seamlessly switch between cloud-based AI models (Google Gemini, OpenAI, Anthropic) or route requests to your own local, offline models using Ollama or LM Studio.
  • Sequential Batch Processing: Select multiple publications at once. The plugin intelligently queues the requests in the background, preventing rate-limit bans and UI lockups.
  • Side-by-Side Review GUI: Never fly blind. The plugin presents a crisp, scaled thumbnail of the cover image right next to the extracted metadata, allowing you to easily verify the AI's accuracy.
  • Isolated Memory Banks: The configuration menu securely remembers your distinct API keys, model selections, and custom system prompts for every individual provider.
  • Advanced Prompt Tuning: Directly edit the AI's core instructions to fine-tune extraction behavior for the unique quirks of your specific collection.
  • Thread-Safe Architecture: Background processing ensures your main calibre window never freezes, while gracefully catching and reporting network or API errors.

Installation
  1. Download the attached AI_Vision_Metadata_v1.0.0.zip file at the bottom of this post. (Do not unzip this plugin file).
  2. Open calibre and click on Preferences (the gear icon) in the top toolbar.
  3. Under the "Advanced" section, click on Plugins.
  4. Click the Load plugin from file button in the bottom right corner.
  5. Navigate to and select the zip file.
  6. Click Yes to accept the security warning and install the plugin.
  7. Restart calibre for the changes to take effect.

Configuration
Before using the tool, you must configure it with an API key or a local server address.
  1. Go to Preferences > Plugins and locate AI Vision Metadata under the User interface action category. Double-click to open the configuration window.
  2. AI Provider: Select your preferred AI engine from the dropdown (Google Gemini, OpenAI, Anthropic, or Local). The UI will dynamically update to show the settings for that specific provider.
  3. API Key / Local URL: Paste your API key for the selected cloud provider. If using a local model, ensure your Local Base URL is correct (e.g., http://localhost:11434 for Ollama).
  4. Model Name: Click Fetch Available Models to populate the dropdown menu directly from your chosen provider, then select the specific model you wish to use.
  5. System Prompt (Advanced): You can safely tweak the AI's core instructions here. Every provider remembers its own prompt.
  6. Click Apply or OK to save.

Usage
Once configured, the plugin integrates seamlessly into your standard calibre workflow.
  1. Select Publications: Highlight one or more entries in your calibre library that have cover images. (Batch processing is fully supported).
  2. Trigger the Plugin: Click the AI Vision Metadata button in your main toolbar, or right-click the highlighted books and select it from the context menu.
  3. Wait for Processing: The plugin runs in a safe background thread. It will analyze the first image and compile the data.
  4. Review the Data: A "Review AI Metadata" window will appear, featuring the cover image on the left and the extracted data on the right.
    • Action Indicators: Every field includes a muted sub-label (e.g., Replaces, Merges, Appends) so you know exactly whether checking the box will overwrite your existing calibre data or safely add to it.
    • Checkboxes: Use the checkboxes to select exactly which fields you want to import. Unchecked fields will be ignored, preserving your existing calibre database entries.
    • Editable Dropdowns: Fields like Series Index offer auto-generated formats, but you can manually type directly into the box for edge cases.
  5. Apply & Auto-Advance: Click OK to save the checked metadata directly to calibre. If you selected multiple books, the plugin will seamlessly load the next cover in your queue and begin processing it immediately.

Provider Setup Guide
To use the cloud features of this plugin, you will need to generate an API key from your preferred provider. Treat these keys like passwords.

Google Gemini (Recommended for Free Tier)
  • Navigate to Google AI Studio to generate a free API key.
  • Note on Limits: gemini-2.0-flash offers generous free daily quotas. Using gemini-2.5-pro for complex covers and deep web searching is highly recommended, but it requires adding a billing profile to your Google Cloud account to lift strict rate limits.

OpenAI (ChatGPT)
  • Navigate to the OpenAI Platform to generate a key.
  • Requirements: OpenAI no longer offers free API grants. You must add prepaid credits (minimum $5) to your developer dashboard for the API to process requests.

Anthropic (Claude)
  • Navigate to the Anthropic Console to generate a key.
  • Requirements: Like OpenAI, Anthropic requires you to load prepaid credits to your account before API requests will be authorized (otherwise you will receive an immediate HTTP 400 error).

Local Models (Ollama / LM Studio)
  • You can run vision-capable models (like llava) completely offline on your own hardware.
  • Download Ollama or LM Studio. Make sure your local server is running, verify the Base URL in the plugin settings, and fetch the models you have downloaded.

License
This plugin is licensed under the GNU GPL v3.

Version History
Spoiler:
  • v1.0.0 - Initial public release. Features multi-provider AI routing, background batch processing, and the side-by-side review GUI.
  • v0.9.0 - Pre-release testing, architecture validation, and UI stabilization.
Attached Thumbnails
Click image for larger version

Name:	Screenshot at 2026-03-25 18-36-27.png
Views:	8
Size:	294.3 KB
ID:	222060  
Attached Files
File Type: zip AI_Vision_Metadata_v1.0.0.zip (41.7 KB, 3 views)
RelUnrelated is offline   Reply With Quote
Old Yesterday, 08:04 PM   #2
RelUnrelated
Junior Member
RelUnrelated is on a distinguished road
 
RelUnrelated's Avatar
 
Posts: 9
Karma: 64
Join Date: Apr 2014
Location: Homer, Illinois, US
Device: Kindle Paperwhite 2
Reserved for future test versions
RelUnrelated is offline   Reply With Quote
Advert
Reply

Tags
artificial intelligence, cover, metadata, periodicals

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
[GUI Plugin] ePub Extended Metadata un_pogaz Plugins 34 11-17-2025 02:38 AM
[GUI Plugin] Embed Comic Metadata dickloraine Plugins 116 11-02-2025 07:01 AM
[GUI Plugin] Zotero Metadata Importer DaltonST Plugins 295 03-22-2025 09:47 AM
Fetch Audio Metadata (GUI Plugin) PetrusV Plugins 0 01-24-2025 08:36 AM
[GUI Plugin] Clean Metadata WS64 Plugins 28 01-06-2022 08:09 PM


All times are GMT -4. The time now is 02:02 AM.


MobileRead.com is a privately owned, operated and funded community.