Released: May 13, 2018
Current Version: "0.1.0"
This plugin allows you to generate MP3 files of all files selected in the Book View window. (To select all files, click the Text folder.)
Note that this plugin has not been optimized for speed. If you want to generate mp3 files for a whole book, use jackie_w's TTS to MP3 Calibre plugin, which is two to three times faster than my plugin.
Credits: This plugin uses
Joshua Milas's SAPI5 TTS library for generating WAV files and
LAME for generating MP3 files.
System requirements
This plugin requires Windows 7 or higher and Sigil 0.8.9 or higher. If you're using an external Python 3 interpreter, you'll need to install bs4, tts and comtypes. (Note that this plugin does not work with
Linux and
macOS machines.)
Installation
1. Select Manage Plugins from the Plugins menu. Make sure that
Use Bundled Python is checked. (If you're using an older Sigil version, select a Python 3 interpreter and install bs4, tts and comtypes.)
2. Click Add Plugin and select TTS_v0.1.0.zip. This will install the TTS plugin, which you can select via Plugins >
Output > TTS.
Settings
The plugin doesn't come with a settings dialog box. However, advanced users can change some settings by adding/editing the following
TTS.json entries.
Add this entry with an integer value between -10 and 10 to modify the speech rate (default = 0).
Code:
"desktop_folder": "C:\\MyFolder",
Change this entry, if you don't want the plugin to write MP3 files to the Desktop folder. (Note that you'll have to use two backslashes.)
Installing additional TTS voices
Advanced Windows users can install additional Catalan, Danish, German, English, Spanish, Finnish, French, Italian, Japanese, Korean, Norwegian, Dutch, Polish, Portuguese, Russian, Swedish and Chinese Microsoft Speech Platform voices. For details see
this website.
If you have Windows 10 you can also enable hidden Cortana voices, e.g., Microsoft Mark and Microsoft Eva. For more information see
this website. Make sure to check the following registry key before importing the registry files provided by the website!
Code:
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Speech_OneCore\Voices\Tokens
License: GNU General Public License v3 (GPL-3)