MobileRead Forums - View Single Post - [Plugin] TTS

Doitsu · 05-13-2018, 07:29 AM

Released: May 13, 2018
Current Version: "0.1.0"

This plugin allows you to generate MP3 files of all files selected in the Book View window. (To select all files, click the Text folder.)

Note that this plugin has not been optimized for speed. If you want to generate mp3 files for a whole book, use jackie_w's TTS to MP3 Calibre plugin, which is two to three times faster than my plugin.

Credits: This plugin uses Joshua Milas's SAPI5 TTS library for generating WAV files and LAME for generating MP3 files.

System requirements

This plugin requires Windows 7 or higher and Sigil 0.8.9 or higher. If you're using an external Python 3 interpreter, you'll need to install bs4, tts and comtypes. (Note that this plugin does not work with Linux and macOS machines.)

Installation

1. Select Manage Plugins from the Plugins menu. Make sure that Use Bundled Python is checked. (If you're using an older Sigil version, select a Python 3 interpreter and install bs4, tts and comtypes.)
2. Click Add Plugin and select TTS_v0.1.0.zip. This will install the TTS plugin, which you can select via Plugins > Output > TTS.

Settings

The plugin doesn't come with a settings dialog box. However, advanced users can change some settings by adding/editing the following TTS.json entries.

Code:

 "rate": 0,

Add this entry with an integer value between -10 and 10 to modify the speech rate (default = 0).

Code:

  "desktop_folder": "C:\\MyFolder",

Change this entry, if you don't want the plugin to write MP3 files to the Desktop folder. (Note that you'll have to use two backslashes.)

Installing additional TTS voices

Advanced Windows users can install additional Catalan, Danish, German, English, Spanish, Finnish, French, Italian, Japanese, Korean, Norwegian, Dutch, Polish, Portuguese, Russian, Swedish and Chinese Microsoft Speech Platform voices. For details see this website.

If you have Windows 10 you can also enable hidden Cortana voices, e.g., Microsoft Mark and Microsoft Eva. For more information see this website. Make sure to check the following registry key before importing the registry files provided by the website!

Code:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Speech_OneCore\Voices\Tokens

License: GNU General Public License v3 (GPL-3)