|
![]() |
|
Thread Tools | Search this Thread |
![]() |
#1 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Aug 2025
Device: iPad
|
Looking for clean chapter splitting workflow (for custom audiobook creation)
Hello everyone,
I’m working on a personal project: I’d like to create my own audiobooks from DRM-free ePubs, with content I choose, using text-to-speech (Amazon Polly long-form). I’m a developer, so I’m comfortable with a somewhat clunky workflow that mixes Python scripts and Calibre plugins. I understand this is a messy problem — but here’s what I’ve tried so far: • I experimented with the EpubSplit plugin, but got frustrated that it “pollutes” my library with all the split parts. • My goal was to combine EpubSplit with a CLI utility I found on GitHub, epub2txt2, which works nicely once chapters are properly separated. • The idea: split the ePub → run epub2txt2 on each split ePub → get one clean text file per chapter. • The next step would be cleaning each chapter of footnotes and other artifacts, which I’m planning to handle with AI. • Finally, I’ll feed those cleaned chapter texts into Amazon Polly to generate high-quality long-form audio. So my questions: • Is there a way to use EpubSplit (or another approach) without cluttering the main Calibre library with all the split sub-books? • Has anyone built or seen a plugin/workflow specifically for “chapter-per-chapter export” to external files? • Or is the recommended way to script this entirely outside Calibre, by parsing the ePub directly? Thanks in advance for any guidance! |
![]() |
![]() |
![]() |
#2 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,116
Karma: 60406498
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Just U(npack) the EPUB <a built in tool
Then call your magic (unpack opens in file explorer. No need to save/repack in this use case) But Why? There are quit a few Viewers the have TTS capability and they understand EPUB flow without effort |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,797
Karma: 30237628
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
If your EPUB uses header tags, do an EPUB->EPUB Conversion with the default settings, especially for Structure detection, the output EPUB will have an .html per chapter. Here's an example
Input - substantive text is in 1 file - Morley_0553.html Output - substantive text is in 43 files - Morley_0553_split_000.html → Morley_0553_split_042.html Now you can unpack as previously suggested, perhaps after using the calibre editor to remove the unwanted artefacts. BR |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Chapter splitting broken? | Sarmat89 | Sigil | 5 | 08-23-2020 10:52 AM |
chapter splitting | failure1 | Editor | 5 | 08-20-2019 11:16 PM |
Automating splitting at Chapter Breaks | John Everett | Sigil | 9 | 06-01-2010 02:53 AM |
html2epub: chapter splitting on more than 1 heading level | Portnull | Calibre | 1 | 06-25-2009 09:17 AM |
Chapter or TOC Creation help needed | gandor62 | Calibre | 4 | 04-15-2009 02:18 PM |