Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old Yesterday, 02:31 PM   #1
bernardsirius
Junior Member
bernardsirius began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Aug 2025
Device: iPad
Looking for clean chapter splitting workflow (for custom audiobook creation)

Hello everyone,

I’m working on a personal project: I’d like to create my own audiobooks from DRM-free ePubs, with content I choose, using text-to-speech (Amazon Polly long-form).

I’m a developer, so I’m comfortable with a somewhat clunky workflow that mixes Python scripts and Calibre plugins. I understand this is a messy problem — but here’s what I’ve tried so far:
• I experimented with the EpubSplit plugin, but got frustrated that it “pollutes” my library with all the split parts.
• My goal was to combine EpubSplit with a CLI utility I found on GitHub, epub2txt2, which works nicely once chapters are properly separated.
• The idea: split the ePub → run epub2txt2 on each split ePub → get one clean text file per chapter.
• The next step would be cleaning each chapter of footnotes and other artifacts, which I’m planning to handle with AI.
• Finally, I’ll feed those cleaned chapter texts into Amazon Polly to generate high-quality long-form audio.

So my questions:
• Is there a way to use EpubSplit (or another approach) without cluttering the main Calibre library with all the split sub-books?
• Has anyone built or seen a plugin/workflow specifically for “chapter-per-chapter export” to external files?
• Or is the recommended way to script this entirely outside Calibre, by parsing the ePub directly?

Thanks in advance for any guidance!
bernardsirius is offline   Reply With Quote
Old Yesterday, 04:49 PM   #2
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,116
Karma: 60406498
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Just U(npack) the EPUB <a built in tool
Then call your magic (unpack opens in file explorer. No need to save/repack in this use case)

But Why? There are quit a few Viewers the have TTS capability and they understand EPUB flow without effort
theducks is offline   Reply With Quote
Advert
Old Yesterday, 06:29 PM   #3
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,797
Karma: 30237628
Join Date: Mar 2012
Location: Sydney Australia
Device: none
If your EPUB uses header tags, do an EPUB->EPUB Conversion with the default settings, especially for Structure detection, the output EPUB will have an .html per chapter. Here's an example

Input - substantive text is in 1 file - Morley_0553.html

Click image for larger version

Name:	input.jpg
Views:	10
Size:	954.2 KB
ID:	217603

Output - substantive text is in 43 files - Morley_0553_split_000.html → Morley_0553_split_042.html

Click image for larger version

Name:	output.jpg
Views:	8
Size:	984.4 KB
ID:	217604

Now you can unpack as previously suggested, perhaps after using the calibre editor to remove the unwanted artefacts.

BR
BetterRed is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Chapter splitting broken? Sarmat89 Sigil 5 08-23-2020 10:52 AM
chapter splitting failure1 Editor 5 08-20-2019 11:16 PM
Automating splitting at Chapter Breaks John Everett Sigil 9 06-01-2010 02:53 AM
html2epub: chapter splitting on more than 1 heading level Portnull Calibre 1 06-25-2009 09:17 AM
Chapter or TOC Creation help needed gandor62 Calibre 4 04-15-2009 02:18 PM


All times are GMT -4. The time now is 01:45 PM.


MobileRead.com is a privately owned, operated and funded community.