View Single Post
Old 02-14-2026, 07:06 PM   #2
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,521
Karma: 62503986
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by wolpi View Post
Hello,

I am trying to extract the text from various ebook formats to corresponding ".txt" files, and want to preserve page/chapter breaks (ideally with the "Formfeed" control character)

I found "ebook-convert"s "--chapter-mark" option with the "pagebreak" parameter, but am unable to figure out from the documentation how this is intended to work and if it might do what I am trying to achieve.

When I run the command like

Code:
ebook-convert "~/Book.epub" "~/Book.txt" --chapter-mark pagebreak
the command output shows the chapters detected in the Book.epub file, but in the generated Book.txt I can find nothing that can be interpreted as a pagebreak indicator.

This is running Calibre 9.2.1 on macOS 15

Thanks!
Control auto-detection of document structure.
I believe this is for the Input (source)

You selected .txt as an output .

Quote:
The form feed character is a control character in ASCII and Unicode, historically used to tell printers to advance to the start of the next page.

Key Details:
ASCII Code: Decimal 12, Hex 0x0C, Unicode U+000C
Escape Sequence in C/Java/Python: \f
Purpose:
In printers: Moves the print head to the top of the next page.
In text files: Acts as a page break or section separator.
theducks is offline   Reply With Quote