With Project Gutenberg books I merge all of the chapters together into one file, then figure out what excess crud can be removed and remove it, then add a sigil split marker in front of each chapter heading with a regular expression replace, then use the split on split marker command to split it properly.
|