Thanks a lot for the help. I like the way that GuteBook leaves all of the source files and the batch file so that the user can manipulate these and rebuild the output files if necessary.
The line of text indicating the end of a PG header always seems to be a standardized "*** START OF THIS PROJECT GUTENBERG EBOOK..." and it is no different in the Schopenhauer file, so it's a mystery why Gutenmark mishandled this one. Anyway, from now on if something seems missing from the output file I will either rebuild the output using the method you suggested (if the file is too long to re-download) or re-run GuteBook using the "Keep PG Header" option and manually removing the header later.
Thanks again for all your efforts.
Last edited by Stephanos; 08-24-2009 at 09:05 AM.
|