Hi Everybody:
I stumbled on a great little application written by Duncan Jauncey that removes the embedded line-returns in a Project Gutenberg text file.
http://www.duncanjauncey.com/gutenberg.html
Update: Thursday, March 18, 2010
I ran across some other issues with a Project Gutenberg text file ... and decided to work up my own, expanded, script to handle embedded line returns, starting tabs, insert & remove blank lines, etc. The "GrannyFix" page is 100% HTML and JavaScript, so it runs on your own computer. I've pasted in the entire "Pride & Prejudice" text and the textarea handles it OK.
Because I'm using Regular Expressions to comb through the file, it's a lot faster than the Jauncey script. Future development plans are to (1) tighten up the regex and (2) add more options as I work with other Gutenberg files.
Link: www.rocketgranny.com/freebies/GrannyFix.html
PS: Right-click the page to view source, copy and paste to your own system. Thanks.