Thanks for the nice tutorial Manichean!
I've found the following regex snippet very handy to clean the navigation header and footer from the HTML version of ProGit (
http://progit.org/, CC by-nc-sa, saved with wget). You only have to find the start of the header/footer div, than tell the regex how many lines to delete:
Code:
<div id="footer">(\n.+){27}\n.+script>
This will delete the opening footer div plus 27 other lines (plus a closing script tag).
HTH