Well, I'm not DarkKipper, but here are a few regular expressions I use. They have worked on my test files, but could probably be improved or modified:
Delete header/footer that starts with "file///" and ends with either ".txt" or ".htm" or "html"
Delete line that starts with "file///" and ends with numbers
Combine the two above
Delete a segment of a line in which the segment ends with a specific string
.* - Baroness Orczy
(the " - Baroness Orczy" is in the line)
Here is one that seems to work, but might need a bit of tweaking. It looks for EITHER a line that starts with "file:///" and ends with numbers, OR a line that starts with a specified string, and deletes the found string. Quite handy when looking for headers / footers that may vary somewhat across a subdirectory
Header with "Generated By ABC ... etc .html (the ABC Amber header)
Google "The Regex Coach" for a very nice freeware that is extremely helpful in designing regexes.
Hope these help!