I have a few books in .HTML format, with each book having individual .HTML pages for each chapter. Now I am trying to convert these to .EPUB and remove the headers/footers and am really stuck with regex.
The header and footers look like this from the .HTML file.
<a href="Chapter19.html">previous</a> |
<a href="-Book Title-.html">Table of Contents</a> |
So obviously the ChapterXX.html will increment/decrement depending on the current page. Is someone able to help me construct a regex to remove those headers/footers please? (both the header and footer have identical code ie: top and bottom of the page had identical links).