MobileRead Forums - View Single Post

Solicitous · 03-29-2010, 11:09 PM

Hi All,

I have a few books in .HTML format, with each book having individual .HTML pages for each chapter. Now I am trying to convert these to .EPUB and remove the headers/footers and am really stuck with regex.

The header and footers look like this from the .HTML file.

Quote:

<hr>
<p class="nav">
<a href="Chapter19.html">previous</a> |
<a href="-Book Title-.html">Table of Contents</a> |
<a href="Chapter21.html">next</a></p>
<hr>

So obviously the ChapterXX.html will increment/decrement depending on the current page. Is someone able to help me construct a regex to remove those headers/footers please? (both the header and footer have identical code ie: top and bottom of the page had identical links).