View Single Post
Old 03-29-2010, 10:09 PM   #1
Solicitous
Wizard
Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.Solicitous ought to be getting tired of karma fortunes by now.
 
Posts: 1,409
Karma: 1137946
Join Date: Aug 2009
Location: TAS, Australia
Device: Astak Pocket Pro (Black), 2 x Kindle WiFi (Graphite), iPod Touch 4G
Header/Footer removal

Hi All,

I have a few books in .HTML format, with each book having individual .HTML pages for each chapter. Now I am trying to convert these to .EPUB and remove the headers/footers and am really stuck with regex.

The header and footers look like this from the .HTML file.

Quote:
<hr>
<p class="nav">
<a href="Chapter19.html">previous</a> |
<a href="-Book Title-.html">Table of Contents</a> |
<a href="Chapter21.html">next</a></p>
<hr>
So obviously the ChapterXX.html will increment/decrement depending on the current page. Is someone able to help me construct a regex to remove those headers/footers please? (both the header and footer have identical code ie: top and bottom of the page had identical links).
Solicitous is offline   Reply With Quote