The person who built the PDF I'm using used a trial version of some XML formatter which spits out some text on every page, but this is hidden in the PDF, but when I convert to ePUB it shows up. I figured I could just remove this using a RegEx on the Header/Footer, but no luck.
Code:
String:
<a href="http://www.antennahouse.com">Antenna House XSL Formatter (Evaluation) http://www.antennahouse.com</a><br>
RegEx:
<a href="http://www.antennahouse.com">Antenna House XSL Formatter (Evaluation) http://www.antennahouse.com</a><br>
I'd also like to remove page numbers and page titles, here's an example
Code:
String:
<A name=13></a><IMG src="index-13_1.jpg"><br>Title <br>11 <br>
RegEx:
<A name=[0-9][0-9][0-9]></a><IMG src="index-[0-9][0-9][0-9]_1.jpg"><br>Title <br>[0-9][0-9][0-9] <br>
Did I completely misunderstand how regular expressions work?