Working off of earlier regex expressions I'm still having a problem as many others are removing a page number at the bottom of a pdf file when converting. I am converting to mobi for a kindle and have used the following regex to remove it.
My test output was as follows:
Last line of text on page. <br>
3<br>
So I took the preceding expression of (<p>\s*\d+\s*<p>)
and changed it to (<br>\s*\d+\s*<br>)
when tested the <br>3<br> became highlighted.
When converted it appears to have worked. Will need to scan through text to verify though.
Hope this helps anyone else trying to fix this issue.
Eric.
|