View Single Post
Old 05-24-2010, 09:44 AM   #37
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by vinco View Post
I need to come up with a regex to detect and remove page numbers from the bottom of PDF pages to convert to Epub for nook usage. The page numbers translate over as bolded, with a paragraph break after them. The HTML code I'd like to remove is (page numbers indicated below by ###)

<b>Page ###</b></p><p>

Thanks for the help.
Try this:
Code:
<b>Page /d+.*<p>
Starson17 is offline