MobileRead Forums - View Single Post - Need Help with Search and Replace please!

theducks · 07-28-2012, 07:08 AM

Quote:

Originally Posted by worley

Hello guys,

Forgive my ignorance, but I need someone's help please. I don't know anything about html. I'm trying to remove the headers from a PDF ebook. I used the following expression to remove the header from page 6, which is the page where the headers begin.

 <hr> <A name=6></a>6 • J o ã o G u i m a r ã e s R o s a 

The Expression above obviously only found 1 match in the file. The following one (marked in red);

Utilizamos ainda outras edi- ções tanto para corrigir variações indevidas como para insistir em outras. Essas grafias em desuso podem parecer simplesmen- te uma questão de atualização ortográfica, mas, se essa atualiza- ção já era exigida pela norma quando da publicação dos livros e <hr> <A name=6></a>6 • J o ã o G u i m a r ã e s R o s a de suas várias edições durante a vida do autor, partimos do prin- cípio de que elas são provavelmente intencionais e devem, por- tanto, ser mantidas. Para justificar essa decisão, lembramos aos leitores que as antigas edições da obra de Guimarães Rosa apre- sentavam uma nota alertando justamente para a grafia persona- líssima do autor e que algumas histórias registram a sua teimosia em acentuar determinadas palavras.

How on earth do I make it match every page, there are 608 pages? I'm sure it should be easy, but I become dyslexic when dealing with html. Again, I would appreciate someone's help! Thanks!

Probably the '6' is a page number (and there is only 1 @ 6 )

The REGEX wildcard for (any quantity of sequential) Numbers is \d+

Code:

<br> <hr> <A name=\d+></a>\d+ <b>•</b> J o ã o G u i m a r ã e s R o s a<br>

What looks odd to me is this part: <A name=6>, The part after the = should normally be in quotes AND to be valid if it was in a EPUB, start with at least a letter (can't be just numbers)