Quote:
Originally Posted by frostschutz
Regular expression usually supports "Word boundary" matching some way or other. (I forget which pattern, maybe \< \> or \b?).
So you could replace "boundary a l boundary" with all.
\<al\> -> all.
|
There is very little HTML inside the actual text. In the HTML files it is really 'final y al are wel .'
I finished my PHP now. I can chose an ePub file, open it, correct all HTML files inside the ePub, REGEX clean them, and reconstruct the ePub file.
@mrmikel
I tried a few conversion tools and got the same errors. I see the PHP as a little fun brain exercise anyway.