MobileRead Forums - View Single Post

ebookreadr · 01-27-2017, 05:51 PM

Quote:

Originally Posted by Doitsu

My regex is only useful for books whose formatting can't be salvaged. You could simply remove span from the regex:

Code:

<(div|p|b|i|u|h[1-6]|ol|ul|li|body) [^>]+>

and then handle the remaining spans with TagMechanic.

Thanks Doitsu
I will try TagMechanic in a couple of days - I am in the middle of cleaning extraneous stuff and backing up my hard drive

Update: I used a combination of TagMechanic and Find/Replace to clean it up. TagMechanic did most of the work, but there were some minor things left over that had to be removed.