View Single Post
Old 01-27-2017, 05:51 PM   #16
ebookreadr
Enthusiast
ebookreadr began at the beginning.
 
Posts: 26
Karma: 10
Join Date: Dec 2012
Device: nook
Quote:
Originally Posted by Doitsu View Post
My regex is only useful for books whose formatting can't be salvaged. You could simply remove span from the regex:

Code:
<(div|p|b|i|u|h[1-6]|ol|ul|li|body) [^>]+>
and then handle the remaining spans with TagMechanic.
Thanks Doitsu
I will try TagMechanic in a couple of days - I am in the middle of cleaning extraneous stuff and backing up my hard drive

Update: I used a combination of TagMechanic and Find/Replace to clean it up. TagMechanic did most of the work, but there were some minor things left over that had to be removed.

Last edited by ebookreadr; 02-06-2017 at 12:51 PM. Reason: Update
ebookreadr is offline   Reply With Quote