Quote:
Originally Posted by AlanHK
Won't that leave all the text using those styles with undefined styles?
The problem is that in one file
<p class="p6 sgc-2">October 11</p> is bold, in another it would be italic.
|
Knew there was something I was forgetting.
Quote:
That looks useful, but first I need to get all the text tagged consistently.
Anyway, I think I can do this by unzipping the epub and sorting the files into groups with common style definitions, using Far file manger, then doing S&R on groups of files to make them all consistent, then making a new epub.
|
Yeah, my second regex assumes unreasonably that tidy named classes "bold" and "italic" (where did I get that from?) which I blame on the lateness of the hour.
So I can't think of a purely regex way to fix these, matching class to style. You'd have to do each class on its own.
Best thing is to avoid this entirely. HTML Tidy is a really annoying crutch.