Thanks for the insights, and they're pretty much right on.
The input to the conversion that uses such radical surgery are usually garbaged up, having 2 or 3 TOC's embedded, bad chapter breaks, inconsistant fonts, etc. The original format is long gone.
I typically just want to clean them up for be readable on my Kindle, and have a usable TOC. Doing the cleaning in Sigil, manually spliting and marking the chapters, and using Calibre to convert the EPUB to MOBI works well enough to have a readable book for personal use.
My CSS with my <p> and <hx> includes the .italic, .bold, .underline that Calibre likes to add, as well a serif font (personal preference)
I just try to tidy things up by deleteing the Calibre added classes, etc. and doing the F&R manually the way I only know how to do it is tedious and error prone.
So I was looking for some technique or RegEx guidence to make it a little easier
Paul
|