View Single Post
Old 09-03-2012, 04:09 PM   #3
phossler
Wizard
phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.phossler ought to be getting tired of karma fortunes by now.
 
Posts: 1,087
Karma: 447222
Join Date: Jan 2009
Location: Valley Forge, PA, USA
Device: Kindle Paperwhite
Thanks for the insights, and they're pretty much right on.

The input to the conversion that uses such radical surgery are usually garbaged up, having 2 or 3 TOC's embedded, bad chapter breaks, inconsistant fonts, etc. The original format is long gone.

I typically just want to clean them up for be readable on my Kindle, and have a usable TOC. Doing the cleaning in Sigil, manually spliting and marking the chapters, and using Calibre to convert the EPUB to MOBI works well enough to have a readable book for personal use.

My CSS with my <p> and <hx> includes the .italic, .bold, .underline that Calibre likes to add, as well a serif font (personal preference)

I just try to tidy things up by deleteing the Calibre added classes, etc. and doing the F&R manually the way I only know how to do it is tedious and error prone.

So I was looking for some technique or RegEx guidence to make it a little easier

Paul
phossler is offline   Reply With Quote