View Single Post
Old 08-17-2015, 08:33 AM   #2
Ripplinger
350 Hoarder
Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.Ripplinger ought to be getting tired of karma fortunes by now.
 
Ripplinger's Avatar
 
Posts: 3,574
Karma: 8281267
Join Date: Dec 2010
Location: Midwest USA
Device: Sony PRS-350, Kobo Glo & Glo HD, PW2
This looks to be a typical paragraph from your selection, and if it's the same throughout the book, I think you could clean it by hand pretty easily using Sigil's Find and Replace:
Quote:
<p class="dlct-000"><span class="dlct-007">Ro haalde haar hand weg</span><span class="dlct-007">.</span> <span class="dlct-007">'Sorry,'</span> <span class="dlct-007">fluisterde hij</span><span class="dlct-007">. 'Ik weet het niet</span><span class="dlct-007">,</span> <span class="dlct-007">maar we moeten iets doen.'</span></p>
I'd search for <p class="dlct-000"> and replace all instances of it with <p class="calibre"> (whatever you decide to use and set up in the stylesheet).

Then search for <span class="dlct-007"> and in the replace box just leave it blank so it will delete all instance of it. Note the number of instances it finds here for the next step.

Then search for </span> and do the same thing, leave the replace box blank to they'll all be deleted. Check that the number of instances found matches with the previous step, or there might be other span classes somewhere, some of which you might want to keep. You can find those after this step is done by just searching for "span" and see what comes up.

If there are a lot of other instances of "dict-###" with various numbers, you could use regex the same way and get them all.

Then just let Sigil clean up the stylesheet to delete any unused styles and try it.

I've never found any automated software that can make the proper decisions about what stays and what should go.
Ripplinger is offline   Reply With Quote