View Single Post
Old 12-15-2011, 09:44 AM   #3
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,103
Karma: 60406498
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by Serpentine View Post
The best advice I can give you is to do the pdf->epub conversion as cleanly as possible, preserving the text. Then take the epub and open it in Sigil to do the regex work - You can use the latest Sigil beta which has a nice new regex engine.

There should not be a space with that replacement, however if you are replacing it with a space, or the following line starts with a space character, perhaps using something like -<br(\s*/?)>\s* will better match. In either case I would suggest doing work like this outside of Calibre itself. While it may seem like a bit of extra work, it often saves a lot of time in the long run and will get you the results you're looking for.

With Sigil, you get to see the results of your mis-steps.

Those hyphens could be ndash or minus signs. different search terms ar needed. In sigil, you an copy and paste the character and never worry about what flavor it really was
theducks is online now   Reply With Quote