Quote:
Originally Posted by friktion
How do you "clean up" formats not in ePub? Right now, I use Sigil to clean ePubs, and unfortunately haven't found anything to let me go into Mobi formats to clean them up. In other words, everything I convert into ePub, clean up, then reconvert to mobi for my Kindle.
I've been thinking there's got to be a better way. Especially when I'm only deleting headers/footers/page #'s.
Can I ask you, what do you use to clean up your books?
|
If you know Regular Expressions well, use regex to strip header/footer/page# during a calibre conversion, in the Conversion dialog box, Search&Replace tab. I've done this to strip "Amber LIT Converter" headers but am not good enough with regex yet to strip anything else.
See Manichean's Conversion Search & Replace sticky.
I usually use Word or Open Office. Convert to RTF in calibre, Save To Disk to a "Fix Formats" folder. Open the RTF in Word or Open Office. If in Word, use Edit/Find/Advanced Find&Replace. If in Open Office, use Edit/Find&Replace. If you know Regular Expressions well, use Regular Expression mode. If not, use character mode (the default mode). Search for the problem, replace it with nothing or a solution. Using character mode, for each string to replace, basically choose the longest or most complex string and whittle it down in successive passes. Repeat for all problem strings until done. Use the Word or Open Office help to figure out how to represent digits and paragraph, line feed, tab marks in the search/replace strings. Discussed in more detail in the last Workflow post.
For MOBIs you may be able to use Kindlegen or Mobipocket Creator for editing purposes. Links for those are in Links section of last Workflow post. I'm not sure about those. I don't work with MOBIs. I have a Kindle but prefer keeping everything EPUB and generating MOBIs on the fly from the EPUBs.
Edit: If you're skilled at stripping headers/footers in Sigil, you know more than I do. Why ask me?