Quote:
Originally Posted by pietro99
The resulting .mobi book is interspersed with unwanted stuff that appears freqently, like this:
54
9781416585855TEXT.indd 54
25/11/09 3:31:56 PM
Changing the various options in Calibre doesn't eradicate it, perhaps I am missing something. The 54 is the page number but can anyone suggest a way to get rid of this please?
|
This should be doable using the regular expression replacement feature of Calibre (you can replace 3 expressions - here all of them should be replaced by the empty string). From the top of my head and from the example you have provided, I would guess the 3 expressions would be:
\[B\]\d+
\d+TEXT\.indd \d+
\d+\/\d+\/+d+ \d+:+d+:+d+ PM\[\/B\]
Since this isn't Perl (which is the variation of regexps I usually use), you may not have to put a "\" behind a "/" as I have done above. Try to experiment with these strings and if supported by Calibre, put "^" in front of the expressions to denote beginning of line and "\s*$" at the end of the expressions to denote end of line with possible trailing white space. If the date and time strings are the same in all instances of the unwanted strings, you can use the actual numbers rather than "\d+" (which denotes one or more digits).
Experimentation is the key here and you will learn how to do this. Regexps are great stuff, though looks like Greek to the uninitiated (except for the Greek uninitiated

).
-- bob_tm