Quote:
Originally Posted by eschwartz
Are you sure you are matching the right stuff?
You need to match against the input content, not the post-processed result.
In the S&R tab of the conversion dialog, click the wand button to get a preview of the pdftohtml result, which is what the regex will operate on.
Or in the Editor, you can S&R the EPUB (?) with more granularity.
|
The input content is pdf text, not html, no? I tried with
Code:
Title of the Book by Author [0-9]*
but this doesn't work either. I'd prefer it to work on the pdf text like this as Calibre is marking up the same string in different ways. This makes it hard to write a regex to find all variations of the tags and remove them along with the text.