Quote:
Originally Posted by vulcan_girl
I didn't want to start a new thread to ask almost the same question.
I found one formula to remove headers from ABC Converter pdfs, and it works fine. Now I've unearthed an older pdf with a slightly different header and I can't figure out what I need to change to make it work. If someone could help me, I'd be very appreciative.
Here is the new header:
ABC Amber Text Converter Trial version, http://www.processtext.com/abctxt.html
Here is the header and formula that works:
[Generated by ABC Amber LIT Converter,
http://www.processtext.com/abclit.html]
(<A name=\d+>\s*</a>)?\s*(<[biu][^>]*>)?\s*Generated\s+by\s+(ABC)?\s
+Amber[^<]*(<a\shref=.*?processtext.*?>)?\s*(.*?processtext. *?</a>)?(</
[ibu]>)?\s*(<br>\s*)?
What do I need to change? I have no idea how to create one of these.
|
These "formulas" are called regular expressions and are, generally, just a way to describe texts. Your problem might be that you need to describe what the header looks like in the XHTML intermediate stage Calibre produces while converting. Personally, I'd recommend that you try to follow the
tutorial from the manual. If you still have questions after that, ask.