View Single Post
Old 12-25-2010, 08:46 AM   #29
Manichean
Wizard
Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.Manichean is the 'tall, dark, handsome stranger' all the fortune-tellers are referring to.
 
Manichean's Avatar
 
Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
Quote:
Originally Posted by vulcan_girl View Post
I didn't want to start a new thread to ask almost the same question.

I found one formula to remove headers from ABC Converter pdfs, and it works fine. Now I've unearthed an older pdf with a slightly different header and I can't figure out what I need to change to make it work. If someone could help me, I'd be very appreciative.

Here is the new header:
ABC Amber Text Converter Trial version, http://www.processtext.com/abctxt.html

Here is the header and formula that works:

[Generated by ABC Amber LIT Converter,
http://www.processtext.com/abclit.html]


(<A name=\d+>\s*</a>)?\s*(<[biu][^>]*>)?\s*Generated\s+by\s+(ABC)?\s
+Amber[^<]*(<a\shref=.*?processtext.*?>)?\s*(.*?processtext. *?</a>)?(</
[ibu]>)?\s*(<br>\s*)?

What do I need to change? I have no idea how to create one of these.
These "formulas" are called regular expressions and are, generally, just a way to describe texts. Your problem might be that you need to describe what the header looks like in the XHTML intermediate stage Calibre produces while converting. Personally, I'd recommend that you try to follow the tutorial from the manual. If you still have questions after that, ask.
Manichean is offline   Reply With Quote