|05-26-2011, 11:53 AM||#1|
Join Date: Sep 2010
Regex for removing headers and footers
I use Calibre to convert PDF to RTF - I have sort of got the hang of removing headers and footers with regex. I go in and put in the actual header and footer in the regex (search and replace now). But what happens if I want to do bulk convert? Then how does this work? Is there a "generic" term in regex to remove headers footers in bulk convert?
|05-26-2011, 12:12 PM||#2|
Join Date: Feb 2008
Device: Cybook Gen3
Not really. You could have a look at the books and try to fit what you see into a generalized regex, but I doubt that will work very well. Your best bet would be to set each books' conversion options individually.
|05-26-2011, 12:32 PM||#3|
Sigil & calibre developer
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
If the book are from the same sourxe and have a consistant format you should be fine doing single or bulk. For instance the free PDFs Tor put out a yearish ago all have consistant headers and footers and the same regex would work on all of them.
Im most cases there is enough variation per book that one regex won't work across multiple titles. In this case you will need to do the conversion on a per book basis.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|Removing Headers and Footers Here's What I Did||allowingtoo||Workshop||0||02-16-2011 08:46 PM|
|Removing Headers/Footers Help?||Anarel||Workshop||10||11-09-2010 12:53 PM|
|Pls help with removing headers /footers||Mamaijee||Calibre||0||09-19-2010 01:29 PM|
|Scanning and removing footers/headers||monsieurms||Workshop||8||12-14-2009 06:12 PM|
|page headers/footers||daesdaemar||Workshop||20||12-12-2008 09:22 PM|