Okay.... so there is, no doubt, somewhere around here an answer to this. Unfortunately, I wouldn't begin to know how to search for it. So pardon the repetition.
As I've mentioned in other places, I've been scanning many of my books. Some of which (research/school) I like to keep the page numbers in for citation purposes, and the OCR process does that well enough.
However, with fiction, this isn't working out too well. I can get rid of the hard returns and unwrap the text, but I've not worked out a way to remove the page number/title/author at the top of the scanned page in any efficient way.
So, take this chunk, for instance:
Code:
Ser Waymar Royce glanced at the sky with disinterest. "It does that every
day
about this time. Are you unmanned by the dark, Gared?"
Will could see the tightness around Gared's mouth, the barely sup
2 <authors name>
pressed anger in his eyes under the thick black hood of his cloak. Gared had
spent forty years in the Night's Watch, man and boy, and he was not
accustomed
to being made light of. Yet it was more than that. Under the wounded pride,
Will could sense something else in the older man. Yonervous tension that
came perilous close to fear.
So I need a way to remove the the page number line on every page, and remove the hard return so that I can then reflow the text normally.
I'm on a Mac - and would think TextMate the answer - but I have windows installed if anyone knows of any free windows software that can help.
I also have an Ubuntu installation available - but I'm not so comfortable with linux as to like this option if I can avoid it.
Thanks!