Ah pandoc looks like an excellent general purpose tool! Thanks! I guess trying to normalize plain text is easier than html so that might be the way to go.
@rkomar, I'm actually an elinks user and I've tried the dump command before. My problem with it was that it tried to preserve too much formatting from the html. I don't think elinks supports a nomargin option unfortunately. That's a good tip.
|