I've found the Archive to be a decent place for PDFs (although I mostly go there for the live music recordings). The other formats are good raw material for doing your own clean-up. I think Project Gutenberg sometimes uses Archive scans as starting points for their own conversions.
|