Yes, I've seen the invisible text layer on some PDFs from other sources. From what I recall, the Internet Archive uses LuraTech's brand of
mixed-raster content compression. Basically there's several different images and the text all layered on each page. It's pretty efficient in filesize, but slow to render and can look pretty terrible if done poorly.