Quote:
Originally Posted by NullNix
FYI, there is a file format that is meant for this stuff (pages that are images with a searchable textual layer: intended for archival of electronic copies of paper books by people like the Internet Archive who do a massive amount of archival work of this sort). Look for 'DjVu'. It's *massively* more space-efficient than storing individual pages in a PDF.
|
Sounds kinda interesting, but again I mostly did this as proof of concept, cant really see copying more than three or four books a year. It actually was tricky enough getting Image Magick to bundle all those png into one pdf. Had to go into its configuration twice to force it to do this optional conversion. Lot of these conversion programs are either $$$ or online where you waste data uploading and downloading again. But Image Magick can do it. And of course Calibre can convert pdf to epub. Why do I guess DjVu is not free?