View Single Post
Old 02-07-2011, 04:16 PM   #4
BobC
Addict
BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.
 
Posts: 342
Karma: 245756
Join Date: Dec 2008
Location: Lancashire, U.K.
Device: BeBook 1, BeBook Pure, Kobo Glo, Various Android Apps
Quote:
Originally Posted by charleski View Post
DjVu doesn't contain text, it works on images, and that's your problem. It uses an image-compression technology that's highly optimised for text and allows far smaller file sizes than other formats that target more general image types.
DJVU's Can contain a hidden text layer (which is used in the search feature). This layer can be extracted and used as the basis for any other conversion.

For example most of the DJVU files on The Internet Archive (TIA) contain such a layer and I have used them as a basis for FB2 books.

Of course the files the OP is working on may not have such a layer as the original text may not have been OCRd and associated with the image layer.

BobC
BobC is offline   Reply With Quote