View Full Version : DJVU to PDF but not as a lot of images


mersault
09-17-2011, 03:02 PM
Hi,

I have a djvu file in which I can select words, copy etc I Mean there are not a lot of images. I want to convert this djvu to a pdf file trying to maintain the selectable word capability. I have tried several djvu-2-pdf converters but all of them work in the same way: Convert the djvu to images and then print them all into a PDF file (a big size PDF).
Do you know about a converter for djvu files that could make what I want (print as a document not as an image)?

Best regards.

DaleDe
09-21-2011, 04:53 PM
It is possible to get the text out of a djvu document but you would lose all the formatting. Read about DJVU in our wiki and you will understand why you can't do what you want directly. You can also OCR the pages.

Dale

BobC
09-24-2011, 03:02 PM
It is possible to get the text out of a djvu document but you would lose all the formatting.

You can also OCR the pages.
Dale

One problem with the "text layer" in many DJVUs is that it contains mis-spellings and sometimes horrendous mangling of the text, caused by the original OCR. I've had some with entire pages of the text layer missing. Often o.k for the intended purpose of locating text in the main layer but needs massive work to get recover the "original" text.

Re-OCRing the pages might produce better results but that will depend on the quality of the images that make up the DJVU.

I have done a couple of these ( building FB2 and EPUB rather than PDF but the principle is the same) and I can say it is not a job for the faint-hearted - there is an awful lot of manual work needed.

BobC

ab7vf
01-11-2012, 01:40 PM
maybe try djview4?

jim

Floghi
01-11-2012, 05:11 PM
Just use SumatraPDF reader to open your file,
then save as pdf format,

that's done a well pdf file
cu