View Single Post
Old 12-03-2017, 01:46 PM   #17
dwig
Wizard
dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.
 
dwig's Avatar
 
Posts: 1,613
Karma: 6718541
Join Date: Dec 2004
Location: Paradise (Key West, FL)
Device: Current:Surface Go & Kindle 3 - Retired: DellV8p, Clie UX50, ...
Quote:
Originally Posted by j.p.s View Post
One of the design goals of djvu is efficient compression of scanned images of text, so it would be expected that a conversion from djvu to pdf would be larger.
Correct.

You won't find a more efficient compressed format than DJVU when dealing with conversions containing scanned "text", whether there is an additional OCR real text layer or not. In fact, almost nothing will even come close to DJVU's compact file size.

The only way to get anything as small or smaller than the DJVU version would be to OCR the text and delete all images, leaving only the text. This is very difficult to do accurately and requires human oversight and manual editing to do well.
dwig is offline   Reply With Quote