11-26-2017, 12:42 AM | #1 |
Member
Posts: 10
Karma: 564
Join Date: Nov 2017
Device: lenovo a3000
|
Best document format for reading with smallest file size.
Hi I am Rupesh from India and I have the habit of reading ebooks on android tab. I am having upto 25 GB of PDF books.
I have downloaded the PDF books legally and with the permission of site owner if you don't trust me I am ready to provide it's address. Previously I have downloaded a djvu file of size 7 mb from some web site and at that time I can't find any reader for opening and reading it so I have converted it to PDF and surprisingly the PDF file generated was 300 mb. Upon analyzing the above anyone can say that there is another format for document reading with lowest file size. Upon compressing to another format I think that total size of files may be reduced to 5 to 6 GB. Actually the PDF files I have consists of scanned images from a text book. I think that djvu is the best format for storing scanned images at lowest file size. If you know any other format for document reading with smallest file size please suggest it and also the software which converts PDF to it. If you think djvu is the best please suggest a converter which converts in batch from PDF to djvu. Regards, Rupesh. |
11-27-2017, 04:21 PM | #2 |
Wizard
Posts: 2,977
Karma: 18343081
Join Date: Oct 2010
Location: Sudbury, ON, Canada
Device: PRS-505, PB 902, PRS-T1, PB 623, PB 840, PB 633
|
The 300MB version probably uses no or little compression of the images. You can compress images in PDF documents in various ways. If your application supports them, using JBIG/JPEG-2000 compression can bring the PDF size down substantially. Try using the pdfbeads application to create PDF files from images; it is usually quite good at reducing the size.
|
Advert | |
|
11-27-2017, 05:39 PM | #3 |
Resident Curmudgeon
Posts: 73,660
Karma: 127838196
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Forget it. Converting from PDF is going to be a hell of a mess and it's going to be not easy to clean up this mess. 25 PDF is not worth converting from. Just go to an eBookstore online and buy the ePub versions.
|
11-27-2017, 08:50 PM | #4 | |
....
Posts: 1,547
Karma: 18068960
Join Date: May 2012
Device: ....
|
Quote:
If the problem is that the material is available in djvu and you cannot find a reading app for an Android tablet, then in my experience the PocketBook app from the Google Play Store reads djvu just fine (and most other formats as well). |
|
12-02-2017, 09:38 AM | #5 |
Grand Sorcerer
Posts: 6,111
Karma: 34000001
Join Date: Mar 2008
Device: KPW1, KA1
|
"djvu" is an intermediate, device-independent format produced by several applications, one of them being the LaTeX typesetter/compiler. There are a lot of programs available for converting djvu into pdf, but you'll (probably) need a computer for it.
Converting djvu or PDF into EPUB isn't worth it, as JSWolf says. PDF is (mostly) a fixed layout format for viewing on large screens and printing, and trying to turn it into a reflowable format is not going to end well. |
Advert | |
|
12-02-2017, 11:20 AM | #6 |
Wizard
Posts: 2,459
Karma: 68781975
Join Date: Oct 2012
Location: Arkansas
Device: Paperwhite 4
|
I convert PDF to epub from time to time. Sometimes the results aren't good. Other times it works just fine.
One thing to check before you start is the file size of the PDF. If it's half a meg or a meg or even 2 meg it's probably text. If it's 20 or 30 meg or larger it's probably scanned images and it won't convert properly. You'll have to OCR it first and that's a long process. Most text PDFs will convert to readable epub just fine. Sometimes I do run into problems but more often than not I get good results as long as the PDF is text. Barry |
12-02-2017, 11:47 AM | #7 | |
Grand Sorcerer
Posts: 5,263
Karma: 98804578
Join Date: Apr 2011
Device: pb360
|
Quote:
|
|
12-02-2017, 12:45 PM | #8 |
Astronomy Nut
Posts: 519
Karma: 3700000
Join Date: Oct 2017
Location: Reno, NV
Device: Kindle (All), Kobo (Multiple), Sony (most) and Nook Glowlight Plus
|
IMO a lot depends on what you want to read a file on. For reading on a PC or Mac the PDF format is versatile and the features of the Adobe reader are excellent. If you want to read on a tablet or E-Reader the choice might not be so clear and could vary depending on where you are. In the USA Kindle formats are the way to go as Kindle has the majority of the Ebook market and makes free reader apps for most common computers and devices from smart phones to computers. In other countries epub is probably the commonest format. For pure text it is hard to beat RTF or TXT files for size if you have a compatible reader or program.
|
12-02-2017, 02:26 PM | #9 | |
Addict
Posts: 281
Karma: 7724454
Join Date: Sep 2017
Location: Bethesda, MD, USA
Device: Kobo Aura H20, Kobo Clara HD
|
Quote:
DjVu is a fixed-layout format based around multiple layers of images (which may be mapped to text characters to allow searching/selection/etc) that's usually output by document scanners (OCR packages and the like). |
|
12-02-2017, 07:38 PM | #10 |
Grand Sorcerer
Posts: 6,111
Karma: 34000001
Join Date: Mar 2008
Device: KPW1, KA1
|
Thanks for the correction, sjfan.
I seem to have swapped the two formats in my head |
12-02-2017, 10:02 PM | #11 |
Karma Kameleon
Posts: 2,908
Karma: 26616647
Join Date: Aug 2009
Device: iPad Mini, iPhone X, Kindle Fire Tab HD 8, Walmart Onn
|
FYI, the legality of a book doesn’t come from the file hosting site, but from rights of the Author/Publisher.
That aside...when I have legitimately purchased a work (or it is legitimately free) - and it’s not in the format I want...that’s when I feel just fine about availing myself of any source on the internet that has it in the format desired. Without repurchasing it. That’s my personal ethics...not trying to bind them on anyone else. Just saying I don’t bother trying to scan books or do my own conversions when it’s fairly easy to find what I want out on the inter webs. |
12-02-2017, 10:47 PM | #12 | |
Just a Yellow Smiley.
Posts: 19,161
Karma: 83862859
Join Date: Jul 2015
Location: Texas
Device: K4, K5, fire, kobo, galaxy
|
Quote:
Yes: there are always free at Amazon. Quick way to tell the difference, if it is in KU, it is on sale. If it isn't, it was price matched free from usually Kobo. Last edited by Cinisajoy; 12-02-2017 at 10:50 PM. |
|
12-03-2017, 02:17 AM | #13 |
cacoethes scribendi
Posts: 5,809
Karma: 137770742
Join Date: Nov 2010
Location: Australia
Device: Kobo Aura One & H2Ov2, Sony PRS-650
|
To answer the subject line, without regard for how hard it might be to do this if converting from other formats, the answer is: EPUB and drop the cover image (or replace them with very small versions), and be sure not to embed any fonts unless critical to the content. With epub the cover typically uses more space than the text, even on very long books.
If the books use internal (non-cover) images then care is needed in how you handle these. (Choosing PNG, GIF or JPG as best suits each image can make a big difference - stay away from bitmaps.) You converted from djvu to PDF - the resulting blow out in size may suggest either a poor tool or bad choice of options. It may have generated images of the pages. Another thing that can really blow out PDF size is choosing Print optimisation options - for on-screen viewing you don't need that, even for printing it can be of dubious value. Conversion is something you have to play with until you get the compromises that best suit your purpose. ... And even so, conversion to PDF would not be my first choice for on-screen reading. |
12-03-2017, 12:13 PM | #14 | |
Resident Curmudgeon
Posts: 73,660
Karma: 127838196
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
https://en.wikipedia.org/wiki/Digital_Visual_Interface |
|
12-03-2017, 01:19 PM | #15 | |
Grand Sorcerer
Posts: 5,263
Karma: 98804578
Join Date: Apr 2011
Device: pb360
|
Quote:
The device independent file extension for TeX, dvi, has been around since 1982, predating Digital Visual Interface. https://en.wikipedia.org/wiki/Device...nt_file_format |
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Smallest text size | stethorn | Amazon Kindle | 1 | 10-12-2014 10:19 AM |
K3 Smallest Font Size Readability? | grownupboy | Amazon Kindle | 6 | 09-06-2010 08:19 PM |
Smallest font size in Mobi | hkdorama | Kindle Formats | 0 | 07-26-2010 04:07 AM |
What format for a Document ? | artemisblossom | Sony Reader Dev Corner | 8 | 11-14-2009 04:21 AM |
Favour requested - Smallest font size photo? | murraypaul | Sony Reader | 5 | 08-16-2008 07:43 AM |