12-06-2019, 06:09 PM | #1 |
Grand Sorcerer
Posts: 5,421
Karma: 99236514
Join Date: Apr 2011
Device: pb360
|
Bytes per page data for a sampling of books
This thread is about bytes per page data based on amazon supplied apnx files for a sampling of books. It is not about whether or not page numbers make sense for ebooks or which page numbering scheme is better or worse than any other scheme. Attached are plots showing byte counts on a page by page basis, which varies within a book and from book to book. Plots for several books are attached.
The variation between books is mostly because of differences in formatting, some of which might improve rendering or might be useless boiler plate. Within a book, variation comes from partial pages at the end of chapters, formatting for first pages of chapters, and the presence of tables, figures, and images. In most cases, a horizontal line can be imagined going through a typical value for a particular book. But end matter such as end notes, bibliographies, and indices can have their own typical bytes per page, which might be quite different from the rest of the book. The byte size of these sections can make them appear to make up more of the book than they actually do. For example, Code:
Title Pages Locations
A Brief History of Everyone 363 / 402 = 90% 5044 / 7586 = 66%
Bad Blood 299 / 341 = 88% 4695 / 5462 = 86%
The Hidden Life of Trees 245 / 271 = 90% 2683 / 3506 = 77%
Hidden figures 271 / 350 = 77% 4318 / 8009 = 54%
Silent Spring 296 / 297 =100% 3975 / 5653 = 70%
The Fifties 732 / 801 = 91% 13167 / 17018 = 77%
Last edited by j.p.s; 12-06-2019 at 06:32 PM. Reason: Forgot to attach plots |
12-06-2019, 06:33 PM | #2 |
Grand Sorcerer
Posts: 5,421
Karma: 99236514
Join Date: Apr 2011
Device: pb360
|
The table in post #1 took so long that I forgot to attach and comment on the data plots. Most books seem to be between 2500 and 3000 bytes per page, but there can be quite a bit of variation, with Ford's autobiography is around 2000 and Utopia For Realists 1500.
Hidden Figures is 2500ish in the main text, but on end matter section run over 5000 and another well over 25,000. Last edited by j.p.s; 12-06-2019 at 06:40 PM. |
12-06-2019, 10:56 PM | #3 |
Grand Sorcerer
Posts: 6,670
Karma: 86234809
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
That is interesting. But it makes me wonder, are working toward something in this analysis?
|
12-07-2019, 11:19 AM | #4 |
Grand Sorcerer
Posts: 5,421
Karma: 99236514
Join Date: Apr 2011
Device: pb360
|
Maybe. It's possible I might update if anything interesting shows up based on a larger number books. This is mainly a result of exploring apnx files, and is posted for future reference and in the hope that others might find it informative.
|
Tags |
apnx, page numbering, page numbers, reading position, reading progress |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Image up-sampling | Norah | Workshop | 27 | 02-29-2016 09:46 AM |
Buying book after sampling keeps sample but not last read page | amoroso | Amazon Kindle | 7 | 05-11-2011 05:35 PM |
First page data | travger | Conversion | 2 | 05-06-2011 01:08 PM |
Purchasing after sampling suggestion | hhdumpling | Amazon Kindle | 4 | 02-28-2009 10:52 AM |