View Single Post
Old 01-19-2018, 09:36 PM   #38
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,252
Karma: 16544692
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
Quote:
Originally Posted by davidfor View Post
I've always assumed it was the same algorithm, but possibly with a different factor than Adobe uses. With all the extra spans, it probably makes sense to use a number a little higher than 1024. But, the page count is always higher than for epubs. Of course, there are other issues such as compression ratios affecting these counts.

I might have a little play the Count Pages plugin to see a comparison between epubs and kepubs.
Here are the results of my little experiment using a random single plain epub from my calibre library:
  1. Create 4 "versions" of the same book:

    Code:
    book1.epub       - the original plain epub (it's epub2 not epub3)
    book2.kepub.epub - book1 format-shifted to kepub using KoboTouchExtended not a calibre full conversion
    book3.epub       - copy of book2 with file ext manually changed to .epub
    book4.kepub.epub - copy of book1 with file ext manually changed to .kepub.epub
  2. Drag-drop all 4 into calibre and run Count Pages on all of them to get ADE page count and word count.
  3. Drag-drop all 4 onto my KA1. I deliberately avoided using calibre to do the transfer to avoid any auto file changes calibre might make in-transit.
    Open all 4 on the KA1 to see total page count.
    Books 1 & 3 open in the Adobe epub renderer.
    Books 2 & 4 open in the Kobo kepub renderer.
Results:
Code:
                            <-Count Pages ->  <--- Kobo entire book --->
File             Filesize   Pages      Words  Adobe Pages   Kepub Pages
book1.epub       1614709    277        98574  277           -
book2.kepub.epub 1665258    321        98574  -             325
book3.epub       1665258    321        98574  321           -
book4.kepub.epub 1614709    277        98574  -             325
Conclusions:
  • Comparing books 1 & 3. The extra Kobo spans/divs inflate the ADE page count from 277 --> 321
  • Comparing books 2 & 3. These are the same file with 2 different file extensions. ADE pages 321, kepub pages 325. Not the same value, so algorithms are different.
  • Similarly, comparing books 1 & 4. The same file with 2 different file extensions. ADE pages 277, kepub pages 325. Not the same value, so algorithms are different.
  • Comparing books 2 & 4. The "true" kepub (book2) and the "fake" kepub (book4) have exactly the same kepub page count, 325. The lack of Kobo spans/divs in the "fake" kepub does not affect the kepub page count, so I surmise filesizes are not a factor in calculating kepub page count. My guess is that the algorithm uses wordcount rather than filesizes.

If you're still reading ... thanks for staying awake and feel free to disagree! I'll be interested to read any of your own test results.
jackie_w is offline   Reply With Quote