View Single Post
Old 10-11-2023, 01:56 AM   #1702
kiwidude
Calibre Plugins Developer
kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.
 
Posts: 4,733
Karma: 2197770
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
Quote:
Originally Posted by JSWolf View Post
And when you convert them to text, you get rid of all of the HTML code. Plus, you convert the entities to symbols. Then you count the words and there you go.
The plugin would be orders of magnitude slower, with all the additional disk activity, temp files etc. It won't be changing to doing that

Quote:
Originally Posted by DNSB View Post
I've attached an epub with what the Adobe SPN algorithm says contains 27 words and 3,115 pages.

A pathological case.

Personally, I find Count Pages to be good enough if not perfect.
Therein lies the weakness in the simplistic approach of the ADE algorithm - inlined Base64 images result in enormous html file sizes which is what ADE calculates page estimates based on. And also why I don't use it personally myself - because unless you are in the habit of opening every file to inspect whether it has been edited this way (rather than external image files via links) you would never know if it is a genuinely big book or from this flaw in approach.
kiwidude is offline   Reply With Quote