Quote:
Originally Posted by JSWolf
And when you convert them to text, you get rid of all of the HTML code. Plus, you convert the entities to symbols. Then you count the words and there you go.
|
The plugin would be orders of magnitude slower, with all the additional disk activity, temp files etc. It won't be changing to doing that
Quote:
Originally Posted by DNSB
I've attached an epub with what the Adobe SPN algorithm says contains 27 words and 3,115 pages.
A pathological case.
Personally, I find Count Pages to be good enough if not perfect.
|
Therein lies the weakness in the simplistic approach of the ADE algorithm - inlined Base64 images result in enormous html file sizes which is what ADE calculates page estimates based on. And also why I don't use it personally myself - because unless you are in the habit of opening every file to inspect whether it has been edited this way (rather than external image files via links) you would never know if it is a genuinely big book or from this flaw in approach.