Quote:
Originally Posted by kaufman
I don't think I would use this feature, but as everyone knows, I have opinions about everything.
I think you should take that highest 5% of your books and make that into a range of the lowest one in the range (rounded) or higher.
So if you have 1000 books, and 95% of them have less than 150,000 words, use 150,000->Infinity as your last range. I'm not sure if this is doable based on how the books are stored, but I think any solution that doesn't do something like this is going to have a lot of sparse categories at the right end of the bell curve.
Depending on how this works out, it might also make sense to do the lowest 5% as one range as well.
For really large libraries, it might make sense to do the last 3% instead of 5%.
|
The algorithm cannot depend on actual values. The decision must be made as soon as the first book is processed, and that decision is final. Changing it would require reprocessing all books.