View Single Post
Old 01-06-2020, 07:22 AM   #91
davidfor
Grand Sorcerer
davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.davidfor ought to be getting tired of karma fortunes by now.
 
Posts: 24,905
Karma: 47303824
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
Quote:
Originally Posted by kovidgoyal View Post
I haven't looked into it yet but almost certainly this i because in windows strings are natively utf-16 which is what they are in python 2 as well and in ICU, so there is no conversion required. With python3 one has to copy/convert strings to and fro
I have just updated most of the Count Pages plugin for Python 3, and I can see this happening. The non-ICU count for a 1.1 million word book took 11 seconds. Using the ICU library it was running for 10 minutes before I killed it. With 4.8, it was less than 20 seconds.

There were similar timings reported on some Linux boxes late last year. I couldn't dig into it, but, I assumed it was something like this as well.
davidfor is offline   Reply With Quote