![]() |
#1 |
Member
![]() Posts: 23
Karma: 10
Join Date: May 2014
Device: Paperwhite
|
Word Count/Unique Words
Hey guys,
What is the best way to count words and unique words in EPUB/AZW3 files? To demonstrate what I mean I will use following book: http://www.feedbooks.com/book/673/david-copperfield Report function in Calibre shows 357 436 words and 16 519 unique words. feedbooks.com shows 358,632 words (doesn't provide number of unique words) wordcounttools.com shows 359 260 words (small error is probably caused by inclusion of "About the Author, etc.") and 29 406 unique words. easycalculation.com shows 359 408 words and 20 466 unique words. planetcalc.com shows 364 872 words and 17 858 unique words. My question is, which source provides most accurate number of unique words? I suspect there might be different methodology to determine number of unique words. |
![]() |
![]() |
![]() |
#2 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Apr 2015
Device: ipad
|
There are many alternatives as you listed. I personally like the website Word Count Tools because it provides many information about the text being put. But how you determine the number of words from there with the epub format since the tool only allow to input text string?
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Bookish
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,017
Karma: 2003162
Join Date: Jun 2011
Device: PC, t1, t2, t3, Clara BW, Clara HD, Libra 2, Libra Color, Nxtpaper 11
|
Use "Edit Book" -> Tools -> Reports to get some basic idea about the used words in your document.
|
![]() |
![]() |
![]() |
#4 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,047
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Count Pages PI (does more than just Pages)
|
![]() |
![]() |
![]() |
#5 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 24,905
Karma: 47303824
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
There is also the Count Pages plugin which also has a word count. From memory, there was a discussion sometime last year about counting unique words. Someone modified the plugin to do this, but I don't remember if they published it.
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Member
![]() Posts: 23
Karma: 10
Join Date: May 2014
Device: Paperwhite
|
to harryngh: I converted the book to .rtf and copy/paste
to DrCipher: I have mentioned this feature, my question was which unique word count tool is most reliable/accurate. to theducks and davidfor: thanks for the tips, will check it out |
![]() |
![]() |
![]() |
#7 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Jan 2019
Device: Kindle
|
try this website word count tool on a Russian translator website. It gives an accurate calculation of new words and repetitions.
|
![]() |
![]() |
![]() |
#8 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Apr 2019
Device: Amazon Kindle Paperwhite 2018
|
Nice post!
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Feature Request: Get word count for current article/chapter | truth1ness | Calibre | 0 | 04-02-2015 05:35 PM |
word count | Tanjamuse | Editor | 5 | 11-09-2014 06:31 AM |
Custom Column with word count index | Tanjamuse | Library Management | 22 | 05-11-2014 08:18 PM |
New DRM method changes the author's words to make each copy unique and traceable | zigzagz | News | 65 | 01-17-2014 07:54 PM |
Word Count | leebase | Calibre | 34 | 06-07-2011 11:53 PM |