|
|
#1 |
|
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 165
Karma: 2000
Join Date: Nov 2025
Device: none
|
Word count feature
Does Sigil have word count feature?
Or is there any plugin can do that? Or any other software can count word in epub? |
|
|
|
|
|
#2 |
|
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,837
Karma: 64181416
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Indirectly.
![]() Tools: Reports: Words. The summary is at the bottom |
|
|
|
| Advert | |
|
|
|
|
#3 |
|
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 52,635
Karma: 180945224
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
|
|
|
|
|
|
#4 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 29,528
Karma: 212177546
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
He uses both so much, he's always getting them confused.
|
|
|
|
|
|
#5 |
|
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,837
Karma: 64181416
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
|
|
|
|
| Advert | |
|
|
|
|
#6 |
|
Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,741
Karma: 6774572
Join Date: Nov 2009
Device: many
|
The fastest easiest way to get the word count in a single xhtml file in Sigil is to highlight that file in BookBrowser and invoke GetInfo from the pop-up menu. It has the wordcount for that file. But if you need it for the entire epub, only Reports will generate that.
|
|
|
|
|
|
#7 |
|
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 52,635
Karma: 180945224
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
The few times I've wanted to check a word count in Sigil, I opened Reports, select the HTML Files item and then browsed to the bottom of the All Words column.
|
|
|
|
|
|
#8 |
|
A Hairy Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,489
Karma: 21099999
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
|
|
|
|
|
|
#9 |
|
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 52,635
Karma: 180945224
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Close but not close enough to be quickly transferable.
|
|
|
|
|
|
#10 |
|
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 165
Karma: 2000
Join Date: Nov 2025
Device: none
|
There is an issue: the word count is incorrect, it counts strings end with space or line break, which is not correct to count Chinese.
Last edited by icearch; Yesterday at 09:04 AM. |
|
|
|
|
|
#11 |
|
Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,741
Karma: 6774572
Join Date: Nov 2009
Device: many
|
Then create your own regex to define a Chinese word in Find and make sure Regex is enabled, that Unicode us checked, and the Text check box is checked and then get count of number of matches across all xhtml.
Last edited by KevinH; Yesterday at 03:18 PM. |
|
|
|
|
|
#12 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,831
Karma: 24222221
Join Date: Dec 2010
Device: Kindle PW2
|
I don't speak Chinese, but, AFAIK, they don't use spaces between words. The only way to add word count support for Chinese would be to bundle Sigil with a Chinese word segmentation library.
|
|
|
|
|
|
#13 | |
|
Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,741
Karma: 6774572
Join Date: Nov 2009
Device: many
|
Quote:
Wow, how to dictionaries and spellchecking even work then? |
|
|
|
|
|
|
#14 |
|
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 52,635
Karma: 180945224
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
For the most part, the word counts I've seen in Chinese texts seem to count every displayed character and then divide by a bugger factor. Modern Chinese uses quite a few words that use 2 or more characters. If you want real fun, try sorting in Japanese where quite a few Kanji glyphs have multiple pronunciations.
The horrible example from one page on the issue was:
And yes, the kanji version of the names are the same, just multiple pronunciations. This is a reason that Japanese ID tends to have a ruby (furigana) version of the name in hiragana which is the primary phonetic syllabary used in Japan though you can find katakana and romaji (Latin alphabet) used. Last edited by DNSB; Yesterday at 05:14 PM. Reason: bugger factor not bugger factory... fat fingers cause typos... |
|
|
|
|
|
#15 |
|
Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,741
Karma: 6774572
Join Date: Nov 2009
Device: many
|
I checked and there does appear to be github python based repos to do pseudo words counts for Chinese text that do seem to mainly count single characters and then "adjust" that figure. So this would be a job for a Chinese Sigil plugin developer as I have no way to evaluate which of these github python programs is the best one to use.
|
|
|
|
![]() |
| Thread Tools | Search this Thread |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Word Count? | bdub | Calibre | 3 | 12-20-2025 06:49 PM |
| Add page count, word count and reading time | ZodWallop | Kobo Reader | 4 | 08-12-2024 05:56 AM |
| Word Count and Page Count? | CrossReach | Library Management | 2 | 07-19-2018 05:44 PM |
| Feature Request: Get word count for current article/chapter | truth1ness | Calibre | 0 | 04-02-2015 05:35 PM |
| Word Count | leebase | Calibre | 34 | 06-07-2011 11:53 PM |