11-12-2014, 08:29 PM | #691 |
Junior Member
Posts: 7
Karma: 10
Join Date: Nov 2014
Device: Kindle Paperwhite
|
Yeah, I just tried looking at the source. Way too complicated to figure out what's going on there. I think I'm just going to convert all my epubs to .txt and then make a command-line script in Python to do the unique word counts.
|
11-12-2014, 09:31 PM | #692 |
null operator (he/him)
Posts: 20,565
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
@stevenlebeau - you could shovel the results from parsing the text files into a custom column in the metadata database via the Import List PI - I don't think the calibredb ebook-meta command allows updating of custom columns.
BR |
Advert | |
|
11-12-2014, 10:03 PM | #693 |
Junior Member
Posts: 7
Karma: 10
Join Date: Nov 2014
Device: Kindle Paperwhite
|
That could work! I'll give it a shot.
|
11-13-2014, 12:07 AM | #694 | |
Grand Sorcerer
Posts: 24,907
Karma: 47303748
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
Quote:
There's probably a more efficient way to do it than that, plus maybe you want to filter some words. Should "a" and "I" be counted? Or "the", "and" etc. Do plurals get counted separately? What about proper nouns? Numbers? Adding it as an option and using a separate custom column is, to me anyway, the hard bit. But, it is mainly copying similar sections and changing the names to protect the innocent. Getting the GUI all looking good with decent field descriptions is probably the hardest bit. And I'm regretting typing this. I'm giving myself a "If it's so easy, why don't you do it" feeling |
|
11-13-2014, 12:30 AM | #695 |
Junior Member
Posts: 7
Karma: 10
Join Date: Nov 2014
Device: Kindle Paperwhite
|
Based on what you wrote, I actually changed the lines in statistics.py to make this happen. Only thing is, calibre won't let me install it. I re-zipped and tried to install the plug-in manually, and it complained about not having a __init__.py file (which it DOES have, so I'm not sure what to do next).
I suppose I could simply change the already-installed statistics.py to do the work for me, but I haven't been able to figure out where calibre stores its' plug-ins in OS X. |
Advert | |
|
11-13-2014, 03:40 AM | #696 | |
null operator (he/him)
Posts: 20,565
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
BR |
|
11-13-2014, 04:16 AM | #697 | ||
Grand Sorcerer
Posts: 24,907
Karma: 47303748
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
Quote:
Quote:
|
||
11-13-2014, 05:54 AM | #698 |
Junior Member
Posts: 7
Karma: 10
Join Date: Nov 2014
Device: Kindle Paperwhite
|
Okay, I figured it all out. I did archive the root directory and not just its contents. Once I fixed that, it worked like a charm.
So basically I replaced the word count with code that set the book text to lowercase, split it into a list, and then made a set of the list and counted the length of the set. I am actually shocked at how many different words are even in the simplest of children's books! But I have a much better idea of where I'll be starting now. Thanks to everyone for all their help! |
11-18-2014, 01:30 PM | #699 |
Junior Member
Posts: 1
Karma: 10
Join Date: Nov 2014
Device: Kindle Paperwhite
|
how do you fix (No goodreads id)%s
getting it for 99% of my books. also getting false page numbers on the majority of my books using estimate. |
11-24-2014, 08:06 PM | #700 |
Fear The Turtle!
Posts: 866
Karma: 4035032
Join Date: Sep 2009
Location: Margaritaville
Device: KV, Kobo Forma, Kobo A1LE, KO3, K3
|
I'm having same issue as crumba in post #699. Any help to troubleshoot is appreciated.
|
11-26-2014, 05:33 AM | #701 | ||
Grand Sorcerer
Posts: 24,907
Karma: 47303748
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
Quote:
Quote:
|
||
11-26-2014, 05:34 AM | #702 |
Grand Sorcerer
Posts: 24,907
Karma: 47303748
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
|
12-24-2014, 10:30 PM | #703 |
Resident Curmudgeon
Posts: 73,931
Karma: 128903250
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
I've found a bug. Count Pages treats words with an em dash between them as one word.
So word1—word2 is counted as one word. I'm betting it's that way for ... and en dash and so forth. Can this be fixed? Thanks. |
12-25-2014, 01:47 AM | #704 |
Trouble
Posts: 12
Karma: 10
Join Date: Dec 2014
Location: Quebec City
Device: Nook
|
Page count plugins
Hello, I'm new at this so please bear with me. I have installed the Count Pages plugins. I have created the custom columns. I have customized the preferences of the plugins so the columns have been selected.
Calibre is installed on the following path: 'K:\10 - Library' , which is an external hard drive. My books are stored using the following path: 'K:\10 - Library\Calibre Library' I run the plugins and it looks like it is working fine. I look up the Count log and I can see the appropriate and correct information. When running the plugins and clicking yes to update the columns, a '1' appears in each columns. In the count log the following information can be read (something tells me you might need to know that): "InputFormatPlugin: EPUB Input running on C:\Users\Turgeon\AppData\Local\Temp\calibre_onffru \a3vesk_count_pages\2759.epub Found HTML cover titlepage.xhtml" What am I doing wrong. Please keep in mind: I have absolutely no programming knowledge/understanding. I know just enough about computers to get myself in trouble. Your help would be greatly appreciated. Last edited by sophieturgeon; 12-25-2014 at 01:55 AM. Reason: Additional Information |
12-25-2014, 10:45 AM | #705 | |
Well trained by Cats
Posts: 29,791
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
What page count Algorithm are you using? |
|
Tags |
count, count pages, page count, pages, plugin |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
[GUI Plugin] Quality Check | kiwidude | Plugins | 1184 | 04-17-2024 06:17 PM |
[GUI Plugin] Open With | kiwidude | Plugins | 403 | 04-01-2024 08:39 AM |
[GUI Plugin] Quick Preferences | kiwidude | Plugins | 62 | 03-16-2024 11:47 PM |
[GUI Plugin] Kindle Collections (old) | meme | Plugins | 2070 | 08-11-2014 12:02 AM |
[GUI Plugin] Plugin Updater **Deprecated** | kiwidude | Plugins | 159 | 06-19-2011 12:27 PM |