It was supposed to be my 4500'th post... but close enough
Pocketsphinx: Audio Recognition and Programming API, built and tested.
Upstream: https://sourceforge.net/projects/cmusphinx/
GITHUB: https://github.com/twobob/pocketsphinx (testing extensions)
Developer API info http://cmusphinx.sourceforge.net/wiki/develop/
(how sexy is that!!!)
Building with 2012.03:
do much wangling...
Testing on a kindle touch:
SAYS
TESTING TO KINDLE TOUCH:
Spoiler:
Quote:
Listening...
Recording is stopped, start recording with ad_start_rec
Stopped listening, please wait...
INFO: cmn_prior.c(121): cmn_prior_update: from < 18.05 -5.35 1.02 -0.73 -0.23 -0.56 -0.04 0.09 -0.23 0.01 0.03 -0.05 -0.05 >
INFO: cmn_prior.c(139): cmn_prior_update: to < 18.14 -5.15 1.14 -0.72 -0.32 -0.59 0.05 0.13 -0.25 0.03 0.04 -0.12 -0.02 >
INFO: ngram_search_fwdtree.c(1549): 3517 words recognized (60/fr)
INFO: ngram_search_fwdtree.c(1551): 200694 senones evaluated (3402/fr)
INFO: ngram_search_fwdtree.c(1553): 400108 channels searched (6781/fr), 25135 1st, 105110 last
INFO: ngram_search_fwdtree.c(1557): 5882 words for which last channels evaluated (99/fr)
INFO: ngram_search_fwdtree.c(1560): 37237 candidate words for entering last phone (631/fr)
INFO: ngram_search_fwdtree.c(1562): fwdtree 1.69 CPU 2.864 xRT
INFO: ngram_search_fwdtree.c(1565): fwdtree 1.71 wall 2.898 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 170 words
INFO: ngram_search_fwdflat.c(937): 2139 words recognized (36/fr)
INFO: ngram_search_fwdflat.c(939): 94138 senones evaluated (1596/fr)
INFO: ngram_search_fwdflat.c(941): 191492 channels searched (3245/fr)
INFO: ngram_search_fwdflat.c(943): 9414 words searched (159/fr)
INFO: ngram_search_fwdflat.c(945): 8057 word transitions (136/fr)
INFO: ngram_search_fwdflat.c(948): fwdflat 0.87 CPU 1.475 xRT
INFO: ngram_search_fwdflat.c(951): fwdflat 0.87 wall 1.477 xRT
INFO: ngram_search.c(1214): </s> not found in last frame, using testing.57 instead
INFO: ngram_search.c(1266): lattice start node <s>.0 end node testing.20
INFO: ngram_search.c(1294): Eliminated 25 nodes before end node
INFO: ngram_search.c(1399): Lattice has 310 nodes, 68 links
INFO: ps_lattice.c(1365): Normalizer P(O) = alpha(testing:20:57) = -478384
INFO: ps_lattice.c(1403): Joint P(O,S) = -481774 P(S|O) = -3390
INFO: ngram_search.c(888): bestpath 0.01 CPU 0.017 xRT
INFO: ngram_search.c(891): bestpath 0.01 wall 0.011 xRT
000000037: testing
READY....
|
Quote:
INFO: ngram_search_fwdtree.c(430): TOTAL fwdtree 200.77 CPU 3.022 xRT
INFO: ngram_search_fwdtree.c(433): TOTAL fwdtree 206.75 wall 3.112 xRT
INFO: ngram_search_fwdflat.c(174): TOTAL fwdflat 57.46 CPU 0.865 xRT
INFO: ngram_search_fwdflat.c(177): TOTAL fwdflat 57.92 wall 0.872 xRT
INFO: ngram_search.c(317): TOTAL bestpath 74.00 CPU 1.114 xRT
INFO: ngram_search.c(320): TOTAL bestpath 74.66 wall 1.124 xRT
|
Special thanks to knc1 and nhsm from the support irc without whom this would never had got built.
YAY!
https://github.com/twobob/sphinxbase-kindle
https://github.com/twobob/pocketsphinx-kindle
expect a KUAL extension RSN EDIT: And now it is completed for kindle touch.
And a build for ALL KINDLES
EDIT:
In order to EXTEND THE VOCABULARY of the application.
1) Grab this corpus file. (or just make you own)
2) push it to the
lmtool webservice
3) grab the tgz. Extract out the lm and dic files... Rename them
gui.lm and
gui.dic
replace the default ones in /mnt/us/extension/ with yours... That should be all..