pocketsphinx_continuous: "Warning: Could not find Mic element"

geekbynature · 02-09-2010, 09:15 PM

I'm having some trouble using pocketsphinx. I'm running Fedora 9 on a Dell Latitude D630 and I know the built-in microphone is fine, as I can test it in the sound preferences as well as using `arecord`. I get the following output from pocketsphinx_continuous:

Code:

pts/31$ pocketsphinx_continuous -verbose yes -hmmusr/local/share/pocketsphinx/model/hmm/wsj1 -lm 3400.lm -dict 3400.dic

INFO: cmd_ln.c(506): Parsing command line:
pocketsphinx_continuous \
	-verbose yes \
	-hmm /usr/local/share/pocketsphinx/model/hmm/wsj1 \
	-lm 3400.lm \
	-dict 3400.dic 

Current configuration:
[NAME]		[DEFLT]		[VALUE]
-adcdev				
-agc		none		none
-agcthresh	2.0		2.000000e+00
-alpha		0.97		9.700000e-01
-argfile			
-ascale		20.0		2.000000e+01
-backtrace	no		no
-beam		1e-48		1.000000e-48
-bestpath	yes		yes
-bestpathlw	9.5		9.500000e+00
-cep2spec	no		no
-ceplen		13		13
-cmn		current		current
-cmninit	8.0		8.0
-compallsen	no		no
-dict				3400.dic
-dictcase	no		no
-dither		no		no
-doublebw	no		no
-ds		1		1
-fdict				
-feat		1s_c_d_dd	1s_c_d_dd
-featparams			
-fillprob	1e-8		1.000000e-08
-frate		100		100
-fsg				
-fsgusealtpron	yes		yes
-fsgusefiller	yes		yes
-fwdflat	yes		yes
-fwdflatbeam	1e-64		1.000000e-64
-fwdflatefwid	4		4
-fwdflatlw	8.5		8.500000e+00
-fwdflatsfwin	25		25
-fwdflatwbeam	7e-29		7.000000e-29
-fwdtree	yes		yes
-hmm				/usr/local/share/pocketsphinx/model/hmm/wsj1
-input_endian	little		little
-jsgf				
-kdmaxbbi	-1		-1
-kdmaxdepth	0		0
-kdtree				
-latsize	5000		5000
-lda				
-ldadim		0		0
-lifter		0		0
-lm				3400.lm
-lmctl				
-lmname		default		default
-logbase	1.0001		1.000100e+00
-logfn				
-logspec	no		no
-lowerf		133.33334	1.333333e+02
-lpbeam		1e-40		1.000000e-40
-lponlybeam	7e-29		7.000000e-29
-lw		6.5		6.500000e+00
-maxhistpf	100		100
-maxhmmpf	-1		-1
-maxnewoov	20		20
-maxwpf		-1		-1
-mdef				
-mean				
-mfclogdir			
-mixw				
-mixwfloor	0.0000001	1.000000e-07
-mmap		yes		yes
-ncep		13		13
-nfft		512		512
-nfilt		40		40
-nwpen		1.0		1.000000e+00
-pbeam		1e-48		1.000000e-48
-pip		1.0		1.000000e+00
-rawlogdir			
-remove_dc	no		no
-round_filters	yes		yes
-samprate	16000		1.600000e+04
-sdmap				
-seed		-1		-1
-sendump			
-silprob	0.005		5.000000e-03
-smoothspec	no		no
-spec2cep	no		no
-svspec				
-tmat				
-tmatfloor	0.0001		1.000000e-04
-topn		4		4
-toprule			
-transform	legacy		legacy
-unit_area	yes		yes
-upperf		6855.4976	6.855498e+03
-usewdphones	no		no
-uw		1.0		1.000000e+00
-var				
-varfloor	0.0001		1.000000e-04
-varnorm	no		no
-verbose	no		yes
-warp_params			
-warp_type	inverse_linear	inverse_linear
-wbeam		7e-29		7.000000e-29
-wip		0.65		6.500000e-01
-wlen		0.025625	2.562500e-02

INFO: cmd_ln.c(506): Parsing command line:
\
	-lowerf 1 \
	-upperf 4000 \
	-nfilt 20 \
	-transform dct \
	-round_filters no \
	-remove_dc yes \
	-feat s2_4x 

Current configuration:
[NAME]		[DEFLT]		[VALUE]
-agc		none		none
-agcthresh	2.0		2.000000e+00
-alpha		0.97		9.700000e-01
-cep2spec	no		no
-ceplen		13		13
-cmn		current		current
-cmninit	8.0		8.0
-dither		no		no
-doublebw	no		no
-feat		1s_c_d_dd	s2_4x
-frate		100		100
-input_endian	little		little
-lda				
-ldadim		0		0
-lifter		0		0
-logfn				
-logspec	no		no
-lowerf		133.33334	1.000000e+00
-mfclogdir			
-ncep		13		13
-nfft		512		512
-nfilt		40		20
-rawlogdir			
-remove_dc	no		yes
-round_filters	yes		no
-samprate	16000		1.600000e+04
-seed		-1		-1
-smoothspec	no		no
-spec2cep	no		no
-svspec				
-transform	legacy		dct
-unit_area	yes		yes
-upperf		6855.4976	4.000000e+03
-varnorm	no		no
-verbose	no		yes
-warp_params			
-warp_type	inverse_linear	inverse_linear
-wlen		0.025625	2.562500e-02

INFO: acmod.c(82): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/hmm/wsj1/feat.params
INFO: mdef.c(520): Reading model definition: /usr/local/share/pocketsphinx/model/hmm/wsj1/mdef
INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
INFO: bin_mdef.c(301): Reading binary model definition: /usr/local/share/pocketsphinx/model/hmm/wsj1/mdef
INFO: bin_mdef.c(480): 44 CI-phone, 66516 CD-phone, 5 emitstate/phone, 220 CI-sen, 5220 Sen, 18660 Sen-Seq
INFO: tmat.c(204): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/hmm/wsj1/transition_matrices
INFO: acmod.c(114): Attempting to use SCGMM computation module
INFO: s2_semi_mgau.c(981): Reading S3 mixture gaussian file '/usr/local/share/pocketsphinx/model/hmm/wsj1/means'
INFO: s2_semi_mgau.c(1080): 1 mixture Gaussians, 256 components, 4 feature streams, veclen 51
INFO: s2_semi_mgau.c(981): Reading S3 mixture gaussian file '/usr/local/share/pocketsphinx/model/hmm/wsj1/variances'
INFO: s2_semi_mgau.c(1080): 1 mixture Gaussians, 256 components, 4 feature streams, veclen 51
INFO: s2_semi_mgau.c(748): Loading senones from dump file /usr/local/share/pocketsphinx/model/hmm/wsj1/sendump
INFO: s2_semi_mgau.c(764): BEGIN FILE FORMAT DESCRIPTION
INFO: s2_semi_mgau.c(793): Rows: 256, Columns: 5220
INFO: s2_semi_mgau.c(801): Using memory-mapped I/O for senones
INFO: kdtree.c(231): Reading tree for feature 0
INFO: kdtree.c(249): n_density 256 n_comp 12 n_level 8 threshold 0.200000
INFO: kdtree.c(186): Read 255 nodes
INFO: kdtree.c(231): Reading tree for feature 1
INFO: kdtree.c(249): n_density 256 n_comp 24 n_level 8 threshold 0.200000
INFO: kdtree.c(186): Read 255 nodes
INFO: kdtree.c(231): Reading tree for feature 2
INFO: kdtree.c(249): n_density 256 n_comp 3 n_level 8 threshold 0.200000
INFO: kdtree.c(186): Read 255 nodes
INFO: kdtree.c(231): Reading tree for feature 3
INFO: kdtree.c(249): n_density 256 n_comp 12 n_level 8 threshold 0.200000
INFO: kdtree.c(186): Read 255 nodes
INFO: fe_interface.c(162): Current FE Parameters:
INFO: fe_interface.c(163): 	Sampling Rate:             16000.000000
INFO: fe_interface.c(164): 	Frame Size:                410
INFO: fe_interface.c(165): 	Frame Shift:               160
INFO: fe_interface.c(166): 	FFT Size:                  512
INFO: fe_interface.c(167): 	Lower Frequency:           1
INFO: fe_interface.c(169): 	Upper Frequency:           4000
INFO: fe_interface.c(171): 	Number of filters:         20
INFO: fe_interface.c(172): 	Number of Overflow Samps:  0
INFO: fe_interface.c(173): 	Start Utt Status:          0
INFO: fe_interface.c(174): Will remove DC offset at frame level
INFO: fe_interface.c(181): Will not add dither to audio
INFO: fe_interface.c(187): Will normalize filters to unit area
INFO: fe_interface.c(189): Will not round filter frequencies to DFT points
INFO: fe_interface.c(191): Will not use double bandwidth in mel filter
INFO: feat.c(849): Initializing feature stream to type: 's2_4x', ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean[0]= 12.00, mean[1..12]= 0.0
INFO: dict.c(232): Allocating 20 placeholders for new OOVs
INFO: dict.c(494):      6 = words in file [3400.dic]
WARNING: "dict.c", line 435: Skipping duplicate definition of <s>
WARNING: "dict.c", line 435: Skipping duplicate definition of </s>
WARNING: "dict.c", line 435: Skipping duplicate definition of <sil>
INFO: dict.c(494):      3 = words in file [/usr/local/share/pocketsphinx/model/hmm/wsj1/noisedict]
INFO: dict.c(349): LEFT CONTEXT TABLES
INFO: dict.c(1013): Entry Context table contains
	     6 entries
INFO: dict.c(1014): 	   264 possible cross word triphones.
INFO: dict.c(1052): 	   200 triphones
	    20 pseudo diphones
	    44 uniphones
INFO: dict.c(1099): Exit Context table contains
	     6 entries
INFO: dict.c(1100): 	   264 possible cross word triphones.
INFO: dict.c(1166): 	   200 triphones
	    20 pseudo diphones
	    44 uniphones
INFO: dict.c(1168): 	    94 right context entries
INFO: dict.c(1169): 	    15 ave entries per exit context
INFO: dict.c(355): RIGHT CONTEXT TABLES
INFO: dict.c(1013): Entry Context table contains
	     7 entries
INFO: dict.c(1014): 	   308 possible cross word triphones.
INFO: dict.c(1052): 	   252 triphones
	    12 pseudo diphones
	    44 uniphones
INFO: dict.c(1099): Exit Context table contains
	     7 entries
INFO: dict.c(1100): 	   308 possible cross word triphones.
INFO: dict.c(1166): 	   252 triphones
	    12 pseudo diphones
	    44 uniphones
INFO: dict.c(1168): 	   154 right context entries
INFO: dict.c(1169): 	    22 ave entries per exit context
INFO: ngram_model_arpa.c(539): ngrams 1=7, 2=8, 3=6
INFO: ngram_model_arpa.c(204): Reading unigrams
INFO: ngram_model_arpa.c(578):        7 = #unigrams created
INFO: ngram_model_arpa.c(260): Reading bigrams
INFO: ngram_model_arpa.c(594):        8 = #bigrams created
INFO: ngram_model_arpa.c(595):        5 = #prob2 entries
INFO: ngram_model_arpa.c(602):        3 = #bo_wt2 entries
INFO: ngram_model_arpa.c(358): Reading trigrams
INFO: ngram_model_arpa.c(615):        6 = #trigrams created
INFO: ngram_model_arpa.c(616):        3 = #prob3 entries
INFO: ngram_search_fwdtree.c(156): 0 root, 0 non-root channels, 26 single-phone words
INFO: ngram_search_fwdtree.c(195): Creating search tree
INFO: ngram_search_fwdtree.c(203): 0 root, 0 non-root channels, 26 single-phone words
INFO: ngram_search_fwdtree.c(325): max nonroot chan increased to 144
INFO: ngram_search_fwdtree.c(334): 5 root, 16 non-root channels, 6 single-phone words
INFO: ngram_search_fwdflat.c(95): fwdflat: min_ef_width = 4, max_sf_win = 25
Warning: Could not find Mic element
INFO: continuous.c(261): pocketsphinx_continuous COMPILED ON: Feb  9 2010, AT: 19:35:47

READY....
Input overrrun (non-fatal)
Input overrrun (non-fatal)
Input overrrun (non-fatal)
Input overrrun (non-fatal)
Input overrrun (non-fatal)

The "Input overrrun" messages continue until I exit manually. I don't have ALSA, pulseadio or esd running.

I'm using sphinxbase 0.4.1 with sphinxbase 0.5.1 and am very new to all this. I'm thinking there must be a configuration file where I define my mic hardware, but I can't seem to find anything on it, and Google is entirely unhelpful.

geekbynature · 02-14-2010, 07:50 PM

Seems sphinx3_livedecode works with pre-recorded files; this is a good temporary workaround. For some reason sphinx3_livedecode or other options don't work.