Forward decoding ( New spotting algorithm )

次へ: Continuous Speech Experiments 上へ: Decoding for optimal state 戻る: Decoding for optimal state

Forward decoding ( New spotting algorithm )

In this algorithm, first, a score is calculated for summing each score of a state. Next, the optimal state sequence is assigned for the best score of each frame. This decoding algorithm is given as follows.

: optimal state sequence at time ;
$\delta_t(j)$ : score at time and state ;
: number of states for Ergodic HMM ;
$a_{ij}$ ; transition probability at state to ;
$b_j(\mbox{\boldmath$o$}_t)$ ; output probability at state
when the observation vector is $(\mbox{\boldmath$o$}_t)$ ;

For all states $i\in\{1,...,N\}$
$\delta_1(i) = \pi_i \times b_i(\mbox{\boldmath$o$}_1)$
$s_1^*= arg \max_i\delta_1(i)$
Calculate $j\in\{1,...,N\}$ along the time axis for and for all states j, compute:

$\delta_t(j) = \sum_i[\delta_{t-1}(i) \times a_{ij} \times b_j(\mbox{\boldmath$o$}_t)]$
$s_t^*= arg \max_j\delta_t(j)$
The final score is given by :
$\sum_{\mbox{\boldmath$S$}}P(\mbox{\boldmath$O$},\mbox{\boldmath$S$}\mid\mbox{\boldmath$M$}) = \sum_j\delta_T(j)$
$s_T^* = arg \max_j\delta_T(j)$

Finally, if the optimal state sequence is the last state of any word, it is assumed that this word is recognized.

This decoding algorithm is similar to the forward-backward algorithm[5]. However, it can recognize the word for each frame, and therefore, can be regarded as a kind of word spotting algorithm.

次へ: Continuous Speech Experiments 上へ: Decoding for optimal state 戻る: Decoding for optimal state

Jin'ichi Murakami 平成13年1月19日