Even though the optimal state sequence is obtained using forward decoding or Viterbi decoding, the relation between the classification and the speaker is still unknown. Therefore, we calculated the classification rate by the following expression.
In the above, is the optimal state sequence.
is an arbitrary
permutation of
, and
is the correct speaker of the
utterance.
is the variable that takes value 1 if the values agree,
and 0 if otherwise.
In this study, the classification numbers are related to , the
correct classification rate is calculated for each of
permutations, and the maximum is defined as the
classification rate. Consequently, in the case of 4 speakers,
combinations are examined.