Discrete-utterance recognition with a fast match based on total data reduction

Nouza, Jan

Discrete-utterance recognition with a fast match based on total data reduction

Files

2-s2.0-0030355991-o.pdf(695.71 KB)

Date

1996-01-01

Authors

Nouza, Jan

Publisher

IEEE, Piscataway, NJ, United States

Abstract

In the paper, a two-level classification scheme applicable to practical discrete-utterance recognition systems is presented. Both the fast and fine match employ CDHMM whole-word models. The fast match is based on total data reduction, which includes both the minimalization of the acoustic data flow (the numbers of speech frames and features) and the reduction of the basic HMM parameters (the numbers of states and mixtures). The optimal choice of the fast match parameters is a subject of the procedure that aims at minimizing the total classification time while preserving the maximum available recognition accuracy. On a medium-size vocabulary task (121 city names) the fast match reduced recognition time to approx. 20% (compared with the original one-level system) with a negligible loss of accuracy. The time savings were even more considerable in case of a system with multi-mixture HMMs.

Item identifier

https://dspace.tul.cz/handle/15240/16587

Show full item record