Comparison of generative and discriminative approaches for speaker recognition with limited data

Silovský, Jan

Comparison of generative and discriminative approaches for speaker recognition with limited data

Files

09_03_307_316.pdf(3.56 MB)

Date

2009-01-01

Authors

Silovský, Jan

Červa, Petr

Žďánský, Jindřich

Abstract

This paper presents a comparison of three different speaker recognition methods deployed in a broadcast news processing system. We focus on how the generative and discriminative nature of these methods affects the speaker recognition framework and we also deal with intersession variability compensation techniques in more detail, which are of great interest in broadcast processing domain. Performed experiments are specific particularly for the very limited amount of data used for both speaker enrollment (typically ranging from 30 to 60 seconds) and recognition (typically ranging from 5 to 15 seconds). Our results show that the system based on Gaussian Mixture Models (GMMs) outperforms both systems based on Support Vector Machines (SVMs) but its drawback is higher computational cost.

Subject(s)

Broadcast processing, Gaussian Mixture Models (GMM), Speaker recognition, Support Vector Machines (SVM)

Item identifier

https://dspace.tul.cz/handle/15240/16641

ISSN

1210-2512

Show full item record