Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream

Silovský, Jan

Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream

dc.contributor.author	Silovský, Jan
dc.contributor.author	Nouza, Jan
dc.date.accessioned	2016-05-24
dc.date.available	2016-05-24
dc.date.issued	2006-01-01
dc.description.abstract	This paper presents a set of techniques for classification of audiosegments in a system for automatic transcription of broadcast programs. The task consists in deciding a) whether the segment is to be labeled as speech or a non-speech one, and in the former case, b) whether the talking person is one of the speakers in the database, and if not, c) which gender the speaker belongs to. The result of the classification is used to extend the information provided by the transcription system and also to enhance the performance of the speech recognition module. Like the most of the state-of-the-art speaker recognition systems, the proposed one is based on Gaussian Mixture Models (GMM). As the number of the database speakers can be large, we introduce a technique that speeds up the identification process in significant way. Furthermore, we compare several approaches to the estimation of GMM parameters. Finally, we present the results achieved in classification of 230 minutes of real broadcast data.	en
dc.description.sponsorship	Grant Agency of the Czech Academy of Sciences [1QS108040569]
dc.format	text
dc.identifier.issn	1210-2512
dc.identifier.scopus	2-s2.0-74549191740
dc.identifier.uri	https://dspace.tul.cz/handle/15240/16364
dc.identifier.uri	https://www.researchgate.net/publication/26511701_Speech_Speaker_and_Speaker's_Gender_Identification_in_Automatically_Processed_Broadcast_Stream
dc.language.iso	en
dc.publisher	Spolecnost Pro Radioelektronicke Inzenyrstvi
dc.publisher	Technická Univerzita v Liberci	cs
dc.publisher	Technical university of Liberec, Czech Republic	en
dc.relation.ispartof	Radioengineering	en
dc.source	j-scopus
dc.source	j-wok
dc.subject	Speaker recognition	en
dc.subject	Gaussian mixture models	en
dc.subject	broadcast speech transcription	en
dc.title	Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream	en
dc.type	Article
local.access	οpen
local.citation.epage	48
local.citation.spage	42
local.department	SpeechLab
local.faculty	Faculty of Mechatronics, Informatics and Interdisciplinary Studies
local.fulltext	yes
local.identifier.stag	RIV/46747885:24220/06:#0000026
local.identifier.wok	208050900008
local.relation.issue	3
local.relation.volume	15

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2-s2.0-74549191740-o.pdf
Size:: 264.99 KB
Format:: Adobe Portable Document Format
Description:: Článek

Download

Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream

Files

Original bundle

Collections