Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream

dc.contributor.authorSilovský, Jan
dc.contributor.authorNouza, Jan
dc.date.accessioned2016-05-24
dc.date.available2016-05-24
dc.date.issued2006
dc.description.abstractThis paper presents a set of techniques for classification of audiosegments in a system for automatic transcription of broadcast programs. The task consists in deciding a) whether the segment is to be labeled as speech or a non-speech one, and in the former case, b) whether the talking person is one of the speakers in the database, and if not, c) which gender the speaker belongs to. The result of the classification is used to extend the information provided by the transcription system and also to enhance the performance of the speech recognition module. Like the most of the state-of-the-art speaker recognition systems, the proposed one is based on Gaussian Mixture Models (GMM). As the number of the database speakers can be large, we introduce a technique that speeds up the identification process in significant way. Furthermore, we compare several approaches to the estimation of GMM parameters. Finally, we present the results achieved in classification of 230 minutes of real broadcast data.en
dc.description.sponsorshipGrant Agency of the Czech Academy of Sciences [1QS108040569]
dc.formattext
dc.identifier.issn1210-2512
dc.identifier.scopus2-s2.0-74549191740
dc.identifier.urihttps://dspace.tul.cz/handle/15240/16364
dc.identifier.urihttps://www.researchgate.net/publication/26511701_Speech_Speaker_and_Speaker's_Gender_Identification_in_Automatically_Processed_Broadcast_Stream
dc.language.isoen
dc.publisherSpolecnost Pro Radioelektronicke Inzenyrstvi
dc.publisherTechnická Univerzita v Libercics
dc.publisherTechnical university of Liberec, Czech Republicen
dc.relation.ispartofRadioengineeringen
dc.sourcej-scopus
dc.sourcej-wok
dc.subjectSpeaker recognitionen
dc.subjectGaussian mixture modelsen
dc.subjectbroadcast speech transcriptionen
dc.titleSpeech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Streamen
dc.typeArticle
local.accessοpen
local.citation.epage48
local.citation.spage42
local.departmentSpeechLab
local.facultyFaculty of Mechatronics, Informatics and Interdisciplinary Studies
local.fulltextyes
local.identifier.stagRIV/46747885:24220/06:#0000026
local.identifier.wok208050900008
local.relation.issue3
local.relation.volume15
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2-s2.0-74549191740-o.pdf
Size:
264.99 KB
Format:
Adobe Portable Document Format
Description:
Článek
Collections