Browsing by Author "Boháč Marek"
Now showing 1 - 10 of 10
Results Per Page
Sort Options
- ItemAutomatic Syllabification and Syllable Timing of Automatically Recognized Speech - for Czech(Springer International Publishing, 2016-01-01) Boháč Marek; Matějů Lukáš; Rott Michal; Šafařík Radek
- ItemBlock-Online Multi-Channel Speech Enhancement Using DNN-Supported Relative Transfer Function Estimates(Institution of Engineering and Technology, 2020-01-01) Málek Jiří; Koldovský Zbyněk; Boháč Marek
- ItemCHiME4: Multichannel Enhancement Using Beamforming Driven by a DNN-based Voice Activity Detector(2016-01-01) Koldovský Zbyněk; Málek Jiří; Boháč Marek; Janský Jakub
- ItemExploiting of the timing information in subtitle-like parallel multilingual data(Fundancja Uniwersytetu im. Adama Mickiewicza w Poznaniu, 2015-01-01) Boháč Marek; Rott Michal
- ItemLinear acoustic echo cancellation using deep neural networks and convex reconstruction of incomplete transfer function(Institute of Electrical and Electronics Engineers Inc., 2017-01-01) Janský Jakub; Boháč Marek; Koldovský Zbyněk; Müller MichaelLinear acoustic path estimation for acoustic echo cancellation is difficult during periods where the near-end signal (speech) is active. In this paper, we assume that the impulse response is sparse. There are many algorithms that solve the problem of estimating sparse impulse response in the time domain. In this paper, we propose algorithms working in the time-frequency domain. In our approach, it is assumed that the respective transfer function can be estimated only for those frequencies where the near-end signal is not active. First, a deep neural network trained on mixed signals is used to detect the activity of the near-end signal. In frequencies where no activity is detected, the acoustic transfer function is estimated using conventional frequency domain least squares. This results in an incomplete transfer function (ITF) estimate. The completion is done through finding the sparsest representation of the ITF in the time domain. This can be done adaptively using the soft-threshold function, which is applied in the time domain. To achieve improved accuracy, oversampling can be used.
- ItemOn Automatic Cross-Lingual Subtitle Timing(IEEE, 2015-01-01) Boháč Marek; Rott Michal; Blavka Karel
- ItemSW for Automatic Pre-clustering of theText Strings in a Specific Consumer Price Index Data Collection Format - DataClassAnalyzer(2015-01-01) Rozkovec Jiří; Boháč Marek; Šeps Ladislav
- ItemSystem for Producing Subtitles to Internet Audio-Visual Documents(Institute of Electrical and Electronics Engineers Inc., 2015-01-01) Nouza Jan; Blavka Karel; Boháč Marek; Červa Petr; Málek Jiří
- ItemText Punctuation: the Inter-Annotator Agreement Study(Springer Verlag, 2017-01-01) Boháč Marek; Rott Michal; Kovář Vojtěch
- ItemUnique Software Technological Platform for Re-scripting of Archives of Historical And Contemporary Relations Čro And Their Opening Up by the Web(2014-01-01) Nouza Jan; Červa Petr; Žďánský Jindřich; Blavka Karel; Boháč Marek; Silovský Jan; Chaloupka Josef; Kuchařová Michaela; Málek Jiří