Deep Neural Network Approach for Single Channel Speech Enhancement Processing

Li, Dongfu

Deep Neural Network Approach for Single Channel Speech Enhancement Processing

Fichiers

Principal Li_Dongfu_2016_thesis.pdf (1.28 MB)

Date

2016

Authors

Li, Dongfu

Éditeur

Université d'Ottawa / University of Ottawa

Résumé

Speech intelligibility represents how comprehensible a speech is. It is more important than speech quality in some applications. Single channel speech intelligibility enhancement is much more difficult than multi-channel intelligibility enhancement. It has recently been reported that training-based single channel speech intelligibility enhancement algorithms perform better than Signal to Noise Ratio (SNR) based algorithm. In this thesis, a training-based Deep Neural Network (DNN) is used to improve single channel speech intelligibility. To increase the performance of the DNN, the Multi-Resolution Cochlea Gram (MRCG) feature set is used as the input of the DNN. MATLAB objective test results show that the MRCG-DNN approach is more robust than a Gaussian Mixture Model (GMM) approach. The MRCG-DNN also works better than other DNN training algorithms. Various conditions such as different speakers, different noise conditions and reverberation were tested in the thesis.

Mots-clés

DNN, GMM, MRCG, Single-channel speech processing

URI

http://hdl.handle.net/10393/34472
http://dx.doi.org/10.20381/ruor-5532

Collections

- Thèses, 2011 - // Theses, 2011 -

Notice complète

Deep Neural Network Approach for Single Channel Speech Enhancement Processing

Fichiers

Date

Authors

Nom de la revue

ISSN de la revue

Titre du volume

Éditeur

Résumé

Description

Mots-clés

Citation

URI

Collections

Approbation

Évaluation

Complété par

Référencé par