Deep Neural Network Approach for Single Channel Speech Enhancement Processing
| dc.contributor.author | Li, Dongfu | |
| dc.contributor.supervisor | Bouchard, Martin | |
| dc.date.accessioned | 2016-04-08T19:47:27Z | |
| dc.date.available | 2016-04-08T19:47:27Z | |
| dc.date.issued | 2016 | |
| dc.description.abstract | Speech intelligibility represents how comprehensible a speech is. It is more important than speech quality in some applications. Single channel speech intelligibility enhancement is much more difficult than multi-channel intelligibility enhancement. It has recently been reported that training-based single channel speech intelligibility enhancement algorithms perform better than Signal to Noise Ratio (SNR) based algorithm. In this thesis, a training-based Deep Neural Network (DNN) is used to improve single channel speech intelligibility. To increase the performance of the DNN, the Multi-Resolution Cochlea Gram (MRCG) feature set is used as the input of the DNN. MATLAB objective test results show that the MRCG-DNN approach is more robust than a Gaussian Mixture Model (GMM) approach. The MRCG-DNN also works better than other DNN training algorithms. Various conditions such as different speakers, different noise conditions and reverberation were tested in the thesis. | en |
| dc.identifier.uri | http://hdl.handle.net/10393/34472 | |
| dc.identifier.uri | http://dx.doi.org/10.20381/ruor-5532 | |
| dc.language.iso | en | en |
| dc.publisher | Université d'Ottawa / University of Ottawa | en |
| dc.subject | DNN | en |
| dc.subject | GMM | en |
| dc.subject | MRCG | en |
| dc.subject | Single-channel speech processing | en |
| dc.title | Deep Neural Network Approach for Single Channel Speech Enhancement Processing | en |
| dc.type | Thesis | en |
| thesis.degree.discipline | Génie / Engineering | en |
| thesis.degree.level | Masters | en |
| thesis.degree.name | MASc | en |
| uottawa.department | Science informatique et génie électrique / Electrical Engineering and Computer Science | en |
