Speech feature estimation under the presence of noise with switching Kalman Filter methods

Deng, Jianping

Speech feature estimation under the presence of noise with switching Kalman Filter methods

dc.contributor.author	Deng, Jianping
dc.date.accessioned	2013-11-08T16:08:18Z
dc.date.available	2013-11-08T16:08:18Z
dc.date.created	2007
dc.date.issued	2007
dc.degree.level	Doctoral
dc.description.abstract	The performance degradation of a speech recognizer in the presence of additive noise is one of the major problems that still remain unsolved in the application of speech recognition technology. This thesis develops speech enhancement schemes where a noisy speech signal is processed in the feature extraction stage. Since the most popular speech features for speech recognition are Mel-Frequency Cepstral Coefficients (MFCC), vectors of Mel-scaled log-spectrum coefficients or cepstrum coefficients are enhanced. Three different speech feature enhancement schemes based on switching linear dynamic models (SLDMs) are proposed. The switching linear dynamic model describes the nonlinear and non-stationary time trajectory of speech features by switching among a set of linear dynamic models over time. With the resulting SLDMs as a speech model and a model for noise, speech and noise can be tracked jointly by means of switching Kalman filtering, which involves a weighted sum of filters operating interactively in parallel. Since the distortion caused by additive ambient noises is highly non-linear in the feature domain, the Extended Kalman Filter algorithm (EKF) and the Unscented Kalman Filter algorithm (UKF) have been used to deal with the nonlinear distortion caused by noise in the feature domain. Comprehensive experiments have been carried out to evaluate the proposed schemes with commonly used databases. The simulation results are presented and compared with other model-based feature enhancement systems in the literature in terms of speech recognition accuracy. Compared with the best results based on the Aurora2 database that we could find in the literature, our approach offers a similar performance (85.96% vs. 86.72%) when the EKF is used in our proposed method, resulting in a smaller complexity. When the UKF is used with our method, our approach achieves a better performance (89.60% vs. 86.72%).
dc.format.extent	174 p.
dc.identifier.citation	Source: Dissertation Abstracts International, Volume: 70-07, Section: B, page: 4375.
dc.identifier.uri	http://hdl.handle.net/10393/29634
dc.identifier.uri	http://dx.doi.org/10.20381/ruor-13067
dc.language.iso	en
dc.publisher	University of Ottawa (Canada)
dc.subject.classification	Engineering, Electronics and Electrical.
dc.title	Speech feature estimation under the presence of noise with switching Kalman Filter methods
dc.type	Thesis

Fichiers

Trousse originale

Voici les éléments 1 - 1 sur 1

Nom:: NR49335.PDF
Taille:: 9.91 MB
Format:: Adobe Portable Document Format

Télécharger

Collections

Thèses, 1910 - 2010 // Theses, 1910 - 2010