Repository logo

Speech feature estimation under the presence of noise with switching Kalman Filter methods

dc.contributor.authorDeng, Jianping
dc.date.accessioned2013-11-08T16:08:18Z
dc.date.available2013-11-08T16:08:18Z
dc.date.created2007
dc.date.issued2007
dc.degree.levelDoctoral
dc.description.abstractThe performance degradation of a speech recognizer in the presence of additive noise is one of the major problems that still remain unsolved in the application of speech recognition technology. This thesis develops speech enhancement schemes where a noisy speech signal is processed in the feature extraction stage. Since the most popular speech features for speech recognition are Mel-Frequency Cepstral Coefficients (MFCC), vectors of Mel-scaled log-spectrum coefficients or cepstrum coefficients are enhanced. Three different speech feature enhancement schemes based on switching linear dynamic models (SLDMs) are proposed. The switching linear dynamic model describes the nonlinear and non-stationary time trajectory of speech features by switching among a set of linear dynamic models over time. With the resulting SLDMs as a speech model and a model for noise, speech and noise can be tracked jointly by means of switching Kalman filtering, which involves a weighted sum of filters operating interactively in parallel. Since the distortion caused by additive ambient noises is highly non-linear in the feature domain, the Extended Kalman Filter algorithm (EKF) and the Unscented Kalman Filter algorithm (UKF) have been used to deal with the nonlinear distortion caused by noise in the feature domain. Comprehensive experiments have been carried out to evaluate the proposed schemes with commonly used databases. The simulation results are presented and compared with other model-based feature enhancement systems in the literature in terms of speech recognition accuracy. Compared with the best results based on the Aurora2 database that we could find in the literature, our approach offers a similar performance (85.96% vs. 86.72%) when the EKF is used in our proposed method, resulting in a smaller complexity. When the UKF is used with our method, our approach achieves a better performance (89.60% vs. 86.72%).
dc.format.extent174 p.
dc.identifier.citationSource: Dissertation Abstracts International, Volume: 70-07, Section: B, page: 4375.
dc.identifier.urihttp://hdl.handle.net/10393/29634
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-13067
dc.language.isoen
dc.publisherUniversity of Ottawa (Canada)
dc.subject.classificationEngineering, Electronics and Electrical.
dc.titleSpeech feature estimation under the presence of noise with switching Kalman Filter methods
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
NR49335.PDF
Size:
9.91 MB
Format:
Adobe Portable Document Format