Repository logo

Investigating speed issues in acoustic-phonetic models for continuous speech recognition

dc.contributor.authorAgbago, Akakpo
dc.date.accessioned2013-11-07T17:24:58Z
dc.date.available2013-11-07T17:24:58Z
dc.date.created2004
dc.date.issued2004
dc.degree.levelMasters
dc.degree.nameM.A.Sc.
dc.description.abstractAutomatic Speech Recognition applications face two challenges: accuracy and speed. For good accuracy, Dynamic Programming and Hidden Markov Model algorithms are widely used despite their heavy computational load. To solve the speed problem, this thesis uses a Three-Stage-Architecture (TSA) in which Stage.1 is to enhance and extract features from the input speech signal, Stage.2 does a phonetic-acoustic level recognition to output strings of phonemes to Stage.3 that completes the recognition into valid words using HMM on strings rather than utterances processing. We designed two algorithms for Stage.2: Fast Two-Level Dynamic Programming (FTLDP) that is 20 times faster than a standard Two-Level DP and ParrallelRecognizer that performs 320 times faster than the standard Two-Level DP. Both algorithms are combined with a heuristic feature called Cepstrum Gain Envelop Profile (CGEP) based Silence Detection to shorten the input speech and clustering to reduce the search space in the reference phonetic models.
dc.format.extent143 p.
dc.identifier.citationSource: Masters Abstracts International, Volume: 43-06, page: 2324.
dc.identifier.urihttp://hdl.handle.net/10393/26559
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-18244
dc.language.isoen
dc.publisherUniversity of Ottawa (Canada)
dc.subject.classificationEngineering, Electronics and Electrical.
dc.titleInvestigating speed issues in acoustic-phonetic models for continuous speech recognition
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
MR01394.PDF
Size:
5.55 MB
Format:
Adobe Portable Document Format