Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition

Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition
Author :
Publisher :
Total Pages : 222
Release :
ISBN-10 : OCLC:55646957
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition by : Xiaodong He

Download or read book Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition written by Xiaodong He and published by . This book was released on 2003 with total page 222 pages. Available in PDF, EPUB and Kindle. Book excerpt: Rapid globalization requires speech recognition systems to handle not only speech spoken by native speakers, but also speech spoken by foreign speakers. Currently, most American English speech recognition systems are built from speech data of American native English speakers. Although these systems work very well for native speakers, their performances degrade dramatically on recognition of foreign accented speech. Moreover, due to wide varieties of foreign accents, different speaking proficiency levels of English and limited data, in general it is difficult to train a specific acoustic model for each foreign accent. Therefore a practically feasible way to improve the performance of nonnative speech recognition is fast model adaptation. In this dissertation, the problem of adapting acoustic models of native English speech to nonnative speakers is addressed from the perspective of adaptive model selection. The goal is to dynamically select the optimal model for each nonnative talker so as to balance model robustness to pronunciation variations and model details for discrimination of speech sounds. A maximum expected likelihood (MEL) based technique is proposed for reliable model selection when adaptation data is sparse, where expectation of log-likelihood (EL) of adaptation data is computed based on distributions of mismatch biases between model and data, and model is selected to maximize EL. Moreover, in order to obtain reliable results when the available data is very limited, an improved prior knowledge guided MEL (P-MEL) approach is also proposed by using maximum a posteriori (MAP) estimation of bias distributions. These model selection methods are further combined with Maximum likelihood linear regression (MLLR) to enable adaptation of both structure and parameters of acoustic models. Experiments were performed on data of speakers with a wide range of foreign accents. Results show that the MEL based model selection can dynamically select proper model according to the available adaptation data, and the P-MEL approach can achieve a good performance even when the data amount is very small. Compared with the standard MLLR, the MEL+MLLR and the P-MEL + MLLR methods led to consistent and significant improvement to recognition accuracy on nonnative speakers, without performance degradation on native speakers.


Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition Related Books

Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition
Language: en
Pages: 222
Authors: Xiaodong He
Categories: Automatic speech recognition
Type: BOOK - Published: 2003 - Publisher:

DOWNLOAD EBOOK

Rapid globalization requires speech recognition systems to handle not only speech spoken by native speakers, but also speech spoken by foreign speakers. Current
Robust Adaptation to Non-Native Accents in Automatic Speech Recognition
Language: en
Pages: 135
Authors: Silke Goronzy
Categories: Computers
Type: BOOK - Published: 2003-07-01 - Publisher: Springer

DOWNLOAD EBOOK

Speech recognition technology is being increasingly employed in human-machine interfaces. A remaining problem however is the robustness of this technology to no
Self-Learning Speaker Identification
Language: en
Pages: 178
Authors: Tobias Herbig
Categories: Technology & Engineering
Type: BOOK - Published: 2011-06-18 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

Current speech recognition systems are based on speaker independent speech models and suffer from inter-speaker variations in speech signal characteristics. Thi
Automatic Speech and Speaker Recognition
Language: en
Pages: 524
Authors: Chin-Hui Lee
Categories: Technology & Engineering
Type: BOOK - Published: 2012-12-06 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

Research in the field of automatic speech and speaker recognition has made a number of significant advances in the last two decades, influenced by advances in s
Advances in Speech Recognition
Language: en
Pages: 177
Authors: Noam Shabtai
Categories: Computers
Type: BOOK - Published: 2010-08-16 - Publisher: BoD – Books on Demand

DOWNLOAD EBOOK

In the last decade, further applications of speech processing were developed, such as speaker recognition, human-machine interaction, non-English speech recogni