Discriminative learning for speech recognition theory and practice /

In this book, we introduce the background and mainstream methods of probabilistic modeling and discriminative parameter optimization for speech recognition. The specific models treated in depth include the widely used exponential-family distributions and the hidden Markov model. A detailed study is...

Full description

Bibliographic Details
Main Author: He, Xiaodong, 1973-
Other Authors: Deng, Li, 1958-
Format: Electronic
Language:English
Published: San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool Publishers, c2008.
Series:Synthesis lectures on speech and audio processing (Online), #4.
Subjects:
Online Access:View fulltext via EzAccess
Description
Summary:In this book, we introduce the background and mainstream methods of probabilistic modeling and discriminative parameter optimization for speech recognition. The specific models treated in depth include the widely used exponential-family distributions and the hidden Markov model. A detailed study is presented on unifying the common objective functions for discriminative learning in speech recognition, namely maximum mutual information (MMI), minimum classification error, and minimum phone/word error. The unification is presented, with rigorous mathematical analysis, in a common rational-function form. This common form enables the use of the growth transformation (or extended Baum-Welch) optimization framework in discriminative learning of model parameters. In addition to all the necessary introduction of the background and tutorial material on the subject, we also included technical details on the derivation of the parameter optimization formulas for exponential-family distributions, discrete hidden Markov models (HMMs), and continuous-density HMMs in discriminative learning. Selected experimental results obtained by the authors in firsthand are presented to show that discriminative learning can lead to superior speech recognition performance over conventional parameter learning. Details on major algorithmic implementation issues with practical significance are provided to enable the practitioners to directly reduce the theory in the earlier part of the book into engineering practice.
Item Description:Part of: Synthesis digital library of engineering and computer science.
Title from PDF t.p. (viewed on Oct. 24, 2008).
Series from website.
Physical Description:1 electronic text (vii, 112 p. : ill.) : digital file.
Also available in print.
Format:Mode of access: World Wide Web.
System requirements: Adobe Acrobat Reader.
Bibliography:Includes bibliographical references (p. 107-110).
ISBN:9781598293098 (ebook)
9781598293081 (pbk.)
ISSN:1932-1678 ;
Access:Abstract freely available; full-text restricted to subscribers or individual document purchasers.