Electrical and Computer Engineering Faculty Research and Publications

Phone Lattice Reconstruction for Embedded Language Recognition in LVCSR

Document Type

Article

Language

eng

Publication Date

4-13-2012

Publisher

Springer

Source Publication

EURASIP Journal on Audio, Speech, and Music Processing

Source ISSN

1687-4722

Original Item ID

DOI: 10.1186/1687-4722-2012-15

Abstract

An increasing number of multilingual applications require language recognition (LRE) as a frontend, but desire low additional computational cost. This article demonstrates a novel architecture for embedding phone based language recognition into a large vocabulary continuous speech recognition (LVCSR) decoder by sharing the same decoding process but generating separate lattices. To compensate for the prior bias introduced by the pronunciation dictionary and the language model of the LVCSR decoder, three different phone lattice reconstruction algorithms are proposed. The underlying goals of these algorithms are to override pronunciation and grammar restrictions to provide richer phonetic information. All of the new algorithms incorporate a vector space modeling backend for improved LRE accuracy. Evaluated on a Mandarin/English detection task, the proposed integrated LVCSR-LRE system using frame-expanded N-best phone lattice achieves comparable performance to a state-of-the-art phone recognition-vector space modeling (PRVSM) system, but with an added computational cost three times lower than that of a separate PRVSM system.

Comments

Published version. EURASIP Journal on Audio, Speech, and Music Processing, Vol. 2012, No. 15 (April 13, 2012). DOI. Published under Creative Commons Attribution License 2.0.

Recommended Citation

Shan, Yuxiang; Deng, Yan; Liu, Jia; and Johnson, Michael T., "Phone Lattice Reconstruction for Embedded Language Recognition in LVCSR" (2012). Electrical and Computer Engineering Faculty Research and Publications. 45.
https://epublications.marquette.edu/electric_fac/45

Download

Find in your library

Included in

Computer Engineering Commons, Electrical and Computer Engineering Commons

COinS

e-Publications@Marquette

Electrical and Computer Engineering Faculty Research and Publications

Phone Lattice Reconstruction for Embedded Language Recognition in LVCSR

Document Type

Language

Publication Date

Publisher

Source Publication

Source ISSN

Original Item ID

Abstract

Comments

Recommended Citation

Included in

Browse

Information about e-Pubs@MU

Links

e-Publications@Marquette

Electrical and Computer Engineering Faculty Research and Publications

Phone Lattice Reconstruction for Embedded Language Recognition in LVCSR

Authors

Document Type

Language

Publication Date

Publisher

Source Publication

Source ISSN

Original Item ID

Abstract

Comments

Recommended Citation

Included in

Share

Browse

Information about e-Pubs@MU

Links