A Comparison of Reconstructed Phase Spaces and Cepstral Coefficients for Multi-Band Phoneme Classification
Format of Original
Institute of Electrical and Electronics Engineers (IEEE)
2004 7th International Conference on Signal Processing. Proceedings. ICSP '04
This paper examines the use of multi-band reconstructed phase spaces as models for phoneme classification. Sub-banding reconstructed phase spaces combines linear, frequency-based techniques with a nonlinear modeling approach to speech recognition. Experiments comparing the effects of filtering speech signals for both reconstructed phase space and traditional speech recognition approaches are presented. These experiments study the use of two non-overlapping subbands for isolated phoneme classification on the TIMIT corpus. It is shown that while classification accuracy using MeI frequency cepstral coefficients as features does not improve with sub-banding, the accuracy increases from 36.1% to 42.0% using sub-banded reconstructed phase spaces to model the phonemes.