Vocal Source Features for Bilingual Speaker Identification
Format of Original
Institute of Electrical and Electronics Engineers
2013 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP)
Original Item ID
This paper introduces the use of two new features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC) and Glottal Flow Cepstrum Coefficients (GLFCC), to capture speaker-specific characteristics from their vocal excitation patterns. Results on a cross-lingual speaker identification task taken from the NIST 2004 SRE demonstrate that these RPCC and GLFCC features are significantly more accurate than traditional melfrequency cepstral coefficients (MFCC). In particular, these two new features give better results with smaller amounts of training data, due to lower model complexity.