Document Type
Conference Proceeding
Language
eng
Format of Original
5 p.
Publication Date
2012
Publisher
Institute of Electrical and Electronics Engineers
Source Publication
Proceedings of the Third International Conference on Audio, Language, and Image Processing
Source ISSN
978-1-4673-0173-2
Original Item ID
doi: 10.1109/ICALIP.2012.6376789
Abstract
Multichannel fusion strategies are presented for the distributed microphone recognition environment, for the task of song-type recognition in a multichannel songbird dataset. The signals are first fused together based on various heuristics, including their amplitudes, variances, physical distance, or squared distance, before passing the enhanced single-channel signal into the speech recognition system. The intensity-weighted fusion strategy achieved the highest overall recognition accuracy of 94.4%. By combining the noisy distributed microphone signals in an intelligent way that is proportional to the information contained in the signals, speech recognition systems can achieve higher recognition accuracies.
Recommended Citation
Trawicki, Marek B.; Johnson, Michael T.; Ji, An; and Osiejuk, Tomasz S., "Multichannel Speech Recognition Using Distributed Microphone Signal Fusion Strategies" (2012). Electrical and Computer Engineering Faculty Research and Publications. 15.
https://epublications.marquette.edu/electric_fac/15
Comments
Accepted version. Published as part of the proceedings of the conference, Multichannel Speech Recognition using Distributed Microphone Signal Fusion Strategies, 2012: 1146-1150. DOI: 10.1109/ICALIP.2012.6376789. © 2012 IEEE. Used with permission.