Electrical, Computer, and Biomedical Engineering Faculty Publications

Comprehensive modulation representation for automatic speech recognition

Yadong Wang, University of Maryland, College Park
Steven Greenberg, Danmarks Tekniske Universitet
Jayaganesh Swaminathan, University of Rhode Island
Ramdas Kumaresan, University of Rhode IslandFollow
David Poeppel, University of Maryland, College Park

Document Type

Conference Proceeding

Date of Original Version

12-1-2005

Abstract

We present a new feature representation for speech recognition based on both amplitude modulation spectra (AMS) and frequency modulation spectra (FMS). A comprehensive modulation spectral (CMS) approach is defined and analyzed based on a modulation model of the band-pass signal. The speech signal is processed first by a bank of specially designed auditory band-pass filters. CMS are extracted from the output of the filters as the features for automatic speech recognition (ASR). A significant improvement is demonstrated in performance on noisy speech. On the Aurora 2 task the new features result in an improvement of 23.43% relative to traditional mel-cepstrum front-end features using a 3 GMM HMM back-end. Although the improvements are relatively modest, the novelty of the method and its potential for performance enhancement warrants serious attention for future-generation ASR applications.

Publication Title, e.g., Journal

9th European Conference on Speech Communication and Technology

Citation/Publisher Attribution

Wang, Yadong, Steven Greenberg, Jayaganesh Swaminathan, Ramdas Kumaresan, and David Poeppel. "Comprehensive modulation representation for automatic speech recognition." 9th European Conference on Speech Communication and Technology (2005): 3025-3028. https://digitalcommons.uri.edu/ele_facpubs/658

This document is currently not available here.

COinS

Electrical, Computer, and Biomedical Engineering Faculty Publications

Comprehensive modulation representation for automatic speech recognition

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Citation/Publisher Attribution

Search

Browse

Author Corner

Electrical, Computer, and Biomedical Engineering Faculty Publications

Comprehensive modulation representation for automatic speech recognition

Authors

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Citation/Publisher Attribution

Share

Search

Browse

Author Corner