Electrical, Computer, and Biomedical Engineering Faculty Publications

Cochannel speaker separation by harmonic enhancement and suppression

David P. Morgan, Lockheed Martin Corporation
E. Bryan George, Fidelity Voice Systems Integration
Leonard T. Lee, Texas Instruments
Steven M. Kay, University of Rhode Island

Document Type

Article

Date of Original Version

12-1-1997

Abstract

This paper presents a system for separating the cochannel speech of two talkers. The proposed harmonic enhancement and suppression (HES) system is based on a frame-by-frame speaker separation algorithm that exploits the pitch estimate of the stronger talker derived from the cochannel signal. The idea behind this approach is to recover the stronger talker's speech by enhancing their harmonic frequencies and formants given a multiresolution pitch estimate. The weaker talker's speech is obtained from the residual signal created when the harmonics and formants of the stronger talker are suppressed. An automatic speaker assignment algorithm is used to place recovered frames from the target and interfering talkers in separate channels. Automatic speaker assignment performs reasonably well in most cochannel environments, including voiced-on-voiced, voiced-on-unvoiced, unvoiced-on-unvoiced, assignment after processing silence intervals, and single talker speech (no cochannel interference). The HES system has been tested at target-to-interferer ratios (TIR's) from -18 to 18 dB with widely available data bases. It has demonstrated improved performance in keyword spotting tests for TIR values of 6, 12, and 18 dB, and in human listening tests for TIR values of -6 and -18 dB. © 1997 IEEE.

Publication Title, e.g., Journal

IEEE Transactions on Speech and Audio Processing

Volume

Issue

Citation/Publisher Attribution

Morgan, David P., E. Bryan George, Leonard T. Lee, and Steven M. Kay. "Cochannel speaker separation by harmonic enhancement and suppression." IEEE Transactions on Speech and Audio Processing 5, 5 (1997): 407-424. doi: 10.1109/89.622561.

Link to Full Text

COinS

DOI

https://doi.org/10.1109/89.622561

Electrical, Computer, and Biomedical Engineering Faculty Publications

Cochannel speaker separation by harmonic enhancement and suppression

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

DOI

Search

Browse

Author Corner

Electrical, Computer, and Biomedical Engineering Faculty Publications

Cochannel speaker separation by harmonic enhancement and suppression

Authors

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

Share

DOI

Search

Browse

Author Corner