Abstract
Real-world speech and speaker recognition systems are often subject to ambient noise which results in significant performance loss, more so when the noise types and noise levels are different between training and testing. This paper presents a new preprocessing technique, Coherent Spectral Modification, that aims to reduce distortion due to noise by modifying the complex speech spectrum using information from non-speech regions of the spectral time trajectories. A refinement process is also proposed that further reduces noise mismatch. This combined technique was evaluated on a speaker verification task where the test data was corrupted with varying levels of white noise and pink noise. The new method yielded significant reduction in error rates and performed better than conventional spectral subtraction, particularly at moderate SNRs.
Original language | English (US) |
---|---|
Pages | 791-794 |
Number of pages | 4 |
State | Published - 1999 |
Event | 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999 - Budapest, Hungary Duration: Sep 5 1999 → Sep 9 1999 |
Conference
Conference | 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999 |
---|---|
Country/Territory | Hungary |
City | Budapest |
Period | 9/5/99 → 9/9/99 |
ASJC Scopus subject areas
- Computer Science Applications
- Software
- Linguistics and Language
- Communication