Geometric Information Based Monaural Speech Separation Using Deep Neural Network

Yang Xian,Yang Sun,Jonathon A. Chambers,Syed Mohsen Naqvi

Geometric Information Based Monaural Speech Separation Using Deep Neural Network

2018

Yang Xian
Yang Sun
Jonathon A. Chambers
Syed Mohsen Naqvi

The performance of deep neural network (DNN) based monaural speech separation methods is limited in reverberant and noisy room environments. In this paper, we propose a new DNN training target which incorporates geometric information describing the target speaker and microphone to improve the performance in reverberant and noisy room environments. The experiments are based on the IEEE corpus and the NOISEX database and real impulse responses (RIRs). The objective evaluations, short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) confirm the efficiency of the proposed direct path ratio mask (DRM).

Keywords:

Noise measurement
Artificial intelligence
Impulse (physics)
Artificial neural network
Intelligibility (communication)
Pattern recognition
Computer science
PESQ
Microphone
Signal-to-noise ratio
Monaural

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations