Listening Enhancement in Noisy Environments: Solutions in Time and Frequency Domain

2021 
The intelligibility of speech from a telephone or a public address system is often affected by acoustical background noise in the near-end listening environment. Speech intelligibility and listening effort can be improved by adaptive pre-processing of the loudspeaker signal. This is called Near-End Listening Enhancement (NELE). The speech spectrum is dynamically modified, taking the acoustical background noise at the near-end into account. In this paper, two opposite NELE strategies with either Noise-Masking-Proportional Shaping or Noise-Masking-Inverse Shaping are proposed which are appropriate for different noise characteristics. Both strategies are formulated in closed form in the frequency domain. They do not require to optimize an intelligibility measure but use explicitly the masking threshold. Motivated by the frequency domain approach, a simpler time domain solution is derived which is based on linear prediction techniques and does not need the masking calculations. The proposed NELE solutions outperform state-of-the-art in terms of computational complexity, memory requirement, continuous processor load, and latency.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    24
    References
    0
    Citations
    NaN
    KQI
    []