Skip to main content
placeholder image

Linear predictive perceptual filtering for acoustic vector sensors: exploiting directional recordings for high quality speech enhancement

Conference Paper


Abstract


  • This paper investigates the performance of a new technique for speech enhancement which combines Linear Predictive (LP) spectrum-based perceptual filtering to the recordings obtained from an Acoustic Vector Sensor (AVS). The technique takes advantage of the directional polar responses of the AVS to obtain a significantly more accurate representation of the LP spectrum of a target speech signal in the presence of noise when compared to single channel, omni-directional recordings. Comparisons between the speech quality obtained from the proposed technique and existing beamforming-based speech enhancement techniques for the AVS are made through Perceptual Evaluation of Speech Quality (PESQ) tests and Mean Opinion Score (MOS) listening tests. Results show significant improvements in PESQ and MOS scores of 0.2 and 1.6, respectively, for the proposed enhancement technique. Being based on a miniature microphone array, the approach is particular suitable for hands free communication applications in mobile telephony.

Authors


  •   Shujau, Muawiyath (external author)
  •   Ritz, Christian H.
  •   Burnett, Ian S. (external author)

Publication Date


  • 2011

Citation


  • Shujau, M., Ritz, C. H. & Burnett, I. S. (2011). Linear predictive perceptual filtering for acoustic vector sensors: exploiting directional recordings for high quality speech enhancement. Proceeding of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (pp. 5068-5071). USA: IEEE.

Scopus Eid


  • 2-s2.0-80051661600

Ro Metadata Url


  • http://ro.uow.edu.au/engpapers/4307

Start Page


  • 5068

End Page


  • 5071

Place Of Publication


  • http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5947496

Abstract


  • This paper investigates the performance of a new technique for speech enhancement which combines Linear Predictive (LP) spectrum-based perceptual filtering to the recordings obtained from an Acoustic Vector Sensor (AVS). The technique takes advantage of the directional polar responses of the AVS to obtain a significantly more accurate representation of the LP spectrum of a target speech signal in the presence of noise when compared to single channel, omni-directional recordings. Comparisons between the speech quality obtained from the proposed technique and existing beamforming-based speech enhancement techniques for the AVS are made through Perceptual Evaluation of Speech Quality (PESQ) tests and Mean Opinion Score (MOS) listening tests. Results show significant improvements in PESQ and MOS scores of 0.2 and 1.6, respectively, for the proposed enhancement technique. Being based on a miniature microphone array, the approach is particular suitable for hands free communication applications in mobile telephony.

Authors


  •   Shujau, Muawiyath (external author)
  •   Ritz, Christian H.
  •   Burnett, Ian S. (external author)

Publication Date


  • 2011

Citation


  • Shujau, M., Ritz, C. H. & Burnett, I. S. (2011). Linear predictive perceptual filtering for acoustic vector sensors: exploiting directional recordings for high quality speech enhancement. Proceeding of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (pp. 5068-5071). USA: IEEE.

Scopus Eid


  • 2-s2.0-80051661600

Ro Metadata Url


  • http://ro.uow.edu.au/engpapers/4307

Start Page


  • 5068

End Page


  • 5071

Place Of Publication


  • http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5947496