Skip to main content
placeholder image

Investigating spatial audio coding cues for meeting audio segmentation

Conference Paper


Abstract


  • As multiparty meetings involve participants that are generally stationary when actively speaking, participant location information can be used to segment the recorded meeting audio into speaker 'turns.' In this paper, speaker location information derived from 'spatial cues' generated by spatial audio coding techniques is investigated. The validity of using spatial cues for meeting audio segmentation is explored through investigating multiple microphone meeting audio recording techniques and different spatial audio coders. Experimental results show that the statistical relationship between speaker location and interchannel level and phase-based spatial cues strongly depends on the microphone pattern. Results also indicate that interchannel correlation-based spatial cues represent location information that is ambiguous for meeting audio segmentation.

Publication Date


  • 2006

Citation


  • Cheng, E., Burnett, I., & Ritz, C. (2006). Investigating spatial audio coding cues for meeting audio segmentation. In Audio Engineering Society - 120th Convention Spring Preprints 2006 Vol. 4 (pp. 2175-2186).

Scopus Eid


  • 2-s2.0-84866516557

Web Of Science Accession Number


Start Page


  • 2175

End Page


  • 2186

Volume


  • 4

Abstract


  • As multiparty meetings involve participants that are generally stationary when actively speaking, participant location information can be used to segment the recorded meeting audio into speaker 'turns.' In this paper, speaker location information derived from 'spatial cues' generated by spatial audio coding techniques is investigated. The validity of using spatial cues for meeting audio segmentation is explored through investigating multiple microphone meeting audio recording techniques and different spatial audio coders. Experimental results show that the statistical relationship between speaker location and interchannel level and phase-based spatial cues strongly depends on the microphone pattern. Results also indicate that interchannel correlation-based spatial cues represent location information that is ambiguous for meeting audio segmentation.

Publication Date


  • 2006

Citation


  • Cheng, E., Burnett, I., & Ritz, C. (2006). Investigating spatial audio coding cues for meeting audio segmentation. In Audio Engineering Society - 120th Convention Spring Preprints 2006 Vol. 4 (pp. 2175-2186).

Scopus Eid


  • 2-s2.0-84866516557

Web Of Science Accession Number


Start Page


  • 2175

End Page


  • 2186

Volume


  • 4