Skip to main content
placeholder image

Towards real-time source counting by estimation of coherent-to-diffuse ratios from ad-hoc microphone array recordings

Conference Paper


Abstract


  • Coherent-to-diffuse ratio (CDR) estimates over short time frames are utilized for source counting using ad-hoc microphone arrays to record speech from multiple participants in scenarios such as a meeting. It is shown that the CDR estimates obtained at ad-hoc dual (two channel) microphone nodes, located at unknown locations within an unknown reverberant room, can detect time frames with more than one active source and are informative for source counting applications. Results show that interfering sources can be detected with accuracies ranging from 69% to 89% for delays ranging from 20 ms to 300 ms, with source counting accuracies ranged from 61% to 81% for two sources and the same range of delays.

Authors


  •   Pasha, Shahab (external author)
  •   Donley, Jacob (external author)
  •   Ritz, Christian H.
  •   Zou, Yue-Xian (external author)

Publication Date


  • 2017

Citation


  • S. Pasha, J. Donley, C. Ritz & Y. Zou, "Towards real-time source counting by estimation of coherent-to-diffuse ratios from ad-hoc microphone array recordings," in 2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017 - Proceedings, 2017, pp. 161-165.

Scopus Eid


  • 2-s2.0-85018748039

Start Page


  • 161

End Page


  • 165

Abstract


  • Coherent-to-diffuse ratio (CDR) estimates over short time frames are utilized for source counting using ad-hoc microphone arrays to record speech from multiple participants in scenarios such as a meeting. It is shown that the CDR estimates obtained at ad-hoc dual (two channel) microphone nodes, located at unknown locations within an unknown reverberant room, can detect time frames with more than one active source and are informative for source counting applications. Results show that interfering sources can be detected with accuracies ranging from 69% to 89% for delays ranging from 20 ms to 300 ms, with source counting accuracies ranged from 61% to 81% for two sources and the same range of delays.

Authors


  •   Pasha, Shahab (external author)
  •   Donley, Jacob (external author)
  •   Ritz, Christian H.
  •   Zou, Yue-Xian (external author)

Publication Date


  • 2017

Citation


  • S. Pasha, J. Donley, C. Ritz & Y. Zou, "Towards real-time source counting by estimation of coherent-to-diffuse ratios from ad-hoc microphone array recordings," in 2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017 - Proceedings, 2017, pp. 161-165.

Scopus Eid


  • 2-s2.0-85018748039

Start Page


  • 161

End Page


  • 165