Skip to main content
placeholder image

Identifying Sound Source Node Locations Using Neural Networks Trained with Phasograms

Conference Paper


Abstract


  • In this work, the best approximation of the sound source location through neural networks is examined. Majority of related work either omits the phase information from the Short Time Fourier Transform (STFT), or uses it for the sole purpose of restoring irregularities in spectrograms. Our process differ, such that it focuses on the phase component of the STFT coefficients to estimate the sound source location by classifying the closest microphone array (node). The image resulting from the mapping of the phase differences information within the time-frequency domain results in what we call phasograms, and are used as inputs to the neural network. Experimentation is achieved through recordings of the first four nodes of the SINS database. For this work, phase difference across adjacent microphones, as well as against the first microphone, were examined. Within a five-fold cross validation, this resulted in an F1-score of 99.68% for the former, and 99.31% for the latter. A real world application for our work are healthcare monitoring systems, when integrated with a sound scene classification system.

Publication Date


  • 2020

Citation


  • Copiaco, A., Ritz, C., Fasciani, S., & Abdulaziz, N. (2020). Identifying Sound Source Node Locations Using Neural Networks Trained with Phasograms. In 2020 IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2020. doi:10.1109/ISSPIT51521.2020.9408643

Scopus Eid


  • 2-s2.0-85105349297

Web Of Science Accession Number


Abstract


  • In this work, the best approximation of the sound source location through neural networks is examined. Majority of related work either omits the phase information from the Short Time Fourier Transform (STFT), or uses it for the sole purpose of restoring irregularities in spectrograms. Our process differ, such that it focuses on the phase component of the STFT coefficients to estimate the sound source location by classifying the closest microphone array (node). The image resulting from the mapping of the phase differences information within the time-frequency domain results in what we call phasograms, and are used as inputs to the neural network. Experimentation is achieved through recordings of the first four nodes of the SINS database. For this work, phase difference across adjacent microphones, as well as against the first microphone, were examined. Within a five-fold cross validation, this resulted in an F1-score of 99.68% for the former, and 99.31% for the latter. A real world application for our work are healthcare monitoring systems, when integrated with a sound scene classification system.

Publication Date


  • 2020

Citation


  • Copiaco, A., Ritz, C., Fasciani, S., & Abdulaziz, N. (2020). Identifying Sound Source Node Locations Using Neural Networks Trained with Phasograms. In 2020 IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2020. doi:10.1109/ISSPIT51521.2020.9408643

Scopus Eid


  • 2-s2.0-85105349297

Web Of Science Accession Number