Exploiting the Overtones in Online Localization of Sound Sources
(2022) In Master's Theses in Mathematical Sciences FMSM01 20221Mathematical Statistics
- Abstract
- Localization of multiple sound sources from a microphone array is a challenging task that has been a research topic for decades. The challenges stem from the diversity of acoustic contexts due to reverberation and disturbances. Therefore, recent approaches that impose structure on the sound sources, rather than on the noise, have shown promising results. The offline method HALO was proposed in 2016 for sparse localization of stationary sources by exploiting the overtone structure. In this thesis, an online recursive localization method is proposed inspired by a similar signal model.
The proposed method consists of a two step procedure. First, the pitch estimator named PEACE estimates the fundamental pitches along with their harmonics in... (More) - Localization of multiple sound sources from a microphone array is a challenging task that has been a research topic for decades. The challenges stem from the diversity of acoustic contexts due to reverberation and disturbances. Therefore, recent approaches that impose structure on the sound sources, rather than on the noise, have shown promising results. The offline method HALO was proposed in 2016 for sparse localization of stationary sources by exploiting the overtone structure. In this thesis, an online recursive localization method is proposed inspired by a similar signal model.
The proposed method consists of a two step procedure. First, the pitch estimator named PEACE estimates the fundamental pitches along with their harmonics in an adaptive dictionary. Secondly, the method named PLEASE estimates the positions of every estimated pitch. An adaptive scheme for setting the sparsity inducing regularization parameters in PEACE is also proposed by exploiting the spatial dynamics. As a result, the compound method PEACE-PLEASE is only left with physically meaningful user defined parameters that are trivial to set for a given application. The proposed method is compared to GCC-Phat on both simulated data and recordings from an anechoic chamber. The results indicate that PEACE-PLEASE outperforms GCC-Phat on both the anechoic dataset and the simulated data. At last, potential directions in research are highlighted and discussed. (Less)
Please use this url to cite or link to this publication:
http://lup.lub.lu.se/student-papers/record/9090620
- author
- Sundström, David LU
- supervisor
- organization
- course
- FMSM01 20221
- year
- 2022
- type
- H2 - Master's Degree (Two Years)
- subject
- keywords
- Multi-pitch estimation, localization, block sparsity, sparse recursive least squares, online estimation, ADMM, adaptive dictionary
- publication/series
- Master's Theses in Mathematical Sciences
- report number
- LUTFMS-3454-2022
- ISSN
- 1404-6342
- other publication id
- 2022:E56
- language
- English
- id
- 9090620
- date added to LUP
- 2022-06-28 11:38:36
- date last changed
- 2022-07-20 13:26:34
@misc{9090620, abstract = {{Localization of multiple sound sources from a microphone array is a challenging task that has been a research topic for decades. The challenges stem from the diversity of acoustic contexts due to reverberation and disturbances. Therefore, recent approaches that impose structure on the sound sources, rather than on the noise, have shown promising results. The offline method HALO was proposed in 2016 for sparse localization of stationary sources by exploiting the overtone structure. In this thesis, an online recursive localization method is proposed inspired by a similar signal model. The proposed method consists of a two step procedure. First, the pitch estimator named PEACE estimates the fundamental pitches along with their harmonics in an adaptive dictionary. Secondly, the method named PLEASE estimates the positions of every estimated pitch. An adaptive scheme for setting the sparsity inducing regularization parameters in PEACE is also proposed by exploiting the spatial dynamics. As a result, the compound method PEACE-PLEASE is only left with physically meaningful user defined parameters that are trivial to set for a given application. The proposed method is compared to GCC-Phat on both simulated data and recordings from an anechoic chamber. The results indicate that PEACE-PLEASE outperforms GCC-Phat on both the anechoic dataset and the simulated data. At last, potential directions in research are highlighted and discussed.}}, author = {{Sundström, David}}, issn = {{1404-6342}}, language = {{eng}}, note = {{Student Paper}}, series = {{Master's Theses in Mathematical Sciences}}, title = {{Exploiting the Overtones in Online Localization of Sound Sources}}, year = {{2022}}, }