Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Exploiting the Overtones in Online Localization of Sound Sources

Sundström, David LU (2022) In Master's Theses in Mathematical Sciences FMSM01 20221
Mathematical Statistics
Abstract
Localization of multiple sound sources from a microphone array is a challenging task that has been a research topic for decades. The challenges stem from the diversity of acoustic contexts due to reverberation and disturbances. Therefore, recent approaches that impose structure on the sound sources, rather than on the noise, have shown promising results. The offline method HALO was proposed in 2016 for sparse localization of stationary sources by exploiting the overtone structure. In this thesis, an online recursive localization method is proposed inspired by a similar signal model.

The proposed method consists of a two step procedure. First, the pitch estimator named PEACE estimates the fundamental pitches along with their harmonics in... (More)
Localization of multiple sound sources from a microphone array is a challenging task that has been a research topic for decades. The challenges stem from the diversity of acoustic contexts due to reverberation and disturbances. Therefore, recent approaches that impose structure on the sound sources, rather than on the noise, have shown promising results. The offline method HALO was proposed in 2016 for sparse localization of stationary sources by exploiting the overtone structure. In this thesis, an online recursive localization method is proposed inspired by a similar signal model.

The proposed method consists of a two step procedure. First, the pitch estimator named PEACE estimates the fundamental pitches along with their harmonics in an adaptive dictionary. Secondly, the method named PLEASE estimates the positions of every estimated pitch. An adaptive scheme for setting the sparsity inducing regularization parameters in PEACE is also proposed by exploiting the spatial dynamics. As a result, the compound method PEACE-PLEASE is only left with physically meaningful user defined parameters that are trivial to set for a given application. The proposed method is compared to GCC-Phat on both simulated data and recordings from an anechoic chamber. The results indicate that PEACE-PLEASE outperforms GCC-Phat on both the anechoic dataset and the simulated data. At last, potential directions in research are highlighted and discussed. (Less)
Please use this url to cite or link to this publication:
author
Sundström, David LU
supervisor
organization
course
FMSM01 20221
year
type
H2 - Master's Degree (Two Years)
subject
keywords
Multi-pitch estimation, localization, block sparsity, sparse recursive least squares, online estimation, ADMM, adaptive dictionary
publication/series
Master's Theses in Mathematical Sciences
report number
LUTFMS-3454-2022
ISSN
1404-6342
other publication id
2022:E56
language
English
id
9090620
date added to LUP
2022-06-28 11:38:36
date last changed
2022-07-20 13:26:34
@misc{9090620,
  abstract     = {{Localization of multiple sound sources from a microphone array is a challenging task that has been a research topic for decades. The challenges stem from the diversity of acoustic contexts due to reverberation and disturbances. Therefore, recent approaches that impose structure on the sound sources, rather than on the noise, have shown promising results. The offline method HALO was proposed in 2016 for sparse localization of stationary sources by exploiting the overtone structure. In this thesis, an online recursive localization method is proposed inspired by a similar signal model.

The proposed method consists of a two step procedure. First, the pitch estimator named PEACE estimates the fundamental pitches along with their harmonics in an adaptive dictionary. Secondly, the method named PLEASE estimates the positions of every estimated pitch. An adaptive scheme for setting the sparsity inducing regularization parameters in PEACE is also proposed by exploiting the spatial dynamics. As a result, the compound method PEACE-PLEASE is only left with physically meaningful user defined parameters that are trivial to set for a given application. The proposed method is compared to GCC-Phat on both simulated data and recordings from an anechoic chamber. The results indicate that PEACE-PLEASE outperforms GCC-Phat on both the anechoic dataset and the simulated data. At last, potential directions in research are highlighted and discussed.}},
  author       = {{Sundström, David}},
  issn         = {{1404-6342}},
  language     = {{eng}},
  note         = {{Student Paper}},
  series       = {{Master's Theses in Mathematical Sciences}},
  title        = {{Exploiting the Overtones in Online Localization of Sound Sources}},
  year         = {{2022}},
}