Advanced

Sparse Localization of Harmonic Audio Sources

Adalbjörnsson, Stefan Ingi LU ; Kronvall, Ted LU ; Burgess, Simon LU ; Åström, Karl LU and Jakobsson, Andreas LU (2016) In IEEE/ACM Transactions on Audio, Speech, and Language Processing 24(1). p.117-129
Abstract
In this paper, we propose a novel method for estimating the locations of near- and/or far-field harmonic audio sources impinging on an arbitrary, but calibrated, sensor array. Using a joint pitch and location estimation formed in two steps, we first estimate the fundamental frequencies and complex amplitudes under a sinusoidal model assumption, whereafter the location of each source is found by utilizing both the difference in phase and the relative attenuation of the magnitude estimates. As audio recordings often consist of multi-pitch signals exhibiting some degree of reverberation, where both the number of pitches and the source locations are unknown, we propose to use sparse heuristics to avoid the necessity of detailed a priori... (More)
In this paper, we propose a novel method for estimating the locations of near- and/or far-field harmonic audio sources impinging on an arbitrary, but calibrated, sensor array. Using a joint pitch and location estimation formed in two steps, we first estimate the fundamental frequencies and complex amplitudes under a sinusoidal model assumption, whereafter the location of each source is found by utilizing both the difference in phase and the relative attenuation of the magnitude estimates. As audio recordings often consist of multi-pitch signals exhibiting some degree of reverberation, where both the number of pitches and the source locations are unknown, we propose to use sparse heuristics to avoid the necessity of detailed a priori assumptions on the spectral and spatial model orders. The method’s performance is evaluated using both simulated and measured audio data, with the former showing that the proposed method achieves near-optimal performance, whereas the latter confirms the method’s feasibility when used with real recordings. (Less)
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Multi-pitch estimation, near- and far-field localization, TDOA, block sparsity, convex optimization, ADMM, non-convex sparsity
in
IEEE/ACM Transactions on Audio, Speech, and Language Processing
volume
24
issue
1
pages
117 - 129
publisher
Piscataway, NJ : Institute of Electrical and Electronics Engineers
external identifiers
  • scopus:84957545669
ISSN
2329-9290
DOI
10.1109/TASLP.2015.2497798
language
English
LU publication?
yes
id
3b47fdaa-5d51-4c2e-bfba-2b0e0b08345a (old id 8409978)
date added to LUP
2015-12-17 16:24:00
date last changed
2017-07-30 04:24:18
@article{3b47fdaa-5d51-4c2e-bfba-2b0e0b08345a,
  abstract     = {In this paper, we propose a novel method for estimating the locations of near- and/or far-field harmonic audio sources impinging on an arbitrary, but calibrated, sensor array. Using a joint pitch and location estimation formed in two steps, we first estimate the fundamental frequencies and complex amplitudes under a sinusoidal model assumption, whereafter the location of each source is found by utilizing both the difference in phase and the relative attenuation of the magnitude estimates. As audio recordings often consist of multi-pitch signals exhibiting some degree of reverberation, where both the number of pitches and the source locations are unknown, we propose to use sparse heuristics to avoid the necessity of detailed a priori assumptions on the spectral and spatial model orders. The method’s performance is evaluated using both simulated and measured audio data, with the former showing that the proposed method achieves near-optimal performance, whereas the latter confirms the method’s feasibility when used with real recordings.},
  author       = {Adalbjörnsson, Stefan Ingi and Kronvall, Ted and Burgess, Simon and Åström, Karl and Jakobsson, Andreas},
  issn         = {2329-9290},
  keyword      = {Multi-pitch estimation,near- and far-field localization,TDOA,block sparsity,convex optimization,ADMM,non-convex sparsity},
  language     = {eng},
  number       = {1},
  pages        = {117--129},
  publisher    = {Piscataway, NJ : Institute of Electrical and Electronics Engineers},
  series       = {IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  title        = {Sparse Localization of Harmonic Audio Sources},
  url          = {http://dx.doi.org/10.1109/TASLP.2015.2497798},
  volume       = {24},
  year         = {2016},
}