Sparse Localization of Harmonic Audio Sources
(2016) In IEEE/ACM Transactions on Audio, Speech, and Language Processing 24(1). p.117-129- Abstract
- In this paper, we propose a novel method for estimating the locations of near- and/or far-field harmonic audio sources impinging on an arbitrary, but calibrated, sensor array. Using a joint pitch and location estimation formed in two steps, we first estimate the fundamental frequencies and complex amplitudes under a sinusoidal model assumption, whereafter the location of each source is found by utilizing both the difference in phase and the relative attenuation of the magnitude estimates. As audio recordings often consist of multi-pitch signals exhibiting some degree of reverberation, where both the number of pitches and the source locations are unknown, we propose to use sparse heuristics to avoid the necessity of detailed a priori... (More)
- In this paper, we propose a novel method for estimating the locations of near- and/or far-field harmonic audio sources impinging on an arbitrary, but calibrated, sensor array. Using a joint pitch and location estimation formed in two steps, we first estimate the fundamental frequencies and complex amplitudes under a sinusoidal model assumption, whereafter the location of each source is found by utilizing both the difference in phase and the relative attenuation of the magnitude estimates. As audio recordings often consist of multi-pitch signals exhibiting some degree of reverberation, where both the number of pitches and the source locations are unknown, we propose to use sparse heuristics to avoid the necessity of detailed a priori assumptions on the spectral and spatial model orders. The method’s performance is evaluated using both simulated and measured audio data, with the former showing that the proposed method achieves near-optimal performance, whereas the latter confirms the method’s feasibility when used with real recordings. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/8409978
- author
- Adalbjörnsson, Stefan Ingi LU ; Kronvall, Ted LU ; Burgess, Simon LU ; Åström, Karl LU and Jakobsson, Andreas LU
- organization
-
- Mathematics (Faculty of Engineering)
- Lund University Humanities Lab
- Mathematical Statistics
- Centre for Mathematical Sciences
- Statistical Signal Processing Group (research group)
- Biomedical Modelling and Computation (research group)
- ELLIIT: the Linköping-Lund initiative on IT and mobile communication
- eSSENCE: The e-Science Collaboration
- publishing date
- 2016
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Multi-pitch estimation, near- and far-field localization, TDOA, block sparsity, convex optimization, ADMM, non-convex sparsity
- in
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- volume
- 24
- issue
- 1
- pages
- 117 - 129
- publisher
- Piscataway, NJ : Institute of Electrical and Electronics Engineers
- external identifiers
-
- scopus:84957545669
- ISSN
- 2329-9290
- DOI
- 10.1109/TASLP.2015.2497798
- language
- English
- LU publication?
- yes
- id
- 3b47fdaa-5d51-4c2e-bfba-2b0e0b08345a (old id 8409978)
- date added to LUP
- 2016-04-01 15:02:42
- date last changed
- 2022-03-06 22:31:13
@article{3b47fdaa-5d51-4c2e-bfba-2b0e0b08345a, abstract = {{In this paper, we propose a novel method for estimating the locations of near- and/or far-field harmonic audio sources impinging on an arbitrary, but calibrated, sensor array. Using a joint pitch and location estimation formed in two steps, we first estimate the fundamental frequencies and complex amplitudes under a sinusoidal model assumption, whereafter the location of each source is found by utilizing both the difference in phase and the relative attenuation of the magnitude estimates. As audio recordings often consist of multi-pitch signals exhibiting some degree of reverberation, where both the number of pitches and the source locations are unknown, we propose to use sparse heuristics to avoid the necessity of detailed a priori assumptions on the spectral and spatial model orders. The method’s performance is evaluated using both simulated and measured audio data, with the former showing that the proposed method achieves near-optimal performance, whereas the latter confirms the method’s feasibility when used with real recordings.}}, author = {{Adalbjörnsson, Stefan Ingi and Kronvall, Ted and Burgess, Simon and Åström, Karl and Jakobsson, Andreas}}, issn = {{2329-9290}}, keywords = {{Multi-pitch estimation; near- and far-field localization; TDOA; block sparsity; convex optimization; ADMM; non-convex sparsity}}, language = {{eng}}, number = {{1}}, pages = {{117--129}}, publisher = {{Piscataway, NJ : Institute of Electrical and Electronics Engineers}}, series = {{IEEE/ACM Transactions on Audio, Speech, and Language Processing}}, title = {{Sparse Localization of Harmonic Audio Sources}}, url = {{https://lup.lub.lu.se/search/files/4311040/8410148.pdf}}, doi = {{10.1109/TASLP.2015.2497798}}, volume = {{24}}, year = {{2016}}, }