Estimation of Impulse Responses for a Moving Source Using Optimal Transport Regularization
(2024) 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings p.921-925- Abstract
The estimation of impulse responses (IRs) is fundamental to various audio applications, including active noise control, telecommunication, and sound zone control. Despite its long history, estimating impulse responses remains challenging when dealing with short signals and with signals having poor spectral excitation. However, in many applications the source is moving such that one has access to several input-output signal pairs corresponding to closely spaced source positions. Intuitively, exploiting this spatial proximity when jointly estimating the full set of IRs should allow for improved estimation performance. In this work, we propose to leverage the information shared between the closely spaced source positions by means of an... (More)
The estimation of impulse responses (IRs) is fundamental to various audio applications, including active noise control, telecommunication, and sound zone control. Despite its long history, estimating impulse responses remains challenging when dealing with short signals and with signals having poor spectral excitation. However, in many applications the source is moving such that one has access to several input-output signal pairs corresponding to closely spaced source positions. Intuitively, exploiting this spatial proximity when jointly estimating the full set of IRs should allow for improved estimation performance. In this work, we propose to leverage the information shared between the closely spaced source positions by means of an optimal transport regularizer when estimating IRs from noisy input-output relations. In particular, the proposed transport formulation allows for modeling shifts in time-delays, corresponding to the IR filter taps, caused by the spatial displacement. The method is validated through numerical experiments using a real voice recording as input signal, demonstrating its superior performance in the challenging scenario.
(Less)
- author
- Sundström, David LU ; Elvander, Filip LU and Jakobsson, Andreas LU
- organization
- publishing date
- 2024
- type
- Chapter in Book/Report/Conference proceeding
- publication status
- published
- subject
- keywords
- audio signal processing, dynamic systems, Impulse response estimation, optimal transport
- host publication
- 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings
- series title
- ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
- pages
- 5 pages
- publisher
- IEEE - Institute of Electrical and Electronics Engineers Inc.
- conference name
- 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
- conference location
- Seoul, Korea, Republic of
- conference dates
- 2024-04-14 - 2024-04-19
- external identifiers
-
- scopus:85188722243
- ISSN
- 1520-6149
- ISBN
- 9798350344851
- DOI
- 10.1109/ICASSP48485.2024.10446838
- language
- English
- LU publication?
- yes
- additional info
- Publisher Copyright: © 2024 IEEE.
- id
- f1235e35-8dac-4fb7-909e-5eb9ea90b701
- date added to LUP
- 2024-08-22 08:10:40
- date last changed
- 2024-09-23 17:00:38
@inproceedings{f1235e35-8dac-4fb7-909e-5eb9ea90b701, abstract = {{<p>The estimation of impulse responses (IRs) is fundamental to various audio applications, including active noise control, telecommunication, and sound zone control. Despite its long history, estimating impulse responses remains challenging when dealing with short signals and with signals having poor spectral excitation. However, in many applications the source is moving such that one has access to several input-output signal pairs corresponding to closely spaced source positions. Intuitively, exploiting this spatial proximity when jointly estimating the full set of IRs should allow for improved estimation performance. In this work, we propose to leverage the information shared between the closely spaced source positions by means of an optimal transport regularizer when estimating IRs from noisy input-output relations. In particular, the proposed transport formulation allows for modeling shifts in time-delays, corresponding to the IR filter taps, caused by the spatial displacement. The method is validated through numerical experiments using a real voice recording as input signal, demonstrating its superior performance in the challenging scenario.</p>}}, author = {{Sundström, David and Elvander, Filip and Jakobsson, Andreas}}, booktitle = {{2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings}}, isbn = {{9798350344851}}, issn = {{1520-6149}}, keywords = {{audio signal processing; dynamic systems; Impulse response estimation; optimal transport}}, language = {{eng}}, pages = {{921--925}}, publisher = {{IEEE - Institute of Electrical and Electronics Engineers Inc.}}, series = {{ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings}}, title = {{Estimation of Impulse Responses for a Moving Source Using Optimal Transport Regularization}}, url = {{http://dx.doi.org/10.1109/ICASSP48485.2024.10446838}}, doi = {{10.1109/ICASSP48485.2024.10446838}}, year = {{2024}}, }