Integrating smooth motion assumptions with RANSAC-based sound source localization
(2025) 33rd European Signal Processing Conference (EUSIPCO 2025) p.226-230- Abstract
- The momentary localization of a single sound source, in an environment with microphones distributed at known positions, can be done with multilateration, using time-difference-of-arrival (TDOA) estimates. However, TDOA estimates from cross-correlation are noisy in real environments, and a robust multilateration method must handle outliers. Assuming constraints on the smoothness of movement over time, the location estimate can be improved for both stationary and moving sources. In this work, the smooth motion assumption is explored in different stages of a RANSAC-based (Random Sample Consensus) implementation. The evaluation is done on real recordings from the public LuViRA dataset, giving the first 3D baseline result on the dataset. Each... (More)
- The momentary localization of a single sound source, in an environment with microphones distributed at known positions, can be done with multilateration, using time-difference-of-arrival (TDOA) estimates. However, TDOA estimates from cross-correlation are noisy in real environments, and a robust multilateration method must handle outliers. Assuming constraints on the smoothness of movement over time, the location estimate can be improved for both stationary and moving sources. In this work, the smooth motion assumption is explored in different stages of a RANSAC-based (Random Sample Consensus) implementation. The evaluation is done on real recordings from the public LuViRA dataset, giving the first 3D baseline result on the dataset. Each of the proposed steps is shown to reduce the localization error compared to the benchmark method. (Less)
- Abstract (Swedish)
- Att vid ett tillfälle avgöra positionen för en ensam ljudkälla, i en omgivning med mikrofoner utspridda på kända positioner, kan göras med multilaterering, genom uppskattade ankomsttidskillnader (TDOA, time-difference-of-arrival).
TDOA uppskattad med korskorrelation är emellertid brusig i verkliga miljöer, och metoder för robust multilaterering måste kunna hantera avvikande värden. Att som begränsning anta en mjuk rörelse över tid kan förbättra den uppskattade positionen för både stillastående och rörliga ljudkällor. I detta arbete utforskas hur antagandet om mjuk rörelse kan tas med i olika steg av en RANSAC-baserad (Random Sample Consensus) implementation. Utvärderingen görs på riktiga inspelningar från det publika LuViRA datasetet,... (More) - Att vid ett tillfälle avgöra positionen för en ensam ljudkälla, i en omgivning med mikrofoner utspridda på kända positioner, kan göras med multilaterering, genom uppskattade ankomsttidskillnader (TDOA, time-difference-of-arrival).
TDOA uppskattad med korskorrelation är emellertid brusig i verkliga miljöer, och metoder för robust multilaterering måste kunna hantera avvikande värden. Att som begränsning anta en mjuk rörelse över tid kan förbättra den uppskattade positionen för både stillastående och rörliga ljudkällor. I detta arbete utforskas hur antagandet om mjuk rörelse kan tas med i olika steg av en RANSAC-baserad (Random Sample Consensus) implementation. Utvärderingen görs på riktiga inspelningar från det publika LuViRA datasetet, det första publicerade 3D resultatet för datat. Vardera av de föreslagna stegen visas förbättra lokaliseringsfelet i jämförelse mot grundmetoden.
(Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/015f9c5d-3d07-4059-aba8-09d5ef102682
- author
- Gulin, Jens
LU
; Åström, Kalle LU
and Aminifar, Amir LU
- organization
-
- Integrated Electronic Systems
- Computer Vision and Machine Learning (research group)
- LTH Profile Area: AI and Digitalization
- Stroke Imaging Research group (research group)
- ELLIIT: the Linköping-Lund initiative on IT and mobile communication
- eSSENCE: The e-Science Collaboration
- Mathematical Imaging Group (research group)
- NEXTG2COM – a Vinnova Competence Centre in Advanced Digitalisation
- Secure and Networked Systems
- alternative title
- Integration av antagande om mjuk rörelse i RANSAC-baserad lokalisering av ljudkälla
- publishing date
- 2025-09-08
- type
- Chapter in Book/Report/Conference proceeding
- publication status
- published
- subject
- keywords
- TDOA, GCC-PHAT, Smooth motion, RANSAC, SSL
- host publication
- 33rd European Signal Processing Conference : EUSIPCO 2025. 8 - 12 September, 2025, Palermo, Italy - EUSIPCO 2025. 8 - 12 September, 2025, Palermo, Italy
- pages
- 5 pages
- conference name
- 33rd European Signal Processing Conference (EUSIPCO 2025)
- conference location
- Palermo, Italy
- conference dates
- 2025-09-08 - 2025-09-12
- ISBN
- 978-9-46-459362-4
- language
- English
- LU publication?
- yes
- id
- 015f9c5d-3d07-4059-aba8-09d5ef102682
- alternative location
- https://eusipco2025.org/wp-content/uploads/pdfs/0000226.pdf
- date added to LUP
- 2025-09-04 08:26:39
- date last changed
- 2025-09-25 08:51:03
@inproceedings{015f9c5d-3d07-4059-aba8-09d5ef102682, abstract = {{The momentary localization of a single sound source, in an environment with microphones distributed at known positions, can be done with multilateration, using time-difference-of-arrival (TDOA) estimates. However, TDOA estimates from cross-correlation are noisy in real environments, and a robust multilateration method must handle outliers. Assuming constraints on the smoothness of movement over time, the location estimate can be improved for both stationary and moving sources. In this work, the smooth motion assumption is explored in different stages of a RANSAC-based (Random Sample Consensus) implementation. The evaluation is done on real recordings from the public LuViRA dataset, giving the first 3D baseline result on the dataset. Each of the proposed steps is shown to reduce the localization error compared to the benchmark method.}}, author = {{Gulin, Jens and Åström, Kalle and Aminifar, Amir}}, booktitle = {{33rd European Signal Processing Conference : EUSIPCO 2025. 8 - 12 September, 2025, Palermo, Italy}}, isbn = {{978-9-46-459362-4}}, keywords = {{TDOA; GCC-PHAT; Smooth motion; RANSAC; SSL}}, language = {{eng}}, month = {{09}}, pages = {{226--230}}, title = {{Integrating smooth motion assumptions with RANSAC-based sound source localization}}, url = {{https://eusipco2025.org/wp-content/uploads/pdfs/0000226.pdf}}, year = {{2025}}, }