Advanced

Direction of Arrival Estimation for Speech Sources using Fourth Order Cross Cumulant

Swartling, Mikael LU ; Sällberg, Benny and Grbic, Nedelko LU (2008) IEEE International Symposium on Circuits and Systems (ISCAS), 2008 In 2008 IEEE International Symposium on Circuits and Systems, Proc. p.1696-1699
Abstract
In many applications where speech separation and enhancement is of interest, e.g. conferencing systems, mobile phones and hearing aids, accurate speaker localization is important. This paper presents an alternative criteria for the well known steered response power with phase transform (SRP-PHAT) algorithm, in which the steered response relates to peaks in the fourth order cross cumulant, rather than peaks in the second order cross cumulant, i.e. the cross power spectrum. Since speech sources have a probability density function (PDF) close to the Laplacian distribution and noise are generally closer to the Gaussian distribution, the fourth order cumulant becomes a good alternative for the steered response search for speech sources. The... (More)
In many applications where speech separation and enhancement is of interest, e.g. conferencing systems, mobile phones and hearing aids, accurate speaker localization is important. This paper presents an alternative criteria for the well known steered response power with phase transform (SRP-PHAT) algorithm, in which the steered response relates to peaks in the fourth order cross cumulant, rather than peaks in the second order cross cumulant, i.e. the cross power spectrum. Since speech sources have a probability density function (PDF) close to the Laplacian distribution and noise are generally closer to the Gaussian distribution, the fourth order cumulant becomes a good alternative for the steered response search for speech sources. The proposed method is evaluated and compared to the original SRP-PHAT algorithm and shows significant improvements in localization performance for speech sources. (Less)
Please use this url to cite or link to this publication:
author
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
in
2008 IEEE International Symposium on Circuits and Systems, Proc.
pages
4 pages
conference name
IEEE International Symposium on Circuits and Systems (ISCAS), 2008
external identifiers
  • Scopus:51749112870
ISBN
978-1-4244-1683-7
DOI
10.1109/ISCAS.2008.4541763
language
English
LU publication?
no
id
812fe276-2888-47fb-8116-710b10b75bd6
date added to LUP
2016-06-23 16:53:27
date last changed
2017-01-01 08:29:04
@inproceedings{812fe276-2888-47fb-8116-710b10b75bd6,
  abstract     = {In many applications where speech separation and enhancement is of interest, e.g. conferencing systems, mobile phones and hearing aids, accurate speaker localization is important. This paper presents an alternative criteria for the well known steered response power with phase transform (SRP-PHAT) algorithm, in which the steered response relates to peaks in the fourth order cross cumulant, rather than peaks in the second order cross cumulant, i.e. the cross power spectrum. Since speech sources have a probability density function (PDF) close to the Laplacian distribution and noise are generally closer to the Gaussian distribution, the fourth order cumulant becomes a good alternative for the steered response search for speech sources. The proposed method is evaluated and compared to the original SRP-PHAT algorithm and shows significant improvements in localization performance for speech sources.},
  author       = {Swartling, Mikael and Sällberg, Benny and Grbic, Nedelko},
  booktitle    = {2008 IEEE International Symposium on Circuits and Systems, Proc.},
  isbn         = {978-1-4244-1683-7},
  language     = {eng},
  pages        = {1696--1699},
  title        = {Direction of Arrival Estimation for Speech Sources using Fourth Order Cross Cumulant},
  url          = {http://dx.doi.org/10.1109/ISCAS.2008.4541763},
  year         = {2008},
}