Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Diffusion Modelling approaches to EEG-based Auditory Attention Decoding

Rannaleet, David and Gunnarsson, Victor (2024)
Department of Automatic Control
Abstract
Machine learning models can analyze physiological data, such as electroencephalography (EEG), for various classification tasks. One such task is Auditory Attention Decoding (AAD), aimed at identifying the sound a person is actively attending to, offering significant benefits for users of hearing aids. However, EEG data often exhibits a low signal-to-noise ratio, and its collection is often expensive, cumbersome and requires trained specialists. Additionally, gathering EEG data over prolonged periods of time presents challenges. The resulting scarcity of available EEG data to train models can be alleviated by using generative models, which generate new examples from the data they were trained on.
Diffusion Probabilistic Models (DPMs) have... (More)
Machine learning models can analyze physiological data, such as electroencephalography (EEG), for various classification tasks. One such task is Auditory Attention Decoding (AAD), aimed at identifying the sound a person is actively attending to, offering significant benefits for users of hearing aids. However, EEG data often exhibits a low signal-to-noise ratio, and its collection is often expensive, cumbersome and requires trained specialists. Additionally, gathering EEG data over prolonged periods of time presents challenges. The resulting scarcity of available EEG data to train models can be alleviated by using generative models, which generate new examples from the data they were trained on.
Diffusion Probabilistic Models (DPMs) have in recent years emerged as the state-of-the-art of generative models within the image domain, showing widespread success in models such as Stable Diffusion and DALL-E. This work investigates whether this success can extend to the domain of multichannel time series data, specifically EEG data. The diffusion models were trained on 1-second EEG data segments and were used as a data augmentation tool for 3 different classification tasks, including AAD. Our findings indicate that diffusion models can effectively generate realistic EEG data, supported by both a visual comparison and a measure of Jensen-Shannon divergence to the real EEG data distribution. In addition to this, a significant improvement in mean performance was achieved in our Locus of Attention (LoA) task, where we classify between a test subject attending to a left or right speaker. Here an approximate classification accuracy of 71% was achieved compared to our baseline of 70.4%. (Less)
Please use this url to cite or link to this publication:
author
Rannaleet, David and Gunnarsson, Victor
supervisor
organization
year
type
H3 - Professional qualifications (4 Years - )
subject
report number
TFRT-6227
other publication id
0280-5316
language
English
id
9150900
date added to LUP
2024-04-17 14:03:02
date last changed
2024-04-17 14:03:02
@misc{9150900,
  abstract     = {{Machine learning models can analyze physiological data, such as electroencephalography (EEG), for various classification tasks. One such task is Auditory Attention Decoding (AAD), aimed at identifying the sound a person is actively attending to, offering significant benefits for users of hearing aids. However, EEG data often exhibits a low signal-to-noise ratio, and its collection is often expensive, cumbersome and requires trained specialists. Additionally, gathering EEG data over prolonged periods of time presents challenges. The resulting scarcity of available EEG data to train models can be alleviated by using generative models, which generate new examples from the data they were trained on.
 Diffusion Probabilistic Models (DPMs) have in recent years emerged as the state-of-the-art of generative models within the image domain, showing widespread success in models such as Stable Diffusion and DALL-E. This work investigates whether this success can extend to the domain of multichannel time series data, specifically EEG data. The diffusion models were trained on 1-second EEG data segments and were used as a data augmentation tool for 3 different classification tasks, including AAD. Our findings indicate that diffusion models can effectively generate realistic EEG data, supported by both a visual comparison and a measure of Jensen-Shannon divergence to the real EEG data distribution. In addition to this, a significant improvement in mean performance was achieved in our Locus of Attention (LoA) task, where we classify between a test subject attending to a left or right speaker. Here an approximate classification accuracy of 71% was achieved compared to our baseline of 70.4%.}},
  author       = {{Rannaleet, David and Gunnarsson, Victor}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Diffusion Modelling approaches to EEG-based Auditory Attention Decoding}},
  year         = {{2024}},
}