Diffusion Modelling approaches to EEG-based Auditory Attention Decoding

Rannaleet, David; Gunnarsson, Victor

Diffusion Modelling approaches to EEG-based Auditory Attention Decoding

Mark

Rannaleet, David and Gunnarsson, Victor (2024)
Department of Automatic Control

Abstract: Machine learning models can analyze physiological data, such as electroencephalography (EEG), for various classification tasks. One such task is Auditory Attention Decoding (AAD), aimed at identifying the sound a person is actively attending to, offering significant benefits for users of hearing aids. However, EEG data often exhibits a low signal-to-noise ratio, and its collection is often expensive, cumbersome and requires trained specialists. Additionally, gathering EEG data over prolonged periods of time presents challenges. The resulting scarcity of available EEG data to train models can be alleviated by using generative models, which generate new examples from the data they were trained on.
Diffusion Probabilistic Models (DPMs) have... (More); Machine learning models can analyze physiological data, such as electroencephalography (EEG), for various classification tasks. One such task is Auditory Attention Decoding (AAD), aimed at identifying the sound a person is actively attending to, offering significant benefits for users of hearing aids. However, EEG data often exhibits a low signal-to-noise ratio, and its collection is often expensive, cumbersome and requires trained specialists. Additionally, gathering EEG data over prolonged periods of time presents challenges. The resulting scarcity of available EEG data to train models can be alleviated by using generative models, which generate new examples from the data they were trained on.
Diffusion Probabilistic Models (DPMs) have in recent years emerged as the state-of-the-art of generative models within the image domain, showing widespread success in models such as Stable Diffusion and DALL-E. This work investigates whether this success can extend to the domain of multichannel time series data, specifically EEG data. The diffusion models were trained on 1-second EEG data segments and were used as a data augmentation tool for 3 different classification tasks, including AAD. Our findings indicate that diffusion models can effectively generate realistic EEG data, supported by both a visual comparison and a measure of Jensen-Shannon divergence to the real EEG data distribution. In addition to this, a significant improvement in mean performance was achieved in our Locus of Attention (LoA) task, where we classify between a test subject attending to a left or right speaker. Here an approximate classification accuracy of 71% was achieved compared to our baseline of 70.4%. (Less)

- Open Access
- |
- PDF

Links

Document download statistics

Related Materials

Related object is popular science:
Popular Science summary

Please use this url to cite or link to this publication: http://lup.lub.lu.se/student-papers/record/9150900

author

Rannaleet, David and Gunnarsson, Victor

supervisor

organization

Department of Automatic Control

year

2024

type

H3 - Professional qualifications (4 Years - )

subject

Technology and Engineering

report number

TFRT-6227

other publication id

0280-5316

language

English

id

9150900

date added to LUP

2024-04-17 14:03:02

date last changed

2024-04-17 14:03:02

@misc{9150900,
  abstract     = {{Machine learning models can analyze physiological data, such as electroencephalography (EEG), for various classification tasks. One such task is Auditory Attention Decoding (AAD), aimed at identifying the sound a person is actively attending to, offering significant benefits for users of hearing aids. However, EEG data often exhibits a low signal-to-noise ratio, and its collection is often expensive, cumbersome and requires trained specialists. Additionally, gathering EEG data over prolonged periods of time presents challenges. The resulting scarcity of available EEG data to train models can be alleviated by using generative models, which generate new examples from the data they were trained on.
 Diffusion Probabilistic Models (DPMs) have in recent years emerged as the state-of-the-art of generative models within the image domain, showing widespread success in models such as Stable Diffusion and DALL-E. This work investigates whether this success can extend to the domain of multichannel time series data, specifically EEG data. The diffusion models were trained on 1-second EEG data segments and were used as a data augmentation tool for 3 different classification tasks, including AAD. Our findings indicate that diffusion models can effectively generate realistic EEG data, supported by both a visual comparison and a measure of Jensen-Shannon divergence to the real EEG data distribution. In addition to this, a significant improvement in mean performance was achieved in our Locus of Attention (LoA) task, where we classify between a test subject attending to a left or right speaker. Here an approximate classification accuracy of 71% was achieved compared to our baseline of 70.4%.}},
  author       = {{Rannaleet, David and Gunnarsson, Victor}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Diffusion Modelling approaches to EEG-based Auditory Attention Decoding}},
  year         = {{2024}},
}

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Diffusion Modelling approaches to EEG-based Auditory Attention Decoding