Classification of Age-Related Hearing Loss using Conversational Speech Biomarkers

Bergström, Sanna; Ekre, Fremja

Classification of Age-Related Hearing Loss using Conversational Speech Biomarkers

Mark

Bergström, Sanna and Ekre, Fremja (2025)
Department of Automatic Control

Abstract: Hearing loss is increasing with an ageing population. There is thus a need for more convenient diagnosis methodology. This thesis investigates whether it is possible to classify age-related hearing loss using speech biomarkers and Automatic Speech Recognition (ASR) or not. The methodology was a binary classification problem based on analysing conversational data between a participant with normal hearing and a participant with age-related hearing loss. These conversations took place in nine different conditions with varying background noise, and levels of hearing aid support for the participant with hearing loss. The ASR model used was Whisper by OpenAi™. The extracted features from the data included frequency characteristics, spectral... (More); Hearing loss is increasing with an ageing population. There is thus a need for more convenient diagnosis methodology. This thesis investigates whether it is possible to classify age-related hearing loss using speech biomarkers and Automatic Speech Recognition (ASR) or not. The methodology was a binary classification problem based on analysing conversational data between a participant with normal hearing and a participant with age-related hearing loss. These conversations took place in nine different conditions with varying background noise, and levels of hearing aid support for the participant with hearing loss. The ASR model used was Whisper by OpenAi™. The extracted features from the data included frequency characteristics, spectral attributes, power measures, pauses, floor-transfer offsets, Mel Frequency Cepstral Coefficients, and principal components derived from the first and final encoder block outputs of the Whisper model. The impact of the conditions and the features on the classification accuracies were analysed in conjunction to evaluate an optimal future method to detect hearing loss from speech data. The results suggest that it is possible to classify age related hearing loss with speech data, the top accuracy achieved was 78.4%. The background noise and the level of hearing aid support affected the outcome of the classification accuracy, as well as the selected features. The most influential features were related to power, frequency, and the Whisper encoder block outputs. This thesis does not incorporate control group conversations between participants with normal hearing, and the data set is limited to 24 participant pairs. To achieve more reliable and generalisable results, the inclusion of a control group and a larger data set is essential. (Less)

- Open Access
- |
- PDF

Links

Document download statistics

Related Materials

Related object is popular science:
Popular Science summary

Please use this url to cite or link to this publication: http://lup.lub.lu.se/student-papers/record/9183947

author

Bergström, Sanna and Ekre, Fremja

supervisor

organization

Department of Automatic Control

alternative title

An Implementation of Feature Engineering and ASR in a Machine Learning Pipeline

year

2025

type

H3 - Professional qualifications (4 Years - )

subject

Technology and Engineering

report number

TFRT-6266

other publication id

0280-5316

language

English

id

9183947

date added to LUP

2025-02-04 14:06:32

date last changed

2025-02-04 14:06:32

@misc{9183947,
  abstract     = {{Hearing loss is increasing with an ageing population. There is thus a need for more convenient diagnosis methodology. This thesis investigates whether it is possible to classify age-related hearing loss using speech biomarkers and Automatic Speech Recognition (ASR) or not. The methodology was a binary classification problem based on analysing conversational data between a participant with normal hearing and a participant with age-related hearing loss. These conversations took place in nine different conditions with varying background noise, and levels of hearing aid support for the participant with hearing loss. The ASR model used was Whisper by OpenAi™. The extracted features from the data included frequency characteristics, spectral attributes, power measures, pauses, floor-transfer offsets, Mel Frequency Cepstral Coefficients, and principal components derived from the first and final encoder block outputs of the Whisper model. The impact of the conditions and the features on the classification accuracies were analysed in conjunction to evaluate an optimal future method to detect hearing loss from speech data. The results suggest that it is possible to classify age related hearing loss with speech data, the top accuracy achieved was 78.4%. The background noise and the level of hearing aid support affected the outcome of the classification accuracy, as well as the selected features. The most influential features were related to power, frequency, and the Whisper encoder block outputs. This thesis does not incorporate control group conversations between participants with normal hearing, and the data set is limited to 24 participant pairs. To achieve more reliable and generalisable results, the inclusion of a control group and a larger data set is essential.}},
  author       = {{Bergström, Sanna and Ekre, Fremja}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Classification of Age-Related Hearing Loss using Conversational Speech Biomarkers}},
  year         = {{2025}},
}

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Classification of Age-Related Hearing Loss using Conversational Speech Biomarkers