Detection of Acromegaly From Facial Images Using Machine Learning : A Comparison With Clinical Experts

Vouzouneraki, Konstantina; Ylipää, Erik; Olsson, Tommy; Berinder, Katarina; Höybye, Charlotte; Petersson, Maria; Bensing, Sophie; Åkerman, Anna Karin; Borg, Henrik; Ekman, Bertil; Robért, Jonas; Engström, Britt Edén; Ragnarsson, Oskar; Burman, Pia; Dahlqvist, Per

Detection of Acromegaly From Facial Images Using Machine Learning : A Comparison With Clinical Experts

Mark

Vouzouneraki, Konstantina ; Ylipää, Erik ; Olsson, Tommy ; Berinder, Katarina ; Höybye, Charlotte ; Petersson, Maria ; Bensing, Sophie ; Åkerman, Anna Karin ; Borg, Henrik ^LU and Ekman, Bertil , et al. (2026) In Journal of the Endocrine Society 10(2).

Abstract: Context Substantial diagnostic delay in acromegaly contributes to increased morbidity and mortality. Screening attempts in high-risk groups have yielded few positive cases, underscoring the need for simple and precise prescreening methods. Objective Machine-learning analysis of facial images shows promise for acromegaly detection but requires validation in larger, well-characterized cohorts using robust machine-learning frameworks as performed in this study. Methods Facial images from different angles were collected via smartphone from 155 acromegaly patients (79% biochemically controlled) and 153 matched controls at all Swedish university hospitals. Six machine-learning models were trained to distinguish acromegaly from control images,... (More); Context Substantial diagnostic delay in acromegaly contributes to increased morbidity and mortality. Screening attempts in high-risk groups have yielded few positive cases, underscoring the need for simple and precise prescreening methods. Objective Machine-learning analysis of facial images shows promise for acromegaly detection but requires validation in larger, well-characterized cohorts using robust machine-learning frameworks as performed in this study. Methods Facial images from different angles were collected via smartphone from 155 acromegaly patients (79% biochemically controlled) and 153 matched controls at all Swedish university hospitals. Six machine-learning models were trained to distinguish acromegaly from control images, including 3 deep neural networks pretrained on diverse image datasets (ImageNet models: ResNet50, InceptionV2, and DenseNet121) and 1 network pretrained specifically on facial images (FaRL). Model performance was compared to assessment by 12 experienced endocrinologists. Results The diagnostic accuracy of the FaRL-based model was superior to all ImageNet models and matched the accuracy of human experts (area under the receiver operating characteristic curve 0.89 for both) with similar specificity (0.87 vs 0.93) but higher sensitivity (0.82 vs 0.66). Classification agreement between the best machine-learning model (FaRL) and human experts was 86% for true negatives and 60% for true positives. Machine-learning models and human experts both showed greater sensitivity in identifying acromegaly in male patients. Conclusion A deep learning model pretrained on facial features (FaRL) can detect acromegaly from standard photographs with accuracy comparable to that of expert endocrinologists. This supports the feasibility of face analysis as a screening tool for acromegaly.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/268ff258-4cf3-4229-931e-c374769d4b79

author

Vouzouneraki, Konstantina ; Ylipää, Erik ; Olsson, Tommy ; Berinder, Katarina ; Höybye, Charlotte ; Petersson, Maria ; Bensing, Sophie ; Åkerman, Anna Karin ; Borg, Henrik ^LU and Ekman, Bertil , et al. (More)

Vouzouneraki, Konstantina ; Ylipää, Erik ; Olsson, Tommy ; Berinder, Katarina ; Höybye, Charlotte ; Petersson, Maria ; Bensing, Sophie ; Åkerman, Anna Karin ; Borg, Henrik ^LU ; Ekman, Bertil ; Robért, Jonas ; Engström, Britt Edén ; Ragnarsson, Oskar ; Burman, Pia ^LU and Dahlqvist, Per (Less)

organization

publishing date

2026

type

Contribution to journal

publication status

published

subject

Endocrinology and Diabetes

keywords

acromegaly, deep learning, diagnostic delay, face classification, face photographs, machine learning, screening

in

Journal of the Endocrine Society

volume

10

issue

2

article number

bvaf203

publisher

Oxford University Press

external identifiers

scopus:105028642835
pmid:41608201

ISSN

2472-1972

DOI

10.1210/jendso/bvaf203

language

English

LU publication?

yes

id

268ff258-4cf3-4229-931e-c374769d4b79

date added to LUP

2026-02-19 10:31:48

date last changed

2026-03-19 17:25:39

@article{268ff258-4cf3-4229-931e-c374769d4b79,
  abstract     = {{<p>Context Substantial diagnostic delay in acromegaly contributes to increased morbidity and mortality. Screening attempts in high-risk groups have yielded few positive cases, underscoring the need for simple and precise prescreening methods. Objective Machine-learning analysis of facial images shows promise for acromegaly detection but requires validation in larger, well-characterized cohorts using robust machine-learning frameworks as performed in this study. Methods Facial images from different angles were collected via smartphone from 155 acromegaly patients (79% biochemically controlled) and 153 matched controls at all Swedish university hospitals. Six machine-learning models were trained to distinguish acromegaly from control images, including 3 deep neural networks pretrained on diverse image datasets (ImageNet models: ResNet50, InceptionV2, and DenseNet121) and 1 network pretrained specifically on facial images (FaRL). Model performance was compared to assessment by 12 experienced endocrinologists. Results The diagnostic accuracy of the FaRL-based model was superior to all ImageNet models and matched the accuracy of human experts (area under the receiver operating characteristic curve 0.89 for both) with similar specificity (0.87 vs 0.93) but higher sensitivity (0.82 vs 0.66). Classification agreement between the best machine-learning model (FaRL) and human experts was 86% for true negatives and 60% for true positives. Machine-learning models and human experts both showed greater sensitivity in identifying acromegaly in male patients. Conclusion A deep learning model pretrained on facial features (FaRL) can detect acromegaly from standard photographs with accuracy comparable to that of expert endocrinologists. This supports the feasibility of face analysis as a screening tool for acromegaly.</p>}},
  author       = {{Vouzouneraki, Konstantina and Ylipää, Erik and Olsson, Tommy and Berinder, Katarina and Höybye, Charlotte and Petersson, Maria and Bensing, Sophie and Åkerman, Anna Karin and Borg, Henrik and Ekman, Bertil and Robért, Jonas and Engström, Britt Edén and Ragnarsson, Oskar and Burman, Pia and Dahlqvist, Per}},
  issn         = {{2472-1972}},
  keywords     = {{acromegaly; deep learning; diagnostic delay; face classification; face photographs; machine learning; screening}},
  language     = {{eng}},
  number       = {{2}},
  publisher    = {{Oxford University Press}},
  series       = {{Journal of the Endocrine Society}},
  title        = {{Detection of Acromegaly From Facial Images Using Machine Learning : A Comparison With Clinical Experts}},
  url          = {{http://dx.doi.org/10.1210/jendso/bvaf203}},
  doi          = {{10.1210/jendso/bvaf203}},
  volume       = {{10}},
  year         = {{2026}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Detection of Acromegaly From Facial Images Using Machine Learning : A Comparison With Clinical Experts