Detection of Acromegaly From Facial Images Using Machine Learning : A Comparison With Clinical Experts
(2026) In Journal of the Endocrine Society 10(2).- Abstract
Context Substantial diagnostic delay in acromegaly contributes to increased morbidity and mortality. Screening attempts in high-risk groups have yielded few positive cases, underscoring the need for simple and precise prescreening methods. Objective Machine-learning analysis of facial images shows promise for acromegaly detection but requires validation in larger, well-characterized cohorts using robust machine-learning frameworks as performed in this study. Methods Facial images from different angles were collected via smartphone from 155 acromegaly patients (79% biochemically controlled) and 153 matched controls at all Swedish university hospitals. Six machine-learning models were trained to distinguish acromegaly from control images,... (More)
Context Substantial diagnostic delay in acromegaly contributes to increased morbidity and mortality. Screening attempts in high-risk groups have yielded few positive cases, underscoring the need for simple and precise prescreening methods. Objective Machine-learning analysis of facial images shows promise for acromegaly detection but requires validation in larger, well-characterized cohorts using robust machine-learning frameworks as performed in this study. Methods Facial images from different angles were collected via smartphone from 155 acromegaly patients (79% biochemically controlled) and 153 matched controls at all Swedish university hospitals. Six machine-learning models were trained to distinguish acromegaly from control images, including 3 deep neural networks pretrained on diverse image datasets (ImageNet models: ResNet50, InceptionV2, and DenseNet121) and 1 network pretrained specifically on facial images (FaRL). Model performance was compared to assessment by 12 experienced endocrinologists. Results The diagnostic accuracy of the FaRL-based model was superior to all ImageNet models and matched the accuracy of human experts (area under the receiver operating characteristic curve 0.89 for both) with similar specificity (0.87 vs 0.93) but higher sensitivity (0.82 vs 0.66). Classification agreement between the best machine-learning model (FaRL) and human experts was 86% for true negatives and 60% for true positives. Machine-learning models and human experts both showed greater sensitivity in identifying acromegaly in male patients. Conclusion A deep learning model pretrained on facial features (FaRL) can detect acromegaly from standard photographs with accuracy comparable to that of expert endocrinologists. This supports the feasibility of face analysis as a screening tool for acromegaly.
(Less)
- author
- organization
- publishing date
- 2026
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- acromegaly, deep learning, diagnostic delay, face classification, face photographs, machine learning, screening
- in
- Journal of the Endocrine Society
- volume
- 10
- issue
- 2
- article number
- bvaf203
- publisher
- Oxford University Press
- external identifiers
-
- scopus:105028642835
- pmid:41608201
- ISSN
- 2472-1972
- DOI
- 10.1210/jendso/bvaf203
- language
- English
- LU publication?
- yes
- id
- 268ff258-4cf3-4229-931e-c374769d4b79
- date added to LUP
- 2026-02-19 10:31:48
- date last changed
- 2026-03-19 17:25:39
@article{268ff258-4cf3-4229-931e-c374769d4b79,
abstract = {{<p>Context Substantial diagnostic delay in acromegaly contributes to increased morbidity and mortality. Screening attempts in high-risk groups have yielded few positive cases, underscoring the need for simple and precise prescreening methods. Objective Machine-learning analysis of facial images shows promise for acromegaly detection but requires validation in larger, well-characterized cohorts using robust machine-learning frameworks as performed in this study. Methods Facial images from different angles were collected via smartphone from 155 acromegaly patients (79% biochemically controlled) and 153 matched controls at all Swedish university hospitals. Six machine-learning models were trained to distinguish acromegaly from control images, including 3 deep neural networks pretrained on diverse image datasets (ImageNet models: ResNet50, InceptionV2, and DenseNet121) and 1 network pretrained specifically on facial images (FaRL). Model performance was compared to assessment by 12 experienced endocrinologists. Results The diagnostic accuracy of the FaRL-based model was superior to all ImageNet models and matched the accuracy of human experts (area under the receiver operating characteristic curve 0.89 for both) with similar specificity (0.87 vs 0.93) but higher sensitivity (0.82 vs 0.66). Classification agreement between the best machine-learning model (FaRL) and human experts was 86% for true negatives and 60% for true positives. Machine-learning models and human experts both showed greater sensitivity in identifying acromegaly in male patients. Conclusion A deep learning model pretrained on facial features (FaRL) can detect acromegaly from standard photographs with accuracy comparable to that of expert endocrinologists. This supports the feasibility of face analysis as a screening tool for acromegaly.</p>}},
author = {{Vouzouneraki, Konstantina and Ylipää, Erik and Olsson, Tommy and Berinder, Katarina and Höybye, Charlotte and Petersson, Maria and Bensing, Sophie and Åkerman, Anna Karin and Borg, Henrik and Ekman, Bertil and Robért, Jonas and Engström, Britt Edén and Ragnarsson, Oskar and Burman, Pia and Dahlqvist, Per}},
issn = {{2472-1972}},
keywords = {{acromegaly; deep learning; diagnostic delay; face classification; face photographs; machine learning; screening}},
language = {{eng}},
number = {{2}},
publisher = {{Oxford University Press}},
series = {{Journal of the Endocrine Society}},
title = {{Detection of Acromegaly From Facial Images Using Machine Learning : A Comparison With Clinical Experts}},
url = {{http://dx.doi.org/10.1210/jendso/bvaf203}},
doi = {{10.1210/jendso/bvaf203}},
volume = {{10}},
year = {{2026}},
}