Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

A deep learning approach to prediction of blood group antigens from genomic data

Moslemi, Camous ; Sækmose, Susanne ; Larsen, Rune ; Brodersen, Thorsten ; Bay, Jakob T. ; Didriksen, Maria ; Nielsen, Kaspar R. ; Bruun, Mie T. ; Dowsett, Joseph and Dinh, Khoa M. , et al. (2024) In Transfusion 64(11). p.2179-2195
Abstract

Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an... (More)

Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an F1-accuracy above 99%. Models for antigens with low or high frequencies like, for example, Cw, low training cohorts like, for example, Cob, or very complicated genetic underpinning like, for example, RhD, proved to be more challenging for high accuracy (>99%) DL modeling. However, in the Danish cohort only 4 out of 36 models (Cob, Cw, D-weak, Kpa) failed to achieve a prediction F1-accuracy above 97%. This high predictive performance was replicated in the Finnish cohort. Discussion: High accuracy in a variety of blood groups proves viability of deep learning-based blood type prediction using array chip genotypes, even in blood groups with nontrivial genetic underpinnings. These techniques are suitable for aiding in identifying blood donors with rare blood types by greatly narrowing down the potential pool of candidate donors before clinical grade confirmation.

(Less)
Please use this url to cite or link to this publication:
author
; ; ; ; ; ; ; ; and , et al. (More)
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; and (Less)
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
AI, blood antigen, blood types, convolutional neural network, deep learning, denoising autoencoder, genetic prediction, Illumina GSA
in
Transfusion
volume
64
issue
11
pages
17 pages
publisher
Wiley-Blackwell
external identifiers
  • scopus:85203695657
  • pmid:39268576
ISSN
0041-1132
DOI
10.1111/trf.18013
language
English
LU publication?
yes
additional info
Publisher Copyright: © 2024 The Author(s). Transfusion published by Wiley Periodicals LLC on behalf of AABB.
id
f7e09d57-de1f-4372-8dbd-6de3d3a39f56
date added to LUP
2024-12-04 09:16:10
date last changed
2025-07-17 03:37:26
@article{f7e09d57-de1f-4372-8dbd-6de3d3a39f56,
  abstract     = {{<p>Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an F1-accuracy above 99%. Models for antigens with low or high frequencies like, for example, C<sup>w</sup>, low training cohorts like, for example, Co<sup>b</sup>, or very complicated genetic underpinning like, for example, RhD, proved to be more challenging for high accuracy (&gt;99%) DL modeling. However, in the Danish cohort only 4 out of 36 models (Co<sup>b</sup>, C<sup>w</sup>, D-weak, Kp<sup>a</sup>) failed to achieve a prediction F1-accuracy above 97%. This high predictive performance was replicated in the Finnish cohort. Discussion: High accuracy in a variety of blood groups proves viability of deep learning-based blood type prediction using array chip genotypes, even in blood groups with nontrivial genetic underpinnings. These techniques are suitable for aiding in identifying blood donors with rare blood types by greatly narrowing down the potential pool of candidate donors before clinical grade confirmation.</p>}},
  author       = {{Moslemi, Camous and Sækmose, Susanne and Larsen, Rune and Brodersen, Thorsten and Bay, Jakob T. and Didriksen, Maria and Nielsen, Kaspar R. and Bruun, Mie T. and Dowsett, Joseph and Dinh, Khoa M. and Mikkelsen, Christina and Hyvärinen, Kati and Ritari, Jarmo and Partanen, Jukka and Ullum, Henrik and Erikstrup, Christian and Ostrowski, Sisse R. and Olsson, Martin L. and Pedersen, Ole B.}},
  issn         = {{0041-1132}},
  keywords     = {{AI; blood antigen; blood types; convolutional neural network; deep learning; denoising autoencoder; genetic prediction; Illumina GSA}},
  language     = {{eng}},
  number       = {{11}},
  pages        = {{2179--2195}},
  publisher    = {{Wiley-Blackwell}},
  series       = {{Transfusion}},
  title        = {{A deep learning approach to prediction of blood group antigens from genomic data}},
  url          = {{http://dx.doi.org/10.1111/trf.18013}},
  doi          = {{10.1111/trf.18013}},
  volume       = {{64}},
  year         = {{2024}},
}