A deep learning approach to prediction of blood group antigens from genomic data

Moslemi, Camous; Sækmose, Susanne; Larsen, Rune; Brodersen, Thorsten; Bay, Jakob T.; Didriksen, Maria; Nielsen, Kaspar R.; Bruun, Mie T.; Dowsett, Joseph; Dinh, Khoa M.; Mikkelsen, Christina; Hyvärinen, Kati; Ritari, Jarmo; Partanen, Jukka; Ullum, Henrik; Erikstrup, Christian; Ostrowski, Sisse R.; Olsson, Martin L.; Pedersen, Ole B.

A deep learning approach to prediction of blood group antigens from genomic data

Mark

Moslemi, Camous ; Sækmose, Susanne ; Larsen, Rune ; Brodersen, Thorsten ; Bay, Jakob T. ; Didriksen, Maria ; Nielsen, Kaspar R. ; Bruun, Mie T. ; Dowsett, Joseph and Dinh, Khoa M. , et al. (2024) In Transfusion 64(11). p.2179-2195

Abstract: Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an... (More); Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an F1-accuracy above 99%. Models for antigens with low or high frequencies like, for example, C^w, low training cohorts like, for example, Co^b, or very complicated genetic underpinning like, for example, RhD, proved to be more challenging for high accuracy (>99%) DL modeling. However, in the Danish cohort only 4 out of 36 models (Co^b, C^w, D-weak, Kp^a) failed to achieve a prediction F1-accuracy above 97%. This high predictive performance was replicated in the Finnish cohort. Discussion: High accuracy in a variety of blood groups proves viability of deep learning-based blood type prediction using array chip genotypes, even in blood groups with nontrivial genetic underpinnings. These techniques are suitable for aiding in identifying blood donors with rare blood types by greatly narrowing down the potential pool of candidate donors before clinical grade confirmation.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/f7e09d57-de1f-4372-8dbd-6de3d3a39f56

author

Moslemi, Camous ; Sækmose, Susanne ; Larsen, Rune ; Brodersen, Thorsten ; Bay, Jakob T. ; Didriksen, Maria ; Nielsen, Kaspar R. ; Bruun, Mie T. ; Dowsett, Joseph and Dinh, Khoa M. , et al. (More)

Moslemi, Camous ; Sækmose, Susanne ; Larsen, Rune ; Brodersen, Thorsten ; Bay, Jakob T. ; Didriksen, Maria ; Nielsen, Kaspar R. ; Bruun, Mie T. ; Dowsett, Joseph ; Dinh, Khoa M. ; Mikkelsen, Christina ; Hyvärinen, Kati ; Ritari, Jarmo ; Partanen, Jukka ; Ullum, Henrik ; Erikstrup, Christian ; Ostrowski, Sisse R. ; Olsson, Martin L. ^LU

and Pedersen, Ole B. (Less)

organization

publishing date

2024-11

type

Contribution to journal

publication status

published

subject

Hematology

keywords

AI, blood antigen, blood types, convolutional neural network, deep learning, denoising autoencoder, genetic prediction, Illumina GSA

in

Transfusion

volume

64

issue

11

pages

17 pages

publisher

Wiley-Blackwell

external identifiers

scopus:85203695657
pmid:39268576

ISSN

0041-1132

DOI

10.1111/trf.18013

language

English

LU publication?

yes

additional info

id

f7e09d57-de1f-4372-8dbd-6de3d3a39f56

date added to LUP

2024-12-04 09:16:10

date last changed

2026-02-12 22:49:08

@article{f7e09d57-de1f-4372-8dbd-6de3d3a39f56,
  abstract     = {{<p>Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an F1-accuracy above 99%. Models for antigens with low or high frequencies like, for example, C<sup>w</sup>, low training cohorts like, for example, Co<sup>b</sup>, or very complicated genetic underpinning like, for example, RhD, proved to be more challenging for high accuracy (&gt;99%) DL modeling. However, in the Danish cohort only 4 out of 36 models (Co<sup>b</sup>, C<sup>w</sup>, D-weak, Kp<sup>a</sup>) failed to achieve a prediction F1-accuracy above 97%. This high predictive performance was replicated in the Finnish cohort. Discussion: High accuracy in a variety of blood groups proves viability of deep learning-based blood type prediction using array chip genotypes, even in blood groups with nontrivial genetic underpinnings. These techniques are suitable for aiding in identifying blood donors with rare blood types by greatly narrowing down the potential pool of candidate donors before clinical grade confirmation.</p>}},
  author       = {{Moslemi, Camous and Sækmose, Susanne and Larsen, Rune and Brodersen, Thorsten and Bay, Jakob T. and Didriksen, Maria and Nielsen, Kaspar R. and Bruun, Mie T. and Dowsett, Joseph and Dinh, Khoa M. and Mikkelsen, Christina and Hyvärinen, Kati and Ritari, Jarmo and Partanen, Jukka and Ullum, Henrik and Erikstrup, Christian and Ostrowski, Sisse R. and Olsson, Martin L. and Pedersen, Ole B.}},
  issn         = {{0041-1132}},
  keywords     = {{AI; blood antigen; blood types; convolutional neural network; deep learning; denoising autoencoder; genetic prediction; Illumina GSA}},
  language     = {{eng}},
  number       = {{11}},
  pages        = {{2179--2195}},
  publisher    = {{Wiley-Blackwell}},
  series       = {{Transfusion}},
  title        = {{A deep learning approach to prediction of blood group antigens from genomic data}},
  url          = {{http://dx.doi.org/10.1111/trf.18013}},
  doi          = {{10.1111/trf.18013}},
  volume       = {{64}},
  year         = {{2024}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

A deep learning approach to prediction of blood group antigens from genomic data