A deep learning approach to prediction of blood group antigens from genomic data
(2024) In Transfusion 64(11). p.2179-2195- Abstract
Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an... (More)
Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an F1-accuracy above 99%. Models for antigens with low or high frequencies like, for example, Cw, low training cohorts like, for example, Cob, or very complicated genetic underpinning like, for example, RhD, proved to be more challenging for high accuracy (>99%) DL modeling. However, in the Danish cohort only 4 out of 36 models (Cob, Cw, D-weak, Kpa) failed to achieve a prediction F1-accuracy above 97%. This high predictive performance was replicated in the Finnish cohort. Discussion: High accuracy in a variety of blood groups proves viability of deep learning-based blood type prediction using array chip genotypes, even in blood groups with nontrivial genetic underpinnings. These techniques are suitable for aiding in identifying blood donors with rare blood types by greatly narrowing down the potential pool of candidate donors before clinical grade confirmation.
(Less)
- author
- organization
- publishing date
- 2024-11
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- AI, blood antigen, blood types, convolutional neural network, deep learning, denoising autoencoder, genetic prediction, Illumina GSA
- in
- Transfusion
- volume
- 64
- issue
- 11
- pages
- 17 pages
- publisher
- Wiley-Blackwell
- external identifiers
-
- scopus:85203695657
- pmid:39268576
- ISSN
- 0041-1132
- DOI
- 10.1111/trf.18013
- language
- English
- LU publication?
- yes
- additional info
- Publisher Copyright: © 2024 The Author(s). Transfusion published by Wiley Periodicals LLC on behalf of AABB.
- id
- f7e09d57-de1f-4372-8dbd-6de3d3a39f56
- date added to LUP
- 2024-12-04 09:16:10
- date last changed
- 2025-07-17 03:37:26
@article{f7e09d57-de1f-4372-8dbd-6de3d3a39f56, abstract = {{<p>Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an F1-accuracy above 99%. Models for antigens with low or high frequencies like, for example, C<sup>w</sup>, low training cohorts like, for example, Co<sup>b</sup>, or very complicated genetic underpinning like, for example, RhD, proved to be more challenging for high accuracy (>99%) DL modeling. However, in the Danish cohort only 4 out of 36 models (Co<sup>b</sup>, C<sup>w</sup>, D-weak, Kp<sup>a</sup>) failed to achieve a prediction F1-accuracy above 97%. This high predictive performance was replicated in the Finnish cohort. Discussion: High accuracy in a variety of blood groups proves viability of deep learning-based blood type prediction using array chip genotypes, even in blood groups with nontrivial genetic underpinnings. These techniques are suitable for aiding in identifying blood donors with rare blood types by greatly narrowing down the potential pool of candidate donors before clinical grade confirmation.</p>}}, author = {{Moslemi, Camous and Sækmose, Susanne and Larsen, Rune and Brodersen, Thorsten and Bay, Jakob T. and Didriksen, Maria and Nielsen, Kaspar R. and Bruun, Mie T. and Dowsett, Joseph and Dinh, Khoa M. and Mikkelsen, Christina and Hyvärinen, Kati and Ritari, Jarmo and Partanen, Jukka and Ullum, Henrik and Erikstrup, Christian and Ostrowski, Sisse R. and Olsson, Martin L. and Pedersen, Ole B.}}, issn = {{0041-1132}}, keywords = {{AI; blood antigen; blood types; convolutional neural network; deep learning; denoising autoencoder; genetic prediction; Illumina GSA}}, language = {{eng}}, number = {{11}}, pages = {{2179--2195}}, publisher = {{Wiley-Blackwell}}, series = {{Transfusion}}, title = {{A deep learning approach to prediction of blood group antigens from genomic data}}, url = {{http://dx.doi.org/10.1111/trf.18013}}, doi = {{10.1111/trf.18013}}, volume = {{64}}, year = {{2024}}, }