Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Predicting type 2 diabetes via machine learning integration of multiple omics from human pancreatic islets

Rönn, Tina LU ; Perfilyev, Alexander LU orcid ; Oskolkov, Nikolay LU and Ling, Charlotte LU orcid (2024) In Scientific Reports 14(1).
Abstract

Type 2 diabetes (T2D) is the fastest growing non-infectious disease worldwide. Impaired insulin secretion from pancreatic beta-cells is a hallmark of T2D, but the mechanisms behind this defect are insufficiently characterized. Integrating multiple layers of biomedical information, such as different Omics, may allow more accurate understanding of complex diseases such as T2D. Our aim was to explore and use Machine Learning to integrate multiple sources of biological/molecular information (multiOmics), in our case RNA-sequening, DNA methylation, SNP and phenotypic data from islet donors with T2D and non-diabetic controls. We exploited Machine Learning to perform multiOmics integration of DNA methylation, expression, SNPs, and phenotypes... (More)

Type 2 diabetes (T2D) is the fastest growing non-infectious disease worldwide. Impaired insulin secretion from pancreatic beta-cells is a hallmark of T2D, but the mechanisms behind this defect are insufficiently characterized. Integrating multiple layers of biomedical information, such as different Omics, may allow more accurate understanding of complex diseases such as T2D. Our aim was to explore and use Machine Learning to integrate multiple sources of biological/molecular information (multiOmics), in our case RNA-sequening, DNA methylation, SNP and phenotypic data from islet donors with T2D and non-diabetic controls. We exploited Machine Learning to perform multiOmics integration of DNA methylation, expression, SNPs, and phenotypes from pancreatic islets of 110 individuals, with ~ 30% being T2D cases. DNA methylation was analyzed using Infinium MethylationEPIC array, expression was analyzed using RNA-sequencing, and SNPs were analyzed using HumanOmniExpress arrays. Supervised linear multiOmics integration via DIABLO based on Partial Least Squares (PLS) achieved an accuracy of 91 ± 15% of T2D prediction with an area under the curve of 0.96 ± 0.08 on the test dataset after cross-validation. Biomarkers identified by this multiOmics integration, including SACS and TXNIP DNA methylation, OPRD1 and RHOT1 expression and a SNP annotated to ANO1, provide novel insights into the interplay between different biological mechanisms contributing to T2D. This Machine Learning approach of multiOmics cross-sectional data from human pancreatic islets achieved a promising accuracy of T2D prediction, which may potentially find broad applications in clinical diagnostics. In addition, it delivered novel candidate biomarkers for T2D and links between them across the different Omics.

(Less)
Please use this url to cite or link to this publication:
author
; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Beta-cell, DNA methylation, Epigenetics, EWAS, Genetic variation, GWAS, Insulin secretion, Machine learning, Metabolic disease, MultiOmics analysis, Omics integration, RNA-sequencing
in
Scientific Reports
volume
14
issue
1
article number
14637
publisher
Nature Publishing Group
external identifiers
  • pmid:38918439
  • scopus:85196868318
ISSN
2045-2322
DOI
10.1038/s41598-024-64846-3
language
English
LU publication?
yes
id
00d82dca-b26e-4ca4-8987-71182d409534
date added to LUP
2024-07-02 13:44:48
date last changed
2024-07-03 03:00:02
@article{00d82dca-b26e-4ca4-8987-71182d409534,
  abstract     = {{<p>Type 2 diabetes (T2D) is the fastest growing non-infectious disease worldwide. Impaired insulin secretion from pancreatic beta-cells is a hallmark of T2D, but the mechanisms behind this defect are insufficiently characterized. Integrating multiple layers of biomedical information, such as different Omics, may allow more accurate understanding of complex diseases such as T2D. Our aim was to explore and use Machine Learning to integrate multiple sources of biological/molecular information (multiOmics), in our case RNA-sequening, DNA methylation, SNP and phenotypic data from islet donors with T2D and non-diabetic controls. We exploited Machine Learning to perform multiOmics integration of DNA methylation, expression, SNPs, and phenotypes from pancreatic islets of 110 individuals, with ~ 30% being T2D cases. DNA methylation was analyzed using Infinium MethylationEPIC array, expression was analyzed using RNA-sequencing, and SNPs were analyzed using HumanOmniExpress arrays. Supervised linear multiOmics integration via DIABLO based on Partial Least Squares (PLS) achieved an accuracy of 91 ± 15% of T2D prediction with an area under the curve of 0.96 ± 0.08 on the test dataset after cross-validation. Biomarkers identified by this multiOmics integration, including SACS and TXNIP DNA methylation, OPRD1 and RHOT1 expression and a SNP annotated to ANO1, provide novel insights into the interplay between different biological mechanisms contributing to T2D. This Machine Learning approach of multiOmics cross-sectional data from human pancreatic islets achieved a promising accuracy of T2D prediction, which may potentially find broad applications in clinical diagnostics. In addition, it delivered novel candidate biomarkers for T2D and links between them across the different Omics.</p>}},
  author       = {{Rönn, Tina and Perfilyev, Alexander and Oskolkov, Nikolay and Ling, Charlotte}},
  issn         = {{2045-2322}},
  keywords     = {{Beta-cell; DNA methylation; Epigenetics; EWAS; Genetic variation; GWAS; Insulin secretion; Machine learning; Metabolic disease; MultiOmics analysis; Omics integration; RNA-sequencing}},
  language     = {{eng}},
  number       = {{1}},
  publisher    = {{Nature Publishing Group}},
  series       = {{Scientific Reports}},
  title        = {{Predicting type 2 diabetes via machine learning integration of multiple omics from human pancreatic islets}},
  url          = {{http://dx.doi.org/10.1038/s41598-024-64846-3}},
  doi          = {{10.1038/s41598-024-64846-3}},
  volume       = {{14}},
  year         = {{2024}},
}