Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Predicting Single Cell Expression from Multiplex-Immunofloresent Imaging and Bulk-Count Data

Heidrich, Markus (2024) BINP52 20232
Degree Projects in Bioinformatics
Abstract
Spatial omics allows for the investigation of the Tumor Immune Microenvironment (TIME). Stratifying patients by their TIMEs contributes with insights on tumor immune escape, and help in tumor drug targeting. Tools to investigate such spatial omics approaches suffer from high costs and limited direct clinical applicability. We address this issue by proposing a Machine Learning model which learns to deconvolute bulk expressions based on immunofluorescence (IF) imaging. By utilizing selected, specific celltype stainings, we aim to improve predictive power over Hematoxylin and Eosin (H&E) imaging. Explicitly, our model learns visual representations of cells in a contrastive manner, utilizing Graph Neural Networks (GNN) to predict expressions... (More)
Spatial omics allows for the investigation of the Tumor Immune Microenvironment (TIME). Stratifying patients by their TIMEs contributes with insights on tumor immune escape, and help in tumor drug targeting. Tools to investigate such spatial omics approaches suffer from high costs and limited direct clinical applicability. We address this issue by proposing a Machine Learning model which learns to deconvolute bulk expressions based on immunofluorescence (IF) imaging. By utilizing selected, specific celltype stainings, we aim to improve predictive power over Hematoxylin and Eosin (H&E) imaging. Explicitly, our model learns visual representations of cells in a contrastive manner, utilizing Graph Neural Networks (GNN) to predict expressions from a cell graph of learned cell representations, enabling the trained model to predict high-plex single cell (sc) expression from just four marker images. We validate our model performance in depth using our in-house ovarian cancer (OC) GeoMx digital spatial profiling (DSP) dataset, as well as the CycIF (Cyclic IF) colorectal cancer (CRC) 2022 Atlas up to the single cell level. While the model struggles with accurate sc expressions, accuracy is dramatically improved when considering groupings of cells, enabling future TIME stratification based on just low-plex IF images, while allowing for standard sc analysis. We also highlight potential avenues for future improvements. (Less)
Popular Abstract
Decoding Tumor Microenvironments using AI and Imaging

Understanding the complex interactions between immune and tumor cells is crucial for advancing cancer treatments. Our project leverages cutting-edge machine learning to decode these intricate environments using cost-effective imaging techniques.

Cancer’s ability to evade the immune system, known as tumor immune escape, complicates the selection of effective immunotherapy treatments. Understanding the Tumor Immune Microenvironment (TIME) is vital for identifying drug targets and tailoring treatments to individual patients. Traditional spatial omics methods, which involve high-resolution imaging and detailed molecular profiling of tissue samples, offer valuable insights but are... (More)
Decoding Tumor Microenvironments using AI and Imaging

Understanding the complex interactions between immune and tumor cells is crucial for advancing cancer treatments. Our project leverages cutting-edge machine learning to decode these intricate environments using cost-effective imaging techniques.

Cancer’s ability to evade the immune system, known as tumor immune escape, complicates the selection of effective immunotherapy treatments. Understanding the Tumor Immune Microenvironment (TIME) is vital for identifying drug targets and tailoring treatments to individual patients. Traditional spatial omics methods, which involve high-resolution imaging and detailed molecular profiling of tissue samples, offer valuable insights but are often prohibitively expensive and challenging to implement in clinical settings.

To address these limitations, we developed a machine learning model designed to predict detailed molecular profiles from more accessible imaging data. Specifically, our model uses immunofluorescence (IF) imaging to estimate the complex gene and protein expression patterns typically obtained through more costly techniques. By focusing on key visual cell-type markers, our approach aims to maintain accuracy while reducing the need for extensive resources. Our model employs advanced techniques such as contrastive learning and Graph Neural Networks (GNN) to analyze IF images. We train the model to predict molecular expressions of single cells. We validated our approach using a unique dataset from ovarian cancer (OC) and colorectal cancer (CRC), achieving promising results in group-level analysis.

Methods and Findings
We used a combination of machine learning techniques to achieve our objectives. First, we trained a Convolutional Neural Network model in a contrastive manner to learn visual representations of individual cells. These representations served as nodes in a cell graph, where a GNN aggregated information of neighboring cells to predict cell expressions. Our validation process involved extensive testing with our in-house OC dataset and the CRC 2022 Atlas. While the model faced challenges in predicting individual single cell expressions, it performed significantly better when analyzing groups of cells. This group-level accuracy is crucial to analyze or predict patient responses to treatments and could potentially guide immunotherapy decisions based on readily available IF images.

Applications and Future Directions
Our research demonstrates that it is possible to glean detailed cell expression insights from IF imaging techniques, potentially aiding how we study and treat cancer. By enabling patient stratification with just low-plex IF images, our model offers a more practical and cost-effective tool for clinical settings. Future improvements could focus on enhancing single-cell prediction accuracy and expanding the model to other types of cancers and higher plex imaging. Additionally, integrating this approach with other diagnostic tools could further refine patient-specific treatment strategies.

Master’s Degree Project Bioinformatics 60 credits 2024
Department of Biology, Lund University

Advisor: Anna Gerdtsson
Department of Lund Immunotechnology (Less)
Please use this url to cite or link to this publication:
author
Heidrich, Markus
supervisor
organization
course
BINP52 20232
year
type
H2 - Master's Degree (Two Years)
subject
language
English
id
9176797
date added to LUP
2024-10-18 14:40:01
date last changed
2024-10-18 14:40:01
@misc{9176797,
  abstract     = {{Spatial omics allows for the investigation of the Tumor Immune Microenvironment (TIME). Stratifying patients by their TIMEs contributes with insights on tumor immune escape, and help in tumor drug targeting. Tools to investigate such spatial omics approaches suffer from high costs and limited direct clinical applicability. We address this issue by proposing a Machine Learning model which learns to deconvolute bulk expressions based on immunofluorescence (IF) imaging. By utilizing selected, specific celltype stainings, we aim to improve predictive power over Hematoxylin and Eosin (H&E) imaging. Explicitly, our model learns visual representations of cells in a contrastive manner, utilizing Graph Neural Networks (GNN) to predict expressions from a cell graph of learned cell representations, enabling the trained model to predict high-plex single cell (sc) expression from just four marker images. We validate our model performance in depth using our in-house ovarian cancer (OC) GeoMx digital spatial profiling (DSP) dataset, as well as the CycIF (Cyclic IF) colorectal cancer (CRC) 2022 Atlas up to the single cell level. While the model struggles with accurate sc expressions, accuracy is dramatically improved when considering groupings of cells, enabling future TIME stratification based on just low-plex IF images, while allowing for standard sc analysis. We also highlight potential avenues for future improvements.}},
  author       = {{Heidrich, Markus}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Predicting Single Cell Expression from Multiplex-Immunofloresent Imaging and Bulk-Count Data}},
  year         = {{2024}},
}