Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Enhanced Disease Classification with HybridCNNViT model and Synthetic Oversampling (SMOTE)

Boehm, Kristina LU and Singh, Raghwendra LU (2025) STAN40 20251
Department of Statistics
Abstract
Early and accurate classification of skin diseases plays a crucial role in improving diagnostic efficiency and patient outcomes. This thesis introduces HybridCNNViT, a deep learning algorithm that combines a setup of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to capture both local texture and global structure from skin lesion images. To address the common issue of class imbalance, where rare but critical skin conditions are underrepresented, the model applies the Synthetic Minority Oversampling Technique (SMOTE) during data preprocessing. Further enhancements include: attention mechanisms to help the model focus on relevant regions and regularization techniques to prevent overfitting. The training process also... (More)
Early and accurate classification of skin diseases plays a crucial role in improving diagnostic efficiency and patient outcomes. This thesis introduces HybridCNNViT, a deep learning algorithm that combines a setup of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to capture both local texture and global structure from skin lesion images. To address the common issue of class imbalance, where rare but critical skin conditions are underrepresented, the model applies the Synthetic Minority Oversampling Technique (SMOTE) during data preprocessing. Further enhancements include: attention mechanisms to help the model focus on relevant regions and regularization techniques to prevent overfitting. The training process also benefits from dynamic learning rate scheduling and data augmentation strategies. Evaluated on a challenging dermatology dataset, the model achieves a test precision of 88.42% and a weighted F1 score of 0.8802, demonstrating competitive performance compared to existing methods and a strong potential for clinical application. (Less)
Popular Abstract
Early and accurate classification of skin diseases plays a crucial role in improving diagnostic efficiency and patient outcomes. This thesis introduces HybridCNNViT, a deep learning algorithm that combines a setup of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to capture both local texture and global structure from skin lesion images. To address the common issue of class imbalance, where rare but critical skin conditions are underrepresented, the model applies the Synthetic Minority Oversampling Technique (SMOTE) during data preprocessing. Further enhancements include: attention mechanisms to help the model focus on relevant regions and regularization techniques to prevent overfitting. The training process also... (More)
Early and accurate classification of skin diseases plays a crucial role in improving diagnostic efficiency and patient outcomes. This thesis introduces HybridCNNViT, a deep learning algorithm that combines a setup of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to capture both local texture and global structure from skin lesion images. To address the common issue of class imbalance, where rare but critical skin conditions are underrepresented, the model applies the Synthetic Minority Oversampling Technique (SMOTE) during data preprocessing. Further enhancements include: attention mechanisms to help the model focus on relevant regions and regularization techniques to prevent overfitting. The training process also benefits from dynamic learning rate scheduling and data augmentation strategies. Evaluated on a challenging dermatology dataset, the model achieves a test precision of 88.42% and a weighted F1 score of 0.8802, demonstrating competitive performance compared to existing methods and a strong potential for clinical application. (Less)
Please use this url to cite or link to this publication:
author
Boehm, Kristina LU and Singh, Raghwendra LU
supervisor
organization
course
STAN40 20251
year
type
H1 - Master's Degree (One Year)
subject
keywords
ViT (Vision Transformer), CNN (Convolutional Neural Network), SMOTE (Synthetic Minority Over-sampling Technique), CBAM (Convolutional Block Attention Module)
language
English
id
9204692
date added to LUP
2025-07-09 11:43:46
date last changed
2025-07-09 11:43:46
@misc{9204692,
  abstract     = {{Early and accurate classification of skin diseases plays a crucial role in improving diagnostic efficiency and patient outcomes. This thesis introduces HybridCNNViT, a deep learning algorithm that combines a setup of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to capture both local texture and global structure from skin lesion images. To address the common issue of class imbalance, where rare but critical skin conditions are underrepresented, the model applies the Synthetic Minority Oversampling Technique (SMOTE) during data preprocessing. Further enhancements include: attention mechanisms to help the model focus on relevant regions and regularization techniques to prevent overfitting. The training process also benefits from dynamic learning rate scheduling and data augmentation strategies. Evaluated on a challenging dermatology dataset, the model achieves a test precision of 88.42% and a weighted F1 score of 0.8802, demonstrating competitive performance compared to existing methods and a strong potential for clinical application.}},
  author       = {{Boehm, Kristina and Singh, Raghwendra}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Enhanced Disease Classification with HybridCNNViT model and Synthetic Oversampling (SMOTE)}},
  year         = {{2025}},
}