Multi-Modal AI Approach for Multi-Class Skin Disease Classification

Michela Effendie; Vanessa Aguiar-Pulido; Vanessa Aguiar

doi:10.1109/ICMLA61862.2024.00245

Back

Conference proceeding

Multi-Modal AI Approach for Multi-Class Skin Disease Classification

Michela Effendie, Vanessa Aguiar-Pulido and Vanessa Aguiar

Proceedings (IEEE International Conference on Emerging Technologies and Factory Automation), pp.1588-1592

2024-12-18

DOI: https://doi.org/10.1109/ICMLA61862.2024.00245

Abstract

Deep learning

Diseases

explainability

Feature extraction

Lesions

Medical services

multi-class classification

Refining

Sensitivity

Skin

Skin cancer

skin disease

Transformers

vision transformers

Skin cancer has been a significant global health threat, with its occurrence increasing in recent years. Traditional diagnostic methods, such as dermoscopy and biopsy have been effective, but have limitations concerning subjectivity, invasiveness, and accessibility. In order to address these challenges, this study explores the application of deep learning models for skin disease classification using the PAD-UFES-20 dataset, which contains smartphone-captured images and clinical patient information, to provide a scalable, non-invasive screening tool that can complement conventional methods. The proposed approach implements intermediate fusion to combine the dataset's modalities using vision transformers (ViT or Dino V2) to extract image features, and employs classical machine learning (XGBoost) to classify the fused data. Data preprocessing and augmentation techniques were applied to handle class imbalance and low-quality images. Results show that the proposed approach achieves high accuracy, outperforming benchmarks from existing studies. Grad-CAM visualizations confirmed that the models primarily focus on relevant skin lesion features, but also identified areas of potential improvement, such as reducing sensitivity to irrelevant regions. This research demonstrates the promise of deep learning models in dermatology for remote skin disease screening, potentially improving access to dermatological care in underserved populations and healthcare deserts. Future work will focus on refining the proposed approach to further increase its performance and generalization capabilities, as well as to refine the explanation provided with the model's classification.

Metrics

1 Record Views

Details

Title: Multi-Modal AI Approach for Multi-Class Skin Disease Classification
Creators: Michela Effendie - University of Miami
Vanessa Aguiar-Pulido - University of Miami
Vanessa Aguiar - A&S - Computer Science
Publication Details: Proceedings (IEEE International Conference on Emerging Technologies and Factory Automation), pp.1588-1592
Publisher: IEEE
Number of pages: 5
Academic Unit: College of A&S; A&S - Computer Science
Language: English
Resource Type: Conference proceeding
Record Identifier: 991032796076102976