DAP391_HAM_ENG
DAP391_HAM_ENG
Abstract
This paper presents the application of Machine Learning in skin cancer recogni-
tion from medical images using the HAM10000 dataset. We apply Explainable AI
(XAI) methods based on counterfactual explanations, as well as Lime and SHAP
techniques to interpret model decisions. Experimental results show that the model
achieves 98.20% accuracy on the test set, with a loss function of 0.0898, and XAI
methods help clarify the basis of predictions.
1 Introduction
Skin cancer is one of the most common types of cancer today. Early detection through
skin images can improve treatment success rates. The HAM10000 dataset contains
10,015 labeled skin lesion images, supporting deep learning research for automatic skin
cancer recognition. Additionally, the use of Explainable AI (XAI) methods such as
Counterfactual-based, Lime, and SHAP helps doctors better understand model decisions.
2 Research Methods
2.1 Dataset
The HAM10000 dataset was collected from different patients with seven types of skin
lesions, including malignant and benign tumors. The data was balanced and preprocessed
by normalizing images, augmenting data, and converting them into a suitable format.
1
2.2 Algorithms Used
2.2.2 Lime
2.2.3 SHAP
SHAP (SHapley Additive exPlanations) is based on game theory, calculating the con-
tribution of each data feature to the prediction outcome. SHAP uses Shapley values to
measure the impact of each variable on the model’s decision, providing a general and
precise interpretation of the factors affecting predictions.
2
3.1 Model Performance Comparison
The results show that the proposed model outperforms traditional models such as
ResNet50, VGG16, and EfficientNetB0, confirming the superiority of the applied method.
4 Conclusion
Experimental results demonstrate that the model achieves high performance in skin can-
cer recognition on the HAM10000 dataset. The application of XAI methods such as
Counterfactual-based, Lime, and SHAP provides clearer information on the model’s pre-
diction reasoning, thus better supporting clinical diagnosis.
References
[1] P. Tschandl, C. Rosendahl, H. Kittler, ”The HAM10000 Dataset: A Large Collection of
Multi-Source Dermatoscopic Images of Common Pigmented Skin Lesions,” Scientific
Data, 2018.
[3] M. Ribeiro, S. Singh, C. Guestrin, ”Why Should I Trust You? Explaining the Predic-
tions of Any Classifier,” Proceedings of the 22nd ACM SIGKDD International Con-
ference on Knowledge Discovery and Data Mining, 2016.
3
[4] S. Lundberg, S. Lee, ”A Unified Approach to Interpretable Model Predictions,” Ad-
vances in Neural Information Processing Systems, 2017.
[6] Z. Jha, ”DoubleU-Net: A Deep Convolutional Neural Network,” Papers with Code,
2021.
[7] X. Wang, ”DiffMic: Dual Guidance Diffusion Network for Microscopy Image Analy-
sis,” Papers with Code, 2023.