Machine Learning Approaches for Predicting Breast Cancer Recurrence: A Comparative Analysis

Noor Razzaq  Abbas; Hussein  Alkattan; Isam Bahaa  Aldallal

doi:10.58496/MJAIH/2025/020

PDF

Published: 2025-07-30

DOI: https://doi.org/10.58496/MJAIH/2025/020

Keywords:

Breast cancer recurrence, Machine Learning, Multi-Layer Perceptron, Feature Importance, ROC curve

Noor Razzaq Abbas

Al-Furat Al-Awsat Technical University, Technical Institute of Najaf, Najaf, Iraq

https://orcid.org/0009-0002-6093-5898

Hussein Alkattan

Department of System Programming, South Ural State University, Chelyabinsk, Russia

https://orcid.org/0000-0002-0281-082X

Isam Bahaa Aldallal

Department of Electrical and Computer Engineering, Altinbas University, Istanbul, Turkey

https://orcid.org/0009-0000-0652-5877

Abstract

This paper reports a comparative analysis of four supervised machine learning algorithms: RF, SVM (using radial and linear kernels), Logistic Regression, and Multi-Layer Perceptron, for breast cancer recurrence prediction on a carefully curated clinical dataset. The data set, first collected by Royston and Altman and subsequently released on Kaggle, has patient age, menopausal status, tumor size, histological grade, lymph node status, estrogen and progesterone receptor levels, hormone therapy for treatment, recurrence-free survival time, and a binary recurrence outcome. The data set was then divided after the elimination of identifiers and z-score normalization in an 80:20 ratio using stratified sampling. Models were compared based on accuracy, precision, recall, F1-score, and area under the ROC curve, with RF and Logistic Regression having the highest test-set accuracy of 0.703. Feature significance analysis Gini impurity in R F, linear model absolute coefficients, and permutation importance in neural networks all showed lymph node count, survival time, and hormone receptor levels to be significant predictors. Visualized confusion matrices, ROC curves, and correlation heatmaps enhanced interpretability. The results illustrate the potential of explainable machine learning to enhance individualized surveillance and treatment planning in breast cancer care.

Issue

Vol. 2025 (2025)

Section

Articles

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Machine Learning Approaches for Predicting Breast Cancer Recurrence: A Comparative Analysis (N. R. . Abbas, H. . Alkattan, & I. B. . Aldallal , Trans.). (2025). Mesopotamian Journal of Artificial Intelligence in Healthcare, 2025, 208-218. https://doi.org/10.58496/MJAIH/2025/020

Article Sidebar

Main Article Content

Abstract

Article Details

Issue

Section

How to Cite

Similar Articles