Empirical Comparison of Data Mining Techniques for Breast Cancer Diagnosis in WEKA Environment
Author(s): Athira VP
Authors Affiliations:
Assistant Professor, Department of Computer Science, Pavanatma College Murickassery, Idukki, India
DOIs:10.2015/IJIRMF/202511017     |     Paper ID: IJIRMF202511017Abstract: Accurate and early recognition of breast cancer significantly recovers survival rates and treatment outcomes. This research paper presents a comprehensive comparative study of machine learning algorithms implemented in the WEKA (Waikato Environment for Knowledge Analysis) platform for breast cancer prediction. The Breast Cancer Wisconsin (Diagnostic) dataset from the UCI Machine Learning Repository is used as the experimental dataset. Various classification algorithms — Logistic Regression, Naive Bayes, Decision Tree (J48), Random Forest, k-Nearest Neighbors (IBk), Support Vector Machine (SMO), and Multilayer Perceptron (MLP) — were evaluated based on accuracy, precision, recall, F1-score, and ROC area using 10-fold cross-validation. Results indicate that Random Forest and SMO achieved the highest prediction accuracy above 97%, demonstrating excellent generalization capability. The study concludes that WEKA provides an effective environment for estimating and comparing machine learning models in healthcare diagnostics.
Athira VP (2025); Empirical Comparison of Data Mining Techniques for Breast Cancer Diagnosis in WEKA Environment, International Journal for Innovative Research in Multidisciplinary Field, ISSN(O): 2455-0620, Vol-11, Issue-11, Pp.100-104. Available on – https://www.ijirmf.com/

