29, November 2025

Empirical Comparison of Data Mining Techniques for Breast Cancer Diagnosis in WEKA Environment

Author(s): Athira VP

Authors Affiliations:

Assistant Professor, Department of Computer Science, Pavanatma College Murickassery, Idukki, India

DOIs:10.2015/IJIRMF/202511017     |     Paper ID: IJIRMF202511017


Abstract
Keywords
Cite this Article/Paper as
References

Abstract:  Accurate and early recognition of breast cancer significantly recovers survival rates and treatment outcomes. This research paper presents a comprehensive comparative study of machine learning algorithms implemented in the WEKA (Waikato Environment for Knowledge Analysis) platform for breast cancer prediction. The Breast Cancer Wisconsin (Diagnostic) dataset from the UCI Machine Learning Repository is used as the experimental dataset. Various classification algorithms — Logistic Regression, Naive Bayes, Decision Tree (J48), Random Forest, k-Nearest Neighbors (IBk), Support Vector Machine (SMO), and Multilayer Perceptron (MLP) — were evaluated based on accuracy, precision, recall, F1-score, and ROC area using 10-fold cross-validation. Results indicate that Random Forest and SMO achieved the highest prediction accuracy above 97%, demonstrating excellent generalization capability. The study concludes that WEKA provides an effective environment for estimating and comparing machine learning models in healthcare diagnostics.

   
Key Words:  Breast Cancer Prediction, WEKA, Machine Learning, Classification Algorithms, Model Comparison, Performance Evaluation, Diagnostic Data Mining.,

Athira VP  (2025); Empirical Comparison of Data Mining Techniques for Breast Cancer Diagnosis in WEKA Environment,  International Journal for Innovative Research in Multidisciplinary Field, ISSN(O): 2455-0620, Vol-11, Issue-11, Pp.100-104.          Available on –   https://www.ijirmf.com/


Download Full Paper

Download PDF No. of Downloads:7 | No. of Views: 85