학술논문

A Logistic Regression Model using Pearson Correlation for Breast Cancer Classification
Document Type
Conference
Source
2023 6th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI) Research of Information Technology and Intelligent Systems (ISRITI), 2023 6th International Seminar on. :284-288 Dec, 2023
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Training
Logistic regression
Correlation
Machine learning algorithms
Training data
Prediction algorithms
Breast cancer
Breast Cancer
Classification
Logistic Regression
Pearson correlation
Language
ISSN
2832-1456
Abstract
We conducted research to diagnose breast cancer by classifying malignant and benign tumors. Several machine learning algorithms are used for classification, including Logistic Regression, Decision Trees, and ANN. The aim of this research is to determine the comparison of the three algorithms that have the highest level of accuracy, as well as the parameters and methods for optimizing the training data process in breast cancer classification. This research uses a dataset from Breast Cancer Wisconsin with the Pearson correlation method as feature selection to optimize the dataset for preprocessing data with drop features that have a correlation of more than 0.95. The research results show that the Logistic Regression algorithm is the algorithm that achieves the highest level of accuracy of 97.368421%.