PREDICTING THE RISK OF MYOCARDIAL INFARCTION USING DIFFERENT CLASSIFICATION ALGORITHMS
Keywords:
KNN, Decision trees, C5, Neural network, Bayesian network, Myocardial infarction, Data mining.Abstract
Abstract
Introduction: According to WHO, cardiovascular diseases (CVDs) are the leading cause of death globally. Although significant progress has been made in the diagnosis of CVDs, more investigation can be helpful. Considering the significant role of data mining techniques in discovering hidden trends in health data, the present study aimed to predict the risk of myocardial infarction (MI) using k-nearest neighbor (KNN), neural network, decision tree (C5), and Bayesian network algorithms.
Methods: The present study is an analytical study. Considering CRISP methodology, data-mining analysis was carried out using KNN, neural network, C5, and Bayesian network algorithms by IBM SPSS modeler (Clementine 14.2) software. The applied data were related to 350 patients admitted to the Shahid Rajaei specialized cardiovascular hospital. However, after cleaning the data, only 274 records were entered in final analysis.
Results: This study revealed that the highest percentages of correct classifications achieved were 67.42%, 64.04%, 60.67%, and 60.67% for the KNN, C5, neural network, and Bayesian network algorithms, respectively. The results also revealed that smoking was the leading risk factor for MI.
Conclusion: In terms of accuracy, KNN was the most effective algorithm in predicting MI, followed by C5, neural network, and Bayesian network.