Yıl: 2023 Cilt: 30 Sayı: 9 Sayfa Aralığı: 1112 - 1118 Metin Dili: İngilizce DOI: 10.5455/annalsmedres.2023.08.204 İndeks Tarihi: 01-11-2023

Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights

Öz:
Aim: As people’s quality of life and habits have changed, Coronary Artery Disease (CAD) has become the leading cause of death globally. It is a complicated cardiac disease with various risk factors and a wide range of symptoms. An early and accurate diagnosis of CAD allows for the quick administration of appropriate treatment, which contributes to a decreased mortality rate. Machine learning (ML) algorithms for CAD prediction and treatment decisions are quickly being developed and implemented in clinical practice. Predictive models based on machine learning algorithms may aid health personnel in the early diagnosis of CAD, lowering mortality. Thus, this study goal is to forecast the elements that may be connected with CAD using tree-based approaches, which are one of the machine learning methods, and to discover which factor is more effective on CAD. Materials and Methods: The open-access heart disease dataset was used within the scope of the study to investigate the risk factors related with CAD. The data set used contains the values of 333 patients, as well as 20 input and 1 target variables. The 10-fold cross validation approach was employed in the modeling, and the data set was divided as 80%: 20% as training and test datasets. For model assessment, the measures of accuracy (ACC), balanced accuracy (b-ACC), sensitivity (SE), specificity (SP), positive predictive value (ppv), negative predictive value (npv), and F1-score were utilized. Results: The values of ACC, b-ACC, SE, SP, ppv, npv, and F1-score performance metrics were 9 98.5%, 98.8%, 97.7%, 100%, 100%, 95.8% and 98.8%, respectively, as a consequence of the estimate model results created with the XGBoost approach, which has the best performance among tree-based models. When the groups with or without CAD were compared, a statistically significant difference was found in terms of the age variable. There is also a significant relationship between the active, lifestyle, ihd, dm, ecgpatt, qwave variables and the presence/absence of the CAD variable. When the variable significance values obtained as a result of modeling with the highest performing XGBoost are examined, it is seen that the variables that most associated with CAD are ekgpatt: normal, ekgpatt: ST-depression, ekgpatt: T-inversion, qwave: yes, age, bpdias, height, LDL, HR, IVSD: with LVH, bpsyDM. Conclusion: According to the performance criteria of the forecasting models used, CAD gave distinctively successful results in forecasting. By identifying risk factors associated with CAD, the proposed machine learning models can provide clinicians with practical, cost-effective and beneficial assistance in making accurate predictive decisions.
Anahtar Kelime: Coronary artery disease Classification Tree-based machine learning Risk factor

Belge Türü: Makale Makale Türü: Araştırma Makalesi Erişim Türü: Erişime Açık
  • 1. Heron M. Deaths: leading causes for 2008. National Vital Statistics Reports: From the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System. 2012;60(6):1-94.
  • 2. Miao KH, Miao JH, Miao GJ. Diagnosing coronary heart disease using ensemble machine learning. International Journal of Advanced Computer Science and Applications. 2016;7(10).
  • 3. Malakar AK, Choudhury D, Halder B, Paul P, Uddin A, Chakraborty S. A review on coronary artery disease, its risk factors, and therapeutics. Journal of cellular physiology. 2019;234(10):16812-23.
  • 4. Dipto IC, Islam T, Rahman HM, Rahman MA. Comparison of Different Machine Learning Algorithms for the Prediction of Coronary Artery Disease. Journal of Data Analysis and Information Processing. 2020;8(2):41-68.
  • 5. Virani SS, Alonso A, Benjamin EJ, Bittencourt MS, Callaway CW, Carson AP, et al. Heart disease and stroke statistics—2020 update: a report from the American Heart Association. Circulation. 2020;141(9):e139-e596.
  • 6. Alizadehsani R, Roshanzamir M, Abdar M, Beykikhoshk A, Khosravi A, Panahiazar M, et al. A database for using machine learning and data mining techniques for coronary artery disease diagnosis. Scientific data. 2019;6(1):227.
  • 7. Hajar R. Risk factors for coronary artery disease: historical perspectives. Heart views: the official journal of the Gulf Heart Association. 2017;18(3):109.
  • 8. Alizadehsani R, Zangooei MH, Hosseini MJ, Habibi J, Khosravi A, Roshanzamir M, et al. Coronary artery disease detection using computational intelligence methods. Knowledge-Based Systems. 2016;109:187-97.
  • 9. Mahesh T, Dhilip Kumar V, Vinoth Kumar V, Asghar J, Geman O, Arulkumaran G, et al. AdaBoost ensemble methods using Kfold cross validation for survivability with the early detection of heart disease. Computational Intelligence and Neuroscience. 2022;2022.
  • 10. Muhammad LJ, Algehyne EA, Usman SS. Predictive supervised machine learning models for diabetes mellitus. SN Computer Science. 2020;1(5):240.
  • 11. Muhammad L, Algehyne EA, Usman SS, Ahmad A, Chakraborty C, Mohammed IA. Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset. SN computer science. 2021;2:1-13.
  • 12. [cited 2023 2022]. Available from: https://doi.org/10.34740/KAGGLE/DSV/3989065.
  • 13. Lawrence R, Bunn A, Powell S, Zambon M. Classification of remotely sensed imagery using stochastic gradient boosting as a refinement of classification tree analysis. Remote sensing of environment. 2004;90(3):331-6.
  • 14. Moisen GG, Freeman EA, Blackard JA, Frescino TS, Zimmermann NE, Edwards Jr TC. Predicting tree species presence and basal area in Utah: a comparison of stochastic gradient boosting, generalized additive models, and tree-based methods. Ecological modelling. 2006;199(2):176-87.
  • 15. Wang J, Li P, Ran R, Che Y, Zhou Y. A short-term photovoltaic power prediction model based on the gradient boost decision tree. Applied Sciences. 2018;8(5):689.
  • 16. Salam Patrous Z. Evaluating XGBoost for user classification by using behavioral features extracted from smartphone sensors. 2018.
  • 17. Timofeev R. Classification and regression trees (CART) theory and applications. Humboldt University, Berlin. 2004;54.
  • 18. Choubin B, Abdolshahnejad M, Moradi E, Querol X, Mosavi A, Shamshirband S, et al. Spatial hazard assessment of the PM10 using machine learning models in Barcelona, Spain. Science of The Total Environment. 2020;701:134474.
  • 19. Sonkar P. Application of supervised machine learning to predict the mortality risk in elderly using biomarkers. 2017.
  • 20. Taşçı ME, Şamlı R. Veri madenciliği ile kalp hastalığı teşhisi. Avrupa Bilim ve Teknoloji Dergisi. 2020:88-95.
  • 21. Finegold JA, Asaria P, Francis DP. Mortality from ischaemic heart disease by country, region, and age: statistics from World Health Organisation and United Nations. International journal of cardiology. 2013;168(2):934-45.
  • 22. Cassar A, Holmes Jr DR, Rihal CS, Gersh BJ, editors. Chronic coronary artery disease: diagnosis and management. Mayo Clinic Proceedings; 2009: Elsevier.
  • 23. Mirbabaie M, Stieglitz S, Frick NR. Artificial intelligence in disease diagnostics: A critical review and classification on the current state of research guiding future direction. Health and Technology. 2021;11(4):693-731.
  • 24. Hamet P, Tremblay J. Artificial intelligence in medicine. Metabolism. 2017;69:S36-S40.
  • 25. Kononenko I. Machine learning for medical diagnosis: history, state of the art and perspective. Artificial Intelligence in medicine. 2001;23(1):89-109.
  • 26. Kumar Y, Koul A, Sisodia PS, Shafi J, Kavita V, Gheisari M, et al. Heart failure detection using quantum-enhanced machine learning and traditional machine learning techniques for internet of artificially intelligent medical things. Wireless Communications and Mobile Computing. 2021;2021:1-16.
  • 27. Kurt I, Ture M, Kurum AT. Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease. Expert systems with applications. 2008;34(1):366-74.
  • 28. Alizadehsani R, Habibi J, Hosseini MJ, Mashayekhi H, Boghrati R, Ghandeharioun A, et al. A data mining approach for diagnosis of coronary artery disease. Computer methods and programs in biomedicine. 2013;111(1):52-61.
  • 29. Akila S, Chandramathi S. A hybrid method for coronary heart disease risk prediction using decision tree and multi layer perceptron. Indian Journal of Science and Technology. 2015;8(34):1-7.
  • 30. Nassif AB, Mahdi O, Nasir Q, Talib MA, Azzeh M, editors. Machine learning classifications of coronary artery disease. 2018 International Joint Symposium on artificial intelligence and natural language processing (iSAI-NLP); 2018: IEEE.
APA Dogan Z, BALIKCI CICEK I (2023). Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. , 1112 - 1118. 10.5455/annalsmedres.2023.08.204
Chicago Dogan Zekeriya,BALIKCI CICEK IPEK Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. (2023): 1112 - 1118. 10.5455/annalsmedres.2023.08.204
MLA Dogan Zekeriya,BALIKCI CICEK IPEK Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. , 2023, ss.1112 - 1118. 10.5455/annalsmedres.2023.08.204
AMA Dogan Z,BALIKCI CICEK I Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. . 2023; 1112 - 1118. 10.5455/annalsmedres.2023.08.204
Vancouver Dogan Z,BALIKCI CICEK I Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. . 2023; 1112 - 1118. 10.5455/annalsmedres.2023.08.204
IEEE Dogan Z,BALIKCI CICEK I "Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights." , ss.1112 - 1118, 2023. 10.5455/annalsmedres.2023.08.204
ISNAD Dogan, Zekeriya - BALIKCI CICEK, IPEK. "Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights". (2023), 1112-1118. https://doi.org/10.5455/annalsmedres.2023.08.204
APA Dogan Z, BALIKCI CICEK I (2023). Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. Annals of Medical Research, 30(9), 1112 - 1118. 10.5455/annalsmedres.2023.08.204
Chicago Dogan Zekeriya,BALIKCI CICEK IPEK Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. Annals of Medical Research 30, no.9 (2023): 1112 - 1118. 10.5455/annalsmedres.2023.08.204
MLA Dogan Zekeriya,BALIKCI CICEK IPEK Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. Annals of Medical Research, vol.30, no.9, 2023, ss.1112 - 1118. 10.5455/annalsmedres.2023.08.204
AMA Dogan Z,BALIKCI CICEK I Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. Annals of Medical Research. 2023; 30(9): 1112 - 1118. 10.5455/annalsmedres.2023.08.204
Vancouver Dogan Z,BALIKCI CICEK I Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights. Annals of Medical Research. 2023; 30(9): 1112 - 1118. 10.5455/annalsmedres.2023.08.204
IEEE Dogan Z,BALIKCI CICEK I "Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights." Annals of Medical Research, 30, ss.1112 - 1118, 2023. 10.5455/annalsmedres.2023.08.204
ISNAD Dogan, Zekeriya - BALIKCI CICEK, IPEK. "Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights". Annals of Medical Research 30/9 (2023), 1112-1118. https://doi.org/10.5455/annalsmedres.2023.08.204