Yıl: 2023 Cilt: 30 Sayı: 2 Sayfa Aralığı: 167 - 174 Metin Dili: İngilizce DOI: 10.5455/annalsmedres.2022.09.276 İndeks Tarihi: 04-04-2023

A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus

Öz:
Aim: The goal of this study is to compare the performances of Logistic Regression (LR), Artificial Neural Networks (ANN) and Decision Tree models, which are machine learning classification methods, in the diagnosis of type 2 Diabetes Mellitus (DM) and to determine the most successful method. It is also the examination of risk factors affecting type 2 DM using these models. Materials and Methods: The study’s data was collected from patients who visited the Diabetes and Thyroid polyclinic at the Inonu University Faculty of Medicine Turgut Ozal Medical Center, Department of Internal Medicine. The k-Nearest Neighbor algorithm, which is one of the missing value assignment methods, was used to eliminate the prob- lems related to missing values. Sensitivity, accuracy, precision, specificity, AUC F1-score, and classification error were used as performance evaluation criteria. Evolutionary algo- rithm parameter optimization method was used to optimize the parameters of the ANN model. Missing value assignment, modeling and parameter optimization were done with Rapidminer Studio Free version 8.1. Results: Among the three methods applied in the diagnosis of type 2 DM, the ANN gave the best classification performance. The accuracy, sensitivity, selectivity, precision, F1-score, AUC and classification error values obtained from this method are respectively; 98.94%, 100%, 97.73%, 98.04%, 99.01%, 0.978 and 1.06. For the ANN method, the im- portance values of the gender, long-term drug use, family history, concomitant disease, cortisone use, stress factor, high blood pressure, smoking, high cholesterol, heart dis- ease, exercise status, carbohydrate use, alcohol consumption, vegetable use, meat use, age, weight, height, starting age, daily bread consumption, LDL, HDL, Total Cholesterol, Triglyceride, Fasting blood sugar the importance values of independent variables are re- spectively; 0.017, 0.009, 0.013, 0.017, 0.008, 0.016, 0.008, 0.006, 0.053, 0.024, 0.023, 0.040, 0.007, 0.020, 0.007, 0.046, 0.083, 0.049, 0.024, 0.066, 0.084, 0.083, 0.020, 0.031, 0.244. Conclusion: According to the performance criteria obtained from the three classifica- tion models used to predict type 2 DM; it has been found that the best classification performance belongs to the ANN model. According to the ANN method, the three most important risk factors that may cause type 2 DM were found to be fasting blood glucose, LDL, and HDL, respectively.
Anahtar Kelime: Artificial neural networks Logistic regression analysis Decision trees Type 2 diabetes mellitus Risk factors

Belge Türü: Makale Makale Türü: Araştırma Makalesi Erişim Türü: Erişime Açık
  • 1. Başer BÖ, Yangın M, Sarıdaş ES. Makine öğrenmesi teknikleriyle diyabet hastalığının sınıflandırılması. Süley- man Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi. 2021;25(1):112-20.
  • 2. Ozougwu J, Obimba K, Belonwu C, Unakalamba C. The patho- genesis and pathophysiology of type 1 and type 2 diabetes mel- litus. J Physiol Pathophysiol. 2013;4(4):46-57.
  • 3. Harding JL, Pavkov ME, Magliano DJ, Shaw JE, Gregg EW. Global trends in diabetes complications: a review of current evidence. Diabetologia. 2019;62(1):3-16.
  • 4. Cihan P, Coşkun H. Diyabet Tahmini için Makine Öğrenmesi Modellerinin Performans Karşılaştırılması Performance Compar- ison of Machine Learning Models for Diabetes Prediction.
  • 5. Neslihan İ, Aşır G. Lojistik Regresyon Analizi Yardımıyla Denekte Menopoz Evresine Geçişe İlişkin Bir Sınıflandırma Mod- elinin Elde Edilmesi. Selçuk Üniversitesi Fen Fakültesi Fen Der- gisi. 2005;1(25):19-28.
  • 6. Bircan H. Lojistik regresyon analizi: Tıp verileri üzerine bir uygulama. Kocaeli Üniversitesi Sosyal Bilimler Dergisi. 2004(8):185-208.
  • 7. Öztemel E. Yapay Sinir Ağları, Papatya Yayıncılık Eğitim Bil- gisayar Sis. San ve Tic AŞ, İstanbul. 2006.
  • 8. Haykin S. Neural Networks, a comprehensive founda- tion, Prentice-Hall Inc. Upper Saddle River, New Jersey. 1999;7458:161-75.
  • 9. Altaş D, Gülpınar V. Karar Ağaçları ve Yapay Sinir Ağlarının Sınıflandırma Performanslarının Karşılaştırılması. Trakya Üniversitesi Sosyal Bilimler Dergisi. 2012.
  • 10. Murthy SK. Automatic construction of decision trees from data: A multi-disciplinary survey. Data mining and knowledge discov- ery. 1998;2(4):345-89.
  • 11. Maimon OZ, Rokach L. Data mining with decision trees: theory and applications: World scientific; 2014.
  • 12. Hosmer D. Lemeshow S. Applied logistic regression. USA: John Wiley and Sons; 2000.
  • 13. Sanz J, Paternain D, Galar M, Fernandez J, Reyero D, Belzunegui T. A new survival status prediction system for severe trauma patients based on a multiple classifier system. Computer methods and programs in biomedicine. 2017;142:1-8.
  • 14. Nabiyev VV, Zeka Y. Problemler. Yöntemler, Algoritmalar, Seçkin Yayıncılık. 2005:83-6.
  • 15. Haykin S. Neural networks and learning machines, 3/E: Pearson Education India; 2009.
  • 16. Azar AT, El-Metwally SM. Decision tree classifiers for auto- mated medical diagnosis. Neural Computing and Applications. 2013;23(7):2387-403.
  • 17. Sathyadevan S, Nair RR. Comparative analysis of decision tree algorithms: ID3, C4. 5 and random forest. Computational intel- ligence in data mining-volume 1: Springer; 2015. p. 549-62.
  • 18. Tayefi M, Tajfard M, Saffar S, Hanachi P, Amirabadizadeh AR, Esmaeily H, et al. hs-CRP is strongly associated with coronary heart disease (CHD): A data mining approach using decision tree algorithm. Computer methods and programs in biomedicine. 2017;141:105-9.
  • 19. Rapidminer DA. RapidMiner 4.1 User Guide. Dortmund; 2008.
  • 20. Bilgin G. Makine öğrenmesi algoritmaları kullanarak erken dönemde diyabet hastalığı riskinin araştırılması. Journal of In- telligent Systems: Theory and Applications. 2021;4(1):55-64.
  • 21. Asif M. The prevention and control the type-2 diabetes by chang- ing lifestyle and dietary pattern. Journal of education and health promotion. 2014;3.
  • 22. Sapon MA, Ismail K, Zainudin S, editors. Prediction of diabetes by using artificial neural network. Proceedings of the 2011 Inter- national Conference on Circuits, System and Simulation, Singa- pore; 2011.
  • 23. Ahmed TM. Developing a predicted model for diabetes type 2 treatment plans by using data mining. Journal of Theoretical and Applied Information Technology. 2016;90(2):181.
  • 24. Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I. Machine learning and data mining methods in diabetes research. Computational and structural biotechnology journal. 2017;15:104-16.
  • 25. Joshi TN, Chawan P. Diabetes prediction using machine learning techniques. Ijera. 2018;8(1):9-13.
  • 26. Al Helal M, Chowdhury AI, Islam A, Ahmed E, Mahmud MS, Hossain S, editors. An optimization approach to improve clas- sification performance in cancer and diabetes prediction. 2019 International Conference on Electrical, Computer and Commu- nication Engineering (ECCE); 2019: IEEE.
  • 27. Mujumdar A, Vaidehi V. Diabetes prediction using machine learning algorithms. Procedia Computer Science. 2019;165:292- 9.
  • 28. Thaiyalnayaki K. Classification of diabetes using deep learning and svm techniques. International Journal of Current Research and Review. 2021;13(01):146.
  • 29. Eroğlu N. Diyabetin Komplikasyonlarından Korunmak için Tanı, Tedavi ve İzlem. İzmir Katip Çelebi Üniversitesi Sağlık Bilimleri Fakültesi Dergisi.4(1):31-3.
  • 30. Erg ̇ın E, Akın S, Kazan S, Erdem M, Tekçe M, Aliustaoğlu M. Diyabetik Hastalarda Lipid Profili: Farkındalık ve Tedavideki Başarı Oranlarımız. Kartal Eğitim ve Araştırma Hastanesi Tıp Dergisi. 2013;24(3).
  • 31. Karslıoğlu DH. Obezite, Tip 2 Diyabet ve Beslenme. Klinik Tıp Bilimleri.7(3):36-43.
APA BALIKCI CICEK I, Yologlu S, Sahin i (2023). A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. , 167 - 174. 10.5455/annalsmedres.2022.09.276
Chicago BALIKCI CICEK IPEK,Yologlu Saim,Sahin ibrahim A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. (2023): 167 - 174. 10.5455/annalsmedres.2022.09.276
MLA BALIKCI CICEK IPEK,Yologlu Saim,Sahin ibrahim A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. , 2023, ss.167 - 174. 10.5455/annalsmedres.2022.09.276
AMA BALIKCI CICEK I,Yologlu S,Sahin i A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. . 2023; 167 - 174. 10.5455/annalsmedres.2022.09.276
Vancouver BALIKCI CICEK I,Yologlu S,Sahin i A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. . 2023; 167 - 174. 10.5455/annalsmedres.2022.09.276
IEEE BALIKCI CICEK I,Yologlu S,Sahin i "A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus." , ss.167 - 174, 2023. 10.5455/annalsmedres.2022.09.276
ISNAD BALIKCI CICEK, IPEK vd. "A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus". (2023), 167-174. https://doi.org/10.5455/annalsmedres.2022.09.276
APA BALIKCI CICEK I, Yologlu S, Sahin i (2023). A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. Annals of Medical Research, 30(2), 167 - 174. 10.5455/annalsmedres.2022.09.276
Chicago BALIKCI CICEK IPEK,Yologlu Saim,Sahin ibrahim A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. Annals of Medical Research 30, no.2 (2023): 167 - 174. 10.5455/annalsmedres.2022.09.276
MLA BALIKCI CICEK IPEK,Yologlu Saim,Sahin ibrahim A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. Annals of Medical Research, vol.30, no.2, 2023, ss.167 - 174. 10.5455/annalsmedres.2022.09.276
AMA BALIKCI CICEK I,Yologlu S,Sahin i A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. Annals of Medical Research. 2023; 30(2): 167 - 174. 10.5455/annalsmedres.2022.09.276
Vancouver BALIKCI CICEK I,Yologlu S,Sahin i A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus. Annals of Medical Research. 2023; 30(2): 167 - 174. 10.5455/annalsmedres.2022.09.276
IEEE BALIKCI CICEK I,Yologlu S,Sahin i "A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus." Annals of Medical Research, 30, ss.167 - 174, 2023. 10.5455/annalsmedres.2022.09.276
ISNAD BALIKCI CICEK, IPEK vd. "A comparison of multivariate statistical methods to detect risk factors for type 2 diabetes mellitus". Annals of Medical Research 30/2 (2023), 167-174. https://doi.org/10.5455/annalsmedres.2022.09.276