Yıl: 2020 Cilt: 0 Sayı: Ejosat Özel Sayı 2020 (ICCEES) Sayfa Aralığı: 268 - 272 Metin Dili: İngilizce DOI: 10.31590/ejosat.803504 İndeks Tarihi: 31-10-2022

Prediction of Diabetes Mellitus by using Gradient Boosting Classification

Öz:
Diabetes has become a pervasive and endemic health problem worldwide. It is a chronic disease and also life-threatening. It can cause health problems in many organs such as the heart, kidneys, eyes, nerves, and blood vessels. To reduce the fatality rate from diabetes, early prevention techniques are needed. Nowadays, machine learning techniques are used to predict or detect different life-threatening diseases like cancer, diabetes, heart diseases, thyroid, etc. In this study, a prediction model of diabetes mellitus was presented using the Pima Indian dataset. Three different machine learning techniques that Decision Tree (DT), Random Forest (RF) and, Gradient Boosting (GB) algorithm were used to predict diabetes mellitus and the performance analysis was performed. Confusion matrix, accuracy, F1 score, precision, recall, Cohen’s kappa were evaluated and also a ROC curve was plotted. Out of the three techniques, the best results have been achieved with GB.
Anahtar Kelime: Diabetes Gradient Boosting Machine Learning

Gradient Boosting Classification kullanarak Diabetes Mellitus Tahmini

Öz:
Diyabet, dünya çapında yaygın ve endemik bir sağlık sorunu haline gelmiştir. Bu hastalık, kronik ve ayrıca yaşamı tehdit eden bir hastalıktır. Kalp, böbrekler, gözler, sinirler ve kan damarları gibi birçok organda sağlık sorununa yol açabilir. Diyabet kaynaklı ölüm oranını azaltmak için erken önleme tekniklerine ihtiyaç duyulmaktadır. Günümüzde makine öğrenmesi teknikleri kanser, diyabet, kalp hastalıkları, tiroid vb. gibi hayatı tehdit eden farklı hastalıkları tahmin etmek veya tespit etmek için kullanılmaktadır. Bu çalışmada Pima Indian veri setini kullanarak bir şeker hastalığı tahmin modeli sunulmuştur. Çalışmada şeker hastalığını tahmin etmek için Karar Ağacı (KA), Rastgele Orman (RO) ve Gradyan Artırma (GA) algoritmaları olmak üzere üç farklı makine öğrenmesi tekniği uygulanmış ve performans analizi yapılmıştır. Karmaşıklık matrisi, doğruluk, F1 skoru, kesinlik, geri çağırma, Cohen'in kappa'sı değerlendirilmiş ve ayrıca ROC eğrisi çizdirilmiştir. Üç teknikten, GA ile en iyi sonuçlar elde edilmiştir.
Anahtar Kelime: Diyabet Gradyan Arttırma Makina Öğrenmesi

Belge Türü: Makale Makale Türü: Araştırma Makalesi Erişim Türü: Erişime Açık
  • [1] Kerner, W., & Brückel, J. (2014). Definition, classification and diagnosis of diabetes mellitus. Experimental and clinical endocrinology & diabetes, 122(07), 384-386.
  • [2] Mellitus, D. (2005). Diagnosis and classification of diabetes mellitus. Diabetes care, 28(S37), S5-S10.
  • [3] Priyadi, Akhmad, et al. (2019). An economic evaluation of diabetes mellitus management in South East Asia. Journal of Advanced Pharmacy Education & Research| Apr-Jun 9.2
  • [4] Chan, J. C., Malik, V., Jia, W., Kadowaki, T., Yajnik, C. S., Yoon, K. H., & Hu, F. B. (2009). Diabetes in Asia: epidemiology, risk factors, and pathophysiology. Jama, 301(20), 2129-2140.
  • [5] Latif, Z. A., Ashrafuzzaman, S. M., Amin, M. F., Gadekar, A. V., Sobhan, M. J., & Haider, T. (2017). A Cross-sectional Study to Evaluate Diabetes Management, Control and Complications in Patients with type 2 Diabetes in Bangladesh. BIRDEM Medical Journal, 7(1), 17-27.
  • [6] Wild, S., Roglic, G., Green, A., Sicree, R., & King, H. (2004). Global prevalence of diabetes: estimates for the year 2000 and projections for 2030. Diabetes care, 27(5), 1047-1053.
  • [7] kumar Dewangan, A., & Agrawal, P. (2015). Classification of diabetes mellitus using machine learning techniques. International Journal of Engineering and Applied Sciences, 2(5).
  • [8] Karthikeyani, V., & Begum, I. P. (2013). Comparison a performance of data mining algorithms (CPDMA) in prediction of diabetes disease. International journal on computer science and engineering, 5(3), 205.
  • [9] Parashar, A., Burse, K., & Rawat, K. (2014). A Comparative approach for Pima Indians diabetes diagnosis using lda-support vector machine and feed forward neural network. International Journal of Advanced Research in Computer Science and Software Engineering, 4(11), 378-383.
  • [10] Al Helal, M., Chowdhury, A. I., Islam, A., Ahmed, E., Mahmud, M. S., & Hossain, S. (2019, February). An optimization approach to improve classification performance in cancer and diabetes prediction. In 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE) (pp. 1-5). IEEE.
  • [11] Dataset, P. I. D. UCI Machine Learning Repository, diambil dari http://archive. ics. uci. edu/ml/datasets. Pima+ Indians+ Diabetes.
  • [12] Song, Y. Y., & Ying, L. U. (2015). Decision tree methods: applications for classification and prediction. Shanghai archives of psychiatry, 27(2), 130.
  • [13] Fawagreh, K., Gaber, M. M., & Elyan, E. (2014). Random forests: from early developments to recent advancements. Systems Science & Control Engineering: An Open Access Journal, 2(1), 602-609.
  • [14] Breiman, L. (June 1997). Arcing The Edge (PDF). Technical Report 486. Statistics Department, University of California, Berkeley.
  • [15] Sokolova, M., & Lapalme, G. (2009). A systematic analysis of performance measures for classification tasks. Information processing & management, 45(4), 427-437.
  • [16] https://towardsdatascience.com/cohens-kappa-9786ceceab58
APA Nusrat F, Uzbaş B, BAYKAN O (2020). Prediction of Diabetes Mellitus by using Gradient Boosting Classification. , 268 - 272. 10.31590/ejosat.803504
Chicago Nusrat Fatema,Uzbaş Betül,BAYKAN OMER KAAN Prediction of Diabetes Mellitus by using Gradient Boosting Classification. (2020): 268 - 272. 10.31590/ejosat.803504
MLA Nusrat Fatema,Uzbaş Betül,BAYKAN OMER KAAN Prediction of Diabetes Mellitus by using Gradient Boosting Classification. , 2020, ss.268 - 272. 10.31590/ejosat.803504
AMA Nusrat F,Uzbaş B,BAYKAN O Prediction of Diabetes Mellitus by using Gradient Boosting Classification. . 2020; 268 - 272. 10.31590/ejosat.803504
Vancouver Nusrat F,Uzbaş B,BAYKAN O Prediction of Diabetes Mellitus by using Gradient Boosting Classification. . 2020; 268 - 272. 10.31590/ejosat.803504
IEEE Nusrat F,Uzbaş B,BAYKAN O "Prediction of Diabetes Mellitus by using Gradient Boosting Classification." , ss.268 - 272, 2020. 10.31590/ejosat.803504
ISNAD Nusrat, Fatema vd. "Prediction of Diabetes Mellitus by using Gradient Boosting Classification". (2020), 268-272. https://doi.org/10.31590/ejosat.803504
APA Nusrat F, Uzbaş B, BAYKAN O (2020). Prediction of Diabetes Mellitus by using Gradient Boosting Classification. Avrupa Bilim ve Teknoloji Dergisi, 0(Ejosat Özel Sayı 2020 (ICCEES)), 268 - 272. 10.31590/ejosat.803504
Chicago Nusrat Fatema,Uzbaş Betül,BAYKAN OMER KAAN Prediction of Diabetes Mellitus by using Gradient Boosting Classification. Avrupa Bilim ve Teknoloji Dergisi 0, no.Ejosat Özel Sayı 2020 (ICCEES) (2020): 268 - 272. 10.31590/ejosat.803504
MLA Nusrat Fatema,Uzbaş Betül,BAYKAN OMER KAAN Prediction of Diabetes Mellitus by using Gradient Boosting Classification. Avrupa Bilim ve Teknoloji Dergisi, vol.0, no.Ejosat Özel Sayı 2020 (ICCEES), 2020, ss.268 - 272. 10.31590/ejosat.803504
AMA Nusrat F,Uzbaş B,BAYKAN O Prediction of Diabetes Mellitus by using Gradient Boosting Classification. Avrupa Bilim ve Teknoloji Dergisi. 2020; 0(Ejosat Özel Sayı 2020 (ICCEES)): 268 - 272. 10.31590/ejosat.803504
Vancouver Nusrat F,Uzbaş B,BAYKAN O Prediction of Diabetes Mellitus by using Gradient Boosting Classification. Avrupa Bilim ve Teknoloji Dergisi. 2020; 0(Ejosat Özel Sayı 2020 (ICCEES)): 268 - 272. 10.31590/ejosat.803504
IEEE Nusrat F,Uzbaş B,BAYKAN O "Prediction of Diabetes Mellitus by using Gradient Boosting Classification." Avrupa Bilim ve Teknoloji Dergisi, 0, ss.268 - 272, 2020. 10.31590/ejosat.803504
ISNAD Nusrat, Fatema vd. "Prediction of Diabetes Mellitus by using Gradient Boosting Classification". Avrupa Bilim ve Teknoloji Dergisi Ejosat Özel Sayı 2020 (ICCEES) (2020), 268-272. https://doi.org/10.31590/ejosat.803504