ENSEMBLE MACHINE LEARNING ALGORITHM FOR DIABETES PREDICTION IN MAIDUGURI, BORNO STATE

  • Emmanuel Gbenga Dada Department of Mathematics and Computer Science, Faculty of Science, Borno State University, Maiduguri.
  • Aishatu Ibrahim Birma Department of Mathematics and Computer Science, Faculty of Science, Borno State University, Maiduguri.
  • Abdulkarim Abbas Gora Department of Computer Science, Faculty of Science, Borno State University, Maiduguri.

Abstract

Diabetes mellitus (DM) is a metabolic disease characterised by high levels of glucose in the blood, known as hyperglycemia, that can result in multiple problems within the body. The World Health Organisation (WHO) data for 2021 reveals a substantial increase in the prevalence of diabetes mellitus (DM), with the number of cases rising from 108 million in 1980 to 422 million in 2014. Between 2000 and 2019, there was a 3% increase in mortality rates associated with diabetes, categorised by age. In 2019, DM caused the deaths of more than 2 million people. These concerning figures clearly necessitate an immediate response. An alarming incidence of diabetes among the population of Maiduguri and Borno State inspired this investigation. This research proposed stacking ensemble learning approach to predict the rate of occurrence of diabetes cases in Maiduguri. The paper used different types of regression models to predict the occurrences of diabetes cases in Maiduguri over time. These models included adaptive boosting regression (Adaboost), gradient boosting regression (GBOOST), random forest regression (RFR), ordinary least square regression (OLS), least absolute shrinkage selection operator regression (LASSO), and ridge regression (RIDGE). The performance indicators studied in this work are root mean square (RMSE), mean absolute error (MAE), and mean square error (MSE). These metrics were used to assess the effectiveness of both the machine learning and proposed Stacking Ensemble Learning (SEL) approaches. Performance metrics considered in this study are root mean square (RMSE), mean absolute error (MAE), and mean square error (MSE), which were used to evaluate the performance of the machine learning and the proposed Stacking Ensemble Learning (SEL) technique. Experimental results revealed that SEL is a better predictor compared to other machine learning approaches considered in this work with an RMSE of 0.0493; a MSE of 0.0024; and a MAE of 0.0349. It is hoped that this research will help government officials understand the threat of diabetes and take the necessary mitigation actions.

Published
2024-07-01