A Machine Learning Model for Multi-Level Classification of Diabetic Peripheral Neuropathy Using Clinical, Lifestyle, and Familial Factors
Abstract
This study aims to develop a robust Machine Learning (ML) framework for multi-level classification of Diabetic Peripheral Neuropathy (DPN) severity by integrating clinical indicators, lifestyle factors, and familial history in patients with type 2 diabetes. The dataset, collected from the Diabetes Research and Treatment Center, Shahid Sadoughi University of Medical Sciences, Yazd, Iran, underwent comprehensive preprocessing including normalization via MinMaxScaler and class rebalancing using the Synthetic Minority Over-sampling Technique (SMOTE). Several ML algorithms — Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Support Vector Machine (SVM), Multilayer Perceptron (MLP), and Voting Classifier — were implemented and systematically compared. Among these, the RF model achieved the best performance with an accuracy of 86.1%, demonstrating superior stability and interpretability, closely followed by XGBoost. Feature engineering and the incorporation of clinically meaningful composite indices significantly enhanced model performance by capturing complex relationships among physiological and lifestyle variables. Model evaluation based on accuracy, sensitivity, specificity, F1-score, and ROC-AUC confirmed both predictive reliability and clinical applicability. To further enhance interpretability, SHapley Additive exPlanations (SHAP) analysis was conducted using the XGBoost framework due to its higher compatibility with gradient-based explanation methods. The SHAP results confirmed the consistency of feature importance observed in RF, revealing that lower Mean Reflex values, reduced vibration sensitivity (Tuning Fork Test), and higher Body Mass Index (BMI) were strongly associated with severe neuropathy levels. These findings highlight that combining predictive modeling with explainable Artificial Intelligence (AI) approaches can provide transparent, clinically interpretable insights — paving the way for intelligent, explainable decision-support systems in diabetic care.
Keywords:
Diabetes, Diabetic peripheral neuropathy, Machine learning, Clinical indicators, Lifestyle factors, Family historyReferences
- [1] Liu, L., Bi, B., Gui, M., Zhang, L., Ju, F., Wang, X., & Cao, L. (2025). Development and internal validation of an interpretable risk prediction model for diabetic peripheral neuropathy in type 2 diabetes: A single-centre retrospective cohort study in China. British medical journal open, 15(4), e092463. https://doi.org/10.1136/bmjopen-2024-092463
- [2] Sun, M., Sun, X., Wang, F., & Liu, L. (2025). Machine learning-based prediction of diabetic peripheral neuropathy: Model development and clinical validation. Frontiers in endocrinology, 16, 1614657. https://doi.org/10.3389/fendo.2025.1614657
- [3] Lian, X., Qi, J., Yuan, M., Li, X., Wang, M., Li, G., ... & Zhong, J. (2023). Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning. BMC medical informatics and decision making, 23(1), 146. https://doi.org/10.1186/s12911-023-02232-1
- [4] Basebaa, A. S., Musiaan, N. S., & Mahross, A. H. (2024). Prevalence and risk factors of diabetic peripheral neuropathy: A cross-sectional study from Yemen. Ain shams medical journal, 75(1), 201-213. https://doi.org/10.21608/asmj.2024.237487.1171
- [5] Wu, R. L., Chen, N., Chen, Y., Wu, X., Ko, C. Y., & Chen, X. Y. (2024). Visceral adiposity as an independent risk factor for diabetic peripheral neuropathy in type 2 diabetes mellitus: A retrospective study. Journal of diabetes research, 2024(1), 9912907. https://doi.org/10.1155/2024/9912907
- [6] Wei, Z., Wang, X., Lu, L., Li, S., Long, W., Zhang, L., & Shen, S. (2024). Construction of an early risk prediction model for type 2 diabetic peripheral neuropathy based on random forest. CIN: Computers, informatics, nursing, 42(9), 665-674. https://doi.org/10.1097/cin.0000000000001157
- [7] Ma, Y., Wang, Z., Yao, Z., Lu, B., & He, Y. (2025). Machine learning in the prediction of diabetic peripheral neuropathy: A systematic review. BMC medical informatics and decision making, 25(1), 344. https://doi.org/10.1186/s12911-025-03201-6
- [8] Jian, Y., Pasquier, M., Sagahyroon, A., & Aloul, F. (2021). A machine learning approach to predicting diabetes complications. Healthcare, 9(12), 1712. https://doi.org/10.3390/healthcare9121712
- [9] Yu, X., Wu, Z., & Zhang, N. (2024). Machine learning-driven discovery of novel therapeutic targets in diabetic foot ulcers. Molecular medicine, 30(1), 215. https://doi.org/10.1186/s10020-024-00955-z
- [10] Haque, F., Bin Ibne Reaz, M., Chowdhury, M. E. H., Srivastava, G., Hamid Md Ali, S., Bakar, A. A. A., & Bhuiyan, M. A. S. (2021). Performance analysis of conventional machine learning algorithms for diabetic sensorimotor polyneuropathy severity classification. Diagnostics, 11(5), 801. https://doi.org/10.3390/diagnostics11050801
- [11] Sheikh, M. M., Balachandra, M., VG, N., & Maiya, A. G. (2025). Predicting diabetic peripheral neuropathy through advanced plantar pressure analysis: A machine learning approach. Scientific reports, 15(1), 20962. https://doi.org/10.1038/s41598-025-07774-0
- [12] Gao, L., Liu, Z., Han, S., & Wang, J. (2025). A machine-learning-based clinical decision model for predicting amputation risk in patients with diabetic foot ulcers: Diagnostic performance and practical implications. Diagnostics, 15(24), 3142. https://doi.org/10.3390/diagnostics15243142
- [13] Almutairi, E., Abbod, M., & Hunaiti, Z. (2025). Prediction of diabetes using statistical and machine learning modelling techniques. Algorithms, 18(3), 145. https://doi.org/10.3390/a18030145
- [14] Shin, D. Y., Lee, B., Yoo, W. S., Park, J. W., & Hyun, J. K. (2021). Prediction of diabetic sensorimotor polyneuropathy using machine learning techniques. Journal of clinical medicine, 10(19), 4576. https://doi.org/10.3390/jcm10194576

