Early diagnosis of chronic kidney disease in children using machine learning algorithms
https://doi.org/10.25881/18110193_2024_3_72
Abstract
Background. Diagnosis of early-stage chronic kidney disease (CKD) is a global challenge, as late-stage disease is more commonly diagnosed. The development of modeling methods for making management decisions aimed at improving the efficiency of early diagnosis of CKD is an important scientific and practical task, which can be greatly supported by the use of machine learning algorithms (MLA).
Aim. To improve the accuracy of diagnosis of CKD using data from history, clinical-instrumental, genetic examination and machine learning algorithms (MLA).
Methods. Data were obtained from a single-center retrospective catamnestic cohort study (2011–2022) of children with CKD stage 1–4 aged 1 to 17 years. The main group included 128 children with CKD, and the comparison group included 30 children without any kidney disease. Two groups were comparable by sex and age. The data of anamnesis, clinical-instrumental and genetic examination were used to build a model for CKD diagnosis. The model was built using the MLA multivariate logistic regression (MLR). Three variables were used in the model: erythrocyte sedimentation rate in blood (β = 0,392; p<0,001).
Results. A diagnostics model was obtained allowing prediction of CKD on a test sample with accuracy of 90,3% [80,6; 96,8], sensitivity of 92,0% [81,5; 100,0], specificity of 83,3% [50,0; 100,0], ROC-AUC = 90,0% [77,2; 100,0]. The resulting model is of excellent quality (>90%) as the ROC-AUC is 0,90 on the test sample. The cut-off point value of the probability of CKD is 0,25.
Conclusions. We developed and tested the model that diagnoses early-stage CKD in children with high accuracy.
About the Authors
O. A. SedashkinaRussian Federation
PhD
Samara
A. V. Kolsanov
Russian Federation
DSc, Prof., Prof. of the RAS
Samara
References
1. Bodrin KA, Krasnoperova AA. Machine learning technologies in medicine. Theory and practice ofmodern science. 2018; 10(40): 52-56. (In Russ.)
2. Schena FP, Magistroni R, Narducci F, et al. Artificial intelligence in glomerular diseases. Pediatr Nephrol. 2022; 37(11): 2533-2545. doi: 10.1007/s00467-021-05419-8.
3. Sanmarchi F, Fanconi C, Golinelli D, et al. Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review. J Nephrol. 2023; 36(4): 1101-1117. doi: 10.1007/s40620-023-01573-4.
4. Chen F, Kantagowit P, Nopsopon T, et al. Prediction and diagnosis of chronic kidney disease development and progression using machine-learning: Protocol for a systematic review and meta-analysis of reporting standards and model performance. PLoS One. 2023; 18(2): e0278729. doi: 10.1371/journal.pone.0278729.
5. Scikit-learn. Decision tree, regressor. [cited 2023 Oct 10]. Available from: https://scikit-learn.org/stable/modules/generated/sklearn.tree.DecisionTreeRegressor.html#sklearn.tree.DecisionTreeRegressor.
6. Scikit-learn. Common errors in the interpretation of linear model coefficients. [cited 2023 Oct 10]. Available from: https://scikit-learn.org/stable/auto_examples/inspection/plot_linear_model_coefficient_interpretation.html#sphx-glr-auto-examples-inspection-plot-linear-model-coefficientinterpretation-py.
7. Scikit-learn. Robust scaling. [cited 2023 Oct 10]. Available from:https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.RobustScaler.html.
8. Scikit-learn. Lasso regression. [cited 2023 Oct 10]. Available at: scikit-learn.org/stable/modules/generated/sklearn.linear.model.Lasso.html.Scikit-learn.Cross-validation. Available from: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.cross_validate.html.
9. Nosova GS, Abdullin AH. Machine learning based on nonparametric and nonlinear Random Forest (RF) algorithm. Innovation. The science. Education. 2021; 35: 33-39. (In Russ.)
10. Scikit-learn. Random forest, regressor. [cited 2023 Oct 10] Available from: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.Random.Forest/Regressor.html.
11. GBD Chronic Kidney Disease Collaboration. Global, regional, and national burden of chronic kidney disease, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2020; 395(10225): 709-733. doi: 10.1016/S0140-6736(20)30045-3.
12. Rees L, Mak RH. Nutrition and growth in children with chronic kidney disease. Nat Rev Nephrol. 2011; 7(11): 615–23. doi: 10.1038/nrneph.2011.137.
13. Dima A, Opris D, Jurcut C, et al. Is there still a place for erythrocyte sedimentation rate and C-reactive protein in systemic lupus erythematosus? Lupus. 2016; 25(11): 1173–1179. doi: 10.1177/0961203316651742.
14. Steyerberg EW, Harrell FE, Borsboom GJ, Eijkemans MJ, Vergouwe Y, Habbema JD. Internal validation of predictive models: efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001; 54(8): 774-81.
15. Nogueira PCK, Konstantyner T, Carvalho MFC, et al. Development of a risk score for earlier diagnosis of chronic kidney disease in children. PLoS One. 2019; 14(4): e0215100. doi: 10.1371/journal.pone.0215100.
Review
For citations:
Sedashkina O.A., Kolsanov A.V. Early diagnosis of chronic kidney disease in children using machine learning algorithms. Medical Doctor and Information Technologies. 2024;(3):72-85. (In Russ.) https://doi.org/10.25881/18110193_2024_3_72