- Research
- Open access
- Published:
Identification and optimization of relevant factors for chronic kidney disease in abdominal obesity patients by machine learning methods: insights from NHANES 2005–2018
Lipids in Health and Disease volume 23, Article number: 390 (2024)
Abstract
Background
The intake of dietary antioxidants and glycolipid metabolism are closely related to chronic kidney disease (CKD), particularly among individuals with abdominal obesity. Nevertheless, the cumulative effect of multiple comorbid risk factors on the progression and complications of CKD remains inadequately characterized.
Methods
This study analyzed data from the National Health and Nutrition Examination Survey (NHANES) dat abase (2005–2018), to examine potential factors related to CKD, including glycolipid metabolism, dietary antioxidant intake, and pertinent medical history. To explore the associations between these variables and CKD, the present study used a multivariable-adjusted least absolute shrinkage and selection operator (LASSO) regression model, along with a restricted cubic spline (RCS) model. Furthermore, an optimal predictive model was developed for CKD using ten machine learning algorithms and enhanced model interpretability with the Shapley Additive Explanations (SHAP) method.
Results
A cohort comprising 8,764 eligible individuals (52% male, including 1,839 CKD patients) with abdominal obesity aged 20–85 years were included. The findings revealed significant positive correlations in patients with abdominal obesity between the presence of CKD and age, a history of heart failure, hypertension, diabetes, elevated lipid accumulation product (LAP) and triglyceride glucose-waist circumference (TyG-WC) levels. Conversely, negative correlations were identified between CKD and variables such as sex, high-density lipoprotein cholesterol (HDL-C) levels, and the composite dietary antioxidant index (CDAI). In parallel, RCS regression analysis revealed significant nonlinear associations between the CDAI, HDL-C, TyG-WC, and CKD among patients with abdominal obesity aged 60–80 years. The development of predictive models demonstrated that the CatBoost model surpassed other models, achieving an accuracy of 86.74% on the validation set. The model’s area under the receiver operator curve (AUC) and F1 score were 0.938 and 0.889, respectively. The SHAP values revealed that age was the most significant predictor, followed by diabetes history, hypertension, HDL-C levels, CDAI index, TyG-WC, and LAP.
Conclusion
CatBoost models, along with glycolipid metabolism indexes and dietary antioxidant intake, are effective for early CKD detection in patients with abdominal obesity.
Introduction
Chronic kidney disease (CKD) has garnered significant attention as the sole non-communicable disease with a consistently rising age-adjusted mortality rate, increasing from 813,000 in 2000 to 1.3 million in 2019 [1]. The consequences of CKD include premature mortality, disability, diminished quality of life, and various psychosocial impacts, resulting in substantial financial burdens on governments, healthcare systems, and families [2]. As a result, efforts to identify and mitigate avoidable risk factors for the onset and progression of CKD have been ongoing.
A high body mass index (BMI) is a well-established and significant risk factor for kidney impairment in the general population [3, 4]. However, individuals with CKD constitute a unique group for studying the interplay between obesity and kidney function. Merely defining obesity based on BMI alone no longer accurately captures the connection between obesity and CKD [5]. Recent studies have highlighted that abdominal obesity in middle-aged and young adults is recognized as one of the most influential risk factors for CKD, with evidence suggesting that it may also lead to higher mortality rates among CKD patients [6, 7]. Additionally, anthropometric indices that assess obesity and metabolism have been developed to help predict CKD, such as the visceral adiposity index (VAI), lipid accumulation product (LAP), and triglyceride glucose (TyG) [8,9,10]. These indices provide a more precise assessment of an unhealthy metabolic phenotype. Whereas, existing research on the TyG index in relation to CKD has primarily focused on the general population rather than specifically on individuals with abdominal obesity.
However, obesity can directly predispose individuals to CKD through its association with obesity-related glomerulopathy, as well as indirectly through its well-established complications including hypertension, and type 2 diabetes. Additionally, measures of abdominal fat, like waist circumference, are often associated with higher morbidity and mortality in people on maintenance hemodialysis [3]. Thus, addressing obesity is vital for preventing and slowing the progression of CKD.
Studies have demonstrated that modifications in lifestyle habits, such as poor dietary choices and sedentary behaviors, have varying impacts on improvements in obesity, blood pressure, and renal function in individuals with CKD [11,12,13,14]. Previous studies have shown that an antioxidant-rich diet is beneficial for various health outcomes, including lung function, cardiovascular function, and mental health [15,16,17], and can offer protection against both CKD and obesity, with the composite dietary antioxidant index (CDAI) demonstrating a positive correlation with a reduced prevalence of CKD in the adult population in the United States [18]. In other words, reducing lifestyle-related risk factors can assist in the prevention of CKD and its complications.
Multiple contributory factors require the consideration of comprehensive models, which is why machine learning (ML) for predicting chronic diseases has received widespread attention in recent years [19, 20]. Hence, this study was to find the factors linked to CKD in individuals with abdominal obesity through the application of ML techniques, considering obesity indices, overall pro- and antioxidant exposure status, and other relevant factors. Such a comprehensive model is relatively uncommon in existing literature, highlighting the distinctiveness of this study. This study aimed to improve understanding of CKD risk factors in individuals with abdominal obesity by developing predictive ML models using cross-sectional data from the National Health and Nutrition Examination Survey (NHANES) database.
Methods
Study design and subjects
The study analyzed data from the NHANES database spanning 2005–2018 (https://www.cdc.gov/nchs/nhanes/). All methods adhered to relevant guidelines and regulations, ensuring no use of personally identifiable information. Publicly accessible interview questionnaires and test response rates are documented in sources [21, 22]. The exclusion criteria include: (1) subjects < 18 years of age, with missing waist circumference and BMI data; (2) incomplete renal function indexes, triglyceride-glucose index data, CDAI information, and medical history details (such as asthma, hypertension, diabetes, cancer, stroke, heart failure, etc.); and (3) incomplete information on covariates (race, marital status, education, etc.), and body weight data. A total of 8,764 patients with abdominal obesity were included. The study procedure is illustrated in Fig. 1. All participants provided written informed consent in accordance with the NHANES study protocol, which received approval from the Research Ethics Review Board of the National Center for Health Statistics (NCHS).
Definitions of CKD, Abdominal Obesity
The maximum urinary albumin-to-creatinine ratio (ACR) of 30 mg/g was classified as albuminuria. The estimated glomerular filtration rate (eGFR) was determined using the 2021 Chronic Kidney Disease Epidemiology Collaboration equation based on standardized serum creatinine levels across survey cycles [23]. A decreased eGFR was considered to be ≤ 60 mL/min/1.73 m2 [24]. CKD was defined as the presence of albuminuria or decreased eGFR. Abdominal obesity was defined by an absolute waist circumference (WC) of more than 102 cm in males and 88 cm in females, respectively [25].
Measurement of TyG, TyG-WC, VAI, LAP, CDAI
The TyG index is a measure of insulin resistance. Serum fasting plasma glucose (FPG), high-density lipoprotein cholesterol (HDL-C) levels, and triglyceride (TG) were assessed at baseline. TyG = ln [TG(mg/dl) × FPG(mg/dl)/2], TyG-WC = TyG × WC [26], VAI (male) = WC/(39.68 + 1.88 × BMI) × TG/1.03 × 1.31/HDL-C, VAI (female) = WC/ (36.58 + 1.89 × BMI) × TG/0.81 × 1.52/HDL-C [27], LAP (male) = (WC-65) × TG, LAP (female) = (WC-58) × TG [28].
To measure dietary antioxidant intake, the present study utilized a modified CDAI based on the method of Wright et al. [29]. In this approach, each individual’s antioxidant intake was normalized by determining how far it differed from the average intake of different antioxidants, including β-carotene, vitamin A, selenium, vitamin C, vitamin E, zinc, copper, and iron. These standardized values were then combined to create the CDAI score [30]. Importantly, antioxidants from supplements, medications, or water alone were not considered in this calculation. The formula is presented below:
Covariates
At the commencement of the study, standardized questionnaires were employed to gather demographic data (including age, sex, race/ethnicity, educational attainment, and marital status), lifestyle factors, and self-reported medical history (encompassing conditions such as stroke, heart failure, hypertension, diabetes, insulin usage, liver disorders, and cancer). The baseline laboratory test results included glycosylated hemoglobin (HbAlc), total cholesterol (TC), TG, HDL-C, serum creatinine (Scr), and urine creatinine levels. Further details on the variable assessment and categories can be found in Supplementary Table 1.
Statistical analysis
Continuous variables are presented as medians (Q1‒Q3) and were compared using either the independent samples t-test or the Kruskal–Wallis rank sum test. Categorical variables are represented as percentages, and intergroup differences were evaluated using the chi-square test or Fisher’s exact test. All values were adjusted using the sampling weights provided by the NHANES to ensure representativeness for the general U.S. population. A two-sided P-value of less than 0.05 was deemed statistically significant. To address potential issues related to high dimensionality impacting ML algorithms, a multivariable-adjusted least absolute shrinkage and selection operator (LASSO) regression model [31] was utilized to identify potential factors that were most strongly associated with CKD risk. Moreover, a subgroup analysis based on multiple logistic regression was used to explore the impact of other variables on different stages of CKD. GFR categories (G1–G5) and albuminuria categories (A1–A3) are used to classify CKD [32], as shown in Supplementary Table 2. A restricted cubic spline (RCS) model was conducted to find the dose–response relationship between CKD risk and other variables. The synthetic minority over-sampling technique (SMOTE) is an oversampling method that is widely used in medicine to handle class imbalances in datasets [33, 34]. In the present study, the SMOTE combined with the edited nearest neighbors (SMOTE-ENN) technique was utilized to increase the sensitivity of classifiers to minority groups.
Construction of the ML model
Ten ML algorithms were then applied to predict CKD in patients with abdominal obesity using the training set: AdaBoost, CatBoost, decision tree (DT), gradient boosting decision tree (GBDT), light gradient boosting machine (LightGBM), logistic regression (LR), naive bayes (NB), random forest (RF), support vector machine (SVM), and eXtreme gradient boosting (XG). These models were implemented using the DecisionLinnc1.0 software [35]. Furthermore, Supplementary Table 3 provides details on hyperparameter configurations.
Evaluation of ML model
Different evaluation metrics were used to evaluate the model, including accuracy, receiver operating characteristic (ROC) curves, AUC, sensitivity/recall, specificity, the Matthews correlation coefficient (MCC) and the F1 score. To evaluate the robustness of the predictive model’s performance, a sensitivity analysis utilizing k-fold cross-validation was performed.
SHAP method
To enhance the comprehensibility of our models, the SHAP method was used to assess the relative significance of predictor variables [36]. In the model, a predicted value is produced for each anticipated sample. The SHAP Summary plot graphically displays the SHAP value associated with each feature across all samples, with purple and yellow dots indicating higher or lower feature values separately.
Results
Baseline characteristics
In this study, 8,764 individuals with abdominal obesity aged 20–85 years were included, comprising 4,532 (52%) males and 4,232 (48%) females. The patients were divided into CKD patients (n = 1839) and non-CKD patients (n = 6925) as shown in Table 1. Table 1 shows a significant association between abdominal obesity in patients with CKD and older age in females compared with those of the non-CKD group, as evidenced by higher values of various anthropometric and metabolic markers such as WC, HbAlc, FPG, TG, Scr, TyG, TyG-WC, VAI, and LAP (P < 0.05). Additionally, a greater proportion of participants in the CKD group reported a history of smoking, stroke, heart failure, hypertension, liver conditions, diabetes, and insulin use (P < 0.05).
Association of candidate predictive variables with CKD in patients with abdominal obesity.
Supplementary Fig. 1A illustrates the link between cross-validation error variability and the log-transformed penalty parameter (λ) in LASSO regression for identifying key variables related to CKD in patients with abdominal obesity. Within the optimal LASSO regression model, a total of nine variables (age, sex, TyG-WC, HDL-C, LAP, CDAI, and a history of heart failure, hypertension, and diabetes) with non-zero coefficients were determined to be more significant in relation to CKD, as illustrated in Supplementary Fig. 1B. The study revealed significant positive correlations between age (Coeff se = 0.0435), history of heart failure (Coeff se = 0.8327), hypertension (Coeff se = 0.3326), diabetes (Coeff se = 0.2440), LAP (Coeff se = 0.0001), and TyG-WC (Coeff se = 0.0008) with CKD in patients with abdominal obesity. Conversely, a negative correlation was observed between sex (Coeff se = −0.3341), HDL-C level (Coeff se = −0.0157), and CDAI score (Coeff se = −0.0012) and CKD in the study participants.
Subgroup analysis of the influence of various variables on different stages of CKD
The subgroup analysis revealed that age significantly influences CKD risk levels. Specifically, individuals aged > 60 years are markedly more likely to develop CKD than those aged 40–60. For individuals over 60 years, the odds ratio (OR) for a moderately increased risk of CKD was 3.83 (95% CI: 3.16–4.64, P < 0.001), whereas in the 40–60 years age group, the OR was 1.55 (95% CI: 1.28–1.87, P < 0.001). Furthermore, the OR for a high risk of CKD was 5.59 (95% CI: 3.59–8.70, P < 0.001) versus 1.49 (95% CI: 0.93–2.36, P = 0.094), and for a very high risk of CKD, the OR was 13.30 (95% CI: 5.66–31.29, P < 0.001) compared with 1.57 (95% CI: 0.62–3.97, P = 0.341) in the younger cohort. The severity of abdominal obesity is closely associated with the risk level of CKD. Similarly, a history of diabetes, hypertension, and heart failure significantly elevates the risk of CKD. For comprehensive results, please consult Supplementary Table 2.
Association of the CDAI, LAP, TyG-WC and HDL-C with CKD in patients with abdominal obesity
RCS regression analysis was employed to investigate the relationship between CDAI, LAP, TyG-WC, and HDL-C and CKD in patients with abdominal obesity. A significant nonlinear association was observed between the CDAI score, HDL-C level, and CKD in obese abdominal patients of both genders (Fig. 2 A-D). However, sex differences had been found in the nonlinear relationships between LAP, TyG-WC and CKD, with females exhibiting a significant association with LAP and males with TyG-WC (Fig. 2 E–H). The present study investigated potential age-related variations in the impact of various factors on CKD in individuals with abdominal obesity. These findings indicate a nonlinear correlation between LAP and CKD in the 18–40 years age group (Fig. 2 I-L). Interestingly, no correlations were detected between each of the four factors and CKD in obese abdominal patients aged 40–60 years (Fig. 2 M-P). However, significant nonlinear correlations between the CDAI, HDL-C, and TyG-WC and CKD in patients with abdominal obesity were maintained in the 60–80 years age group (Fig. 2 Q-T).
The study examines the Restricted Cubic Spline (RCS) modeling of the relationship between composite dietary antioxidant index (CDAI), HDL-C (mmol/L), Lipid accumulation product (LAP), triglyceride glucose-waist circumference (TyG-WC) and the prevalence of CKD in abdominal obesity patients. A The RCS curve of the association between CDAI (male) and CKD with abdominal obesity. B The RCS curve of the association between CDAI (female) and CKD with abdominal obesity. C The RCS curve of the association between HDL-C (male) and CKD with abdominal obesity. D The RCS curve of the association between HDL-C (female) and CKD with abdominal obesity. E The RCS curve of the association between LAP (male) and CKD with abdominal obesity. F The RCS curve of the association between LAP (female) and CKD with abdominal obesity. G The RCS curve of the association between TyG-WC (male) and CKD with abdominal obesity. H The RCS curve of the association between TyG-WC (female) and CKD with abdominal obesity. I The RCS curve of the association between CDAI and CKD with abdominal obesity at age 18–40. J The RCS curve of the association between HDL-C and CKD with abdominal obesity at age 18–40. K The RCS curve of the association between LAP and CKD with abdominal obesity at age 18–40. L The RCS curve of the association between TyG-WC and CKD with abdominal obesity at age 18–40. M The RCS curve of the association between CDAI and CKD with abdominal obesity at age 40–60. N The RCS curve of the association between HDL-C and CKD with abdominal obesity at age 40–60. O The RCS curve of the association between LAP and CKD with abdominal obesity at age 40–60. P The RCS curve of the association between TyG-WC and CKD with abdominal obesity at age 40–60. Q The RCS curve of the association between CDAI and CKD with abdominal obesity at age 60–80. R The RCS curve of the association between HDL-C and CKD with abdominal obesity at age 60–80. S The RCS curve of the association between LAP and CKD with abdominal obesity at age 60–80. T The RCS curve of the association between TyG-WC and CKD with abdominal obesity at age 60–80
Evaluation of the model
Figure 3 illustrates the ROC curves for the ten models analyzed in this study. Specifically, AdaBoost (Fig. 3 A), CatBoost (Fig. 3 B), DT (Fig. 3 C), GBDT (Fig. 3 D), LightGBM (Fig. 3 E), LR (Fig. 3 F), NB (Fig. 3 G), RF (Fig. 3 H), SVM (Fig. 3 I) and XG (Fig. 3 J) models achieved AUC values of 0.838, 0.938, 0.863, 0.869, 0.783, 0.886, 0.823, 0.891, 0.882, and 0.923, respectively. The SMOTE-ENN technique balanced the classes in the training dataset to avoid model bias, as shown in Table 2 with a detailed summary of the performance metrics for each model. Notably, the CatBoost algorithm achieved superior prediction performance, with an accuracy of 86.74% and an F1 score of 0.889. Therefore, the CatBoost-based prediction model was ultimately selected for the subsequent analysis.
The ROC of the ten machine learning models. Machine learning model was utilized to fit the prediction between composite dietary antioxidant index (CDAI), HDL-C (mmol/L), Lipid accumulation product (LAP), triglyceride glucose-waist circumference (TyG-WC) and the prevalence of CKD in abdominal obesity patients. The plots show the ROC of (A) AdaBoost; (B) CatBoost; (C) Decision Tree; (D) Gradient Boosting Decision Tree; (E) LightGBM; (F) Logistic regression; (G) Naive Bayes; (H) Random Forest; (I) Support Vector machine; (J) XGBoost Machine. These curves depict the performance of the models on both the training set and the testing set
The outcomes of the tenfold cross-validation indicated that the CatBoost-based prediction model exhibited robust performance, achieving an average AUC of 0.938, a mean accuracy of 86.41%, and an F1 score of 0.885. The ROC curve corresponding to the cross-validation is presented in Supplementary Fig. 2.
Model decision of SHAP
The SHAP plot of the CatBoost model, as depicted in Fig. 4, illustrates the profiles of patients at varying levels of risk for developing CKD. This visualization offers insight into individualized care planning strategies informed by the predictive capabilities of the CatBoost model. In Fig. 4, the impact of each feature on the model output is quantified by calculating the average of the absolute SHAP values across all samples. Age emerged as the most influential predictor, followed by a history of diabetes, hypertension, sex, HDL-C levels, CDAI, TyG-WC, and LAP. The SHAP summary plot illustrates that the color gradient of the horizontal bars corresponds to the variable magnitude, ranging from purple (high) to yellow (low). The width of the horizontal bars indicates the impact on model predictions, with wider bars suggesting a greater effect.
Importance ranking of SHAP variables based on the CatBoost model. The SHAP summary plot illustrates that the color gradient of the horizontal bars corresponds to the variable magnitude, ranging from purple (high) to yellow (low). The width of the horizontal bars indicates the impact on model predictions, with wider bars suggesting a greater effect
Discussion
This study provides a comprehensive analysis of ML models related to various factors in abdominal obesity individuals combined CKD. ML, a widely used statistical method, utilizes clinical data to create predictive models and evaluate their accuracy. The current research utilizes ML in combination with the NHANES database to explore potential factors related to CKD in individuals with abdominal obesity and their predictive capabilities. In particular, this study examined LAP and TyG-WC as significant risk factors, as well as HDL-C and CDAI as protective factors associated with CKD.
The clinical diagnosis of CKD relies predominantly on the assessment of GFR or albuminuria, necessitating repeated measurements over a period exceeding three months [37]. Diagnosing CKD can be challenging, particularly in its initial stages, as certain renal functions may be compromised prior to the onset of symptoms [38]. Timely identification of CKD is imperative for predicting its prognosis. Diabetes, hypertension, and obesity continue to be significant contributors to the prevalence of CKD risk factors [2, 39].
The concept of LAP as a predictor of cardiovascular risk was introduced by Kahn et al. as a mathematical model incorporating WC and TG levels [40]. LAP has since been recognized as a valuable non-imaging indicator of visceral obesity and metabolic syndrome [41,42,43]. According to this study, LAP may also lend insight into the presence of abdominal obesity in patients with CKD.
The relationship between LAP and CKD in individuals with abdominal obesity may be attributed to the interaction between adipokines and CKD, suggesting a potential mechanism of action. Initially, dysregulation of adipokine levels has the potential to disrupt the glomerular filtration barrier, resulting in a decline in GFR [44]. Subsequently, adipokines may contribute to renal injury through the mediation of endothelial dysfunction, promotion of oxidative stress, development of atherosclerosis, and induction of inflammation [45,46,47].
The equilibrium between glucose metabolism and lipid metabolism plays a critical role in preserving optimal kidney function. In instances of pathology, disruptions in glucose-lipid metabolism can result in compromised cellular structure and inflammatory reactions within the kidneys [48,49,50]. The excessive buildup of visceral fat can hinder insulin clearance and signaling pathways within the pancreatic islets, thereby impacting renal energy provision and potentially causing kidney damage [51, 52]. Both HDL-C and TyG-WC are acknowledged as emerging and reliable markers for evaluating insulin resistance [53, 54]. Triglyceride glucose-waist circumference has garnered increased attention as a potential predictor of fatty liver disease and cardiovascular disease in recent research [55,56,57]. This study suggests that HDL-C and TyG-WC may serve as equally effective markers in individuals with CKD and abdominal obesity.
Insulin resistance, a common feature in disorders of fat metabolism, can also contribute to kidney injury by inducing hyperglycemia, metabolic acidosis, and the overproduction of endothelin-1 and aldosterone, ultimately leading to renal tubulointerstitial injury. A further consequence of insulin resistance is the excessive production of reactive oxidative stress, which results in fibrosis of the renal tissue [35]. Additionally, hyperinsulinemia may promote renal vasodilation, causing glomerular hyperfiltration and compromised endothelial function, thereby exacerbating proteinuria excretion [58,59,60].
The CDAI score is derived by normalizing and combining the intake levels of various antioxidants, including beta-carotene, selenium, vitamins A, C, and E, as well as zinc, copper, and iron [61]. In this study, a nonlinear relationship was found between the CDAI and CKD in patients with abdominal obesity, which is consistent with the findings of Wang et al. [18]. The underlying mechanism between the CDAI and CKD is unclear, but it may be that oxidative stress and inflammation pathways cause kidney injury, which further contributes to the development of CKD. The CDAI score is strongly correlated with the level of oxidative stress. Oxidative stress leads to the infiltration of neutrophils, an increase in protease secretion, and the generation of substantial quantities of oxidative intermediates, contributing to inflammatory processes. First, the overproduction of reactive oxygen species (ROS) impairs cellular structure and function, leading to tissue damage. CKD is accompanied by significant protein, carbohydrate, and lipid oxidation, and these processes lead to lipid peroxidation and the accumulation of advanced glycosylation end products, further exacerbating tissue damage. Oxidative stress can affect a variety of renal functions, including glomerular filtration rate, fluid electrolyte disturbances, elevated blood pressure, and renal sympathetic nerve activity. Second, the activation of the renin–angiotensin–aldosterone system (RAAS) is linked to oxidative stress. RAAS blockers may be renoprotective through mechanisms that attenuate the inflammatory response and resist renal fibrosis [62]. The CDAI functions as an extrinsic modulator of plasma redox balance, inhibiting the generation of ROS and reactive nitrogen species. This inhibition consequently mitigates oxidative stress and decelerates the onset and progression of CKD [63, 64].
Diets with a high dietary antioxidant index tend to be rich in these antioxidants and are therefore able to reduce the level of oxidative stress in the body, which in turn reduces the release of inflammatory factors. Pathogenesis of CKD is heavily influenced by inflammatory factors. When the kidneys are damaged, inflammatory cells infiltrate renal tissues and release inflammatory factors, leading to renal tubular atrophy, tubulointerstitial fibrosis and glomerulosclerosis. These pathological changes gradually destroy the normal structure and function of the kidneys and eventually lead to renal failure. In addition, inflammatory factors are involved in the pathological processes of renal arteriolar sclerosis and ischemia, glomerular hyperperfusion and increased intracapsular pressure, which further exacerbate renal injury [29, 65].
According to the findings of the RCS study and subgroup analysis, the predictive signature had a pronounced impact on the demographics of individuals aged 60–80 years. This predictive signature holds promise in assisting healthcare professionals in tailoring personalized care plans for individuals with chronic kidney disease and abdominal obesity, particularly among elderly patients, by considering dietary and endocrine factors.
The current research underscores the advantages of combining ML methodologies with SHAP techniques to analyze the diverse effects of glucose and lipid metabolism factors on CKD outcomes in individuals with abdominal obesity. Machine learning techniques offer a significant advantage in managing intricate, multidimensional data and identifying nonlinear connections among various variables. This capability is particularly advantageous in the context of CKD, where the progression of the disease is influenced by a multitude of factors, such as age, comorbidities, metabolic disorders, and environmental influences. Conventional risk prediction models frequently encounter difficulties in accommodating these complex interactions, resulting in suboptimal performance [66]. Moreover, the predictive model established in this research may offer effective management strategies for CKD in elderly patients with abdominal obesity.
Study strengths and limitations
This study represents a highly representative and innovative cross-sectional analysis encompassing the entire United States, focusing on the prevalence and severity of CKD in individuals with abdominal obesity. The research utilizes a sophisticated predictive model that incorporates various factors and applies ten different ML algorithms. The ultimate CatBoost model indicates that for males > 60 years of age with diabetes, hypertension, and a history of cardiovascular disease, it is advisable to promote a reduction in waist circumference, decrease fat intake, and enhance the intake of antioxidant-rich diets, alongside effective management of blood glucose levels. This strategy is advantageous for preventing CKD and decelerating its progression. Nevertheless, this study is subject to certain limitations. First, the predictive model utilized in this research was developed using cross-sectional data, implying that the model’s predictions do not establish causal relationships and necessitate further investigation through prospective studies or external validation. Second, all data utilized in this study were sourced from the NHANES database, which is representative of the U.S. population and may not be generalizable to populations in other countries.
Conclusion
This nationally representative study investigated the association of TyG-WC, LAP, HDL-C, and CADI with CKD risk in patients with abdominal obesity, particularly those over the age of 60 years. The study utilized CatBoost models to develop a predictive tool for identifying CKD risk in this population. These findings may facilitate early detection of CKD in patients with abdominal obesity and support informed decision-making by healthcare providers.
Data availability
No datasets were generated or analysed during the current study.
Abbreviations
- CKD:
-
Chronic kidney disease
- NHANES:
-
National Health and Nutrition Examination Survey
- TyG-WC:
-
Triglyceride glucose-waist circumference
- CDAI:
-
Composite dietary antioxidant index
- LAP:
-
Lipid accumulation product
- HDL-C:
-
High-density lipoprotein cholesterol
- BMI:
-
Body mass index
- VAI:
-
Visceral adiposity index
- GFR:
-
Glomerular filtration rate
- ACR:
-
Albumin-to-creatinine ratio
- FPG:
-
Fasting plasm glucose
- TG:
-
Triglycerides
- TC:
-
Total cholesterol
- LASSO:
-
Least absolute shrinkage and selection operator
- RCS:
-
Restricted cubic spline
- SHAP:
-
Shapley additive explanations
- WC:
-
Waist circumference
- ROC:
-
Receiver operating curve
- AUC:
-
Area under curve
- MCC:
-
Matthews correlation coefficient
- OR:
-
Odds ratio
- CI:
-
Confidential interval
- ML:
-
Machine learning
- SMOTE:
-
Synthetic minority over-sampling technique
- DT:
-
Decision tree
- GBDT:
-
Gradient boosting decision tree
- LightGBM:
-
Light gradient-boosting machine
- LR:
-
Logistic regression
- NB:
-
Naive bayes
- RF:
-
Random forest
- SVM:
-
Support vector machine
- XG:
-
EXtreme gradient boosting
References
Xie Y, Bowe B, Mokdad AH, Xian H, Yan Y, Li T, Maddukuri G, Tsai CY, Floyd T, Al-Aly Z. Analysis of the Global Burden of Disease study highlights the global, regional, and national trends of chronic kidney disease epidemiology from 1990 to 2016. Kidney Int. 2018;94:567–81.
Francis A, Harhay MN, Ong ACM, Tummalapalli SL, Ortiz A, Fogo AB, Fliser D, Roy-Chaudhury P, Fontana M, Nangaku M, et al. Chronic kidney disease and the global public health agenda: an international consensus. Nat Rev Nephrol. 2024;20(7):473–85.
Nawaz S, Chinnadurai R, Al-Chalabi S, Evans P, Kalra PA, Syed AA, Sinha S. Obesity and chronic kidney disease: a current review. Obes Sci Pract. 2023;9:61–74.
Pommer W. Preventive nephrology: the role of obesity in different stages of chronic kidney disease. Kidney Diseases. 2018;4:199–204.
Seong JM, Lee JH, Gi MY, Son YH, Moon AE, Park CE, Sung HH, Yoon H. Gender difference in the association of chronic kidney disease with visceral adiposity index and lipid accumulation product index in Korean adults: Korean National Health and Nutrition Examination Survey. Int Urol Nephrol. 2021;53:1417–25.
Kuma A, Uchino B, Ochiai Y, Kawashima M, Enta K, Tamura M, Otsuji Y, Kato A. Relationship between abdominal adiposity and incident chronic kidney disease in young- to middle-aged working men: a retrospective cohort study. Clin Exp Nephrol. 2019;23:76–84.
Sarathy H, Henriquez G, Abramowitz MK, Kramer H, Rosas SE, Johns T, Kumar J, Skversky A, Kaskel F, Melamed ML. Abdominal Obesity, Race and Chronic Kidney Disease in Young Adults: Results from NHANES 1999–2010. PLoS ONE. 2016;11:e0153588.
Mousapour P, Barzin M, Valizadeh M, Mahdavi M, Azizi F, Hosseinpanah F. Predictive performance of lipid accumulation product and visceral adiposity index for renal function decline in non-diabetic adults, an 8.6-year follow-up. Clin Exp Nephrol. 2020;24(3):225–34.
Xiao H, Xiong C, Shao X, Gao P, Chen H, Ning J, Chen Y, Zou Z, Hong G, Li X, et al. Visceral Adiposity Index and Chronic Kidney Disease in a Non-Diabetic Population: A Cross-Sectional Study. Diabetes Metab Syndr Obes. 2020;13:257–65.
Hou Q, Zhang H, Zhang R, Li B, Li L, Li D, Wang X, Liu Y, Wan Z, Zhang J, Shuai P. Relationship between the longitudinal trajectory of the triglyceride-glucose index and the development of CKD: an 8-year retrospective longitudinal cohort study. Front Endocrinol (Lausanne). 2024;15:1376166.
Evangelidis N, Craig J, Bauman A, Manera K, Saglimbene V, Tong A. Lifestyle behaviour change for preventing the progression of chronic kidney disease: a systematic review. BMJ Open. 2019;9:e031625.
Gutierrez OM. Contextual poverty, nutrition, and chronic kidney disease. Adv Chronic Kidney Dis. 2015;22:31–8.
Luyckx VA, Cherney DZI, Bello AK. Preventing CKD in Developed Countries. Kidney Int Rep. 2020;5:263–77.
Rebholz CM, Anderson CA, Grams ME, Bazzano LA, Crews DC, Chang AR, Coresh J, Appel LJ. Relationship of the American Heart Association’s Impact Goals (Life’s Simple 7) With Risk of Chronic Kidney Disease: Results From the Atherosclerosis Risk in Communities (ARIC) Cohort Study. J Am Heart Assoc. 2016;5:e003192.
Huang AA, Huang SY. Quantification of the Relationship of Pyridoxine and Spirometry Measurements in the United States Population. Current Developments in Nutrition. 2023;7:100078.
Huang A, Huang SY. Increasing potassium intake up to 2300mg is associated with decreased depressive symptoms in United States adults: Analysis of the National Health and Nutrition Examination Survey (NHANES) 2017–2020. 2022.
Lee C-H, Chan RS, Wan HY, Woo Y-C, Cheung CY, Fong CH, Cheung BM, Lam T-H, Janus E, Woo J. Dietary intake of anti-oxidant vitamins A, C, and E is inversely associated with adverse cardiovascular outcomes in chinese—A 22-years population-based prospective study. Nutrients. 2018;10:1664.
Wang M, Huang ZH, Zhu YH, He P, Fan QL. Association between the composite dietary antioxidant index and chronic kidney disease: evidence from NHANES 2011–2018. Food Funct. 2023;14:9279–86.
Huang AA, Huang SY. Computation of the distribution of model accuracy statistics in machine learning: comparison between analytically derived distributions and simulation-based methods. Health science reports. 2023;6:e1214.
Huang AA, Huang SY. Dendrogram of transparent feature importance machine learning statistics to classify associations for heart failure: A reanalysis of a retrospective cohort study of the Medical Information Mart for Intensive Care III (MIMIC-III) database. PLoS ONE. 2023;18:e0288819.
National Center for Health Statistics (NCHS).National Health and Nutrition Examination Survey Data. Hyattsville, MD: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention [https://wwwn.cdc.gov/nchs/nhanes/default.aspx].
Questionnaires, datasets, and related documentation, National Health and Nutrition Examination Survey [ https://wwwn.cdc.gov/nchs/nhanes/default.aspx].
Levey AS, Stevens LA, Schmid CH, Zhang YL, Castro AF 3rd, Feldman HI, Kusek JW, Eggers P, Van Lente F, Greene T, Coresh J. A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150:604–12.
Inker LA, Astor BC, Fox CH, Isakova T, Lash JP, Peralta CA, Kurella Tamura M, Feldman HI. KDOQI US commentary on the 2012 KDIGO clinical practice guideline for the evaluation and management of CKD. Am J Kidney Dis. 2014;63:713–35.
Ross R, Neeland IJ, Yamashita S, Shai I, Seidell J, Magni P, Santos RD, Arsenault B, Cuevas A, Hu FB, et al. Waist circumference as a vital sign in clinical practice: a Consensus Statement from the IAS and ICCR Working Group on Visceral Obesity. Nat Rev Endocrinol. 2020;16:177–89.
Song K, Park G, Lee HS, Choi Y, Oh JS, Choi HS, Suh J, Kwon A, Kim HS, Chae HW. Prediction of Insulin Resistance by Modified Triglyceride Glucose Indices in Youth. Life (Basel). 2021;11(4):286.
Amato MC, Giordano C, Galia M, Criscimanna A, Vitabile S, Midiri M, Galluzzo A, AlkaMeSy Study G. Visceral Adiposity Index: a reliable indicator of visceral fat function associated with cardiometabolic risk. Diabetes Care. 2010;33:920–2.
Biyik Z, Guney I. Lipid accumulation product and visceral adiposity ındex: two new indices to predict metabolic syndrome in chronic kidney disease. Eur Rev Med Pharmacol Sci. 2019;23:2167–73.
Wright ME, Mayne ST, Stolzenberg-Solomon RZ, Li Z, Pietinen P, Taylor PR, Virtamo J, Albanes D. Development of a comprehensive dietary antioxidant index and application to lung cancer risk in a cohort of male smokers. Am J Epidemiol. 2004;160:68–76.
Liu C, Lai W, Zhao M, Zhang Y, Hu Y. Association between the Composite Dietary Antioxidant Index and Atherosclerotic Cardiovascular Disease in Postmenopausal Women: A Cross-Sectional Study of NHANES Data, 2013–2018. Antioxidants (Basel). 2023;12(9):1740.
Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Stat Methodol. 1996;58:267–88.
Levin A, Ahmed SB, Carrero JJ, Foster B, Francis A, Hall RK, Herrington WG, Hill G, Inker LA, Kazancioglu R, et al. Executive summary of the KDIGO 2024 Clinical Practice Guideline for the Evaluation and Management of Chronic Kidney Disease: known knowns and known unknowns. Kidney Int. 2024;105:684–701.
Li J, Fong S, Sung Y, Cho K, Wong R, Wong KKL. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification. BioData Min. 2016;9:37.
El-Sofany H, Bouallegue B, El-Latif YMA. A proposed technique for predicting heart disease using machine learning algorithms and an explainable AI method. Sci Rep. 2024;14:23277.
Ren X, Jiang M, Han L, Zheng X. Association between triglyceride-glucose index and chronic kidney disease: A cohort study and meta-analysis. Nutr Metab Cardiovasc Dis. 2023;33:1121–8.
Lundberg SM, Lee SI. A unified approach to interpreting model predictions. In: NIPS, vol. 30. Curran Associates, Inc; 2017. p. 4765–74.
Kdoqi. KDOQI Clinical Practice Guidelines and Clinical Practice Recommendations for Diabetes and Chronic Kidney Disease. Am J Kidney Dis. 2007;49:S12-154.
Sherwood M, McCullough PA. Chronic kidney disease from screening, detection, and awareness, to prevention. Lancet Glob Health. 2016;4:e288-289.
Stanifer JW, Muiru A, Jafar TH, Patel UD. Chronic kidney disease in low- and middle-income countries. Nephrol Dial Transplant. 2016;31:868–74.
Kahn HS. The “lipid accumulation product” performs better than the body mass index for recognizing cardiovascular risk: a population-based comparison. BMC Cardiovasc Disord. 2005;5:26.
Tellechea ML, Aranguren F, Martinez-Larrad MT, Serrano-Rios M, Taverna MJ, Frechtel GD. Ability of lipid accumulation product to identify metabolic syndrome in healthy men from Buenos Aires. Diabetes Care. 2009;32:e85.
Taverna MJ, Martinez-Larrad MT, Frechtel GD, Serrano-Rios M. Lipid accumulation product: a powerful marker of metabolic syndrome in healthy population. Eur J Endocrinol. 2011;164:559–67.
Ray L, Ravichandran K, Nanda SK. Comparison of Lipid Accumulation Product Index with Body Mass Index and Waist Circumference as a Predictor of Metabolic Syndrome in Indian Population. Metab Syndr Relat Disord. 2018;16:240–5.
Martinez-Garcia C, Izquierdo-Lahuerta A, Vivas Y, Velasco I, Yeo TK, Chen S, Medina-Gomez G. Renal Lipotoxicity-Associated Inflammation and Insulin Resistance Affects Actin Cytoskeleton Organization in Podocytes. PLoS ONE. 2015;10:e0142291.
Zhu Y, Chen YL, Li C, Ding XY, Xu GY, Hu LL, Hou FF, Zhou QG. The effect of inhibition of endoplasmic reticulum stress on lipolysis in white adipose tissue in a rat model of chronic kidney disease. Acta Pharmacol Sin. 2014;35:356–62.
Kershaw EE, Flier JS. Adipose tissue as an endocrine organ. J Clin Endocrinol Metab. 2004;89:2548–56.
Fang T, Zhang Q, Wang Y, Zha H. Diagnostic value of visceral adiposity index in chronic kidney disease: a meta-analysis. Acta Diabetol. 2023;60:739–48.
Jao TM, Nangaku M, Wu CH, Sugahara M, Saito H, Maekawa H, Ishimoto Y, Aoe M, Inoue T, Tanaka T, et al. ATF6alpha downregulation of PPARalpha promotes lipotoxicity-induced tubulointerstitial fibrosis. Kidney Int. 2019;95:577–89.
Tufro A. Cholesterol accumulation in podocytes: a potential novel targetable pathway in diabetic nephropathy. Diabetes. 2013;62:3661–2.
Chen N, Mu L, Yang Z, Du C, Wu M, Song S, Yuan C, Shi Y. Carbohydrate response element-binding protein regulates lipid metabolism via mTOR complex1 in diabetic nephropathy. J Cell Physiol. 2021;236:625–40.
Lu H, Bogdanovic E, Yu Z, Cho C, Liu L, Ho K, Guo J, Yeung LSN, Lehmann R, Hundal HS, et al. Combined Hyperglycemia- and Hyperinsulinemia-Induced Insulin Resistance in Adipocytes Is Associated With Dual Signaling Defects Mediated by PKC-zeta. Endocrinology. 2018;159:1658–77.
Fu Z, Wu Q, Guo W, Gu J, Zheng X, Gong Y, Lu C, Ye J, Ye X, Jiang W, et al. Impaired Insulin Clearance as the Initial Regulator of Obesity-Associated Hyperinsulinemia: Novel Insight Into the Underlying Mechanism Based on Serum Bile Acid Profiles. Diabetes Care. 2022;45:425–35.
Li S, Feng L, Ding J, Zhou W, Yuan T, Mao J. Triglyceride glucose-waist circumference: the optimum index to screen nonalcoholic fatty liver disease in non-obese adults. BMC Gastroenterol. 2023;23:376.
Sanchez-Garcia A, Rodriguez-Gutierrez R, Mancillas-Adame L, Gonzalez-Nava V, Diaz Gonzalez-Colmenero A, Solis RC, Alvarez-Villalobos NA, Gonzalez-Gonzalez JG. Diagnostic Accuracy of the Triglyceride and Glucose Index for Insulin Resistance: A Systematic Review. Int J Endocrinol. 2020;2020:4678526.
Song S, Son DH, Baik SJ, Cho WJ, Lee YJ. Triglyceride Glucose-Waist Circumference (TyG-WC) Is a Reliable Marker to Predict Non-Alcoholic Fatty Liver Disease. Biomedicines. 2022;10(9):2251.
Kim HS, Cho YK, Kim EH, Lee MJ, Jung CH, Park JY, Kim HK, Lee WJ. Triglyceride Glucose-Waist Circumference Is Superior to the Homeostasis Model Assessment of Insulin Resistance in Identifying Nonalcoholic Fatty Liver Disease in Healthy Subjects. J Clin Med. 2021;11(1):41.
He X, Huang X, Qian Y, Sun T. A non-linear relationship between triglyceride glucose waist circumference and nonalcoholic fatty liver disease in a Japanese population: a secondary analysis. Front Endocrinol (Lausanne). 2023;14:1188214.
Groop PH, Forsblom C, Thomas MC. Mechanisms of disease: Pathway-selective insulin resistance and microvascular complications of diabetes. Nat Clin Pract Endocrinol Metab. 2005;1:100–10.
Tucker BJ, Anderson CM, Thies RS, Collins RC, Blantz RC. Glomerular hemodynamic alterations during acute hyperinsulinemia in normal and diabetic rats. Kidney Int. 1992;42:1160–8.
Esteghamati A, Ashraf H, Nakhjavani M, Najafian B, Hamidi S, Abbasi M. Insulin resistance is an independent correlate of increased urine albumin excretion: a cross-sectional study in Iranian Type 2 diabetic patients. Diabet Med. 2009;26:177–81.
Liu N, Liu C, Qu Z, Tan J. Association between the triglyceride-glucose index and chronic kidney disease in adults. Int Urol Nephrol. 2023;55:1279–89.
Roumeliotis S, Roumeliotis A, Dounousi E, Eleftheriadis T, Liakopoulos V. Dietary antioxidant supplements and uric acid in chronic kidney disease: a review. Nutrients. 2019;11(8):1911.
Demmig-Adams B, Adams WW 3rd. Antioxidants in photosynthesis and human nutrition. Science. 2002;298:2149–53.
Liakopoulos V, Roumeliotis S, Bozikas A, Eleftheriadis T, Dounousi E. Antioxidant Supplementation in Renal Replacement Therapy Patients: Is There Evidence? Oxid Med Cell Longev. 2019;2019:9109473.
Wang W, Wang X, Cao S, Duan Y, Xu C, Gan D, He W. Dietary Antioxidant Indices in Relation to All-Cause and Cause-Specific Mortality Among Adults With Diabetes: A Prospective Cohort Study. Front Nutr. 2022;9:849727.
Khalid F, Alsadoun L, Khilji F, Mushtaq M, Eze-Odurukwe A, Mushtaq MM, Ali H, Farman RO, Ali SM, Fatima R, Bokhari SFH. Predicting the Progression of Chronic Kidney Disease: A Systematic Review of Artificial Intelligence and Machine Learning Approaches. Cureus. 2024;16:e60145
Acknowledgements
We are grateful to the National Health and Nutrition Examination Survey for the data provided and to all participants for their selfless dedication.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Author information
Authors and Affiliations
Contributions
XL.D, LF.M, P.L and MY.H wrote the main manuscript text. RY.J, YD.T and HL.C prepared figures. HY.G, WQ.Z, and K.L prepared Tables. XY.C prepared Supplementary Materials. WC.L and HX.Z designed the manuscript. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
All contributing authors were consent for publication.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
12944_2024_2384_MOESM1_ESM.docx
Supplementary Material 1: Table 1. The resource and definition of Baseline characteristics. Table 2. Subgroup analysis of CKD at different risk levels based on various influencing factors. Table 3. Details of hyperparameter for ML models. Figure 1. Lasso regression of CKD with abdominal obesity related variables. Figure 2. The ROC curve corresponding to the cross-validation.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Deng, X., Ma, L., Li, P. et al. Identification and optimization of relevant factors for chronic kidney disease in abdominal obesity patients by machine learning methods: insights from NHANES 2005–2018. Lipids Health Dis 23, 390 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12944-024-02384-7
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12944-024-02384-7