Development and validation of an insulin resistance prediction model in children and adolescents using machine learning algorithms

Xiu Huang; Kun Yi; Lin Jia; Yinmei Li; Hui He; Can Ma; Xiao Fang

doi:10.21037/tp-2024-502

Original Article

Development and validation of an insulin resistance prediction model in children and adolescents using machine learning algorithms

Xiu Huang¹, Kun Yi¹, Lin Jia¹, Yinmei Li¹, Hui He¹, Can Ma², Xiao Fang³

¹Department of Paediatrics, Nanchong City Jialing District People’s Hospital (Jialing Branch of Nanchong Central Hospital), Nanchong, China; ²School of Public Health, Suzhou Medical College of Soochow University, Suzhou, China; ³National Science Library (Chengdu), Chinese Academy of Sciences, Chengdu, China

Contributions: (I) Conception and design: X Huang, X Fang; (II) Administrative support: K Yi, X Fang; (III) Provision of study materials or patients: L Jia, Y Li, H He; (IV) Collection and assembly of data: All authors; (V) Data analysis and interpretation: C Ma, X Huang; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

Correspondence to: Xiao Fang, PhD. National Science Library (Chengdu), Chinese Academy of Sciences, No. 289 Qunxian South Street, Chengdu 610000, China. Email: fangx@clas.ac.cn.

Background: Insulin resistance (IR) is a precursor to metabolic disorders like type 2 diabetes and hypertension in children and adolescents. Early detection of IR is critical to prevent severe metabolic complications. IR is influenced by factors such as diet, inflammation, and genetics. However, existing studies often focus on limited populations and overlook dietary factors. This study aimed to evaluate the use of machine learning (ML) models for early IR prediction in children and adolescents, emphasizing accuracy.

Methods: We used physical examination data of children and adolescents aged 6–17 years from the China Health and Nutrition Survey (CHNS) database as the training set and collected routine physical examination data from children and adolescents aged 6–17 years admitted to Nanchong Central Hospital and the Nanchong City Jialing District People’s Hospital in Sichuan Province from January 2019 to October 2024 for validation. IR was assessed using the Homeostatic Model Assessment for IR (HOMA-IR) score, with a cutoff of >3.0 indicating IR. Potential predictors included demographic details, lifestyle habits, and blood test results. We conducted univariate logistic regression (LR) analysis to select variables with statistical significance and then constructed and compared the back propagation neural network (BPNN), exhaustive Chi-squared automatic interaction detector (E-CHAID), support vector machine (SVM), and LR models.

Results: The training sample included 827 children and adolescents (281 with IR and 546 without IR), while the test sample included 207 participants. The SVM model demonstrated superior predictive accuracy (91.90% in training and 90.34% in test set) compared to the E-CHAID (77.75% in training and 72.95% in test set), BPNN (75.94% in training and 70.05% in test set), and LR models (76.18% in training and 71.01% in test set). Sensitivity, specificity, Youden’s index, and area under the curve (AUC) values also favored the SVM model in both training and test samples.

Conclusions: Compared with the E-CHAID, BPNN, and LR models, the SVM model exhibited superior predictive ability for IR in children and adolescents based on physical examination data that include dietary factors. These findings suggest that the SVM model could serve as a valuable tool for early clinical prediction of IR, potentially aiding in the prevention of type 2 diabetes mellitus (T2DM) and associated metabolic complications. Further research is needed to validate these results in larger and more diverse populations.

Keywords: Insulin resistance (IR); machine learning (ML); child; adolescent; physical examination

Submitted Nov 13, 2024. Accepted for publication Mar 04, 2025. Published online Mar 26, 2025.

doi: 10.21037/tp-2024-502

Highlight box

Key findings

• This study compared four machine learning (ML) models to predict insulin resistance (IR) in children and adolescents using routine health and dietary data. The support vector machine (SVM) model demonstrated superior performance with 91.90% accuracy in training and 90.34% in external testing, achieving high sensitivity (0.870), specificity (0.930), Youden’s index (0.800), and area under the curve (0.958). Key predictors included glucose, waist circumference, body mass index, triglyceride, and dietary factors like protein and energy intake.

What is known and what is new?

• IR is a precursor to metabolic disorders like type 2 diabetes mellitus and hypertension in children and adolescents. Previous studies focused on limited populations or specific predictors.

• This study innovates by integrating dietary factors and comparing multiple ML algorithms, highlighting the SVM model’s effectiveness and identifying novel predictors like protein and energy intake.

What is the implication, and what should change now?

• The findings suggest that ML, especially SVM, can effectively predict IR using accessible clinical and dietary data, allowing early identification of high-risk individuals without direct insulin measurement. This approach supports targeted interventions and emphasizes the importance of balanced nutrition in preventing IR. Future work should expand datasets and validate models across diverse populations to enhance generalizability and clinical applicability.

Introduction

Insulin resistance (IR) is a critical precursor to several metabolic disorders, including type 2 diabetes mellitus (T2DM), hypertension, and polycystic ovary syndrome (PCOS), particularly among children and adolescents (1,2). The prevalence of IR in pediatric populations has increased alongside rising obesity rates, with strong associations between obesity, IR, and metabolic syndrome components (3). Early detection and intervention for IR are essential to prevent its progression to more severe metabolic complications (4). Therefore, developing accurate and practical predictive models for IR in children and adolescents is of great clinical significance.

A variety of elements contribute to IR, including dietary habits, disruptions in lipid metabolism, oxidative stress, issues with the function of mitochondria, inflammation, and genetic predispositions (1). Research has consistently shown strong associations between hematological parameters and IR across different populations. Studies have revealed positive correlations between IR and red blood cell (RBC) count, hemoglobin (HGB), hematocrit, and white blood cell (WBC) count (5-7). Lifestyle factors, including physical inactivity and stress, were found to be significantly correlated with waist circumference, an indicator of IR, among female medical students in Saudi Arabia. Additionally, anthropometric measurements, such as body mass index (BMI), waist circumference, blood pressure, and postprandial blood sugar levels, are significantly correlated with IR (8). These factors have been incorporated into various prediction models, including logistic regression (LR) models and machine learning (ML) models, which have shown promising results in predicting IR risk. For instance, Hall et al. developed an ML model for children with 78% overall accuracy using anthropometric features (9), while Zhang & Wan focused on children aged 6–12 years, with their extreme gradient boosting (XGBoost) model achieving an area under the curve (AUC) of 0.85 and identifying glucose, waist circumference, and age as key predictors (10). However, these studies often focus on limited populations (e.g., children with obesity or diabetes) or specific age groups and may overlook the importance of dietary factors in predicting IR (11-13). Additionally, few studies have compared the performance of different ML algorithms in predicting IR among children and adolescents. This gap limits our understanding of which models may be most effective in clinical settings.

ML offers a powerful tool for predicting complex conditions like IR by identifying patterns and relationships in large datasets. In this study, we selected four widely used ML algorithms—the exhaustive Chi-squared automatic interaction detector (E-CHAID), back propagation neural network (BPNN), support vector machine (SVM), and LR—each with distinct advantages. E-CHAID is effective in handling categorical data and identifying interactions between variables (14). BPNN has strong nonlinear mapping capabilities and adaptability, making it suitable for complex datasets (15). SVM is renowned for its robustness in high-dimensional spaces and ability to handle nonlinear relationships through kernel functions (16). LR, on the other hand, provides clear and interpretable models, which are crucial for clinical decision-making (17). Given the diverse characteristics of these algorithms, comparing their accuracy in predicting IR is essential to identify the most effective model for clinical use. By evaluating these models using a comprehensive dataset that includes both basic health information and dietary factors, we aim to provide a more accurate and practical tool for early IR prediction in children and adolescents. We present this article in accordance with the TRIPOD reporting checklist (available at https://tp.amegroups.com/article/view/10.21037/tp-2024-502/rc).

Methods

Study participants

In this research, we collected information on individuals aged 6–17 years from the China Health and Nutrition Survey (CHNS) database, employing this dataset as our training sample. The CHNS is a collaborative project between the Carolina Population Center at the University of North Carolina and the National Institute for Nutrition and Health at the Chinese Center for Disease Control and Prevention. It was established to track the primary care nutritional and health conditions of individuals across 15 Chinese provinces from 1989 to 2015. Notably, the 2009 CHNS dataset includes biomarker information for children and adolescents, which is crucial for our study; hence, we utilized this specific dataset for our analysis. Finally, a total of 827 qualified participants aged 6–17 years from nine provinces in China were included in the training dataset.

Given our research scope—developing and validating an early IR prediction model using routine physical examination data and dietary factors—a cross-sectional approach allowed us to meet our objectives efficiently with the available resources. To mitigate limitations of cross-sectional data, we validated the model using an external dataset from two hospitals. We collected data on children and adolescents aged 6–17 years from the physical examination Department and the Department of Pediatrics at the Nanchong Central Hospital and the Nanchong City Jialing District People’s Hospital (a tertiary hospital and a secondary hospital, respectively) from January 2019 to October 2024. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by the Institutional Review Boards of the Nanchong Central Hospital (No. 2024-033) and the Nanchong City Jialing District People’s Hospital (No. 2024-002). The participants’ legal guardians provided written informed consent to participate in this study. This compilation included fundamental demographic details, lifestyle practices, and blood sample analyses. In conclusion, a dataset encompassing 207 individuals within the age bracket of 6–17 years was created. Due to CHNS not collecting data on children and adolescents in Sichuan province, and all external test sets come from two hospitals in Sichuan province, China, there may be some regional differences in the population of the training and test sets.

Outcome

The Homeostatic Model Assessment for IR (HOMA-IR) calculation method is simple, has a good correlation with the hyperinsulinemic-euglycemic clamp technique, and is more widely used in clinical applications for the diagnosis of IR. Children and adolescents with a HOMA-IR score >3.0 were defined as having IR in this study. The HOMA-IR score was calculated as [fasting insulin (mU/L)] × [fasting glucose (mmol/L)]/22.5. While HOMA-IR calculation is straightforward, it requires measuring fasting insulin and glucose levels (18). In resource-limited areas or primary care settings, insulin testing may be impractical or costly. Our study aims to identify high-risk individuals for IR using early warning indicators and ML models that integrate clinical factors and biomarkers from routine physical exams. This allows preliminary IR risk assessment without direct insulin measurement, prioritizing those needing further testing in resource-constrained environments.

Data preprocessing and feature selection

The dataset obtained from the CHNS contained missing values. Initially, any row or column with more than 50% missing data was removed; subsequently, a multivariate imputation technique was applied to address the remaining gaps in the data. We strive to ensure that the selected variables can be obtained during routine physical examinations in most primary healthcare hospitals, which is child-friendly. Combining the data provided by the CHNS database, we selected 41 features for the study, including 17 concerning basic information [age, gender, nationality, province, stratum, urban, height, weight, BMI, systolic blood pressure (SBP), diastolic blood pressure (DBP), upper arm circumference, hip circumference, waist circumference, waist-height ratio, waist-to-hip ratio, and arm-height ratio (AHtR)], 12 about daily habits (sleep duration, currently in school, physical exercise, physical exercise per week, physical exercise in school, physical exercise per week, doing homework time, energy intake, carbohydrate intake, fat intake, protein intake), and 12 laboratory tests {total cholesterol (TC), triglyceride (TG), high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), apolipoprotein A (APO_A), apolipoprotein B (APO_B), lipoprotein(a) [Lp(a)], WBC, RBC, HGB, platelet (PLT), and glucose}. The outcome variable IR is assigned a value of 1, while non-IR is assigned a value of 0. Among the other independent variables, categorical variables were assigned values according to the rule that 1 represents positive and 0 represents negative. The continuity variable remains unchanged.

Model construction

All available data present in the CHNS database were utilized to enhance the statistical power and generalizability of the outcomes. Using IBM SPSS Modeler 18.0 software, four ML algorithms that are widely used in clinics, namely, E-CHAID, BPNN, SVM, and LR, were trained for model building.

The E-CHAID algorithm model is configured as follows. Maximum tree depth: 5. Importance values were adjusted via the Bonferroni method. Pearson’s χ² test was used as the test statistic. Maximum number of iterations: 100. Termination rule: splitting stops if the minimum record count in the parent branch is less than 2% of the total. Splitting stops if the minimum record count in the child node after splitting is less than 1% of the total. For the BPNN model, a multilayer perceptron is used for modeling. The hidden layer is set to automatically calculate, with a maximum training time of 15 minutes, a maximum number of training epochs of 250, and a minimum accuracy of 90%. The optimal network structure for the BPNN model is 11 → 6 → 1. For the SVM model, all probabilities were selected for inclusion, and the termination criterion is 1.0E−5, the regularization parameter is 10, the regression accuracy is 1.6, the kernel type is the recombinant fusion (RBF), and the RBF gamma is 1.0. Additionally, the calculation of the importance of the predictive variables is selected. For LR, the binary process uses an enter method, with a scale set to user-defined (value of 3.0), all probabilities were selected for inclusion, and the outlier tolerance is 1.0E−10.

After filling in missing data, we employed univariate LR analysis to identify variables that were statistically significant (α=0.05). Following this, we developed four distinct ML algorithm models. To further assess the accuracy and applicability of these models, they underwent external validation using datasets from two different hospitals.

Statistical analysis

SPSS version 26.0 (IBM Corporation, Armonk, NY, USA) was used for the Chi-squared test, t-test, univariate LR, and stepwise linear regression analysis. An independent t-test was used to compare continuous variables, and the Chi-squared test or Fisher’s exact test was used to compare categorical variables. All tests were two-tailed, and P<0.05 was considered to indicate statistical significance. To evaluate the performance of each IR risk prediction model for children and adolescents, we used the HOMA-IR calculator to assess the risk probability, with the calculation formula as described earlier. Four performance metrics, namely, sensitivity, specificity, Youden’s index, and AUC, were calculated to better understand the differences between the models.

Results

Subject characteristics

Data from a total of 827 children and adolescents from the CHNS database were gathered. Among them, 281 were identified with IR, defined by a HOMA-IR >3.0, and 546 were identified without IR. Accordingly, 207 children and adolescents were included in the external test set. We initially considered 41 variables, including demographic details, lifestyle habits, and blood test results. Data on weekly physical activity and homework duration were excluded because >50% of participants were missing. Ultimately, 39 variables were advanced to the feature selection phase.

A comparison of feature information between the training and test cohorts is presented in Tables 1,2. No significant disparities were observed between the two sets regarding sex, 3-day average intake of carbohydrates, 3-day average intake of fat, waist-to-hip ratio, sleep duration, number of sports activities currently in school, number of sports activities in school, number of sports activities, number of sports activities in school per week, homework, TG, TC, HDL-C, LDL-C, APO_A, Lp(a), WBC, RBC, PLT, or glucose (P>0.05). Conversely, significant differences were noted between the two sets in terms of the prevalence of IR, stratum, urban area, nationality, province, 3-day average intake of energy, 3-day average intake of protein, SBP, DBP, height, weight, BMI, upper arm circumference, hip circumference, waist circumference, AHtR, APO_B, and HGB, all of which were statistically significant (P<0.05).

Table 1

Feature comparison between the training set and test set

Variables	Training set	Test set	t	P
Age (years)	11.56±2.93	11.8±2.94	−1.053	0.29
3-day average
Energy (kcal)	1,759.16±579.11	1,856.91±561.58	−2.185	0.03
Carbohydrate (g)	245.53±83.62	255.41±87.85	−1.504	0.13
Fat (g)	61.57±37.41	64.92±27.96	−1.207	0.23
Protein (g)	55.47±21.41	62.55±24.81	−3.765	<0.001
SBP (mmHg)	99.9±13.22	102.35±13.99	−2.360	0.02
DBP (mmHg)	66.44±9.73	68.27±10.59	−2.383	0.02
Height (cm)	146.47±15.99	151.53±15.81	−4.084	<0.001
Weight (kg)	38.84±13.41	43.45±14.35	−4.364	<0.001
BMI (kg/m²)	17.58±3.54	18.42±3.71	−3.044	0.002
Upper arm circumference (cm)	20.7±4.24	22.07±4.55	−4.091	<0.001
Hip circumference (cm)	76.61±11.61	79.86±12.04	−3.579	<0.001
Waist circumference (cm)	62.91±9.81	65.63±10.62	−3.504	<0.001
Waist-height ratio	0.43±0.05	0.43±0.06	−0.870	0.38
Waist-to-hip ratio	0.82±0.08	0.82±0.07	0.160	0.87
AHtR	0.14±0.02	0.15±0.03	−2.329	0.02
Sleep duration (hours/day)	8.47±2.57	8.26±2.74	1.057	0.29
Sports activities in school per week (times/week)	5.29±7.8	5.02±8.65	0.433	0.67
TG (mmol/L)	1.01±0.75	0.92±0.64	1.440	0.15
TC (mmol/L)	3.88±0.7	3.9±0.71	−0.456	0.65
HDL-C (mmol/L)	1.44±0.54	1.49±0.8	−1.137	0.26
LDL-C (mmol/L)	2.19±0.89	2.22±0.68	−0.424	0.67
APO_A (g/L)	1.01±0.23	1.02±0.25	−0.436	0.66
APO_B (g/L)	0.64±0.18	0.66±0.2	−1.658	0.10
Lp(a) (mg/L)	148.7±209.94	129.88±178.03	1.187	0.24
HGB (g/L)	137.25±16.24	140.53±15.49	−2.622	0.009
WBC (×10⁹/L)	6.7±1.84	6.7±1.71	−0.005	>0.99
RBC (×10⁹/L)	4.82±0.6	4.9±0.53	−1.929	0.054
PLT (×10⁹/L)	264.35±75.73	262.82±71.59	0.262	0.79
Glucose (mmol/L)	4.88±0.81	4.84±0.8	0.509	0.61

Data are presented as mean ± SD. P values <0.05 were considered significant. AHtR, arm-height ratio; APO_A, apolipoprotein A; APO_B, apolipoprotein B; BMI, body mass index; DBP, diastolic blood pressure; HDL-C, high-density lipoprotein cholesterol; HGB, hemoglobin; LDL-C, low-density lipoprotein cholesterol; Lp(a), lipoprotein(a); PLT, platelet; RBC, red blood cell; SBP, systolic blood pressure; SD, standard deviation; TC, total cholesterol; TG, triglyceride; WBC, white blood cell.

Table 2

Feature comparison between the training set and test set

Variables	Training set	Test set	χ²	P
Gender			0.128	0.72
Male	456 (55.1)	117 (56.5)
Female	371 (44.9)	90 (43.5)
Stratum			11.092	0.01
Urban neighborhood	75 (9.1)	29 (14.0)
Suburban village	148 (17.9)	45 (21.7)
County town neighborhood	127 (15.4)	39 (18.8)
Rural village	477 (57.7)	94 (45.4)
Urban			6.09	0.01
Rural	625 (75.6)	139 (67.1)
Urban	202 (24.4)	68 (32.9)
Nationality			41.063	<0.001
Han	693 (83.8)	191 (92.3)
Buyi	32 (3.9)	0 (0.0)
Tujia	32 (3.9)	0 (0.0)
Miao	24 (2.9)	0 (0.0)
Man	17 (2.1)	14 (6.8)
Zhuang	12 (1.5)	0 (0.0)
Hui	3 (0.4)	0 (0.0)
Korean	2 (0.2)	1 (0.5)
Missing	1 (0.1)	0 (0.0)
Vaguer	1 (0.1)	0 (0.0)
Other	10 (1.2)	1 (0.5)
Province			235.908	<0.001
Guangxi	170 (20.6)	0 (0.0)
Guizhou	108 (13.1)	0 (0.0)
Henan	104 (12.6)	0 (0.0)
Jiangsu	93 (11.2)	0 (0.0)
Hunan	92 (11.1)	0 (0.0)
Heilongjiang	75 (9.1)	0 (0.0)
Hubei	73 (8.8)	0 (0.0)
Shandong	68 (8.2)	0 (0.0)
Liaoning	44 (5.3)	0 (0.0)
Sichuan	0 (0.0)	207 (100.0)
Sports activities	304 (36.8)	74 (35.7)	0.174	0.92
Currently in school	774 (93.6)	190 (91.8)	0.854	0.36
Sports activities in school	696 (84.2)	170 (82.1)	0.511	0.78
Homework	666 (80.5)	165 (79.7)	0.195	0.91
Children and adolescents with IR	281 (34.0)	92 (44.4)	7.864	0.005

Data are presented as n (%). P values <0.05 were considered significant. IR, insulin resistance.

Univariate LR

Through univariate LR analysis, the results revealed that age, province, 3-day average intake of carbohydrates, 3-day average intake of fat, 3-day average intake of energy, 3-day average intake of protein, SBP, DBP, height, weight, BMI, upper arm circumference, hip circumference, waist circumference, waist-height ratio, AHtR, currently in school, TG, TC, APO_B, RBC, HGB, and glucose were statistically significant variables in predicting IR, with a total of 23 variables.

E-CHAID model

The first optimal grouping variable in the decision tree is hip circumference, which forms a four-way branching tree (≤69.50, 69.51–76.00, 76.01–87.00, >87.00 cm). The stratified categorical variable nodes in the E-CHAID model include hip circumference, TG, glucose, height, and waist-height ratio, with a total of 20 nodes established, resulting in 14 terminal nodes. Among the five independent variables selected, the importance of IR clinical early warning indicators was ranked as follows: glucose (0.46), hip circumference (0.29), height (0.13), waist-height ratio (0.06), and TG (0.06). The model’s prediction accuracy can be found in Table 3, and the confusion matrix can be found in Table 4.

Table 3

The overall accuracy and error rates of the four models in the training and validation sets

Model	E-CHAID		BPNN		SVM		LR
Model	Training set	Test set	Training set	Test set	Training set	Test set	Training set	Test set
Correct	64,377.75	15,172.95	62,875.94	14,570.05	76,091.90	18,790.34	63,076.18	14,771.01
False	18,422.25	5,627.05	19,924.06	6,229.95	678.10	209.66	19,723.82	6,028.99
Total	827,100	207,100	827,100	207,100	827,100	207,100	827,100	207,100

BPNN, back propagation neural network; E-CHAID, exhaustive Chi-squared automatic interaction detector; LR, logistic regression; SVM, support vector machine.

Table 4

Confusion matrix of the four models in the training and testing sets (actual behavior values)

Model	E-CHAID				BPNN				SVM				LR
	Training set		Test set		Training set		Test set		Training set		Test set		Training set		Test set
	Non-IR	IR	Non-IR	IR	Non-IR	IR	Non-IR	IR	Non-IR	IR	Non-IR	IR	Non-IR	IR	Non-IR	IR
Non-IR	508	38	84	31	472	74	93	22	521	25	107	8	486	60	86	29
IR	146	135	25	67	125	156	40	52	42	239	12	80	137	144	31	61

BPNN, back propagation neural network; E-CHAID, exhaustive Chi-squared automatic interaction detector; IR, insulin resistance; LR, logistic regression; SVM, support vector machine.

BPNN model

The importance of the IR clinical early warning indicators is as follows: glucose (0.14), TG (0.07), waist circumference (0.07), APO_B (0.06), BMI (0.06), weight (0.06), 3-day average intake of protein (0.05), DBP (0.05), height (0.05), and waist-height ratio (0.04). The model’s prediction accuracy can be found in Table 3, and the confusion matrix can be found in Table 4.

SVM model

The importance of the IR clinical early warning indicators is as follows: glucose (0.10), SBP (0.09), age (0.09), TG (0.06), APO_B (0.06), height (0.06), upper arm circumference (0.05), province (0.05), currently in school (0.05), and 3-day average carbohydrate fat (0.04). The model’s prediction accuracy can be found in Table 3, and the confusion matrix can be found in Table 4.

LR model

The importance of the IR clinical early warning indicators is as follows: waist circumference (0.18), glucose (0.13), 3-day average intake of energy (0.09), BMI (0.08), 3-day average intake of carbohydrates (0.07), hip circumference (0.06), age (0.06), AHtR (0.05), height (0.05), and DBP (0.04). The model’s prediction accuracy can be found in Table 3, and the confusion matrix can be found in Table 4.

Comparison of the predictive ability of the four prediction models

Table 3 shows that for both the training and testing samples, the accuracy of the SVM model (91.90%, 90.34%) is greater than that of the E-CHAID model (77.75%, 72.95%), BPNN model (75.94%, 70.05%), and LR model (76.18%, 71.01%). Table 5 shows that in the training sample, the sensitivity (0.851), specificity (0.954), Youden’s index (0.805), and AUC (0.967) of the SVM model were all greater than those of the E-CHAID model (0.480, 0.930, 0.410, 0.814), the BPNN model (0.555, 0.864, 0.419, 0.803), and the LR model (0.512, 0.890, 0.402, 0.808). In the external testing sample, the sensitivity (0.870), specificity (0.930), Youden’s index (0.800), and AUC (0.958) of the SVM model were all higher than those of the E-CHAID model (0.728, 0.730, 0.458, 0.803), the BPNN model (0.565, 0.809, 0.374, 0.810), and the LR model (0.663, 0.748, 0.411, 0.804). In summary, the predictive ability of the SVM model is superior to that of the E-CHAID model, the BPNN model, and the LR regression model, whereas the IR predictive abilities of the E-CHAID model, BPNN model, and LR model were essentially comparable.

Table 5

Comparison of the evaluation index systems for the four prediction models of IR

Model	E-CHAID		BPNN		SVM		LR
Model	Training set	Test set	Training set	Test set	Training set	Test set	Training set	Test set
Sensitivity	0.480	0.728	0.555	0.565	0.851	0.870	0.512	0.663
Specificity	0.930	0.730	0.864	0.809	0.954	0.930	0.890	0.748
Youden’s index	0.410	0.458	0.419	0.374	0.805	0.800	0.402	0.411
AUC	0.814	0.803	0.803	0.810	0.967	0.958	0.808	0.804

AUC, area under the curve; BPNN, back propagation neural network; E-CHAID, exhaustive Chi-squared automatic interaction detector; IR, insulin resistance; LR, logistic regression; SVM, support vector machine.

Discussion

Although IR might not cause any signs or discomfort in children or teens, it is a significant risk factor for numerous metabolic disorders. If left unchecked, IR can impact the growth and development of these young individuals. IR in young individuals can lead to numerous health complications, including cardiovascular risk factors, T2DM, and nonalcoholic fatty liver disease (19). Early recognition and intervention are crucial for preventing the progression of IR and its associated complications. Treatment strategies focus on lifestyle modifications, pharmacotherapy, and, in some cases, surgical interventions (3). Even small improvements in diet and lifestyle can positively impact measured IR in children and adolescents (19). Research on IR has focused mostly on the predictive ability of specific variables, such as waist circumference and BMI (20). However, only a few single variables showed some correlation with IR, and this approach is not comprehensive enough in predicting IR, making it difficult to develop accurate evaluation criteria. Stawiski et al. applied neural network techniques to construct a predictive model for IR in children and achieved good predictive performance (21). A more recent study developed multivariate regression models using demographic and clinical variables, which showed good discriminative ability (AUC: 0.834–0.868) for IR prediction in children and adolescents with obesity (22). However, in terms of the population to which it was adapted, their study focused mainly on children with type 1 diabetes mellitus (T1DM) or obesity, whereas our study focused on a broader population covering all children and adolescents aged 6–17 years. Multiple studies have been able to accurately predict IR in children, but the indicators used, such as fasting blood glucose and genetic test results, are difficult to obtain from physical examination results, resulting in poor practical usability (23,24). The results of this study confirm the previous hypothesis that ML techniques can indeed accurately predict IR at a very early stage in school-aged children and adolescents via routine physical examination results and dietary questionnaires.

Through comparative analysis, this study revealed that the SVM model outperforms several other models in terms of predictive performance, including the E-CHAID model, the BPNN model, and the LR model. This result may be closely related to the unique advantages of the SVM model. The SVM model excels at distinguishing different categories by finding the optimal hyperplane, effectively identifying complex patterns and relationships when dealing with high-dimensional data and nonlinear problems (25). Moreover, the SVM model possesses good generalizability, enabling accurate predictions for unknown data with limited training data (26). These characteristics have led the SVM model to demonstrate high predictive accuracy in this study, providing a powerful tool for research in related fields.

The IR warning indicators selected by the four ML models are basically consistent with those used in previous studies and include mainly glucose, waist circumference, age, BMI, waist-height ratio, TG, blood pressure, etc. (10). Notably, several physical examination indicators and dietary factors discovered in this study, including hip circumference, upper arm circumference, protein intake, energy intake, and carbohydrate intake, have received little attention in previous studies. Previous studies have shown that a high-carbohydrate diet in Western countries is associated with an increased risk of IR in children and adolescents (27,28). Our research further revealed that increased protein and energy intake are also associated with an increased risk of IR. This suggests that we should pay more attention to the balanced and moderate composition of children’s and adolescents’ diets, avoiding excessive intake of carbohydrates, proteins, and energy.

There are certain limitations in this research. First, the sizes of both the training and testing datasets were limited, which could potentially lead to overfitting. Nonetheless, the utilization of an external test set demonstrated that the extent of this issue was within acceptable bounds. To further address this concern, we have applied appropriate regularization parameters for models that support regularization. Specifically, for LR, we set the regularization scale to a user-defined value of 3.0; for SVM, we set the regularization parameter to 10 and the RBF gamma to 1.0. These settings help to balance the trade-off between bias and variance, ensuring that the model performs well on unseen data. In future work, we plan to expand our dataset by incorporating additional data sources and conducting multi-center studies. Secondly, regional differences exist between the CHNS dataset (excluding Sichuan province) and our Sichuan-based test set, potentially impacting result generalizability. However, this design allows us to assess the model’s generalization capability. For instance, the SVM model showed strong performance on the Sichuan test set (sensitivity 0.870, specificity 0.930, Youden’s index 0.800, AUC 0.958), comparable to the training set, indicating good generalization. Future work will integrate datasets from other regions to further enhance model robustness and generalizability. Additionally, the models were based on a predefined set of clinically relevant variables, which may not cover all potential factors influencing IR. Unmeasured or unconsidered variables (such as folate, vitamin B12 diets, and rosiglitazone) could also significantly impact IR development and progression (29,30). To enhance model comprehensiveness, we plan to integrate additional data sources and conduct multi-center studies in future work. This will expand our dataset and include a broader range of variables, providing a more holistic understanding of IR contributors in children and adolescents.

Conclusions

This study aims to provide a risk classification prediction method for early clinical warning of IR in children and adolescents. We integrated the basic information, lifestyle habits, blood tests, and dietary questionnaire results of children and adolescents, which are easily obtained during the physical examination process. The results showed that the SVM model performed better in predicting individual risk than did the E-CHAID model, BPNN model, and LR model. In practical applications, their respective advantages should be combined to better leverage their application strengths and serve clinical practice.

Acknowledgments

None.

Footnote

Reporting Checklist: The authors have completed the TRIPOD reporting checklist. Available at https://tp.amegroups.com/article/view/10.21037/tp-2024-502/rc

Data Sharing Statement: Available at https://tp.amegroups.com/article/view/10.21037/tp-2024-502/dss

Peer Review File: Available at https://tp.amegroups.com/article/view/10.21037/tp-2024-502/prf

Funding: This research was funded by the Innovation Fund Youth Program of the National Science Library (Chengdu) of the Chinese Academy of Sciences (No. E3Z0000301).

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://tp.amegroups.com/article/view/10.21037/tp-2024-502/coif). The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by the Institutional Review Boards of the Nanchong Central Hospital (No. 2024-033) and the Nanchong City Jialing District People’s Hospital (No. 2024-002). The participants’ legal guardians provided written informed consent to participate in this study.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.

References

Luo Y, Luo D, Li M, et al. Insulin resistance in pediatric obesity: from mechanisms to treatment strategies. Pediatr Diabetes 2024;2024:2298306.
Huo Y, Ji S, Yang H, et al. Differential expression of microRNA in the serum of patients with polycystic ovary syndrome with insulin resistance. Ann Transl Med 2022;10:762. [Crossref] [PubMed]
Al-Beltagi M, Bediwy AS, Saeed NK. Insulin-resistance in paediatric age: Its magnitude and implications. World J Diabetes 2022;13:282-307. [Crossref] [PubMed]
Ziamanesh F, Mohammadi M, Ebrahimpour S, et al. Unraveling the link between insulin resistance and Non-alcoholic fatty liver disease (or metabolic dysfunction-associated steatotic liver disease): A Narrative Review. J Diabetes Metab Disord 2023;22:1083-94. [Crossref] [PubMed]
Koivula T, Lempiäinen S, Laine S, et al. Cross-Sectional Associations of Body Adiposity, Sedentary Behavior, and Physical Activity with Hemoglobin and White Blood Cell Count. Int J Environ Res Public Health 2022;19:14347. [Crossref] [PubMed]
Alvarez-Jimenez L, Morales-Palomo F, Moreno-Cabañas A, et al. Effects of statin therapy on glycemic control and insulin resistance: A systematic review and meta-analysis. Eur J Pharmacol 2023;947:175672. [Crossref] [PubMed]
Chen JY, Chen YH, Lee YC, et al. The Association Between White Blood Cell Count and Insulin Resistance in Community-Dwelling Middle-Aged and Older Populations in Taiwan: A Community-Based Cross-Sectional Study. Front Med (Lausanne) 2022;9:813222. [Crossref] [PubMed]
Badawy Y, Aljohani NH, Salem GA, et al. Predictability of the Development of Insulin Resistance Based on the Risk Factors Among Female Medical Students at a Private College in Saudi Arabia. Cureus 2023;15:e39112. [Crossref] [PubMed]
Hall AJ, Hussain A, Shaikh MG. Predicting insulin resistance in children using a machine-learning-based clinical decision support system. In: Advances in Brain Inspired Cognitive Systems: 8th International Conference, BICS 2016, Beijing, China, November 28-30, 2016, Proceedings 8. Springer International Publishing; 2016:274-83.
Zhang Q, Wan NJ. Simple Method to Predict Insulin Resistance in Children Aged 6-12 Years by Using Machine Learning. Diabetes Metab Syndr Obes 2022;15:2963-75. [Crossref] [PubMed]
Yan W, Wu S, Liu Q, et al. The link between obesity and insulin resistance among children: Effects of key metabolites. J Diabetes 2023;15:1020-8. [Crossref] [PubMed]
El Sehmawy AA, Diab FEA, Hassan DA, et al. Utility of Adipokines and IL-10 in Association with Anthropometry in Prediction of Insulin Resistance in Obese Children. Diabetes Metab Syndr Obes 2022;15:3231-41. [Crossref] [PubMed]
Rodríguez-Rodríguez E, Salas-González MD, Ortega RM, et al. Leukocytes and Neutrophil-Lymphocyte Ratio as Indicators of Insulin Resistance in Overweight/Obese School-Children. Front Nutr 2021;8:811081. [Crossref] [PubMed]
Altay Y. Prediction of the live weight at breeding age from morphological measurements taken at weaning in indigenous Honamli kids using data mining algorithms. Trop Anim Health Prod 2022;54:172. [Crossref] [PubMed]
Jiang S, Li Y, Jiao Y, et al. A back propagation neural network approach to estimate the glomerular filtration rate in an older population. BMC Geriatr 2023;23:322. [Crossref] [PubMed]
Teng S, Zheng N, Al-Huqail AA, et al. Effect of nanoparticle macroalgae in the treatment of fatty liver disease using logistic regression, and support vector machine. Environ Res 2023;224:115426. [Crossref] [PubMed]
Lee AM, Hu J, Xu Y, et al. Using Machine Learning to Identify Metabolomic Signatures of Pediatric Chronic Kidney Disease Etiology. J Am Soc Nephrol 2022;33:375-86. [Crossref] [PubMed]
Khalili D, Khayamzadeh M, Kohansal K, et al. Are HOMA-IR and HOMA-B good predictors for diabetes and pre-diabetes subtypes? BMC Endocr Disord 2023;23:39. [Crossref] [PubMed]
Chen ME, Hannon TS. Clinical manifestations of insulin resistance in youth. In: Zeitler P, Nadeau K. editors. Insulin Resistance: Childhood Precursors of Adult Disease. Cham: Springer International Publishing; 2019:3-17.
Hatami H, Montazeri SA, Hashemi N, et al. Optimal Cutoff Points for Anthropometric Variables to Predict Insulin Resistance in Polycystic Ovary Syndrome. Int J Endocrinol Metab 2017;15:e12353. [Crossref] [PubMed]
Stawiski K, Pietrzak I, Młynarski W, et al. NIRCa: An artificial neural network-based insulin resistance calculator. Pediatr Diabetes 2018;19:231-5. [Crossref] [PubMed]
Araújo D, Morgado C, Correia-Pinto J, et al. Predicting Insulin Resistance in a Pediatric Population With Obesity. J Pediatr Gastroenterol Nutr 2023;77:779-87. [Crossref] [PubMed]
Lin H, Tas E, Børsheim E, et al. Circulating miRNA Signatures Associated with Insulin Resistance in Adolescents with Obesity. Diabetes Metab Syndr Obes 2020;13:4929-39. [Crossref] [PubMed]
Zhang PP, Song JY, Li L, et al. Associations between genetic variants of HSD17B13 and fasting plasma glucose in Chinese children. Nutr Metab Cardiovasc Dis 2023;33:1778-84. [Crossref] [PubMed]
Bharadwaj, Prakash KB, Kanagachidambaresan GR. Pattern recognition and machine learning. In: Prakash KB, Kanagachidambaresan GR. editors. Programming with TensorFlow: Solution for Edge Computing Applications. Cham: Springer; 2021:105-44.
Du KL, Swamy MNS. Support vector machines. In: Neural Networks and Statistical Learning. London: Springer; 2014:469-524.
Romero-Polvo A, Denova-Gutiérrez E, Rivera-Paredez B, et al. Association between dietary patterns and insulin resistance in Mexican children and adolescents. Ann Nutr Metab 2012;61:142-50. [Crossref] [PubMed]
Gow ML, Garnett SP, Baur LA, et al. The Effectiveness of Different Diet Strategies to Reduce Type 2 Diabetes Risk in Youth. Nutrients 2016;8:486. [Crossref] [PubMed]
Qian L, Yin X, Lan T, et al. Peroxisome proliferator-activated receptor gamma preserves intracellular homeostasis of insulin-resistant periodontal ligament stem cells. Ann Transl Med 2022;10:580. [Crossref] [PubMed]
Zhang H, Wang X, Zhang J, et al. Early supplementation of folate and vitamin B12 improves insulin resistance in intrauterine growth retardation rats. Transl Pediatr 2022;11:466-73. [Crossref] [PubMed]

Cite this article as: Huang X, Yi K, Jia L, Li Y, He H, Ma C, Fang X. Development and validation of an insulin resistance prediction model in children and adolescents using machine learning algorithms. Transl Pediatr 2025;14(3):452-462. doi: 10.21037/tp-2024-502

Development and validation of an insulin resistance prediction model in children and adolescents using machine learning algorithms

Highlight box

Introduction

Methods

Study participants

Outcome

Data preprocessing and feature selection

Model construction

Statistical analysis

Results

Subject characteristics

Table 1

Table 2

Univariate LR

E-CHAID model

Table 3

Table 4

BPNN model

SVM model

LR model

Comparison of the predictive ability of the four prediction models

Table 5

Discussion

Conclusions

Acknowledgments

Footnote

References

Article Options

Download Citation

Share