2025 Volume 72 Issue 4 Pages 387-398
Insulin resistance (IR) is the core for type 2 diabetes and metabolic syndrome. The homeostasis assessment model is a straightforward and practical tool for quantifying insulin resistance (HOMA-IR). Multiple adaptive regression spline (MARS) is a machine learning method used in many research fields but has yet to be applied to estimating HOMA-IR. This study uses MARS to build an equation to estimate HOMA-IR in pre-menopausal Chinese women based on a sample of 4,071 healthy women aged 20–50 with no major diseases and no medication use for blood pressure, blood glucose or blood lipids. Thirty variables were applied to build the HOMA-IR model, including demographic, laboratory, and lifestyle factors. MARS results in smaller prediction errors than traditional multiple linear regression (MLR) methods, and is thus more accurate. The model was established based on key impact factors including waist-hip ratio (WHR), C reactive protein (CRP), uric acid (UA), total bilirubin (TBIL), leukocyte (WBC), serum glutamic oxaloacetic transaminase (GOT), high-density lipoprotein cholesterol (HDL-C), systolic blood pressure (SBP), serum glutamic pyruvic transaminase (GPT), and triglycerides (TG). The equation is as following:
HOMA-IR = 6.634 – 1.448MAX(0, 0.833 – WHR) + 10.152MAX(0, WHR – 0.833) – 1.351MAX(0, 0.7 – CRP) – 0.449MAX(0, CRP – 0.7) + 1.062MAX(0, UA – 8.5) + +1.047(MAX(0, 0.83 – TBIL) + 0.681MAX(0, WBC – 11.53) – 0.071MAX(0, 11.53 – WBC) + 0.043MAX(0, 24 – GOT) – 0.017MAX(0, GOT – 24) + 0.021MAX(0, 59 – HDL) – 0.005MAX(0, HDL – 59) – 0.013MAX(0, 141 – SBP) – 0.033MAX(0, 100 – GPT) + 0.013MAX(0, GPT – 100) – 0.004MAX(303 – TG)
Results indicate that MARS is a more precise tool than fasting plasma insulin (FPI) levels, and could be used in the daily practice, and further longitudinal studies are warranted.
The prevalence of type 2 diabetes (T2D) has increased dramatically in recent years, with total global cases increasing from 462 million in 2017 to nearly 700 million in 2023, an increase of over 50% in just six years [1]. With nearly 1 million deaths per year, T2D is currently the world’s 9th leading cause of death. Similar trend is found in Taiwan, where around 9.7% of the population had T2D in 2021 [2]. T2D is the 5th leading cause of death in Taiwan, where T2D and its co-morbidities account for 11.5% of total expenses by the government’s national health insurance scheme [3]. Thus, early diagnosis and appropriate management of T2D is a critical issue both in Taiwan and globally.
The underlying pathophysiologies of T2D are increased insulin resistance (IR) and decreased insulin secretion [4], and IR is simply defined as a “blunting of insulin’s hypoglycemic effects” [5]. There are many different methods to measure IR. The most sophisticated method involves the use of a “euglycemic clamp,” wherein the amount of glucose water infused to maintain plasma glucose level in the normal range for 2 hours would be the IR value, and a higher glucose infused volume is negatively correlated with IR. However, this approach is expensive and labor intensive to perform. Other tests such as insulin suppression test, intravenous glucose tolerance test and oral glucose tolerance test all have their unique characteristics and have been developed and applied in different studies. Of these tests, the hemostasis assessment model for insulin resistance (HOMA-IR) was first reported by Matthews et al. [6], calculating IR through multiple measurements of fasting plasma glucose (FPG) and insulin (FPI) with a coefficient. Because of its simplicity, HOMA-IR has been extensively used.
Many studies have attempted to identify and characterize various impact factors for IR, including inflammation, obesity, fatty liver, metabolic syndrome, and polycystic ovary syndrome. Aging is also considered to be related to IR [7], reporting that severity of IR increases with age.
However, the findings presented by Barbieri et al. suggest that IR and reduced insulin secretion are not necessarily correlated with age [8].
Most of the aforementioned studies used traditional statistical methods. Recently, however, the development of artificial intelligence and related machine learning techniques have been widely applied in medical research. Among these methods, multivariate adaptive regression spline (MARS) could provide a specific equation which captures and makes explicit non-linear relationships between variables, allowing MARS to outperform traditional multiple linear regression (MLR) approaches in predicting the behavior of dependent variables. The present study uses MARS in building an equation based on demographic, biochemistry, and lifestyle data for the prediction of HOMA-IR in healthy Chinese women.
The present study followed our previously published protocol [9]. Data were sourced from the Taiwan MJ cohort, an ongoing prospective cohort of health examinations conducted by the MJ Health Screening Centers in Taiwan [10]. These examinations cover more than 100 important biological indicators, including anthropometric measurements, blood tests, imaging tests, etc. Each participant completed a self-administered questionnaire to collect information related to personal and family medical history, current health status, lifestyle, physical exercise, sleep habits, and dietary habits [11]. It should be noted that this study is a secondary data analysis. The data of our study participants were obtained from health checkups conducted at the MJ Clinic, with general consent obtained for future anonymous studies. All or part of the data used in this research were authorized by and received from MJ Health Research Foundation (Authorization Code: MJHRF2023007A). Any interpretations or conclusions described in this paper do not represent the views of MJ Health Research Foundation. Please refer to the technical report annually [11]. The study protocol was approved by the Institutional Review Board of the Kaohsiung Armed Forces General Hospital (IRB No.: KAFGHIRB 112-006). Since no samples were collected from patients, a short IRB review was approved and no consent was required. The data set initially included 646,728 healthy participants. Filtering for the exclusion criteria listed below, the final sample for analysis included 4,071 subjects (see Fig. 1).
The study exclusion criteria were:
1. Male;
2. Age <20 and >50 years old;
3. Taking medications for metabolic syndrome at the time of the study
4. Missing data of components of metabolic syndrome
On the day of the study, a senior MJ nursing staff member recorded the subject’s medical history, including information on any current medications, and performed a physical examination. Waist circumference was measured horizontally at the level of the natural waist. Body mass index (BMI) was calculated as the subject’s body weight (kg) divided by the square of the subject’s height (m). Both systolic blood pressure and diastolic blood pressure were measured using a standard mercury sphygmomanometer on the right arm while seated.
After fasting for 10 hours, blood samples were drawn for biochemical analysis. Plasma was separated from the blood within 1 hour of collection and stored at 30°C until analysis for FPG and lipid profiles. FPG was measured using a glucose oxidase method (YSI 203 glucose analyzer, Yellow Springs Instruments, Yellow Springs, USA). Total cholesterol and triglyceride (TG) levels were measured using a dry, multilayer analytical slide method with the Fuji Dri-Chem 3000 analyzer (Fuji Photo Film, Tokyo, Japan). Serum high-density lipoprotein cholesterol (HDL-C) and low-density lipoprotein cholesterol (LDL-C) concentrations were analyzed using an enzymatic cholesterol assay, following dextran sulfate precipitation. Beckman Coulter AU 5800 biochemical analyzer determined the urine microalbumin by turbidimetry.
The 30 variables of the present study are shown in Table 1 (dependent variables) which could be grouped into three group categories, demographics, biochemistry, and lifestyle and HOMA-IR, of the independent variable. It should be noted that fasting plasma glucose and insulin levels were not included in the statistical analysis since HOMA-IR was calculated from these two parameters.
Variables | Unit and description |
---|---|
Age | Years |
Systolic blood pressure (SBP) | mmHg |
Diastolic blood pressure (DBP) | mmHg |
Waist-hip ratio (WHR) | waist circumference/hip circumference |
Leukocyte (WBC) | *103/μL |
Hemoglobin (Hb) | *106/μL |
Platelets (Plt) | *103/μL |
Total bilirubin (TBIL) | mg/dL |
Albumin (Alb) | mg/dL |
Globulin (Glo) | g/dL |
Alkaline Phosphatase (ALP) | IU/L |
Serum glutamic oxaloacetic transaminase (GOT) | IU/L |
Serum glutamic pyruvic transaminase (GPT) | IU/L |
Serum γ-glutamyl transpeptidase (γ-GT) | IU/L |
Lactate dehydrogenase (LDH) | mg/dL |
Creatinine (Cr) | mg/dL |
Uric acid (UA) | mg/dL |
Triglycerides (TG) | mg/dL |
High-density lipoprotein cholesterol (HDL-C) | mg/dL |
Low-density lipoprotein cholesterol (LDL-C) | mg/dL |
Plasma calcium concentration (Ca) | mg/dL |
Plasma phosphate concentration (P) | mg/dL |
Thyroid stimulating hormone (TSH) | μIU/mL |
C reactive protein (CRP) | mg/dL |
Education level (Edu.) | (1) Illiterate (2) Elementary school (3) Junior high school (4) High school (vocational) (5) Junior college (6) University (7) Graduate school or above |
Marriage status (MS) | (1) Unmarried (2) Married |
Income level (IL) | NTD/years (1) Below $200,000 (2) $200,001–$400,000 (3) $400,001–$800,000 (4) $800,001–$1,200,000 (5) $1,200,001–$1,600,000 (6) $1,600,001–$2,000,000 (7) More than $2,000,000 |
Drinking area | Alcohol proof × drinking years × frequency |
Sport time | Hours per week × years of exercise × types |
Sleeping hours (SH) | (1) 0~4 hours (2) 4~6 hours (3) 6~8 hours (4) more than 8 hours |
insulin resistance (HOMA-IR) | FPI (μU/mL) × FPG (mg/dL)/405 |
To compare HOMA-IR performance using different demographic, biochemistry, and lifestyle parameters, we used t-tests to compare HOMA-IR for marital status, and analyzed variances to compare the HOMA-IR in terms of ordinal data such as education and income level (see Table 2). A simple correlation was applied to evaluate the relationship between HOMA-IR and other parameters (see Table 3). Considering the potential multicollinearity between WBC and CRP, both of which are inflammatory markers, principal component analysis (PCA) was applied to evaluate the impact of these two factors on HOMA-IR.
MS | IL | Edu. | SH | |
---|---|---|---|---|
HOMA-IR | 0.996 | 0.001*** | 0.000*** | 0.000*** |
Note: MS: marriage status; IL: income level; Edu.: education level; SH: sleep hours; *: p < 0.05; **: p < 0.01; ***: p < 0.001.
Age | SBP | DBP | WHR | WBC | Hb | |
HOMA-IR | 0.006 | 0.340*** | 0.27*** | 0.423*** | 0.335*** | 0.096*** |
Plt | TBIL | Alb | Glo | ALP | GOT | |
HOMA-IR | 0.217*** | –0.204*** | 0.057*** | 0.151*** | 0.245*** | 0.256*** |
GPT | γ-GT | LDH | Cr | UA | TG | |
HOMA-IR | 0.392*** | 0.215*** | 0.206*** | 0.001 | 0.310*** | 0.390*** |
HDL-C | LDL-C | Ca | P | TSH | CRP | |
HOMA-IR | –0.326*** | 0.220*** | 0.067*** | –0.096*** | 0.021* | 0.281*** |
Drinking area | Sport time | |||||
HOMA-IR | –0.045*** | –0.084*** |
Note: SBP: systolic blood pressure; DBP: diastolic blood pressure; WHR: waist-hip ratio; WBC: leukocyte; Hb: hemoglobin; Plt: platelets; TBIL: total bilirubin; Alb: albumin; Glo: globulin; ALP: alkaline phosphatase; GOT: serum glutamic oxaloacetic transaminase; GPT: serum glutamic pyruvic transaminase; γ-GT: serum γ-glutamyl transpeptidase; LDH: lactate dehydrogenase; Cr: creatinine; UA: uric acid; TG: triglycerides; HDL-C: high-density lipoprotein cholesterol; LDL-C: low-density lipoprotein cholesterol; Ca: plasma calcium concentration; P: plasma phosphate concentration; TSH: thyroid stimulating hormone; CRP: C reactive protein;*: p < 0.05; **: p < 0.01; ***: p < 0.001.
The aforementioned tests were performed using SPSS version 19.0 (IBM Inc., Armonk, New York).
Machine learning methodThe present study used the multiple adaptive regression spline (MARS) which is unique in that it provides an equation. For each independent variable (x), there might be a non-linear relationship. MARS can capture these relationships, and thus, is expected to provide more accurate results than the traditional multiple regression.
A PubMed search shows that MARS has been extensively used in other studies that can be classified as follows: 1. Comparison between different AI and MARS [12, 13]. 2. Demonstration of the method of MARS itself [14, 15]. 3. Animal studies [16]. 4. Genetic studies [17]. 5. Medical research [18-20]. The present study is one of the first to apply MARS to the study of IR. MARS is useful for developing adaptable models suited for high-dimensional data. This modeling method uses an expansion structure reliant on product spline basis functions. Remarkably, both the count of fundamental functions and the attributes connected to each one, encompassing product degree and knot placements, are autonomously established through data-driven mechanisms [21]. This strategy draws inspiration from the principles of recursive partitioning, akin to methods like Classification and Regression Trees in terms of its proficiency in capturing intricate higher-order interactions.
In the analysis phase, the dataset was initially partitioned into an 80% training dataset for use in model construction, and a separate 20% testing dataset for model assessment. In the training phase, MARS uses specific hyperparameters that requires tuning to ensure good model performance. To facilitate this, the training dataset was once more divided at random to yield one segment for model formulation using a distinct set of hyperparameters, while the other section was used for validation purposes. A systematic exploration of all conceivable combinations of hyperparameters was performed using a comprehensive grid search approach. Subsequently, the model characterized by the lower root mean square error when applied to the validation dataset was deemed the optimal choice for each compared to MLR.
In the evaluation phase, the testing dataset was used to gauge the predictive efficacy of the MARS model. Given that the target variable in this study is a numerical parameter, the evaluation metrics chosen to compare model performance include symmetric mean absolute percentage error (SMAPE), relative absolute error (RAE), root relative squared error (RRSE), and root mean squared error (RMSE) (see Table 4).
Metric | Description | Calculation | |
---|---|---|---|
SMAPE | Symmetric mean absolute Percentage error |
|
|
RAE | Relative absolute error |
|
|
RRSE | Root relative squared error |
|
|
RMSE | Root mean squared error |
|
To establish a comparative context, the averaged metrics derived from the MARS model were used to juxtapose the model performance with that of the benchmark MLR model. Note that both the MARS models and the MLR model were trained and tested using the same dataset.
In this study, all methods were performed using R software version 4.0.5 and RStudio version 1.1.453 with the required packages installed (http://www.R-project.org, accessed on 11 June 2024; https://www.rstudio.com/products/rstudio/, accessed on 11 June 2024). The implementations of MARS were the “earth” R package version 5.3.3 [22] , the “caret” R package version 6.0–94 [23]. MLR was implemented using the “stats” R package version 4.0.5, using the default settings to construct the models.
A total of 4,071 healthy subjects were enrolled. Demographic data are summarized in Table 5.
Demographics Variables | Mean ± SD | Biochemistry Variables | Mean ± SD |
---|---|---|---|
Age | 36.68 ± 7.75 | Alb | 4.39 ± 0.21 |
Edu. | N (%) | Glo | 3.16 ± 0.33 |
Illiterate | 1 (0.02%) | ALP | 50.98 ± 19.91 |
Elementary school | 8 (0.20%) | GOT | 19.98 ± 9.40 |
Junior high school | 32 (0.79%) | GPT | 19.85 ± 16.43 |
High school (vocational) | 584 (14.35%) | γ-GT | 20.21 ± 37.54 |
Junior college | 620 (15.23%) | LDH | 148.6 ± 24.83 |
University | 2,255 (55.39%) | Cr | 0.81 ± 0.09 |
Graduate school or above | 571 (14.03%) | UA | 4.71 ± 1.03 |
MS | N (%) | TG | 80.46 ± 57.01 |
Single | 1,678 (41.22%) | HDL-C | 63.78 ± 14.29 |
With spouse | 2,393 (58.78%) | LDL-C | 108.16 ± 30.52 |
IL | N (%) | Ca | 9.51 ± 0.37 |
Below $200,000 | 731 (17.96%) | P | 3.84 ± 0.45 |
$200,001–$400,000 | 972 (23.88%) | TSH | 1.79 ± 1.50 |
$400,001–$800,000 | 1,446 (35.52%) | CRP | 0.20 ± 0.29 |
$800,001–$1,200,000 | 629 (15.45%) | HOMA-IR | 1.76 ± 1.23 |
$1,200,001–$1,600,000 | 140 (3.44%) | Lifestyle Variables | Mean ± SD |
$1,600,001–$2,000,000 | 70 (1.72%) | Drinking area | 1.36 ± 6.17 |
More than $2,000,000 | 83 (2.04%) | Sport time | 3.84 ± 6.28 |
Biochemistry Variables | Mean ± SD | SH | N (%) |
SBP | 108.84 ± 13.76 | <4 hours/day | 40 (0.98%) |
DBP | 70.84 ± 9.67 | 4–6 hours/day | 1,085 (26.65%) |
WHR | 0.76 ± 0.05 | 6–7 hours/day | 1,841 (45.22%) |
WBC | 6.04 ± 1.59 | 7–8 hours/day | 884 (21.71%) |
Hb | 13.01 ± 1.35 | 8–9 hours/day | 177 (4.35%) |
Plt | 251.14 ± 59.04 | >9 hours/day | 44 (1.08%) |
TBIL | 0.89 ± 0.34 |
Table 6 shows the PCA result. The respective contributions of WBC and CRP are 0.62 and 0.38. Though CRP has a smaller impact on IR, it was not negligible.
Component | Eigenvalue | Difference | Proportion | Cumulative |
---|---|---|---|---|
WBC | 1.24889 | 0.49772 | 0.6244 | 0.6244 |
CRP | 0.751114 | 0.3756 | 1 |
WBC: leukocyte, CRP: c-reactive protein
Table 7 shows that MARS yielded smaller prediction errors than the MLR method, indicating greater accuracy. Table 8 shows the 16 basis functions (BF) derived from MARS from the following variables: SBP, WHR, WBC, TBIL, GOT, GPT, UA, TG, HDL-C, and CRP. Based on Table 8, the MARS equation is generated as follows:
Methods | SMAPE | RAE | RRSE | RMSE |
---|---|---|---|---|
MARS | 0.501 | 1.225 | 1.150 | 1.694 |
MLR | 0.522 | 1.256 | 1.179 | 1.738 |
Note: MLR: multiple linear regression, MARS: multivariate adaptive regression splines.
Definition | ai | |
---|---|---|
Intercept | — | 6.634 |
B1 | Max(0, 141 – SBP) | –0.013 |
B2 | Max(0, 0.833 – WHR) | –1.448 |
B3 | Max(0, WHR – 0.833) | 10.152 |
B4 | Max(0, 11.53 – WBC) | –0.071 |
B5 | Max(0, WBC – 11.53) | 0.681 |
B6 | Max(0, 0.83 – TBIL) | 1.047 |
B7 | Max(0, 24 – GOT) | 0.043 |
B8 | Max(0, GOT – 24) | –0.017 |
B9 | Max(0, 100 – GPT) | –0.033 |
B10 | Max(0, GPT – 100) | 0.013 |
B11 | Max(0, UA – 8.5) | 1.062 |
B12 | Max(0, 303 – TG) | –0.004 |
B13 | Max(0, 59 – HDL-C) | 0.021 |
B14 | Max(0, HDL-C – 59) | –0.005 |
B15 | Max(0, 0.7 – CRP) | –1.351 |
B16 | Max(0, CRP – 0. 7) | –0.449 |
WHR: waist-hip ratio, SBP: systolic blood pressure, TG: triglyceride, HDL-C: high-density lipoprotein cholesterol, GOT: serum glutamic oxaloacetic transaminase, GPT: serum glutamic pyruvic transaminase, TBIL: total bilirubin, UA: uria acid, CRP: C-reactive protein
HOMA-IR = 6.634 – 1.448MAX(0, 0.833 – WHR) + 10.152MAX(0, WHR – 0.833) – 1.351MAX(0, 0.7 – CRP) – 0.449MAX(0, CRP – 0.7) + 1.062MAX(0, UA – 8.5) + +1.047(MAX(0, 0.83 – TBIL) + 0.681MAX(0, WBC – 11.53) – 0.071MAX(0, 11.53 – WBC) + 0.043MAX(0, 24 – GOT) – 0.017MAX(0, GOT – 24) + 0.021MAX(0, 59 – HDL) – 0.005MAX(0, HDL – 59) – 0.013MAX(0, 141 – SBP) – 0.033MAX(0, 100 – GPT) + 0.013MAX(0, GPT – 100) – 0.004MAX(303 – TG)
These BFs are interpreted as follows. Taking WHR as an example, when the WHR was below 0.833, the first equation was used: HOMA-IR = –1.448 × (0.833 – WHR). When subject WHR exceeded 0.8333, the equation changed to HOMA-IR = 10.152 × (WHR – 0.833333). In short, the results of the aforementioned equations should all exceed 0. For those risk factors with more than one BF, corresponding figures are displayed to capture a clearer understanding of the relationship between these factors and HOMA-IR. An equation built with these BFs is shown in Fig. 2.
A: Systolic blood pressure. B: Waist-hip ratio. C: Leukocyte. D: Total bilirubin. E: Serum glutamic oxaloacetic transaminase. F: Serum glutamic pyruvic transaminase. G: Uric acid. H: Triglycerides. I: High-density lipoprotein cholesterol. J: C reactive protein.
Finally, the overview of the present study is shown in Graphical Abstract. It could be noted that, from how to selecting the participants, to the Mach-L methods we used, to the comparison between these two methods and, the equation built by MARS are all depicted.
The present study constructed an equation to estimate HOMA-IR in healthy Chinese women aged between 20–50. This age range was selected to exclude menopausal women. No subjects were currently using medications known to affect blood pressure (BP), FPG or blood lipids, and had no major diseases. Our results found that there were 10 biochemistry variables related to HOMA-IR. To our knowledge, the present study is the only one using MARS to build the equation. It is of clinical use and, in the same time, could shed light on the true determining factors for IR in premenopausal women.
No lifestyle factors were found to have significant correlation with HOMA-IR, including smoking, drinking, marital status, income, educational attainment, and exercise. This does not necessarily indicate that these factors are not important for insulin resistance, but rather suggest that their impact might be “absorbed” by their end results such as higher BP or WHR. A similar explanation could also be applied to the role of age. As mentioned in the introduction section, the impact of age on IR remains controversial. Our results found no correlation, suggesting that the positive relationship found in other studies might be due to differences in study design, age groupings, and ethnicity. IR might also be indirectly related to age through factors such as inflammation or body weight.
Ten biochemistry variables were found to be significantly related to HOMA-IR, the six most important of which are discussed here. WHR had the most significant impact. Many previous studies examined the relationship between obesity and IR [24]. Obesity can be measured using many methods, including body fat, body mass index, and WHR. Body fat measurement is more expensive and requires specific hardware, raising clinical obstacles. While simpler to use, body mass index cannot determine whether body weight is due to fat, muscle, or bone mass, making it less accurate [25]. Previous studies have shown that WHR provides greater accuracy than body mass index [26]. The results of the present study suggest that, among the ten biochemistry variables examined, WHR has the most significant impact on HOMA-IR. A review article published by Wondmkun suggested that this relationship was caused by multiple mechanisms, such as increased amounts of non-esterified fatty acids, glycerol, hormones, and pro-inflammatory cytokines [27].
The second leading factor was UA. In an empirical study of 687 diabetic patients, Han et al. found that UA is positively related to metabolic score for IR (METS-IR) (r = 0.238, p < 0.01) [28]. While that study focused on diabetics rather than healthy individuals, their results can be seen as supporting the findings of the present study. This raises the possibility of improving IR by reducing UA. Takir et al. found that HOMA-IR decreased after allopurinol treatment (n = 73) when compared to controls (n = 33) in a group of asymptomatic hyperuricemia subjects [29]. UA could increase HOMA-IR through several pathways such as inhibition of Adenosine 5'-monophosphate (AMP)-activated protein kinase, thereby increasing gluconeogenesis, inflammation, and oxidative stress in the islet cells [30-33].
As a systemic inflammatory biomarker, CRP is regarded as a strong and independent predictor for cardiovascular diseases, diabetes, and hypertension [34]. It reflects only local inflammation in atherosclerosis, but is also linked to IR [35]. Our results found CRP to be third leading factor, which is consistent with other mainstream studies. For example, Gelaye et al. showed that women with higher CRP had a 2.18-fold increased risk of HOMA-IR compared to the lowest tertile. In men, the heightened risk increased to 2.54-fold. This relationship could be explained by CRP being synthesized in liver and regulated by interleukin-6 and tumor necrosis factor-alfa [36]. At the same time, FPG and FPI values would increase in the presence of chronic and systemic inflammation [35].
Surprisingly, bilirubin was the next key factor. The few studies that have examined the relationship of bilirubin to IR have found a negative correlation [37, 38]. Working with a small sample, Zhang et al. reported that indirect bilirubin might reduce IR. Takei et al. suggested that this correlation might be related to bilirubin’s unique ability to suppress cytokines [39]. Our findings strongly support this correlation in a large cohort.
The correlation between WBC and IR remains controversial. Mahdiani et al. found no relationship between WBC and HOMA-IR in 283 diabetic patients [40], while Kuo et al. found a positive correlation in 21,112 non-obese men, though they used an equation based on demographic and biochemistry data to estimate IR which differs from HOMA-IR [41]. Mahdiani’s results could be explained by diabetic patients typically having high IR values, which may mask the relationship. Our findings provide additional evidence that, although both WBC and CRP are markers for inflammation, they have independent influences on HOMA-IR.
Several reports have shown that diabetes and metabolic syndrome are related to liver enzymes [42] based on a correlation between liver enzymes and IR. In a study of 261 subjects, Niranjan et al. found that the crude prevalence ratio for higher HOMA-IR (≥3.8) was increased in subjects with elevated GOT and GPT. However, the significance for GOT disappeared in multivariate analysis [43]. Our results are consistent with these findings, but both GOT and GPT had independent effects on HOMA-IR. This discrepancy could be explained by differences in study design, sample ethnicity, and study cohort.
The remaining three factors HDL-C, SBP and TG all correlated with IR [44-46], but given their relatively low importance, discussion is omitted here in consideration of article length.
The present study is subject to certain limitations. First, this is a cross-sectional study and is thus less persuasive than a longitudinal one. Second, only Chinese women were enrolled thus caution should be taken in extrapolating the findings to other ethnic groups.
MARS is used to build an equation to estimate IR in Chinese premenopausal women with WHR, CPR, UA, TBIL, WBC, GOT, HDL, SPB, GPT, TG as following.
HOMA-IR = 6.634 – 1.448MAX(0, 0.833–WHR) + 10.152MAX(0, WHR – 0.833) – 1.351MAX(0, 0.7 – CRP) – 0.449MAX(0, CRP – 0.7) + 1.062MAX(0, UA – 8.5) + +1.047(MAX(0, 0.83 – TBIL) + 0.681MAX(0, WBC – 11.53) – 0.071MAX(0, 11.53 – WBC) + 0.043MAX(0, 24 – GOT) – 0.017MAX(0, GOT – 24) + 0.021MAX(0, 59 – HDL) – 0.005MAX(0, HDL – 59) – 0.013MAX(0, 141 – SBP) – 0.033MAX(0, 100 – GPT) + 0.013MAX(0, GPT – 100) – 0.004MAX(303 – TG)
This provides a precise and easily implemented tool when FPI level measurements are unavailable.
The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of Kaohsiung Armed Forces General Hospital (protocol code KAFGHIRB 112-006 and date of approval 21/06/2023).
Consent for publicationNot applicable. Analysis was based on secondary data sourced from the MJ Health Research Foundation.
Availability of data and materialsData are available on request due to privacy/ethical restrictions.
Competing interestsThe authors declare no conflicts of interest.
FundingThis research was funded by Kaohsiung Armed Forces General Hospital, grant number KAFGH_E_112053.
AcknowledgementsThe authors thank all subjects who participated in the study.
Author ContributionsValidation: Dee Pei and Shih-Peng Mao; Formal analysis: Dee Pei; Investigation: Chen-Yu Wang, Chi-Hao Liu, Chung-Bao Hsieh, and Ta-Wei Chu; Data curation: Shih-Peng, Mao; Writing – original draft: Shih-Peng, Mao; Writing – review & editing: Yao-Jen Liang.
insulin resistance
MARSmultiple adaptive regression spline
MLRmultiple linear regression
FPIfasting plasma insulin
FPGfasting plasma glucose
T2Dtype 2 diabetes
BMIbody mass index
SBPsystolic blood pressure
DBPdiastolic blood pressure
WHRwaist-hip ratio
WBCleukocyte
Hbhemoglobin
Pltplatelets
TBILtotal bilirubin
Albalbumin
Gloglobulin
ALPalkaline phosphatase
GOTserum glutamic oxaloacetic transaminase
GPTserum glutamic pyruvic transaminase
γ-GTserum γ-glutamyl transpeptidase
LDHlactate dehydrogenase
Crcreatinine
UAuric acid
TGtriglycerides
HDL-Chigh-density lipoprotein cholesterol
LDL-Clow-density lipoprotein cholesterol
Caplasma calcium concentration
Pplasma phosphate concentration
TSHthyroid stimulating hormone
CRPC reactive protein
MSmarital status
ILincome level
Edueducation level
SHsleep hours
SMAPEsymmetric mean absolute percentage error
RAErelative absolute error
RRSEroot relative squared error
RMSEroot mean squared error