Endocrine Journal
Online ISSN : 1348-4540
Print ISSN : 0918-8959
ISSN-L : 0918-8959
ORIGINAL
Prediction of insulin resistance using multiple adaptive regression spline in Chinese women
Shih-Peng MaoChen-Yu WangChi-Hao LiuChung-Bao HsiehDee PeiTa-Wei ChuYao-Jen Liang
Author information
JOURNAL OPEN ACCESS FULL-TEXT HTML

2025 Volume 72 Issue 4 Pages 387-398

Details
Abstract

Insulin resistance (IR) is the core for type 2 diabetes and metabolic syndrome. The homeostasis assessment model is a straightforward and practical tool for quantifying insulin resistance (HOMA-IR). Multiple adaptive regression spline (MARS) is a machine learning method used in many research fields but has yet to be applied to estimating HOMA-IR. This study uses MARS to build an equation to estimate HOMA-IR in pre-menopausal Chinese women based on a sample of 4,071 healthy women aged 20–50 with no major diseases and no medication use for blood pressure, blood glucose or blood lipids. Thirty variables were applied to build the HOMA-IR model, including demographic, laboratory, and lifestyle factors. MARS results in smaller prediction errors than traditional multiple linear regression (MLR) methods, and is thus more accurate. The model was established based on key impact factors including waist-hip ratio (WHR), C reactive protein (CRP), uric acid (UA), total bilirubin (TBIL), leukocyte (WBC), serum glutamic oxaloacetic transaminase (GOT), high-density lipoprotein cholesterol (HDL-C), systolic blood pressure (SBP), serum glutamic pyruvic transaminase (GPT), and triglycerides (TG). The equation is as following:

HOMA-IR = 6.634 – 1.448MAX(0, 0.833 – WHR) + 10.152MAX(0, WHR – 0.833) – 1.351MAX(0, 0.7 – CRP) – 0.449MAX(0, CRP – 0.7) + 1.062MAX(0, UA – 8.5) + +1.047(MAX(0, 0.83 – TBIL) + 0.681MAX(0, WBC – 11.53) – 0.071MAX(0, 11.53 – WBC) + 0.043MAX(0, 24 – GOT) – 0.017MAX(0, GOT – 24) + 0.021MAX(0, 59 – HDL) – 0.005MAX(0, HDL – 59) – 0.013MAX(0, 141 – SBP) – 0.033MAX(0, 100 – GPT) + 0.013MAX(0, GPT – 100) – 0.004MAX(303 – TG)

Results indicate that MARS is a more precise tool than fasting plasma insulin (FPI) levels, and could be used in the daily practice, and further longitudinal studies are warranted.

Background

The prevalence of type 2 diabetes (T2D) has increased dramatically in recent years, with total global cases increasing from 462 million in 2017 to nearly 700 million in 2023, an increase of over 50% in just six years [1]. With nearly 1 million deaths per year, T2D is currently the world’s 9th leading cause of death. Similar trend is found in Taiwan, where around 9.7% of the population had T2D in 2021 [2]. T2D is the 5th leading cause of death in Taiwan, where T2D and its co-morbidities account for 11.5% of total expenses by the government’s national health insurance scheme [3]. Thus, early diagnosis and appropriate management of T2D is a critical issue both in Taiwan and globally.

The underlying pathophysiologies of T2D are increased insulin resistance (IR) and decreased insulin secretion [4], and IR is simply defined as a “blunting of insulin’s hypoglycemic effects” [5]. There are many different methods to measure IR. The most sophisticated method involves the use of a “euglycemic clamp,” wherein the amount of glucose water infused to maintain plasma glucose level in the normal range for 2 hours would be the IR value, and a higher glucose infused volume is negatively correlated with IR. However, this approach is expensive and labor intensive to perform. Other tests such as insulin suppression test, intravenous glucose tolerance test and oral glucose tolerance test all have their unique characteristics and have been developed and applied in different studies. Of these tests, the hemostasis assessment model for insulin resistance (HOMA-IR) was first reported by Matthews et al. [6], calculating IR through multiple measurements of fasting plasma glucose (FPG) and insulin (FPI) with a coefficient. Because of its simplicity, HOMA-IR has been extensively used.

Many studies have attempted to identify and characterize various impact factors for IR, including inflammation, obesity, fatty liver, metabolic syndrome, and polycystic ovary syndrome. Aging is also considered to be related to IR [7], reporting that severity of IR increases with age.

However, the findings presented by Barbieri et al. suggest that IR and reduced insulin secretion are not necessarily correlated with age [8].

Most of the aforementioned studies used traditional statistical methods. Recently, however, the development of artificial intelligence and related machine learning techniques have been widely applied in medical research. Among these methods, multivariate adaptive regression spline (MARS) could provide a specific equation which captures and makes explicit non-linear relationships between variables, allowing MARS to outperform traditional multiple linear regression (MLR) approaches in predicting the behavior of dependent variables. The present study uses MARS in building an equation based on demographic, biochemistry, and lifestyle data for the prediction of HOMA-IR in healthy Chinese women.

Methods

Participant and Study Design

The present study followed our previously published protocol [9]. Data were sourced from the Taiwan MJ cohort, an ongoing prospective cohort of health examinations conducted by the MJ Health Screening Centers in Taiwan [10]. These examinations cover more than 100 important biological indicators, including anthropometric measurements, blood tests, imaging tests, etc. Each participant completed a self-administered questionnaire to collect information related to personal and family medical history, current health status, lifestyle, physical exercise, sleep habits, and dietary habits [11]. It should be noted that this study is a secondary data analysis. The data of our study participants were obtained from health checkups conducted at the MJ Clinic, with general consent obtained for future anonymous studies. All or part of the data used in this research were authorized by and received from MJ Health Research Foundation (Authorization Code: MJHRF2023007A). Any interpretations or conclusions described in this paper do not represent the views of MJ Health Research Foundation. Please refer to the technical report annually [11]. The study protocol was approved by the Institutional Review Board of the Kaohsiung Armed Forces General Hospital (IRB No.: KAFGHIRB 112-006). Since no samples were collected from patients, a short IRB review was approved and no consent was required. The data set initially included 646,728 healthy participants. Filtering for the exclusion criteria listed below, the final sample for analysis included 4,071 subjects (see Fig. 1).

Fig. 1  Flowchart of sample selection from the MJ study cohort

The study exclusion criteria were:

1. Male;

2. Age <20 and >50 years old;

3. Taking medications for metabolic syndrome at the time of the study

4. Missing data of components of metabolic syndrome

On the day of the study, a senior MJ nursing staff member recorded the subject’s medical history, including information on any current medications, and performed a physical examination. Waist circumference was measured horizontally at the level of the natural waist. Body mass index (BMI) was calculated as the subject’s body weight (kg) divided by the square of the subject’s height (m). Both systolic blood pressure and diastolic blood pressure were measured using a standard mercury sphygmomanometer on the right arm while seated.

After fasting for 10 hours, blood samples were drawn for biochemical analysis. Plasma was separated from the blood within 1 hour of collection and stored at 30°C until analysis for FPG and lipid profiles. FPG was measured using a glucose oxidase method (YSI 203 glucose analyzer, Yellow Springs Instruments, Yellow Springs, USA). Total cholesterol and triglyceride (TG) levels were measured using a dry, multilayer analytical slide method with the Fuji Dri-Chem 3000 analyzer (Fuji Photo Film, Tokyo, Japan). Serum high-density lipoprotein cholesterol (HDL-C) and low-density lipoprotein cholesterol (LDL-C) concentrations were analyzed using an enzymatic cholesterol assay, following dextran sulfate precipitation. Beckman Coulter AU 5800 biochemical analyzer determined the urine microalbumin by turbidimetry.

The 30 variables of the present study are shown in Table 1 (dependent variables) which could be grouped into three group categories, demographics, biochemistry, and lifestyle and HOMA-IR, of the independent variable. It should be noted that fasting plasma glucose and insulin levels were not included in the statistical analysis since HOMA-IR was calculated from these two parameters.

Table 1 Variable unit and description

Variables Unit and description
Age Years
Systolic blood pressure (SBP) mmHg
Diastolic blood pressure (DBP) mmHg
Waist-hip ratio (WHR) waist circumference/hip circumference
Leukocyte (WBC) *103/μL
Hemoglobin (Hb) *106/μL
Platelets (Plt) *103/μL
Total bilirubin (TBIL) mg/dL
Albumin (Alb) mg/dL
Globulin (Glo) g/dL
Alkaline Phosphatase (ALP) IU/L
Serum glutamic oxaloacetic transaminase (GOT) IU/L
Serum glutamic pyruvic transaminase (GPT) IU/L
Serum γ-glutamyl transpeptidase (γ-GT) IU/L
Lactate dehydrogenase (LDH) mg/dL
Creatinine (Cr) mg/dL
Uric acid (UA) mg/dL
Triglycerides (TG) mg/dL
High-density lipoprotein cholesterol (HDL-C) mg/dL
Low-density lipoprotein cholesterol (LDL-C) mg/dL
Plasma calcium concentration (Ca) mg/dL
Plasma phosphate concentration (P) mg/dL
Thyroid stimulating hormone (TSH) μIU/mL
C reactive protein (CRP) mg/dL
Education level (Edu.) (1) Illiterate (2) Elementary school (3) Junior high school (4) High school (vocational) (5) Junior college (6) University (7) Graduate school or above
Marriage status (MS) (1) Unmarried (2) Married
Income level (IL) NTD/years
(1) Below $200,000 (2) $200,001–$400,000 (3) $400,001–$800,000 (4) $800,001–$1,200,000 (5) $1,200,001–$1,600,000 (6) $1,600,001–$2,000,000 (7) More than $2,000,000
Drinking area Alcohol proof × drinking years × frequency
Sport time Hours per week × years of exercise × types
Sleeping hours (SH) (1) 0~4 hours (2) 4~6 hours (3) 6~8 hours (4) more than 8 hours
insulin resistance (HOMA-IR) FPI (μU/mL) × FPG (mg/dL)/405

Equation used to calculate HOMA-IR

  
HOMA−IR = FPI ( μU/mL ) × FPG ( mg/dL ) 405

Traditional statistics

To compare HOMA-IR performance using different demographic, biochemistry, and lifestyle parameters, we used t-tests to compare HOMA-IR for marital status, and analyzed variances to compare the HOMA-IR in terms of ordinal data such as education and income level (see Table 2). A simple correlation was applied to evaluate the relationship between HOMA-IR and other parameters (see Table 3). Considering the potential multicollinearity between WBC and CRP, both of which are inflammatory markers, principal component analysis (PCA) was applied to evaluate the impact of these two factors on HOMA-IR.

Table 2 T-test between insulin resistance and marital status, analysis of variance between insulin resistance and Income level, education level, and sleep hours

MS IL Edu. SH
HOMA-IR 0.996 0.001*** 0.000*** 0.000***

Note: MS: marriage status; IL: income level; Edu.: education level; SH: sleep hours; *: p < 0.05; **: p < 0.01; ***: p < 0.001.

Table 3 The r values of Pearson’s correlation between insulin resistance and demographic, biochemistry, and lifestyle parameters

Age SBP DBP WHR WBC Hb
HOMA-IR 0.006 0.340*** 0.27*** 0.423*** 0.335*** 0.096***
Plt TBIL Alb Glo ALP GOT
HOMA-IR 0.217*** –0.204*** 0.057*** 0.151*** 0.245*** 0.256***
GPT γ-GT LDH Cr UA TG
HOMA-IR 0.392*** 0.215*** 0.206*** 0.001 0.310*** 0.390***
HDL-C LDL-C Ca P TSH CRP
HOMA-IR –0.326*** 0.220*** 0.067*** –0.096*** 0.021* 0.281***
Drinking area Sport time
HOMA-IR –0.045*** –0.084***

Note: SBP: systolic blood pressure; DBP: diastolic blood pressure; WHR: waist-hip ratio; WBC: leukocyte; Hb: hemoglobin; Plt: platelets; TBIL: total bilirubin; Alb: albumin; Glo: globulin; ALP: alkaline phosphatase; GOT: serum glutamic oxaloacetic transaminase; GPT: serum glutamic pyruvic transaminase; γ-GT: serum γ-glutamyl transpeptidase; LDH: lactate dehydrogenase; Cr: creatinine; UA: uric acid; TG: triglycerides; HDL-C: high-density lipoprotein cholesterol; LDL-C: low-density lipoprotein cholesterol; Ca: plasma calcium concentration; P: plasma phosphate concentration; TSH: thyroid stimulating hormone; CRP: C reactive protein;*: p < 0.05; **: p < 0.01; ***: p < 0.001.

The aforementioned tests were performed using SPSS version 19.0 (IBM Inc., Armonk, New York).

Machine learning method

The present study used the multiple adaptive regression spline (MARS) which is unique in that it provides an equation. For each independent variable (x), there might be a non-linear relationship. MARS can capture these relationships, and thus, is expected to provide more accurate results than the traditional multiple regression.

A PubMed search shows that MARS has been extensively used in other studies that can be classified as follows: 1. Comparison between different AI and MARS [12, 13]. 2. Demonstration of the method of MARS itself [14, 15]. 3. Animal studies [16]. 4. Genetic studies [17]. 5. Medical research [18-20]. The present study is one of the first to apply MARS to the study of IR. MARS is useful for developing adaptable models suited for high-dimensional data. This modeling method uses an expansion structure reliant on product spline basis functions. Remarkably, both the count of fundamental functions and the attributes connected to each one, encompassing product degree and knot placements, are autonomously established through data-driven mechanisms [21]. This strategy draws inspiration from the principles of recursive partitioning, akin to methods like Classification and Regression Trees in terms of its proficiency in capturing intricate higher-order interactions.

In the analysis phase, the dataset was initially partitioned into an 80% training dataset for use in model construction, and a separate 20% testing dataset for model assessment. In the training phase, MARS uses specific hyperparameters that requires tuning to ensure good model performance. To facilitate this, the training dataset was once more divided at random to yield one segment for model formulation using a distinct set of hyperparameters, while the other section was used for validation purposes. A systematic exploration of all conceivable combinations of hyperparameters was performed using a comprehensive grid search approach. Subsequently, the model characterized by the lower root mean square error when applied to the validation dataset was deemed the optimal choice for each compared to MLR.

In the evaluation phase, the testing dataset was used to gauge the predictive efficacy of the MARS model. Given that the target variable in this study is a numerical parameter, the evaluation metrics chosen to compare model performance include symmetric mean absolute percentage error (SMAPE), relative absolute error (RAE), root relative squared error (RRSE), and root mean squared error (RMSE) (see Table 4).

Table 4 Equations for calculating performance metrics

Metric Description Calculation
SMAPE Symmetric mean absolute
Percentage error
  
SMAPE=1ni=1n|yiŷi|(|yi|+|ŷi|)/2
RAE Relative absolute error   
RAE=i=1n(yiŷi)2i=1n(yi)2
RRSE Root relative squared error   
RRSE=i=1n(yiŷi)2i=1n(yiyi̇¯)2
RMSE Root mean squared error   
RMSE=1ni=1n(yiŷi)2

ŷi and yi represent predicted and actual values, respectively; n stands for the number of instances.

To establish a comparative context, the averaged metrics derived from the MARS model were used to juxtapose the model performance with that of the benchmark MLR model. Note that both the MARS models and the MLR model were trained and tested using the same dataset.

In this study, all methods were performed using R software version 4.0.5 and RStudio version 1.1.453 with the required packages installed (http://www.R-project.org, accessed on 11 June 2024; https://www.rstudio.com/products/rstudio/, accessed on 11 June 2024). The implementations of MARS were the “earth” R package version 5.3.3 [22] , the “caret” R package version 6.0–94 [23]. MLR was implemented using the “stats” R package version 4.0.5, using the default settings to construct the models.

Results

A total of 4,071 healthy subjects were enrolled. Demographic data are summarized in Table 5.

Table 5 Participant demographics and the testing conditions for insulin resistance and various subgroup variables

Demographics Variables Mean ± SD Biochemistry Variables Mean ± SD
Age 36.68 ± 7.75 Alb 4.39 ± 0.21
Edu. N (%) Glo 3.16 ± 0.33
 Illiterate 1 (0.02%) ALP 50.98 ± 19.91
 Elementary school 8 (0.20%) GOT 19.98 ± 9.40
 Junior high school 32 (0.79%) GPT 19.85 ± 16.43
 High school (vocational) 584 (14.35%) γ-GT 20.21 ± 37.54
 Junior college 620 (15.23%) LDH 148.6 ± 24.83
 University 2,255 (55.39%) Cr 0.81 ± 0.09
 Graduate school or above 571 (14.03%) UA 4.71 ± 1.03
MS N (%) TG 80.46 ± 57.01
 Single 1,678 (41.22%) HDL-C 63.78 ± 14.29
 With spouse 2,393 (58.78%) LDL-C 108.16 ± 30.52
IL N (%) Ca 9.51 ± 0.37
 Below $200,000 731 (17.96%) P 3.84 ± 0.45
$200,001–$400,000 972 (23.88%) TSH 1.79 ± 1.50
$400,001–$800,000 1,446 (35.52%) CRP 0.20 ± 0.29
$800,001–$1,200,000 629 (15.45%) HOMA-IR 1.76 ± 1.23
$1,200,001–$1,600,000 140 (3.44%) Lifestyle Variables Mean ± SD
$1,600,001–$2,000,000 70 (1.72%) Drinking area 1.36 ± 6.17
 More than $2,000,000 83 (2.04%) Sport time 3.84 ± 6.28
Biochemistry Variables Mean ± SD SH N (%)
SBP 108.84 ± 13.76  <4 hours/day 40 (0.98%)
DBP 70.84 ± 9.67  4–6 hours/day 1,085 (26.65%)
WHR 0.76 ± 0.05  6–7 hours/day 1,841 (45.22%)
WBC 6.04 ± 1.59  7–8 hours/day 884 (21.71%)
Hb 13.01 ± 1.35  8–9 hours/day 177 (4.35%)
Plt 251.14 ± 59.04  >9 hours/day 44 (1.08%)
TBIL 0.89 ± 0.34

Table 6 shows the PCA result. The respective contributions of WBC and CRP are 0.62 and 0.38. Though CRP has a smaller impact on IR, it was not negligible.

Table 6 The results of principal component analysis of white blood cell count and c-reactive protein

Component Eigenvalue Difference Proportion Cumulative
WBC 1.24889 0.49772 0.6244 0.6244
CRP 0.751114 0.3756 1

WBC: leukocyte, CRP: c-reactive protein

Table 7 shows that MARS yielded smaller prediction errors than the MLR method, indicating greater accuracy. Table 8 shows the 16 basis functions (BF) derived from MARS from the following variables: SBP, WHR, WBC, TBIL, GOT, GPT, UA, TG, HDL-C, and CRP. Based on Table 8, the MARS equation is generated as follows:

Table 7 The average performance of multiple linear regression and multivariate adaptive regression splines

Methods SMAPE RAE RRSE RMSE
MARS 0.501 1.225 1.150 1.694
MLR 0.522 1.256 1.179 1.738

Note: MLR: multiple linear regression, MARS: multivariate adaptive regression splines.

Table 8 List of basis function Bi of the MARS model and their coefficients, ai

Definition ai
Intercept 6.634
B1 Max(0, 141 – SBP) –0.013
B2 Max(0, 0.833 – WHR) –1.448
B3 Max(0, WHR – 0.833) 10.152
B4 Max(0, 11.53 – WBC) –0.071
B5 Max(0, WBC – 11.53) 0.681
B6 Max(0, 0.83 – TBIL) 1.047
B7 Max(0, 24 – GOT) 0.043
B8 Max(0, GOT – 24) –0.017
B9 Max(0, 100 – GPT) –0.033
B10 Max(0, GPT – 100) 0.013
B11 Max(0, UA – 8.5) 1.062
B12 Max(0, 303 – TG) –0.004
B13 Max(0, 59 – HDL-C) 0.021
B14 Max(0, HDL-C – 59) –0.005
B15 Max(0, 0.7 – CRP) –1.351
B16 Max(0, CRP – 0. 7) –0.449

WHR: waist-hip ratio, SBP: systolic blood pressure, TG: triglyceride, HDL-C: high-density lipoprotein cholesterol, GOT: serum glutamic oxaloacetic transaminase, GPT: serum glutamic pyruvic transaminase, TBIL: total bilirubin, UA: uria acid, CRP: C-reactive protein

HOMA-IR = 6.634 – 1.448MAX(0, 0.833 – WHR) + 10.152MAX(0, WHR – 0.833) – 1.351MAX(0, 0.7 – CRP) – 0.449MAX(0, CRP – 0.7) + 1.062MAX(0, UA – 8.5) + +1.047(MAX(0, 0.83 – TBIL) + 0.681MAX(0, WBC – 11.53) – 0.071MAX(0, 11.53 – WBC) + 0.043MAX(0, 24 – GOT) – 0.017MAX(0, GOT – 24) + 0.021MAX(0, 59 – HDL) – 0.005MAX(0, HDL – 59) – 0.013MAX(0, 141 – SBP) – 0.033MAX(0, 100 – GPT) + 0.013MAX(0, GPT – 100) – 0.004MAX(303 – TG)

These BFs are interpreted as follows. Taking WHR as an example, when the WHR was below 0.833, the first equation was used: HOMA-IR = –1.448 × (0.833 – WHR). When subject WHR exceeded 0.8333, the equation changed to HOMA-IR = 10.152 × (WHR – 0.833333). In short, the results of the aforementioned equations should all exceed 0. For those risk factors with more than one BF, corresponding figures are displayed to capture a clearer understanding of the relationship between these factors and HOMA-IR. An equation built with these BFs is shown in Fig. 2.

Fig. 2  Influence of important variables on the insulin resistance

A: Systolic blood pressure. B: Waist-hip ratio. C: Leukocyte. D: Total bilirubin. E: Serum glutamic oxaloacetic transaminase. F: Serum glutamic pyruvic transaminase. G: Uric acid. H: Triglycerides. I: High-density lipoprotein cholesterol. J: C reactive protein.

Finally, the overview of the present study is shown in Graphical Abstract. It could be noted that, from how to selecting the participants, to the Mach-L methods we used, to the comparison between these two methods and, the equation built by MARS are all depicted.

Graphical Abstract Overview of the study methodology, including participant selection criteria, the Mach-L methods applied, the comparative analysis between the two methods, and the equation developed using MARS.

Discussion

The present study constructed an equation to estimate HOMA-IR in healthy Chinese women aged between 20–50. This age range was selected to exclude menopausal women. No subjects were currently using medications known to affect blood pressure (BP), FPG or blood lipids, and had no major diseases. Our results found that there were 10 biochemistry variables related to HOMA-IR. To our knowledge, the present study is the only one using MARS to build the equation. It is of clinical use and, in the same time, could shed light on the true determining factors for IR in premenopausal women.

No lifestyle factors were found to have significant correlation with HOMA-IR, including smoking, drinking, marital status, income, educational attainment, and exercise. This does not necessarily indicate that these factors are not important for insulin resistance, but rather suggest that their impact might be “absorbed” by their end results such as higher BP or WHR. A similar explanation could also be applied to the role of age. As mentioned in the introduction section, the impact of age on IR remains controversial. Our results found no correlation, suggesting that the positive relationship found in other studies might be due to differences in study design, age groupings, and ethnicity. IR might also be indirectly related to age through factors such as inflammation or body weight.

Ten biochemistry variables were found to be significantly related to HOMA-IR, the six most important of which are discussed here. WHR had the most significant impact. Many previous studies examined the relationship between obesity and IR [24]. Obesity can be measured using many methods, including body fat, body mass index, and WHR. Body fat measurement is more expensive and requires specific hardware, raising clinical obstacles. While simpler to use, body mass index cannot determine whether body weight is due to fat, muscle, or bone mass, making it less accurate [25]. Previous studies have shown that WHR provides greater accuracy than body mass index [26]. The results of the present study suggest that, among the ten biochemistry variables examined, WHR has the most significant impact on HOMA-IR. A review article published by Wondmkun suggested that this relationship was caused by multiple mechanisms, such as increased amounts of non-esterified fatty acids, glycerol, hormones, and pro-inflammatory cytokines [27].

The second leading factor was UA. In an empirical study of 687 diabetic patients, Han et al. found that UA is positively related to metabolic score for IR (METS-IR) (r = 0.238, p < 0.01) [28]. While that study focused on diabetics rather than healthy individuals, their results can be seen as supporting the findings of the present study. This raises the possibility of improving IR by reducing UA. Takir et al. found that HOMA-IR decreased after allopurinol treatment (n = 73) when compared to controls (n = 33) in a group of asymptomatic hyperuricemia subjects [29]. UA could increase HOMA-IR through several pathways such as inhibition of Adenosine 5'-monophosphate (AMP)-activated protein kinase, thereby increasing gluconeogenesis, inflammation, and oxidative stress in the islet cells [30-33].

As a systemic inflammatory biomarker, CRP is regarded as a strong and independent predictor for cardiovascular diseases, diabetes, and hypertension [34]. It reflects only local inflammation in atherosclerosis, but is also linked to IR [35]. Our results found CRP to be third leading factor, which is consistent with other mainstream studies. For example, Gelaye et al. showed that women with higher CRP had a 2.18-fold increased risk of HOMA-IR compared to the lowest tertile. In men, the heightened risk increased to 2.54-fold. This relationship could be explained by CRP being synthesized in liver and regulated by interleukin-6 and tumor necrosis factor-alfa [36]. At the same time, FPG and FPI values would increase in the presence of chronic and systemic inflammation [35].

Surprisingly, bilirubin was the next key factor. The few studies that have examined the relationship of bilirubin to IR have found a negative correlation [37, 38]. Working with a small sample, Zhang et al. reported that indirect bilirubin might reduce IR. Takei et al. suggested that this correlation might be related to bilirubin’s unique ability to suppress cytokines [39]. Our findings strongly support this correlation in a large cohort.

The correlation between WBC and IR remains controversial. Mahdiani et al. found no relationship between WBC and HOMA-IR in 283 diabetic patients [40], while Kuo et al. found a positive correlation in 21,112 non-obese men, though they used an equation based on demographic and biochemistry data to estimate IR which differs from HOMA-IR [41]. Mahdiani’s results could be explained by diabetic patients typically having high IR values, which may mask the relationship. Our findings provide additional evidence that, although both WBC and CRP are markers for inflammation, they have independent influences on HOMA-IR.

Several reports have shown that diabetes and metabolic syndrome are related to liver enzymes [42] based on a correlation between liver enzymes and IR. In a study of 261 subjects, Niranjan et al. found that the crude prevalence ratio for higher HOMA-IR (≥3.8) was increased in subjects with elevated GOT and GPT. However, the significance for GOT disappeared in multivariate analysis [43]. Our results are consistent with these findings, but both GOT and GPT had independent effects on HOMA-IR. This discrepancy could be explained by differences in study design, sample ethnicity, and study cohort.

The remaining three factors HDL-C, SBP and TG all correlated with IR [44-46], but given their relatively low importance, discussion is omitted here in consideration of article length.

The present study is subject to certain limitations. First, this is a cross-sectional study and is thus less persuasive than a longitudinal one. Second, only Chinese women were enrolled thus caution should be taken in extrapolating the findings to other ethnic groups.

Conclusion

MARS is used to build an equation to estimate IR in Chinese premenopausal women with WHR, CPR, UA, TBIL, WBC, GOT, HDL, SPB, GPT, TG as following.

HOMA-IR = 6.634 – 1.448MAX(0, 0.833–WHR) + 10.152MAX(0, WHR – 0.833) – 1.351MAX(0, 0.7 – CRP) – 0.449MAX(0, CRP – 0.7) + 1.062MAX(0, UA – 8.5) + +1.047(MAX(0, 0.83 – TBIL) + 0.681MAX(0, WBC – 11.53) – 0.071MAX(0, 11.53 – WBC) + 0.043MAX(0, 24 – GOT) – 0.017MAX(0, GOT – 24) + 0.021MAX(0, 59 – HDL) – 0.005MAX(0, HDL – 59) – 0.013MAX(0, 141 – SBP) – 0.033MAX(0, 100 – GPT) + 0.013MAX(0, GPT – 100) – 0.004MAX(303 – TG)

This provides a precise and easily implemented tool when FPI level measurements are unavailable.

Declaration

Ethical approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of Kaohsiung Armed Forces General Hospital (protocol code KAFGHIRB 112-006 and date of approval 21/06/2023).

Consent for publication

Not applicable. Analysis was based on secondary data sourced from the MJ Health Research Foundation.

Availability of data and materials

Data are available on request due to privacy/ethical restrictions.

Competing interests

The authors declare no conflicts of interest.

Funding

This research was funded by Kaohsiung Armed Forces General Hospital, grant number KAFGH_E_112053.

Acknowledgements

The authors thank all subjects who participated in the study.

Author Contributions

Validation: Dee Pei and Shih-Peng Mao; Formal analysis: Dee Pei; Investigation: Chen-Yu Wang, Chi-Hao Liu, Chung-Bao Hsieh, and Ta-Wei Chu; Data curation: Shih-Peng, Mao; Writing – original draft: Shih-Peng, Mao; Writing – review & editing: Yao-Jen Liang.

References
Appendix

IR

insulin resistance

MARS

multiple adaptive regression spline

MLR

multiple linear regression

FPI

fasting plasma insulin

FPG

fasting plasma glucose

T2D

type 2 diabetes

BMI

body mass index

SBP

systolic blood pressure

DBP

diastolic blood pressure

WHR

waist-hip ratio

WBC

leukocyte

Hb

hemoglobin

Plt

platelets

TBIL

total bilirubin

Alb

albumin

Glo

globulin

ALP

alkaline phosphatase

GOT

serum glutamic oxaloacetic transaminase

GPT

serum glutamic pyruvic transaminase

γ-GT

serum γ-glutamyl transpeptidase

LDH

lactate dehydrogenase

Cr

creatinine

UA

uric acid

TG

triglycerides

HDL-C

high-density lipoprotein cholesterol

LDL-C

low-density lipoprotein cholesterol

Ca

plasma calcium concentration

P

plasma phosphate concentration

TSH

thyroid stimulating hormone

CRP

C reactive protein

MS

marital status

IL

income level

Edu

education level

SH

sleep hours

SMAPE

symmetric mean absolute percentage error

RAE

relative absolute error

RRSE

root relative squared error

RMSE

root mean squared error

 
© The Japan Endocrine Society

This article is licensed under a Creative Commons [Attribution-NonCommercial-NoDerivatives 4.0 International] license.
https://creativecommons.org/licenses/by-nc-nd/4.0/
feedback
Top