Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
Advertisement
Scientific Reports volume 13, Article number: 410 (2023)
Metrics details
Findings were inconsistent regarding the superiority of using recently introduced hybrid methods to derive DPs compared to widely used statistical methods like principal component analysis (PCA) in assessing dietary patterns and their association with type 2 diabetes mellitus (T2DM). We aimed to investigate the association between DPs extracted using principal component analysis (PCA), partial least-squares (PLS), and reduced-rank regressions (RRR) in identifying DPs associated with T2DM risk. The study was conducted in the context of two cohort studies accomplished in central Iran. Dietary intake data were collected by food frequency questionnaires (FFQs). DPs were derived by using PCA, PLS, and RRR methods considering. The association between DPs with the risk of T2DM was assessed using log-binomial logistic regression test. A total of 8667 participants aged 20–70 years were included in this study. In the multivariate-adjusted models, RRR-DP3 characterized by high intake of fruits, tomatoes, vegetable oils, and refined grains and low intake of processed meats, organ meats, margarine, and hydrogenated fats was significantly associated with a reduced T2DM risk (Q5 vs Q1: RR 0.540, 95% CI 0.33–0.87, P-trend = 0.020). No significant highest-lowest or trend association was observed between DPs derived using PCA or PLS and T2DM. The findings indicate that RRR method was more promising in identifying DPs that are related to T2DM risk compared to PCA and PLS methods.
Type 2 diabetes mellitus (T2DM) is an increasingly common public health concern that its prevalence remains high on the world health agenda1 and can cause serious damage to body systems such as kidneys, heart, eyes, as well the vascular system2. It is a multifactorial chronic disease emanating from interaction between genetic and lifestyle factors3. Lifestyle-modification studies have established that prevention of T2DM underline the major role of acquired alterations, including an unhealthy diet, sedentary behavior, overweight/obesity, tobacco use, and other environmental factors4,5,6,7,8. Moreover, T2DM is known as the most important chronic disease developed by an unhealthy modern lifestyle9. It has been demonstrated that the quality and quantity of diet are at the heart of T2DM pathogenesis10. Despite the clear effects of nutrition as a fundamental factor in the pathogenesis of T2DM, it remains unclear which dietary aspects have more impacts on its prevention and management.
Recently, the dietary pattern (DP) approach was suggested to investigate the association between diets and chronic diseases with multi-factorial etiology11. It is proposed that dietary patterns (DPs) can provide more information regarding the nutrition and chronic diseases link beyond the effects of foods or single nutrients12. Various methods have been used to derive DPs including theoretical methods (a priori), empirical methods (a posteriori), and hybrid techniques of theoretical and empirical methods11. A priori and a posteriori approaches are traditionally applied in DP analysis, and a frequently used posterior approach is principal component analysis (PCA)13. This method derives DPs by constructing uncorrelated linear combinations of original food intake variables that explain as much variation in food groups intake as possible14. Hence, PCA-derived patterns present actual dietary behaviors in the population; however, PCA may reveal a poor correlation with the risk of diseases because DPs related to individuals’ behavior are not necessarily predictors of the disease of interest14.
Hybrid approaches with the combination of both a priori and a posteriori approaches, such as reduced-rank regression (RRR) and partial least squares (PLS), are proposed by researchers to derive the DP that better predict chronic diseases13,15,16. These methods lead to DPs that are highly correlated with a set of mediator variables between diet and disease association, called response variables. The response variables are determined based on a “priori” knowledge13,17. These two methods mathematically work through creating a linear combination of the predictors and response variables17. The RRR method strives to identify patterns through constructing linear functions of food groups, best explaining the variation in the outcomes; whereas, PLS aims to maximize the variance explained in both food groups and the responses13.
Few studies have assessed the association between DPs and T2DM through RRR method18,19. Batis et al.18 have suggested that using both PCA and RRR provided useful insights when studying the association of DPs with diabetes. On the other hand, no study has evaluated the DPs derived only by PLS method in association with T2DM. Moreover, one recent study found that DPs associated with adverse blood lipids are associated with incidence of T2DM20. Though, it is still not fully clear which approach may better predict the risk of T2DM. Therefore, we aimed to evaluate the association between DPs and T2DM risk through PCA, RRR, and PLS methods with incident T2DM, simultaneously, and also to compare the relative advantages of these methods in Iranian adults.
A total of 8667 study participants (52.5% females) had complete data to be entered to the current analysis of which 245 patients were diagnosed with T2DM after 6 years of follow-up for YaHS-TAMYZ study and 4 years for Shahedieh study. The baseline characteristics of the study population are presented in Table 1. There were significant differences between participant with and without T2DM across age categories, educational status, smoking status (P = 0.003), and total energy intake. Participants with T2DM were in higher age categories than participant without T2DM (P < 0.001). Compared to cases with T2DM, participant without T2DM were more to have high school diploma and BSc or higher academic degree (P = 0.026). In addition, participants with T2DM were more likely to be current and former smokers compared to participants without T2DM (P = 0.003). While, participants without T2DM had higher energy intake than cases with T2DM (P = 0.009). No significant difference was observed between two groups of participants for sex, marital status, BMI categories, and physical activity (P > 0.05).
The 33 dietary food groups and factor loadings for each DPs derived by PCA, PLS, and RRR methods are shown in Table 2. The first DPs derived by PCA method (PCA-DP1) was characterized by high intake of processed meats, organ meats, fish, margarine, fruit juice, pizza, snacks, sweet dessert, and soft drinks and low intake of whole grains. The PCA-DP2 was associated with high intakes of dairy products, fruits, tomatoes, other vegetables, potatoes, refined grains, and vegetable oils.in addition, and PCA-DP3 was characterized by high intake of tea, mayonnaise, nuts, hydrogenated fats, sugars, and soft drinks.
Using PLS method, we also derived three DPs: (1) PLS-DP1: high intake of whole grains and low intake of processed meats, organ meats, poultry, fish, margarine, fruit juice, pizza, snacks, and sweet dessert; (2) PLS-DP2: low intake of tea, potatoes, refined grains, sugars, and vegetable oils; (3) PLS-DP3: higher intake levels of fruits, tomatoes, other vegetables, and yoghurt drink, but low intake of margarine.
The first DPs from RRR method (RRR-DP1) was rich in whole grains and low in processed meats, red meats, poultry, fish, margarine, fruit juice, pizza, snacks, sweet dessert, and soft drinks. The RRR-DP2 was characterized primarily by high intake of poultry, fruits, soft drinks, and yoghurt drink and low intake of potatoes, refined grains, and mayonnaise; and the third DPs (RRR-DP3) was defined by high intake of fruits, fruit juice, refined grains, and vegetable oils, but low intake of processed meats, organ meats, margarine, and hydrogenated fats.
The percentage of variation explained by food groups was higher in DPs derived by PCA method (23.142%) in comparison to 19.252% of PLS-derived DPs and 13.89% for RRR-derived DPs (Table 3).
The three DPs of PCA explained 0.324% of the response variables variation. DPs from PLS method explained 0.831% of the total variation in six response variables and the RRR-derived DPs explained 0.993%. As expected, both RRR and PLS methods explained a greater amount of variation in the response variables (Table 3).
Figure 1 represents the risk of developing T2DM for each quintile of the DPs scores compared to the lowest quintile. No association was observed between three DPs from PCA method and T2DM risk in crude and all adjusted models.
Risk ratios and 95% confidence intervals for the association between dietary patterns (DPs) derived using principal component analysis (PCA, (A–C)), partial least-squares (PLS, (D–F)), and reduced-rank regression (RRR, (G–I)) and type 2 diabetes mellitus (T2DM).
The crude model of second DP derived by PLS method was inversely associated with T2DM risk (PLS-DP2 Q3 vs Q1: risk ratio (RR) 0.609, 95% confidence interval (CI) 0.39–0.94, P-trend = 0.585). In the multivariate-adjusted models, PLS-DP2 method was found to be inversely associated with T2DM risk in participants in the third quintile than in people in the first quintile (Model I Q3 vs Q1: RR = 0.609, 95% CI 0.39–0.94, P-trend = 0.997; Model II Q3 vs Q1: RR = 0.608, 95% CI 0.39–0.94, P-trend = 0.981; and Model III Q3 vs Q1: RR = 0.613, 95% CI 0.39–0.95, P-trend = 0.975).
In the crude models, two DPs derived by RRR method were inversely associated with T2DM risk (RRR-DP2 Q3 vs Q1: RR = 0.602, 95% CI 0.39–0.92, P-trend = 0.661; RRR-DP3 Q4 vs Q1: RR = 0.585, 95% CI 0.37–0.91 P-trend = 0.073). In the multivariate-adjusted models, RRR-DP3 was inversely associated with T2DM risk in all adjusted models (Model I Q5 vs Q1: RR = 0.599, 95% CI 0.37–0.94, P-trend = 0.035; Model II Q5 vs Q1: RR = 0.557, 95% CI 0.34–0.89, P-trend = 0.025; and Model III Q5 vs Q1: RR = 0.540, 95% CI 0.33–0.87, P-trend = 0.020). In addition, the inverse association between RRR-DP2 and T2DM risk was significant only for participants in the third quintile than those who in the lowest adherence to RRR-DP2 (Model I Q3 vs Q1: RR = 0.567, 95% CI 0.36–0.87, P-trend = 0.926; Model II Q3 vs Q1: RR = 0.568, 95% CI 0.36–0.87, P-trend = 0.950; and Model III Q3 vs Q1: RR = 0.564, 95% CI 0.36–0.87, P-trend = 0.786).
The greater adherence to a diet characterized by high intake of fruits, tomatoes, vegetable oils, and refined grains and low intake of processed meats, organ meats, margarine, and hydrogenated fats derived by RRR method was significantly associated with reduced T2DM risk. The present study also showed that the RRR method can provide a better identifying DPs that are related to T2DM risk due to the considering intermediate factors related to diseases for generating DPs.
Several studies have assessed the association between DPs derived only by RRR method and T2DM. Duan et al.20 reported that blood lipids-related DPs using the RRR method, for both men and women were characterized by high consumption of sugary beverages, juice, and added sugar; and low consumption of cereals, fruits, vegetables, nuts or seeds, and tea were significantly linked with an increased risk of T2DM. Liese et al.21 have used the RRR method on plasminogen activator inhibitor-1 (PAI-1) and fibrinogen biomarkers to derive DPs, and they identified a DP that was predictive of T2DM which was characterized by a high intake of red meat, fried potatoes, tomato vegetables, dried beans, low-fiber bread and cereal, eggs, cheese, and low intake of wine. A nested case–control study identified a RRR-derived DPs using inflammatory biomarkers that was characterized by a high intake of processed meats, soft drinks, sugar-sweetened drinks, and refined grains, but a low intake of cruciferous and yellow vegetables, wine, and coffee that was associated with an increased T2DM risk22. However, the differences in the results of our study and the aforementioned study could be influenced by the difference between responses variables.
In the current study, people in second and third quintile of adherence to RRR-DP2 had lower risk of T2DM; In addition, the inverse association between adherence to PLS-DP2 and T2DM risk was observed only in modest quintile. Whereas, no association between highest adherence to PLS-DP2 and RRR-DP2 and T2DM risk was observed.
Our findings revealed that although PCA explains the highest variation in food groups, none of the derived DPs by this method were significantly associated with T2DM risk. This supports the view that PCA generates the diet behavior-related patterns and PCA-derived DPs could not necessarily predict the risk of diseases.
Altogether, in this study, we found more T2DM-associated DPs by using the RRR method than both PLS and PCA. In line with our results, Hoffmann et al.13 compared three methods PCA, RRR, and PLS in association with T2DM and found that the RRR method could extract significant risk factors for T2DM. It should be considered that RRR method focuses on explaining variation in the disease-related response variables, while PLS is a method that mathematically considers both food groups and responses. This fact may explain the significant associations between RRR-derived DPs and T2DM rather than PLS-derived DPs. Moreover, in accordance with our results, some investigations demonstrated that RRR derived DPs had stronger and more statistically significant link with other outcomes than those derived using PCA and PLS13,16,23.
In line with this association, numerous previous studies support the link between consumption of fruits and vegetables and a decreased risk of T2DM. A meta-analysis of prospective studies found that T2DM risk reduced by 10% with increasing intakes of fruits up to 200–300 g/day24. A study by Nguyen et al.25 showed that greater intake of fruits and vegetables are related to a lower risk of T2DM. Furthermore, a review established that the intake of fruit juices can decrease the risk of chronic diseases including T2DM26; whereas, two meta-analyses did not proposed an association between fruit juice intake and T2DM risk27,28. The favorable effects of fruits and vegetables in the prevention of T2DM could be because of their high content of fiber, vitamins, minerals, antioxidants, and phytochemicals29,30. In addition, antioxidant phytochemicals contribute to the reduction of oxidative stress and inflammation30. For instance, it is shown that blueberries reduce blood glucose31,32 and C reactive protein31 and improve the insulin sensitivity33. Blueberries, grapes, and apples are rich in anthocyanins and quercetin34,35,36. Animal studies have shown that anthocyanins with anti-diabetic effects via glucose transporter 4 regulation37. Quercetin also has a protective role in reducing oxidative stress and beta-cell damage38. Moreover, the magnesium content of fruits and vegetables could improve insulin signaling39. It has been also demonstrated that the consumption of fruits and vegetables may reduce T2DM risk by decreasing adipose tissue and weight gain over time29. It is shown that tomatoes are beneficial for diabetic conditions due to reducing oxidative stress, inflammation, and tissue damages40. Tomatoes contain a wide range of antioxidants like lycopene, vitamins, and minerals41; as a dose–response association was observed between serum lycopene levels and T2DM42,43.
It has been consistently shown that processed meats increase the T2DM risk in prospective studies44,45. A meta-analysis by Tian et al.46 also revealed that the intake of processed meats is a risk factors for T2DM. It is conceivable that the high content of nitrates or nitrites in processed meats may increase the risk of T2DM47. Nitrosamine compounds in processed meats are formed during manufacturing or via interactions between nitrates and amino acids in the body48. It has been demonstrated that Nitrosamines have a toxic effect on β cells and can raise the T2DM risk49,50. Additionally, advanced glycation end products from processed meats can induce inflammatory mediators related to T2DM51. A growing body of evidence showed that DPs containing hydrogenated fats were positively associated with T2DM risk52,53. Trans fats are associated with an increased risk of T2DM through increasing TG levels, postprandial insulin and glucose, and reducing glucose uptake in skeletal and cardiac muscles.
There are several limitations in this study that should be considered. Although FFQs are widely used to measure usual dietary exposures and considered as a valid and reproducible nutrition science tool, they are prone to possible misreporting and misclassification of study participants which might lead to weak or null relationships. Moreover, short follow-up period and the limited number of incident T2DM cases were other limitations of our study. In addition, both YaHS-TAMYZ and Shahedieh cohort studies had less than 5-year follow-up, therefore, in the present study, the long-term effects of DPs on T2DM risk might not be revealed. In general, determining the most effective method for deriving dietary patterns related to a specific disease varies according to the study goals such as study population, selected response variables, and outcome of interest. Further studies are required to examine the generalizability of DPs derived by different methods in other populations using the similar response variables.
In conclusion, the higher adherence to a diet characterized by high intake of fruits, tomatoes, vegetable oils, and refined grains and low intake of processed meats, organ meats, margarine, and hydrogenated fats was significantly associated with reduced risk of T2DM. The findings indicate that RRR method was more promising in identifying DPs that are related to T2DM risk than PCA and PLS methods. Though, future investigations are required to approve the relative advantages of the RRR method in association with T2DM and other nutrition-related diseases.
The Yazd Health Study (YaHS) was established in September 2014 in Yazd greater area located in central Iran. In this study 9962 participants aged 20–70 years were entered in the enrollment phase. The dietary intake assessment of participants was separately collected in Taghzieh Mardom Yazd (TAMYZ) study using a validated semi-quantitative food frequency questionnaire (FFQ)54. The Shahedieh cohort study is a part of a large Persian multicentral study (Persian cohort) conducted on 180,000 participants in 18 various geographical areas of Iran55. The Shahedieh study was established in 2014 and 9977 adults aged 35 to 70 years entered to the study at baseline. Participants also filled a semi-quantitative food frequency questionnaire to report their dietary intake. Information on demographic characteristics, smoking status, physical activity, medical history was also collected in both studies. The study protocol for YaHS-TAMYZ56 and Shahedieh cohort55 are completely described elsewhere.
Flow chart of participant’s selection from YaHS-TAMYZ and Shahedieh cohort studies is showed in Fig. 2. Participants who reported an implausible total energy intake or incomplete dietary intakes data (< 800 kcal/day or > 6000 kcal/day, YaHS-TAMYZ study, n = 639, Shahedieh study, n = 1709), those had not provided data on response variables (YaHS-TAMYZ study, n = 6258, Shahedieh study, n = 356), those who had a previous diagnosis of type 1 diabetes or T2DM (YaHS-TAMYZ study, n = 601, Shahedieh study, n = 1685), those who had not provided data on national identifier code (YaHS-TAMYZ study, n = 34, Shahedieh study, n = 0), and people who died (YaHS-TAMYZ study, n = 23, Shahedieh study, n = 28) were excluded, which left 8667 participants (YaHS-TAMYZ: 2468, Shahedieh: 6199) for current analyses.
Flow chart representing the selection process of participants from YaHS-TAMYZ and Shahedieh cohort studies.
All participants gave an informed consent before entering the studies. Both studies were approved by the research Council of Shahid Sadoughi University of Medical Sciences. The current study was also ethically approved by Shahid Sadoughi University’s ethics committee (approval code: IR.SSU.SPH.REC.1399.197). All methods of the present study were carried out according to the relevant guidelines and regulations.
Dietary intakes in the YaHS-TAMYZ study were assessed by a 178-item validated, multiple-choice semi-quantitative FFQ54. For each food item, participants were asked by trained interviewers to report the frequency of food item intake during the past year by answering 10-multiple-choice frequency responses ranging from “never or less than once a month” to “10 or more times per day”. In addition, FFQ had five choices for portion size for estimation of the amount of each consumed food item57. Dietary intake information was collected by a semi-quantitative open-ended FFQ based on 134-items in the Shahedieh study. Participants of the Shahedieh study were asked to report how often on average over the previous year they consumed a typical portion size of each food item with multiple possible responses on a “daily”, “weekly”, or “monthly” basis. The frequency and portion size reported for food items were converted to grams per day using household measures58. The United States Department of Agriculture food composition database was used to estimate daily intake of energy and nutrient for each participant59. Food items were merged into 33 food groups based on food items similarity in their nutrient profiles and are presented in Table 4.
The height and body weight of the study participants were measured in both YaHS-TAMYS and Shahedieh studies. In the Shahedieh study, body weight (kg) and height (cm) were measured using the National Institute of Health protocols by trained staffs. Body weight was measured while the participants were with minimum clothing and without shoes by using a digital scale (SECA, model 755, Germany). Height was measured by using a measure tape attached to a flat wall with the accuracy of 0.5 cm. In the YaHS-TAMYS study, body weight was measured by using an Omron BF511 portable digital scale (Omron Inc. Nagoya, Japan) with the accuracy of 0.1 kg, while standing on the middle of the scale, without assistance and with minimum clothing and height was measured in a standing position using a tape measure on a straight wall to the nearest centimeter. Body mass index (BMI) was calculated as weight (kg)/height squared (m2). Waist circumference (WC) was recorded to the nearest 0.5 cm by using non-stretch tape placed midway between iliac crest and lowest rib while participants were in the standing position. In addition, hip circumference was measured over the largest part of the buttocks, with an accuracy of 0.5 cm.
Data on age, gender, physical activity, education level, smoking status, marital status, and the history of chronic diseases was collected through a similar questionnaire in both cohort studies.
In the Shahedieh cohort study, participants were asked about their usual physical activity levels in the last year and in case they had seasonal jobs60. In the YaHS-TAMYZ cohort study, the short version of the International Physical Activity Questionnaire (IPAQ) was used to measure physical activity level of participants61. Physical activity was expressed as metabolic equivalent hours per week (MET-h/week) for all participants.
Age was classified into five categories (20–30, 30–40, 40–50, 50–60, and ≥ 60 years). Educational level was categorized into four levels (Uneducated, Elementary or guidance school, High school diploma, BSc or higher academic degree). Smoker participants were defined as current smokers, former smokers, and never smokers. Marital status was categorized into three categories (Single, Married, and Widowed or divorced).
Fasting blood glucose (FBG) (mg/dl), triglycerides (TG), low-density lipoprotein-cholesterol (LDL-c), high-density lipoprotein-cholesterol (HDL-c), and total serum cholesterol were measured in the YaHS-TAMYZ cohort study according to the standard laboratory protocol using Pars Azmoon kits and calibrated Ciba Corning (Ciba Corp, Basle, Switzerland) auto-analyzers. In Shahedieh cohort study, blood samples (25 mL) were collected from the participants after an overnight fasting (8–12 h). The blood samples were aliquoted into serum, buffy coat, and whole blood samples. FBG, TG, LDL-c, HDL-c, and total serum cholesterol were determined from the serum samples by an auto-analyzer (Analyzer BT1500) using Pars Azmoon standard kits.
Three complementary data reduction techniques, including PCA, RRR, and PLS, were used to identify DPs out of 33 food groups. In PCA method, the DPs explain as much variation as possible of the food groups. RRR method identifies linear functions of predictors (food groups) that explain as much intermediate responses variation as possible with using a covariance matrix of predictors and responses in calculating the DPs scores. The PLS method combines PCA and RRR methods and calculates DPs scores considering both the predictor and response matrices; therefore, the explained variance of both food groups and intermediate responses is expected to be between the PCA and RRR methods.
The number of DPs initially produced by PCA is constrained by the number of food groups used62; However, we retained just three DPs from PCA for subsequent analysis was according to the scree plot, an eigenvalue (> 1), and the interpretability of the principal DPs63. Varimax rotation was applied to achieve orthogonal DPs and increase the interpretability of principle DPs. Sample adequacy was checked by using the Kaisere Mayere Olkin (KMO) test.
According to previous literature, WC, FBG, TG, LDL-c, HDL-c and total serum cholesterol, were used as the intermediate response variables for PLS and RRR. Response variables were collected at the baseline of both YaHS-TAMYS and Shahedieh studies.
The SAS procedure PLS were used to conduct PLS and RRR analysis, respectively. The number of DPs derived by PLS and RRR is restricted by the number of intermediate response variables used; Therefore, six DPs were specified in each method. For both methods, we calculated the continuous DPs scores (the linear functions of food groups) in the subsequent analyses and interpretations. The first three DPs obtained by PLS and RRR was retained for further analyses because these DPs explained the largest amount of variation among the response variables.
In the YaHS-TAMYZ cohort study, information on death events and T2DM incidence was collected by using data from population-based registries and linked outcome information from the aggregated hospital information system (Samanah Electronici PArvandeh Salamat-SEPAS) which covers 100% of public hospitals and the majority of private hospitals in Yazd province. The data was obtained from SEPAS using the National Identifier number of each participant to link data.
During follow-up time in Shahedieh cohort study, participants received annual phone calls and follow-up questionnaires were completed in terms of the occurrence of death or the incidence of T2DM diagnosis. In case a participant had expired or had been diagnosed with T2DM, investigators followed the phone call with a house or hospital visit to perform a more follow-up and to collect copies of pertinent medical documents for further evaluation and recording. If needed, medical/physical examinations were performed to formulate a T2DM diagnosis. In addition, a verbal autopsy form validated in the Iranian population was completed during the death events. Two trained internists assessed the medical documents to determine the final T2DM diagnosis or cause of death. In case of inconsistency, a third internist conducted a final assessment of the documents to reach a final decision. The same follow-up procedures were followed in the case of self-reported T2DM incidence or death.
Quantitative and qualitative variables were compared between participants who were diagnosed with and without T2DM using independent sample t-test and chi-square tests, respectively. Binomial logistic regression was used to evaluate the association between DPs derived by PCA, PLS, and RRR analyses and risk of T2DM incidence. All analyses were done in crude and three multivariable-adjusted models. The first model was adjusted for age, sex, and energy intake (Model I); Model II was additionally adjusted for education, marital status, smoking status, and physical activity; and in Model III, BMI was additionally controlled.
All statistical analyses were conducted with SAS Version 8.02 (SAS Institute, Cary, NC, USA) and R-4.2.2 (https://cran.r-project.org/bin/windows/base/). P values less than 0.05 were considered as statistically significant.
In this study, PCA, PLS and RRR methods were compared according to the relative factor loading within each DPs and its association with risk of T2DM. Additionally, these methods were evaluated based on the magnitude of variation of each method which explained the food groups and response variables.
YaHS-TAMYZ and Shahedieh cohort studies were approved by the research Council and the ethics committee of Shahid Sadoughi University of Medical Sciences. The present study was approved by the ethics committee of Shahid Sadoughi University of Medical Sciences (Approval Code: IR.SSU.SPH.REC.1399.197). All participants gave an informed consent before entering both studies.
The data of the current study is available from the corresponding author on reasonable request.
International Diabetes Federation. International Diabetes Federation’s Diabetes Epidemiology Guide. https://diabetesatlas.org (2021).
Association, A. D. Diagnosis and classification of diabetes mellitus. Diabetes Care 36(Suppl 1), S67 (2013).
Google Scholar
Ortega, Á. et al. Gene-diet interactions in type 2 diabetes: The chicken and egg debate. Int. J. Mol. Sci. 18(6), 1188 (2017).
Google Scholar
Roden, M. & Shulman, G. I. The integrative biology of type 2 diabetes. Nature 576(7785), 51–60 (2019).
ADS Google Scholar
Lean, M. E. et al. Durability of a primary care-led weight-management intervention for remission of type 2 diabetes: 2-year results of the DiRECT open-label, cluster-randomised trial. Lancet Diabetes Endocrinol. 7(5), 344–355 (2019).
Google Scholar
Bellou, V., Belbasis, L., Tzoulaki, I. & Evangelou, E. Risk factors for type 2 diabetes mellitus: An exposure-wide umbrella review of meta-analyses. PLoS ONE 13(3), e0194127 (2018).
Google Scholar
Wahl, S. et al. Epigenome-wide association study of body mass index, and the adverse outcomes of adiposity. Nature 541(7635), 81–86 (2017).
ADS Google Scholar
Barrès, R. & Zierath, J. R. The role of diet and exercise in the transgenerational epigenetic landscape of T2DM. Nat. Rev. Endocrinol. 12(8), 441–451 (2016).
Google Scholar
Roglic, G. Global Report on Diabetes (World Health Organization, 2016).
Google Scholar
Ley, S. H., Hamdy, O., Mohan, V. & Hu, F. B. Prevention and management of type 2 diabetes: Dietary components and nutritional strategies. The Lancet. 383(9933), 1999–2007 (2014).
Google Scholar
Newby, P. & Tucker, K. L. Empirically derived eating patterns using factor or cluster analysis: A review. Nutr. Rev. 62(5), 177–203 (2004).
Google Scholar
Tapsell, L. C., Neale, E. P., Satija, A. & Hu, F. B. Foods, nutrients, and dietary patterns: Interconnections and implications for dietary guidelines. Adv. Nutr. 7(3), 445–454 (2016).
Google Scholar
Hoffmann, K. et al. Application of a new statistical method to derive dietary patterns in nutritional epidemiology. Am. J. Epidemiol. 159(10), 935–944 (2004).
Google Scholar
Frank, L. K. et al. A dietary pattern derived by reduced rank regression is associated with type 2 diabetes in an urban Ghanaian population. Nutrients 7(7), 5497–5514 (2015).
Google Scholar
Hu, F. B. Dietary pattern analysis: A new direction in nutritional epidemiology. Curr. Opin. Lipidol. 13(1), 3–9 (2002).
Google Scholar
Melaku, Y. A. et al. A comparison of principal component analysis, partial least-squares and reduced-rank regressions in the identification of dietary patterns associated with bone mass in ageing Australians. Eur. J. Nutr. 57(5), 1969–1983 (2018).
Google Scholar
Weikert, C. & Schulze, M. B. Evaluating dietary patterns: The role of reduced rank regression. Curr. Opin. Clin. Nutr. Metab. Care 19(5), 341–346 (2016).
Google Scholar
Batis, C. et al. Using both principal component analysis and reduced rank regression to study dietary patterns and diabetes in Chinese adults. Public Health Nutr. 19(2), 195–203 (2016).
Google Scholar
McNaughton, S. A., Mishra, G. D. & Brunner, E. J. Dietary patterns, insulin resistance, and incidence of type 2 diabetes in the Whitehall II Study. Diabetes Care 31(7), 1343–1348 (2008).
Google Scholar
Duan, M.-J., Dekker, L. H., Carrero, J.-J. & Navis, G. J. Blood lipids-related dietary patterns derived from reduced rank regression are associated with incident type 2 diabetes. Clin. Nutr. 40, 4712 (2021).
Google Scholar
Liese, A. D., Weis, K. E., Schulz, M. & Tooze, J. A. Food intake patterns associated with incident type 2 diabetes: The insulin resistance atherosclerosis study. Diabetes Care 32(2), 263–268 (2009).
Google Scholar
Schulze, M. B. et al. Dietary pattern, inflammation, and incidence of type 2 diabetes in women. Am. J. Clin. Nutr. 82(3), 675–684 (2005).
Google Scholar
Kurniawan, A. L. et al. Comparing two methods for deriving dietary patterns associated with risk of metabolic syndrome among middle-aged and elderly Taiwanese adults with impaired kidney function. BMC Med. Res. Methodol. 20(1), 1–12 (2020).
Google Scholar
Schwingshackl, L. et al. Food groups and risk of type 2 diabetes mellitus: A systematic review and meta-analysis of prospective studies. Eur. J. Epidemiol. 32(5), 363–375 (2017).
Google Scholar
Nguyen, H. D., Oh, H. & Kim, M.-S. Higher intakes of nutrients are linked with a lower risk of cardiovascular diseases, type 2 diabetes mellitus, arthritis, and depression among Korean adults. Nutr. Res. 100, 19–32 (2022).
Google Scholar
Dreher, M. L. Whole fruits and fruit fiber emerging health effects. Nutrients 10(12), 1833 (2018).
Google Scholar
Imamura, F. et al. Consumption of sugar sweetened beverages, artificially sweetened beverages, and fruit juice and incidence of type 2 diabetes: Systematic review, meta-analysis, and estimation of population attributable fraction. BMJ 351, h3576 (2015).
Google Scholar
Xi, B. et al. Intake of fruit juice and incidence of type 2 diabetes: A systematic review and meta-analysis. PLoS ONE 9(3), e93471 (2014).
ADS Google Scholar
Halvorsen, R. E., Elvestad, M., Molin, M. & Aune, D. Fruit and vegetable consumption and the risk of type 2 diabetes: A systematic review and dose–response meta-analysis of prospective studies. BMJ Nutr. Prev. Health 4, 519 (2021).
Google Scholar
Steinberg, G. R. & Schertzer, J. D. AMPK promotes macrophage fatty acid oxidative metabolism to mitigate inflammation: Implications for diabetes and cardiovascular disease. Immunol. Cell Biol. 92(4), 340–345 (2014).
Google Scholar
Basu, A. et al. Dietary blueberry and soluble fiber supplementation reduces risk of gestational diabetes in women with obesity in a randomized controlled trial. J. Nutr. 151(5), 1128–1138 (2021).
Google Scholar
Törrönen, R. et al. Berries modify the postprandial plasma glucose response to sucrose in healthy subjects. Br. J. Nutr. 103(8), 1094–1097 (2010).
Google Scholar
Stull, A. J. et al. Bioactives in blueberries improve insulin sensitivity in obese, insulin-resistant men and women. J. Nutr. 140(10), 1764–1768 (2010).
Google Scholar
Vinayagam, R., Xiao, J. & Xu, B. An insight into anti-diabetic properties of dietary phytochemicals. Phytochem. Rev. 16(3), 535–553 (2017).
Google Scholar
Espley, R. V. et al. Red colouration in apple fruit is due to the activity of the MYB transcription factor, MdMYB10. Plant J. 49(3), 414–427 (2007).
Google Scholar
Zunino, S. J. Type 2 diabetes and glycemic response to grapes or grape products. J. Nutr. 139(9), 1794S-S1800 (2009).
Google Scholar
Nizamutdinova, I. T. et al. The anti-diabetic effect of anthocyanins in streptozotocin-induced diabetic rats through glucose transporter 4 regulation and prevention of insulin resistance and pancreatic apoptosis. Mol. Nutr. Food Res. 53(11), 1419–1429 (2009).
Google Scholar
Coskun, O., Kanter, M., Korkmaz, A. & Oter, S. Quercetin, a flavonoid antioxidant, prevents and protects streptozotocin-induced oxidative stress and β-cell damage in rat pancreas. Pharmacol. Res. 51(2), 117–123 (2005).
Google Scholar
Chiu, T. H., Pan, W.-H., Lin, M.-N. & Lin, C.-L. Vegetarian diet, change in dietary patterns, and diabetes risk: A prospective study. Nutr. Diabetes 8(1), 1–9 (2018).
Google Scholar
Banihani, S. A. Tomato (Solanum lycopersicum L.) and type 2 diabetes. Int. J. Food Proper. 21(1), 99–105 (2018).
Google Scholar
Salehi, B. et al. Beneficial effects and potential risks of tomato consumption for human health: An overview. Nutrition 62, 201–208 (2019).
Google Scholar
Song, B. et al. Lycopene and risk of cardiovascular diseases: A meta-analysis of observational studies. Mol. Nutr. Food Res. 61(9), 1601009 (2017).
Google Scholar
Cheng, H. M. et al. Lycopene and tomato and risk of cardiovascular diseases: A systematic review and meta-analysis of epidemiological evidence. Crit. Rev. Food Sci. Nutr. 59(1), 141–158 (2019).
Google Scholar
Schulze, M., Manson, J., Willett, W. & Hu, F. Processed meat intake and incidence of type 2 diabetes in younger and middle-aged women. Diabetologia 46(11), 1465–1473 (2003).
Google Scholar
Song, Y., Manson, J. E., Buring, J. E. & Liu, S. A prospective study of red meat consumption and type 2 diabetes in middle-aged and elderly women: The women’s health study. Diabetes Care 27(9), 2108–2115 (2004).
Google Scholar
Tian, S. et al. Dietary protein consumption and the risk of type 2 diabetes: A systematic review and meta-analysis of cohort studies. Nutrients 9(9), 982 (2017).
Google Scholar
Maghsoudi, Z., Ghiasvand, R. & Salehi-Abargouei, A. Empirically derived dietary patterns and incident type 2 diabetes mellitus: A systematic review and meta-analysis on prospective observational studies. Public Health Nutr. 19(2), 230–241 (2016).
Google Scholar
Lijinsky, W. N-nitroso compounds in the diet. Mutat. Res./Genet. Toxicol. Environ. Mutagen. 443(1–2), 129–138 (1999).
Google Scholar
Hofmann, S. M. et al. Improved insulin sensitivity is associated with restricted intake of dietary glycoxidation products in the db/db mouse. Diabetes 51(7), 2082–2089 (2002).
Google Scholar
Ito, M., Kondo, Y., Nakatani, A. & Naruse, A. New model of progressive non-insulin-dependent diabetes mellitus in mice induced by streptozotocin. Biol. Pharm. Bull. 22(9), 988–989 (1999).
Google Scholar
Vlassara, H. et al. Inflammatory mediators are induced by dietary glycotoxins, a major risk factor for diabetic angiopathy. Proc. Natl. Acad. Sci. 99(24), 15596–15601 (2002).
ADS Google Scholar
Zaroudi, M. et al. Dietary Patterns are Associated with Risk of Diabetes Type 2: A Population-Based Case-Control Study (2016)
Risérus, U., Willett, W. C. & Hu, F. B. Dietary fats and prevention of type 2 diabetes. Prog. Lipid Res. 48(1), 44–51 (2009).
Google Scholar
Mirmiran, P. et al. Reliability and relative validity of an FFQ for nutrients in the Tehran lipid and glucose study. Public Health Nutr. 13(5), 654–662 (2010).
Google Scholar
Poustchi, H. et al. Prospective epidemiological research studies in Iran (the PERSIAN Cohort Study): Rationale, objectives, and design. Am. J. Epidemiol. 187(4), 647–655 (2018).
Google Scholar
Mirzaei, M., Salehi-Abargouei, A., Mirzaei, M. & Mohsenpour, M. A. Cohort profile: The Yazd Health Study (YaHS): A population-based study of adults aged 20–70 years (study design and baseline population data). Int. J. Epidemiol. 47(3), 697–698 (2018).
Google Scholar
Zimorovat, A. et al. Validity and reproducibility of a semiquantitative multiple-choice food frequency questionnaire in Iranian adults. Food Nutr. Bull. 43(2), 171–188 (2022).
Google Scholar
Ghaffarpour, M., Houshiar-Rad, A. & Kianfar, H. The manual for household measures, cooking yields factors and edible portion of foods. Tehran Nashre Olume Keshavarzy 7(213), 42–58 (1999).
Google Scholar
US Department of Agriculture, Agricultural Research Service, Nutrient Data Laboratory. USDA National Nutrient Database for Standard Reference.
Pate, R. R. et al. Physical activity and public health: A recommendation from the Centers for Disease Control and Prevention and the American College of Sports Medicine. JAMA 273(5), 402–407 (1995).
Google Scholar
Moghaddam, M. B. et al. The Iranian version of international physical activity questionnaire (IPAQ) in Iran: Content and construct validity, factor structure, internal consistency and stability. World Appl. Sci. J. 18(8), 1073–1080 (2012).
Google Scholar
De Coster, J. Overview of Factor Analysis. www.stat-help.com/factor.pdf (Accessed 2 March 2011) (1998).
Fransen, H. P. et al. A posteriori dietary patterns: How many patterns to retain? J. Nutr. 144(8), 1274–1282 (2014).
Google Scholar
Download references
The authors would like to thank those participated in the YaHS-TAMYZ and Shahedieh cohort studies, and authorities of Shahid Sadoughi University of Medical Sciences for their cooperation.
The present study was funded by Shahid Sadoughi University of Medical Sciences.
Nutrition and Food Security Research Center, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
Sara Beigrezaei, Sayyed Saeid Khayyatzadeh & Amin Salehi-Abargouei
Department of Nutrition, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
Sara Beigrezaei, Sayyed Saeid Khayyatzadeh & Amin Salehi-Abargouei
Departments of Biostatistics and Epidemiology, School of Public Health, Center for Healthcare Data Modeling, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
Sara Jambarsang
Yazd Cardiovascular Research Center, Non-Communicable Disease Research Institute, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
Masoud Mirzaei & Amin Salehi-Abargouei
Industrial Diseases Research Center, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
Amir Houshang Mehrparvar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
A.S.A. and S.B. conceived and designed the study. S.J. and S.B. and A.S.A. conducted and interpreted the statistical analyses. S.B. wrote the first draft of the manuscript. A.S.A., S.J., S.K., M.M., and A.H.M. critically reviewed the manuscript. All authors read and approved the final version of the manuscript.
Correspondence to Amin Salehi-Abargouei.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Reprints and Permissions
Beigrezaei, S., Jambarsang, S., Khayyatzadeh, S.S. et al. The association between dietary patterns derived by three statistical methods and type 2 diabetes risk: YaHS-TAMYZ and Shahedieh cohort studies. Sci Rep 13, 410 (2023). https://doi.org/10.1038/s41598-023-27645-w
Download citation
Received: 09 October 2022
Accepted: 05 January 2023
Published: 09 January 2023
DOI: https://doi.org/10.1038/s41598-023-27645-w
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.
Advertisement
© 2023 Springer Nature Limited
Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.