A Proposed Genetic Risk Score for Dilated Cardiomyopathy Susceptibility in the Chinese Han Population
Dilated cardiomyopathy (DCM) is one of the most common types of cardiomyopathies. The current study constructs a proposed genetic risk score and assesses its potential as a predictive tool for individual DCM susceptibility. A total of 1,000 patients with idiopathic DCM and 828 control participants from the study institution were enrolled. The top 25 up-regulated and down-regulated genes from tandem mass spectrometry analysis of 6 hearts with DCM and 7 control hearts (from a study by Chen et al) were selected for logistic analysis using whole-exome sequencing data. Statistically significant variants were prepared for genetic risk score construction. The predictive power of the genetic risk score and a composite variable for DCM risk were evaluated using receiver operating characteristic curve analysis. A total of 5 variants associated with DCM susceptibility were identified to develop the genetic risk score. A score of at least 6.4 was more strongly associated with increased risk of DCM (odds ratio, 2.4; P < .001) than scores lower than 6.4. Statistical significance remained evident in multivariate analysis after adjusting for traditional risk factors, including age, sex, hypertension, diabetes, and smoking status (odds ratio, 2.54; P < .001). Individuals with a score of at least 6.4 exhibited a decrease in left ventricular ejection fraction and an increase in left ventricular end-diastolic diameter compared with individuals with a score lower than 6.4 (P < .001). Stratification by age, sex, history of hypertension, diabetes, and smoking status did not substantially affect the association between genetic risk score and the risk of DCM. The discriminant power of the genetic risk score is excellent, with a C statistic of 0.72. The genetic risk score, which consists of 5 variants, could effectively identify individuals at high risk of DCM in the study population and aid in the implementation of early prevention strategies in clinical practice.Abstract
Background:
Methods:
Results:
Conclusion:
Key Points
The authors of this study developed a proposed genetic risk score consisting of 5 variants for the risk of DCM.
A genetic risk score of at least 6.4 was strongly associated with increased risk of DCM (OR, 2.40 [95% CI, 1.99-2.90], P < .001) compared with a score lower than 6.4.
The composite variable, which combines the genetic risk score with age, sex, history of hypertension, diabetes, and smoking status, could substantially improve the score's predictive ability for DCM risk.
Abbreviations
- DCM
dilated cardiomyopathy
- LVEDD
left ventricular end-diastolic diameter
- LVEF
left ventricular ejection fraction
- OR
odds ratio
- ROC
receiver operating characteristic
- WES
whole-exome sequencing
Introduction
Dilated cardiomyopathy (DCM) is a condition characterized by left ventricular dilation and impaired contractile function that could not be exclusively attributed to coronary arterial injury or abnormal loading condition, with a prevalence of up to 1 in 2,500 in the general population.1 As the third-most common cause of heart failure, DCM is associated with a poor overall prognosis, with a high risk of mortality.2 Dilated cardiomyopathy can be attributed to genetic and nongenetic causes.3 When it has a genetic cause, DCM is characterized as a genetically heterogeneous disease with a complex etiology.4 Previous reports have suggested that inflammation, autoimmune diseases, toxic and metabolic damage, and heredity might participate in the process of DCM.5-7
Numerous genetic studies on DCM have been conducted. These studies have identified more than 100 genes that are associated with the DCM phenotype, most of which encode proteins related to the sarcomere, cytoskeleton, sarcolemma, nucleus, and nuclear lamina.8 Many common genetic variants in DCM-related genes have been found to be associated with susceptibility to and the prognosis of DCM.4,8,9 For instance, Shen et al4 found that the rs237028 variant in the TAB2 gene was associated with DCM susceptibility and prognosis in the Chinese population. In a cohort study involving 341 patients with DCM and 381 control patients, the variants rs17083838 and rs7992670 in the CDK8 gene were shown to be related to the risk of and prognosis for DCM.9 In addition, the rs243865-C allele in the MMP2 gene was found to be correlated with an increased susceptibility to and mortality risk from DCM.8 These studies, however, primarily focused on individual variants, most of which had modest or small effects on predicting susceptibility to DCM. It is therefore crucial to develop a DCM-related genetic risk score and assess the cumulative effects of multiple genetic loci on susceptibility to DCM.
The cytoskeleton plays a crucial role in maintaining cell structure and integrity.10 A substantial body of evidence has demonstrated the major involvement of cytoskeletal and cytoskeletal-related proteins in the pathogenesis of DCM.11-14 In a study conducted by Chen et al,15 mass spectrometry was used to analyze 34 human hearts, revealing that the up-regulation and stabilization of the cytoskeleton were the prominent features of end-stage heart failure in humans. This study compared the proteomic profiles of DCM and normal hearts; importantly, it showed that the top 25 up-regulated and down-regulated proteins were associated with the cytoskeleton.15
Given the importance of cytoskeletal and cytoskeletal-related proteins in DCM, the current study aimed to construct a genetic risk score for susceptibility to DCM by using the differentially expressed genes from Chen et al15 and the current study's whole-exome sequencing (WES) data.
Patients and Methods
Study Participants
This study enrolled 1,000 patients with idiopathic DCM and 828 control participants from the Cardiology Division of Tongji Hospital in Wuhan, China, between March 2013 and June 2020. These patients were diagnosed with DCM according to the modified version of standardized diagnostic criteria for DCM.16 Individuals with a history of cardiac valve disease, coronary heart disease, hypertension, tachyarrhythmia, congenital heart disease, pericardial disease, acute viral myocarditis, heavy alcohol intake, skeletal myopathies, systemic diseases of a putative autoimmune origin, diabetes, and nutrition disorders were excluded from the study. Control participants had no history of cardiac disease or cardiac dysfunction. Echocardiography was performed on all participants to assess their heart function. This study adhered to the principles outlined in the Declaration of Helsinki and was approved by the Review Board of Tongji Hospital. Informed consent was obtained from all patients.
Whole-Exome Sequencing
Genomic DNA was extracted from the peripheral blood leukocytes using the FastPure Cell/Tissue DNA Isolation Mini Kit (Vazyme International LLC) and the commercially available TIANamp Genomic DNA Kit (Tiangen Biotech [Beijing] Co, Ltd). The experimental workflow, sample preparation, and sequencing were performed according to the protocol. All genomic DNA samples were of high quality and assessed through spectrophotometric and electrophoretic analyses. Initially, genomic DNA was fragmented into 300 base pair fragments using an ultrasonicator (Covaris, LLC). Subsequently, the SureSelectXT Human All Exon V6 kit (Agilent Technologies) was employed to prepare the library, capture the target regions, repair and purify fragment ends, and ligate the adapters. These fragments were then amplified using Herculase II Fusion DNA Polymerases (Agilent Technologies). Paired-end sequencing with a read length of 300 base pairs was conducted on the amplicons using an Illumina HiSeq X Ten Sequencing System (Illumina, Inc), following the standard Illumina protocol.
Gene Selection
Chen et al15 conducted tandem mass spectrometry analysis of 6 hearts with DCM and 7 control hearts. The related data are available in the ProteomeXchange Consortium through the PRIDE partner repository with the dataset identifier PXD008934. In this study, the top 25 up-regulated and down-regulated genes between DCM and controls were found to be associated with the cytoskeleton and were therefore selected for further analysis.
Genetic Risk Score
Common genetic variants with minor allele frequency greater than 0.05 in the selected genes were extracted from the current study's WES data. Logistic regression analysis was conducted to assess the effect of these common variants on DCM risk. Variants in strong linkage disequilibrium with each other (r2 > 0.9) were analyzed using the WES data, and 1 single variant was chosen as the tagged variant for the construction of the genetic risk score. Genotypes with higher susceptibility to DCM were assigned a weighted score of 1 multiplied by the odds ratio (OR), while the rest were given a weighted score of 1. The weighted scores from the selected variants were then summed for each patient and employed to predict their individual risk of developing DCM.
Statistical Analysis
Data analyses were conducted with SPSS, version 26.0, statistical software (IBM Corp). Distribution of normality was tested using the Shapiro-Wilk test. Abnormally distributed quantitative data were presented as median (IQR), normally distributed quantitative data were presented as mean (SD), and categorical variables were presented as numbers (percentages). Differences between groups were assessed using appropriate statistical tests, including the χ2 test, independent-sample t test, or the Mann-Whitney U test. Linkage disequilibrium was calculated using Haploview, version 4.1 (Broad Institute). Logistic regression was used to assess the association of common variants and genetic risk score with the susceptibility to DCM. Adjusted variables included age, sex, hypertension, diabetes, and smoking status. All comparisons were 2 sided, and a significance level of P < .05 was considered statistically significant.
Results
Baseline Characteristics of the DCM and Control Groups
The baseline characteristics of patients with DCM and control participants are presented in Table I. There were no statistically significant differences in age and sex between the patients with DCM and control participants. Patients with DCM had statistically lower systolic blood pressure, diastolic blood pressure, left ventricular ejection fraction (LVEF), total cholesterol, triglycerides, high-density lipoprotein cholesterol, and low-density lipoprotein cholesterol than did the control participants (P < .05). Conversely, the level of left ventricular end-diastolic diameter (LVEDD) and the proportion of individuals with hypertension, diabetes, β-blocker use, and smoking were much higher in the DCM group than in the control group (P < .05).

Associations of Variants in Selected Genes With Susceptibility to DCM
Based on the results of Chen et al,15 the top 25 up-regulated and down-regulated genes between patients with DCM and control participants are displayed in Supplemental Table I. Analysis of WES data revealed a total of 1 and 7 common variants within the selected genes associated with DCM risk in dominant and recessive models, respectively (Supplemental Tables II and III). Because rs10160013, rs3740002, and rs3740003 exhibited strong linkage disequilibrium with each other (r2 > 0.9), rs10160013 was selected as the representative variant for further genetic risk score development. Although rs1799852 showed statistical significance in both the dominant and the recessive models, it was regarded as a dominant model because it had a smaller P value and a higher OR. Finally, 1 variant in the dominant model and 4 variants in the recessive model were identified for the development of the genetic risk score.
Genetic Risk Score Construction
To evaluate the cumulative effects of the 5 variants on a patient's susceptibility to DCM, a genetic risk score for each individual was calculated (Supplemental Table IV). The score ranged from 5.0 to 8.2, with a mean value of 6.6. According to the maximum Youden Index criterion and receiver operating characteristic (ROC) curve analysis, the optimal cutoff value for the genetic risk score to stratify patients into 2 groups was determined to be 6.4 (ie, <6.4 and ≥6.4). As shown in Table II, individuals with a score of at least 6.4 displayed elevated LVEDD and a higher proportion of diabetes, β-blocker use, and DCM compared with individuals with a score below 6.4. Conversely, the group with scores of 6.4 or higher exhibited substantially lower systolic blood pressure, LVEF, total cholesterol, and high-density lipoprotein cholesterol (P < .05). Logistic regression analysis further revealed a strong association between higher genetic risk score and an increased risk of DCM (OR, 2.40 [95% CI, 1.99-2.90]; P < .001) (Table III). This statistical significance remained evident in multivariate analysis after adjusting for traditional risk factors, including age, sex, hypertension, diabetes, and smoking status (OR, 2.54 [95% CI, 2.07-3.11]; P < .001).


The study population was subsequently randomly assigned to a training group (914 individuals) or a validation group (914 individuals). Logistic regression analysis demonstrated that the genetic risk score was strongly associated with the risk of DCM in both the training group (OR, 2.15 [95% CI, 1.65-2.81]; P < .001) and the validation group (OR, 2.68 [95% CI, 2.04-3.50]; P < .001) (Table IV).

In addition, a subgroup analysis was conducted to further evaluate the associations between the genetic risk score and susceptibility to DCM within different clinically relevant subgroups. Figure 1 illustrates that stratification by age, sex, history of hypertension, diabetes, and smoking did not substantially affect the association between a patient's genetic risk score and risk of DCM.


Citation: Texas Heart Institute Journal 52, 1; 10.14503/THIJ-24-8525
Association Between Genetic Risk Score and Echocardiographic Parameters
Echocardiographic parameters were compared between individuals in different genetic risk score groups. As shown in Figure 2, individuals in the group with scores of 6.4 or higher exhibited a decrease in LVEF and an increase in LVEDD compared with individuals in the group with scores below 6.4 (P < .001).


Citation: Texas Heart Institute Journal 52, 1; 10.14503/THIJ-24-8525
ROC Curve Analysis
Receiver operating characteristic curve analysis was conducted to evaluate and compare the predictive abilities of the genetic risk score and the composite variable (score combined with age, sex, history of hypertension, diabetes, and smoking status) for DCM risk. As shown in Figure 3 and Table V, the average area under the ROC curve for the genetic risk score and for the composite variable were 0.72 (95% CI, 0.70-0.74) and 0.83 (95% CI, 0.81-0.85), respectively. The composite variable exhibited superior predictive power compared with the genetic risk score (P < .001). The predictive efficacy of the genetic risk score and the composite variable for DCM were also assessed within the training and validation groups. As illustrated in Table VI, the average area under the ROC curve for the genetic risk score surpassed 0.7 in both groups. The ROC curve for the composite variable exceeded 0.8, as well, indicating a statistically significantly superior predictive capability for DCM compared with the genetic risk score (P < .001).


Citation: Texas Heart Institute Journal 52, 1; 10.14503/THIJ-24-8525


Discussion
The present study continuously recruited 1,000 patients with DCM and 828 control participants. Association analyses were performed to investigate the relationship between the genetic risk score, which consists of 5 variants, and an individual's susceptibility to DCM. Study findings demonstrated that higher genetic risk scores were associated with increased susceptibility to DCM. Individuals with a score of at least 6.4 further exhibited a decrease in LVEF and an increase in LVEDD compared with individuals whose score was less than 6.4. Stratification by age, sex, history of hypertension, diabetes, and smoking status had no statistically significant impact on the association between genetic risk score and an individual's risk of DCM. In addition, the composite variable, which combines the genetic risk score with age, sex, history of hypertension, diabetes, and smoking status, could substantially improve the predictive ability for DCM risk.
Dilated cardiomyopathy is a common cause of heart failure and associated with high mortality.2,3 In recent years, several genetic variants linked to DCM susceptibility have been identified. Most of these risk loci, however, have only modest or small effects on predicting DCM susceptibility, with ORs for DCM risk not exceeding 2.1,4,17,18 For instance, Shen et al4 reported an association between rs237028 and DCM susceptibility. Its effect is limited, however, with the OR ranging from 1.42 to 1.73 in different models. In a study involving 488 patients with DCM and 924 control participants, the OR of the risk locus for DCM was only 1.42.18 A study by Zhang et al17 found that LGALS3 gene polymorphisms may be associated with susceptibility to DCM in a Northern Han Chinese population. Their study, however, included only 279 patients with unrelated clinically diagnosed DCM and 363 apparently control participants, which limits the generalization of their results to other populations. The current study's objective was to therefore comprehensively construct a genetic risk score for assessing an individual's risk of DCM.
First, this study was based on the data from tandem mass spectrometry analysis of 6 hearts with DCM and 7 control hearts as well as on the WES of 1,000 patients with DCM, allowing for a comprehensive assessment of variants associated with the risk for DCM.
Second, this study included a larger population with DCM than did previous investigations.9,19,20 For example, Shah et al20 included only 352 patients with DCM and 352 control participants. Another case-control study enrolled 369 control participants and 373 patients with DCM for analysis.19 The larger sample size in the current study may enhance the reliability of its conclusions and facilitate their generalization to other populations.
Third, the current study's genetic risk score was constructed with a total of 5 variants, which represented the first genetic risk score study of susceptibility to DCM. The genetic risk score in the current study demonstrated excellent risk discrimination, with a C statistic of 0.72. Individuals with a score of at least 6.4 exhibited a 2.4-fold higher risk of DCM compared with individuals with a score below 6.4. The prediction ability of the current study's genetic risk score was independent of traditional risk factors. In addition, a higher score was correlated with reduced LVEF and increased LVEDD, underscoring its critical role in predicting DCM. Notably, the composite variable, combining the genetic risk score with other variables, improved the score's discrimination ability, with a C statistic of up to 0.83. This risk stratification for DCM susceptibility can aid in identifying individuals at high risk of DCM and facilitate early prevention strategies.
Limitations
The current study has several limitations. First, it was a single-center study with 1 cohort, which limits the generalizability of its findings. Although the results were statistically significant, further validation of the genetic risk score through larger, multicenter studies is warranted. Second, the current study was of the Chinese population, which may limit the generalizability of the results to individuals from other racial backgrounds. Verification of the results in diverse populations or among different ancestries is essential. Third, the small sample size used for candidate gene selection may have influenced the accuracy of estimating the impact of individual variants on the risk of DCM. Additional validation, however, can be achieved by filtering notable variants located within or near these genes. Fourth, the study did not incorporate family history indices into the traditional risk assessment. Information about the disease and its direct correlations is often ambiguous, potentially due to variations in the quality of medical care provided at different historical time points. Finally, the study did not take into account multiple testing to ensure sufficient variants for genetic risk score construction.
Conclusion
The proposed genetic risk score consisting of 5 variants exhibited a strong association with susceptibility to DCM within the study population. This innovative risk stratification tool has the potential to effectively identify individuals at high risk of DCM and aid in the implementation of early prevention strategies in clinical practice.

A forest plot for subgroup analysis indicates that stratification by age, sex, history of hypertension, diabetes, and smoking status did not strongly affect the association between an individual's genetic risk score and risk of DCM.
DCM, dilated cardiomyopathy; OR, odds ratio.

Comparison of echocardiographic parameters indicates that individuals in the group with genetic risk scores of 6.4 or higher exhibited (A) a decrease in LVEF and (B) an increase in LVEDD compared with individuals with scores below 6.4 (P < .001).
LVEDD, left ventricular end-diastolic diameter; LVEF, left ventricular ejection fraction.
P < .05 was considered statistically significant.

Receiver operating characteristic curves for dilated cardiomyopathy risk indicate that the composite variable (genetic risk score combined with age, sex, history of hypertension, diabetes, and smoking status) exhibit superior predictive power compared with genetic risk score.
Contributor Notes