Introduction

Transcatheter mitral valve repair (TMVR) with MitraClip is a recommended treatment for selected patients with primary and secondary mitral regurgitation (MR) and high or prohibitive surgical risk [1]. TMVR has shown an excellent safety profile and high technical success [2, 3]. However, outcome after TMVR is impaired due to severe cardiac and non-cardiac morbidities [4]. Risk stratification in these patients is important for informed decision making of physicians, patients and relatives. This might for instance refer to decisions on a conservative or palliative strategy or advanced heart failure interventions such as assist devices in patients with a futile risk profile.

So far, reliable risk stratification in patients undergoing TMVR is lacking. Scores derived from surgical patients predicting peri-operative mortality like logistic Euroscore, Euroscore II and Society of Thoracic Surgeons score, show weak stratification in TMVR patients [5]. Other risk models proposed exclusively for TMVR patients were derived from small patient cohorts [6]. Although several studies have reported individual risk factors for adverse outcome after TMVR, many of these are single-center studies [7, 8], used variables not commonly available in routine [9], or predictive variables were inconsistent across studies [10, 11].

The majority of patients undergoing TMVR have secondary MR with underlying left ventricular (LV) dysfunction and heart failure (HF). Furthermore, also patients undergoing TMVR for primary MR often have a history of symptomatic HF and impaired LV function [12]. We hypothesized that a risk score derived from HF patients might be associated with outcome in patients undergoing TMVR. Aim of this study was to examine the association of the “Get-with-the-Guidelines Heart Failure (GWTG-HF) risk score” with all-cause mortality and validate the score in a large multi-center cohort of patients undergoing TMVR. The GWTG-HF score is so far only validated for prediction of mortality in hospitalized and pre-discharge patients with HF [13, 14].

Methods

Data of 1010 patients who underwent TMVR with the MitraClip system in the Heart Failure Network Rhineland (University hospitals of Bonn, Cologne and Duesseldorf) between 2010 and 2018 were retrospectively analyzed for parameters of the GWTG-HF score. The study received the proper ethical oversight and the data collection has been previously approved by the ethics committee of the University of Bonn, Cologne and Duesseldorf, respectively. The three tertiary care hospitals are high-volume referral centers for valvular heart disease providing the whole spectrum of surgical and catheter based mitral valve therapies. Briefly, all patients were discussed by a Heart Team, including at least one interventional cardiologist, one non-interventional cardiologist and one cardiac surgeon, and a decision on interventional treatment approach was made based on surgical risk, MR etiology, morphological suitability and other relevant patient characteristics. Only patients with deployment of at least one MitraClip device, complete follow-up data on vital status and complete data of the GWTG-HF score were enrolled in this analysis (Fig. 1). Baseline characteristics were assessed before the TMVR procedure and collected from records. Importantly, the Heart Failure Network Rhineland is independent from industry.

Fig. 1
figure 1

Study flow chart

GWTG-HF risk score

The GWTG-HF score [13] is a mortality predictive model for patients hospitalized with HF that has been validated independently [15]. It uses commonly available clinical variables to predict in-hospital mortality as a tool for mortality risk stratification that is applicable to a broad spectrum of HF patients, including those with preserved LV systolic function [16]. The GWTG-HF score can easily be calculated from systolic blood pressure, blood urea nitrogen, blood sodium, age, heart rate, race, and history of chronic obstructive pulmonary disease using an online calculator (www.mdcalc.com/gwtg-heart-failure-risk-score). Scores range from 0 to 100, with scores 0–33 having < 1% probability of death to scores over 79 having > 50% probability of death, with a continuous increase of death risk associated with increases in score.

Endpoint

The clinical course was monitored by institutional follow-up examinations, phone calls with the referring cardiologists and the patients’ primary physicians or the patients themselves. The endpoint was all-cause mortality, given the fact that GTWTG-HF score has been proposed as a mortality prediction model.

Statistical analysis

The GWTG-HF score showed a continuous risk increase with increasing score values in previous studies, without evidence for a threshold effect. Hence, we used quartiles of the GWTG-HF score to examine the shape of association with mortality after TMVR and baseline characteristics. Patient characteristics were compared between quartiles using the chi-square test (or Fisher’s exact test if the expected count was less than five per cell) for categorical variables. All continuous variables were distributed non-normally according to the Kolmogorov–Smirnov test and were examined by Kruskal–Wallis test for analysis of differences in distribution. Percentages were reported to describe categorical variables and median (interquartile range) was reported for continuous variables. The Kaplan–Meier method was used for presenting the event-free survival. The observation time on the Kaplan–Meier plot was truncated at 75% of the total follow-up time. Log-rank test was used to examine differences across groups. Subgroup analysis was performed by etiology of MR (primary or secondary) and LV-EF (≥ 50% or < 50%). Due to low numbers in patients with primary MR, tertiles of the GWTG-HF score were used instead of quartiles. Cox proportional hazard models were used to assess the contribution of the GWTG-HF score, individual components of the GWTG-HF score and other risk factors for mortality. The proportional hazards assumption was tested on the basis of Schoenfeld residuals after fitting a Cox model. Hazard ratio and 95% confidence interval are presented. To assess the continuous association between GWTG-HF score and mortality, we generated restricted cubic splines with knots at the 5th, 25th, 75th and 95th percentiles of the GWTG-HF score (33, 37, 46 and 55 points), centred at the median (42 points), and the plot was truncated at the 2.5th and 97.5th percentiles. Score discrimination was assessed using Harrell’s c-statistic. The calibration of the score’s performance was analyzed using a goodness-of-fit test proposed by Gronnesby and Borgan for Cox proportional hazard models and using Arjas like plots comparing counts of observed and expected events at each event time. For all analyses, a p value of < 0.05 was considered to be statistically significant. Analyses were performed using SPSS Statistics 25 (IBM, Armonk, NY, USA) and Stata/SE 12.1 (StataCorp LP, College Station, TX, USA).

Results

Baseline characteristics

A total of 815 patients who underwent TMVR with MitraClip were included into the analysis. Median age was 78 (73–83) years, 43% were female. Median NTproBNP was 2529 (1392–5499) ng/l and left ventricular ejection fraction (LV-EF) was severely reduced (< 30%) in 185 (22.8%) patients. Median estimated glomerular filtration rate was 48 (35–64) ml/min/1.73 m2. Secondary MR was present in 509 (62.5%) patients. The median Euroscore was 19.2 (10.4–31.3). The study population differed significantly from the 195 patients excluded from the analysis. The study population had more women, more secondary etiology of MR, previous strokes, diabetes mellitus, previous bypass surgery, higher Euroscore levels, and lower rates of transitory ischemic attacks and hypertension compared to the patients excluded from the analysis (Supplementary Table 1).

Table 1 Baseline characteristics according to GWTG-HF score quartiles

GWTG-HF score in patients undergoing TMVR

The median GWTG-HF score was 42 (37–46). Baseline characteristics according to quartiles of the GWTG-HF score (score range of quartiles: 0–37, 38–42, 43–46 and 47 +) are summarized in Table 1. All individual score components were significantly different across quartiles. NYHA functional class significantly increased, as did NTproBNP, the Euroscore, and rate of peripheral artery disease, previous decompensation and mineralocorticoid-receptor antagonist use, whereas renal function significantly decreased with higher quartiles of the score. LV-EF groups did not significantly differ across the GWTG-HF score quartiles. There was no difference in the frequency of MR etiology between the four groups.

GWTG-HF score and clinical outcome

The median follow-up of the study population was 365 (365–546) days. During the follow-up period, 177 (21.7%) patients died. In-hospital mortality across quartiles was observed in 3 (1%), 6 (3%), 14 (7.5%) and 12 (6%) patients, respectively (p = 0.007). Mortality during the first year after the procedure was observed in 12 (6%), 24 (11%), 42 (23%) and 60 (30%) patients, respectively (p < 0.001, Fig. 2).

Fig. 2
figure 2

In-hospital and 1-year mortality according to GWTG-HF score quartiles

Kaplan–Meier analysis and log-rank test showed a significantly reduced survival of patients by increasing quartiles of the GWTG-HF score (p < 0.001, Fig. 3). Compared to patients of the bottom score quartile, patients of the second, third and fourth quartile had a hazard ratio of mortality of 1.78 (95% CI 1.02–3.12%), 3.17 (95% CI 1.86–5.41) and 4.68 (95% CI 2.83–7.73), respectively. Every increase in one GWTG-HF score point was associated with a 9% higher hazard (95% CI 6–11%, p < 0.001) of mortality. Restricted cubic spine analysis showed a continuous increase of mortality risk across the range of the GWTG-HF score from 33 to 57 points (Fig. 4). Risk discrimination of the GWTG-HF score for all-cause mortality was moderate with a c-statistic of 0.68. The calibration of the score was examined by comparing observed vs. predicted numbers of death events by quartiles of the score. As shown in Supplementary Table 2 and Supplementary Fig. 1, there was excellent concordance between observed and model-predicted mortality rates across quartiles of the GWTG-HF score as determined by the Score test for goodness of fit (p = 0.46).

Fig. 3
figure 3

Kaplan–Meier plot for mortality by GWTG-HF score quartiles

Fig. 4
figure 4

Frequency distribution of the GWTG-HF score (left y axis, grey bars) and associated hazard ratio of mortality using cubic spline analysis (right y axis, black line, with 95% confidence interval dotted lines)

The association of the GWTG-HF score with mortality was attenuated but remained significant when adjusting for other measures of heart failure severity such as NYHA class, left-ventricular ejection fraction, NT-pro-BNP and previous hospitalization of heart failure (HR 1.06, 95% CI 1.03–1.09, p < 0.001). When analyzing the prognostic impact of individual GWTG-HF score components, blood urea nitrogen, systolic blood pressure and heart rate showed the strongest association with mortality (Table 2).

Table 2 Analysis of individual GWTG-HF score components

Subgroup analysis stratified by MR etiology and EF impairment revealed a similar association between all-cause mortality and GWTG-HF score as in the total population (Supplementary Figs. 2–5).

When comparing the GWTG-HF score with the Euroscore, the discriminatory performance of the GWTG-HF score for mortality was higher according to ROC analysis (Supplementary Fig. 6). The area under the ROC curve for 30-day mortality was 0.65 in GWTG-HF vs. 0.62 for Euroscore, with a p-value of 0.65 for comparison. The area under the ROC curve for 1-year mortality was 0.71 in GWTG-HF vs. 0.57, with a p-value of < 0.001 for comparison. After performing a bivariate analysis of the two score systems for mortality, the association of GWTG-HF with mortality was virtually the same, resulting in an 8% higher hazard (95% CI 6–10%, p < 0.001) with every increase in one GWTG-HF score point (Supplementary Table 3).

Discussion

In the present study we investigated the performance of the GWTG-HF score for mortality prediction in patients undergoing TMVR with the MitraClip system. First, the GWTG-HF score was significantly associated with all-cause mortality, with a continuously increasing risk of mortality across the range of the score from 33 to 57 points and a 9% risk increase for every increase in one score point. The concordance between score predicted and observed mortality rates was excellent. Second, the association of the GWTG-HF score with mortality was independent of other HF markers such as NYHA class and NTproBNP. Third, the association of the GWTG-HF score with mortality was consistent in patients with primary and secondary MR and in patients with preserved and reduced ejection fraction.

The GWTG-HF score has initially been derived and validated in hospitalized patients with HF to predict in-hospital mortality [13, 15]. In recent studies risk prediction was not only demonstrated for in-hospital mortality but also long-term prognosis in HF patients [14, 17]. Although many patients undergoing TMVR have HF with LV dysfunction and thus might be represented by the derivation and validation cohorts of the GWTG-HF score, it was not clear whether the risk association of the score remains valid after TMVR, which has the potential to improve the disease course and attenuate mortality in these patients. Thus, we extend existing data on the risk prediction of the GWTG-HF score to HF patients undergoing TMVR, and also to patients with primary MR undergoing TMVR. Importantly, almost two thirds of our patients with primary MR had preserved LV ejection fraction.

The components comprising the GWTG-HF score showed a significant association with mortality risk, except for chronic obstructive pulmonary disease. Association with mortality in the setting of TMVR has been described before regarding renal function [18], resting heart rate [19] and higher age [20]. To our best knowledge, blood pressure and baseline sodium have not been evaluated as a risk factor in TMVR patient so far. However, in patients with HF and reduced ejection fraction both low systolic blood pressure and hyponatremia are well-established risk factors which makes our findings plausible [21].

Markers of heart failure severity such as NTproBNP, NYHA class and LV-EF are associated with mortality in TMVR [4, 11] underlining the relevance of HF as an underlying disease in most TMVR patients. Nonetheless, the GWTG-HF score showed a prognostic impact independently to these clinically important heart failure markers which is an important finding of this study and supports the additive prognostic value of the GWTG-HF score.

Major advantages of the GWTG-HF score with respect to other risk scores used in HF patients [22, 23] are the familiarity for most cardiologists, the ease of calculation using an online tool, the parsimony of variables included, and the availability of these variables in literally every patient admitted for TMVR. These are crucial reasons for a broad use of the score in clinical routine [17, 24]. The GWTG-HF score permits a continuous characterization of individual mortality risk. Still, for clinical decision-making cut-offs are usually inevitable. Such cut-offs guiding treatment decisions need to be defined in further studies. For instance, patients in the top decile of the GWTG-HF score (52 points or more) had a 1-year mortality of 50% and patients with primary MR even 59%. For these patients TMVR might be regarded futile. Of note, exclusion of patients with very advanced HF is one point under discussion to explain the divergent results of the trials COAPT and MITRA-FR in patients with secondary MR undergoing TMVR. The criterion used in COAPT to classify and exclude advanced HF was “stage D” according to the American Heart Association, which is not well defined. A score-based risk prediction might be a more objective measure to define patients without benefit from TMVR in the future.

Strengths and limitations

A strength of this study is the large sample size and the multi-center approach which increases generalizability of our findings. However, we had only one black patient so that no conclusion can be drawn on the prognostic role of race in TMVR. Furthermore, only about 80% of patients treated at our centers with MitraClip were included in the study due to missing covariates, with evidence for the inclusion of more diseased patients. When compared to other large MitraClip registries our population showed very similar frequency of comorbidities and 1-year mortality, indicating yet good representativeness [11, 12, 25]. We had only all-cause mortality available as outcome measure. Non-fatal events such as hospital admission for HF are also important from the patient`s and physician’s perspective and might be associated with the GWTG-HF score. Indeed, in 309 patients, where data on hospitalization for HF were available, a significant 6% hazard increase was observed for each increase of a GWTG-HF score point (data not shown). The risk association with mortality is only valid for the GWTG-HF score range of 33–57 points, which was represented in our study population. This might seem a narrow range given the total score range from 0 to 100 points. However, even in classical heart failure populations up to 80% of patients have score values within the range of 33–57 points [17, 24], indicating that the GWTG-HF score will provide valid estimates for the majority of TMVR patients.

Conclusion

The GWTG-HF score showed a strong association with short- and long-term mortality in patients undergoing TMVR, with good concordance between observed and expected mortality rates and additive information beyond traditional heart failure markers. Given the routinely available variables included in this score and the ease of calculation, a broad application is possible and might improve risk stratification in these patients.

Clinical perspective

What is known?

The GWTG-HF score has been developed to predict mortality in patients hospitalized due to heart failure. It has been validated independently for in-hospital mortality and long-term prognosis.

What is new?

In this study we stratified 815 patients who underwent TMVR with MitraClip according to their GWTG-HF score into quartiles and evaluated the association between GWTG-HF score and clinical outcomes. The score was associated with a continuously increasing risk of mortality across the range of the score from 33 to 57 points and a good concordance between observed and expected mortality rates.

What is next?

The GWTG-HF score can be used for routine risk prediction in patients undergoing TMVR with MitraClip.