Introduction

Atrial fibrillation after cardiac surgery (AFACS) is frequent, occurring in 30–50% of cases [1]. Development of AFACS is associated with a number of perioperative pathophysiological and clinical factors [2]. Besides leading to longer hospital stays, AFACS has a prognostic impact, and has been associated with a higher risk of stroke, thromboembolism, and heart failure [3,4,5].

Interventions for preventing AFACS have shown to decrease AF incidence, length of hospital stay, and stroke [6]. Unfortunately, these treatments are not devoid of side effects. The Society of Cardiovascular Anesthesiologists and the European Association of Cardiothoracic Anaesthetists recommend different perioperative management depending on AFACS risk [7, 8]. However, there is no effective or validated way of risk stratifying these patients.

Strategies to identify patients at risk of developing AFACS would therefore be of interest and would allow the implementation of preventive strategies and more intensive monitoring to high-risk individuals. Some attempts have been made to develop prognostic schemes, but the currently available models [9,10,11] do not meet TRIPOD quality criteria [12].

A validated multivariable prognostic model allowing targeted prophylaxis is the first step to improving outcomes, informing patients, and resource planning. The objective of the study is to develop and validate a clinical prognostic model to determine the risk of a patient developing AFACS at the time of cardiac surgery.

Methods

This investigation is part of a project addressing a National Institute of Health Research (NIHR) call on AFACS—HTA no 19/132: “Predicting AF after Cardiac Surgerythe PARADISE Score. A Clinical Prediction Rule for Post-operative Atrial Fibrillation in Patients Undergoing Cardiac Surgery (PARADISE)”; NIHR131227. The PARADISE project aims to develop validated clinical prediction models to determine the risk of a patient developing AFACS: in the pre-operative assessment clinic or on admission for surgery (PARADISE-1) and on arrival in the post-operative care unit (PARADISE-2). In this paper, we will identify variables from a linked electronic health record dataset associated with AFCAS. In the near future, we will subsequently test these variables in British and American datasets and trials, with the aim of developing the PARADISE-1 score. Both PARADISE 1 and 2 will subsequently be validated in British, US-American and German real-world patient cohorts.

Study design, source of data, and population

We used a prospective cohort study to assess AF incidence among individuals with cardiac surgery. The Clinical Practice Research Datalink (CPRD) was established in 1987 and as of 2018 includes 7,998,501 patients in the UK with linked data of primary care consultation, hospital data (Hospital Episodes Statistics), national cancer registry (National Cancer Intelligence Network) and death registry data (Office for National Statistics—ONS) [13]. The data are generally representative of the age, gender, and geographic distribution of the UK population [13]. Previous validation studies of the UK nationwide EHR showed high quality and completeness of clinical information recorded in the data [14, 15]. The data used for the present study were approved by the MHRA (UK) Independent Scientific Advisory Committee [17_205], under Section 251 of the National Health Service (NHS) Social Care Act 2006. CALIBER has research ethics approval (09/H0810/16) and ECC approval (ECC 2-06(b)/2009 CALIBER dataset). The study followed the Transparent Reporting of a multivariable prognostic model for Individual Prognosis or Diagnosis (TRIPOD) recommendations [12].

We identified individuals aged 18 years or older that had been registered in the current primary care practice for at least 1 year. The study period was between January 1, 1998, and May 31, 2016, and there were 401 practices included in the data (Fig. 1).

Fig. 1
figure 1

Study population

Participants

Individuals who underwent cardiac surgery were included in the study. The definition of cardiac surgery was based on the Office of Population Censuses and Surveys Classification of Surgical Operations and Procedures (4th revision, OPCS-4) [16] in hospitalisation and their corresponding primary care READ coding. The complete definition was summarised in supplementary table S1. Individuals were excluded if they had a prior history of AF or cardiac surgery before study entry. Follow-up ceased for the following reasons: death, the end date of registration with the practice, last day of the general practice data collection or the end of the study period (31 May 2016).

Outcome

The primary outcome was new-onset AF occurring after the date of cardiac surgery (AFACS). AF was defined from the International Classification of Diseases, tenth revision as I48 from HES and Read codes G573400, G573500, 3272.00, G573000, G573300, G573.00, G573z00 from CPRD. Based on the definition, AF cases included a minor proportion of atrial flutter [17]. The definition was developed and tested in cardiovascular disease research using linked bespoke studies and the electronic health records (CALIBER) platform (https://caliberresearch.org/portal) [18]. Previous research has shown high validity and completeness of the disease definition in AF and other conditions [19, 20]. We identified death, date of death and causes of death from the Office of National Statistics (ONS) records. New-onset AF included individuals with first AF diagnoses as the primary cause of death.

Predictors

We used the Index of Multiple Deprivation (IMD) 2015 quintile to describe socioeconomic status, with a higher quintile representing the more deprived areas [21]. For eligible participants, we identified 45 predictors reported in the literature [22], reported in a Delphi process [23] or with high prevalence observed in the study cohort: age at cardiac surgery, sex, ethnicity categories, smoking status, history of diabetes, hypertension, stable angina, unstable angina, myocardial infarction, stroke, dementia, heart failure, chronic obstructive pulmonary disease, chronic kidney disease, cancer, asthma, valvular disease, deep vein thrombosis, pulmonary embolism, mitral valve disease, supraventricular tachycardia, cardiogenic shock, ventricular tachycardia, transient ischemic attack, hypothyroidism, dyslipidaemia, systolic blood pressure, diastolic blood pressure, calendar year at study entry [24], the use of anticoagulants, antiplatelet drugs, anti-arrhythmic drugs, beta-adrenoceptor blocking drugs, diuretics, warfarin, hypertension and heart failure drugs, statin, NSAIDS, cardiac glycosides, immunosuppressants, inotropic drugs. The CHA2DS2-VASc score was calculated [25], as a measure of stroke risk although prior studies have examined the role of this score in predicting AFCAS [26]. CHA2DS2-VASc Score and categorised into elevated CHA2DS2-VASc Score (≥ 1 for men and ≥ 2 for women). We reported the proportion of individuals with a diagnosis recorded in their primary care or hospital admissions, before their initial documentation of cardiac surgery. Diagnosis code lists for each condition were adapted from the CALIBER code repository (Supplementary table S2).

Statistical analyses

Baseline characteristics were presented among derivation and validation groups. We reported frequencies (%) for categorical data and means with standard deviation for continuous data, and chi-square and t tests were used to examine the difference between sex and socioeconomic categories. The extent of missing values was assessed and then managed by recoding for categorical variables or multiple imputations for continuous variables.

Model construction

Univariable models were used to examine non-linear trends. We used multivariable logistic regression with the least absolute shrinkage and selection operator (LASSO) for model building [27]. Variance inflation factors were calculated to detect evidence of multicollinearity problems in the model selection process. The minimum sample size for developing the was estimated [28]. Assuming a 0.21 prevalence of AF post cardiac surgery, 50 parameters (including dummy variables of predictors), and a Cox-Snell R2 of 0.0208. We estimated the minimum sample size required for new model development was 21,384.

Model validation and performance

Model validation was implemented by internal bootstrap validation [29]. Model performance was examined by assessing discrimination and calibration in the study cohort. Discrimination was quantified with the c-statistic. Model calibration was assessed by calibration plots [30] and Hosmer–Lemeshow goodness-of-fit tests.

Additional analyses were performed using random forests for variable selection in constructing the prognostic model, with the 100 trees built from the randomly split training (75% of the full cohort) and validation (25% of the full cohort) populations. We performed the analyses in the secured Data Safe Haven, meeting the data safety and information governance requirements by University College London, NHS Digital and ONS. Analyses were performed in SAS (version 9.4), R (version 3.6.1) and Stata (version 16.1). The funders did not have any role in the study design, data collection, data analysis, interpretation, and writing of the report.

Public and patient involvement

Patients were involved in the Delphi process for the selection of potential candidate variables, and will be involved in the dissemination of the study results.

Results

The mean age of the 33,464 patients who received cardiac surgery was 57.3 years (standard deviation 13.5 years). 29.2% of the patients were women. Frequent comorbidities included hypertension (prevalence at baseline: 41.1%), ischaemic heart disease (38.7%), angina (37.3%), myocardial infarction (30.4%), and valvular disease (23.2%). (Table 1).

Table 1 Patient characteristics at study entry

There were 6902 (20.6%) individuals with AFACS reported. The incidence of AF was 20.8% in patients receiving bypass surgery and 29.4% among patients receiving valve surgery.

The C-statistic of the full model with 49 covariates was 0.69. LASSO selection of variables concluded in a final model with 13 predictors at baseline: including age (centred at 60 years), elevated CHA2DS2-VASc score, congestive heart failure, hypertension, acute coronary syndromes, mitral valve disease, ventricular tachycardia, statin use, valve surgery, receiving two combined procedures (e.g., heart valve repair + coronary artery bypass grafting), or three combined procedures (e.g., dual heart valve replacement + coronary artery bypass grafting) in the index cardiac surgery, ethnicity other than white or black, and year of the index surgery (Fig. 2). Statin use and ethnicity other than white or black were associated with a protective effect. Valve surgery as a type of cardiac surgical procedure was the strongest predictor of AFACS.

Fig. 2
figure 2

Summary of the adjusted odds ratio of prognostic factors

This model had a C-statistic of 0.68 in the estimation cohort. The C index for the prognostic model in the bootstrapping validation was 0.68, confirming the model has a moderate discrimination ability to identify patients at risk of AFACS. The model was well calibrated (Fig. 3). Sensitivity analyses with random forest model selection reported similar performance to the prognostic model (with the area under the curve: 0.64) (Supplementary Figs. 1, 2).

Fig. 3
figure 3

Calibration plots of the prognostic model

Analysis of the performance of the model showed that three quarters of the 7750 patients with valvular heart disease were classified as having a high or highest risk for developing new-onset AFACS (Table S3). Most cases (nearly 90%) of the 2214 new-onset AFACS events in patients with valvular heart disease were detected in the groups with high or highest AF risk as defined by our model (Table S4).

The AFACS risk profile of patients without valvular heart disease was lower, with more than half being assigned to the low or lowest risk groups. However, most of the new-onset AF cases in this group was also detected in the highest AFACS risk classes as defined by our model.

Finally, we estimated subsequent stroke occurrences in the different Predicted AFACS risk groups, as defined by our model, and observed that of the 2514 strokes occurring during follow-up ¾ were observed in the high or highest AFACS risk groups (Table S5).

Discussion

In this multi-center cohort study, we successfully developed and validated an AF incidence prediction model for patients receiving cardiac surgery. The model showed adequate validity with a C-index of 0.68 and had good calibration for predicting AFACS.

From the predictors of AFACS in the model, we found that older age, elevated CHA2DS2-VASc score, history of congestive heart failure, hypertension, acute coronary syndromes, mitral valve disease, ventricular tachycardia at the time of cardiac surgery and earlier year of surgery performed, receiving valve surgery or multiple procedures performed during the index cardiac surgery were significantly associated with the subsequent AFACS, as documented in previous literature. We also found that statin use and ethnicity other than white or black were more likely to have a better prognosis (i.e., protective factors). These results have been consistent with previous reports.

Individuals with a higher risk of AFACS were more frequently diagnosed with AF and sustained more strokes during follow-up. This suggests that individuals with a higher risk of AFACS, as identified through our model, may represent a sub-population in whom mid to long-term arrhythmia monitoring (e.g., with implantable loop recorders) may be of interest, and measures like risk factor management, rhythm control and/or anticoagulation should be tested in future trials.

This study has several strengths. First, the large sample size of a total of 33,464 patients provides sufficient power in its results. Second, by including all patients receiving cardiac surgery the outcome can be representative of the national population of patients. Third, the use of a wide range of predictor variables, including socio-demographic data and data on clinical factors, and fourth the reproducibility of the model has been confirmed by bootstrapping validation and the performance was better than the modelling method using random forest. Our model also has several limitations. Detailed biomarker information, and information on clinical types of AF (paroxysmal or persistent), is not available for all patients. However, even though it is important to understand whether AFACS has a different prognosis or clinical impact depending on whether it is self-limited, behaving like paroxysmal AF (the most frequent occurrence according to Lin et al. [31]), or if it progresses to a more persistent form, the aim of our investigation was to develop a predictive model for AFACS, new-onset AF after cardiac surgery, irrespectively of its subsequent behaviour or burden. It is possible that the inclusion of additional variables could have improved the discrimination of the model and can be investigated in future research. The C-index of 0.68 is indicative of the multifactorial nature of risk for patients receiving cardiac surgery who subsequently developed AF. However, since strong statistically significant predictors were found, we can still draw important conclusions about how changes in the predictor values are associated with changes in the outcome, consistent with the literature.

Implications and future research

Clinicians can utilize information from electronic health records to identify patients with a greater AFACS risk to aid the prioritisation of preventive care. This study will feed into the PARADISE-1 risk score, which aims at identifying patients at risk of AFACS prior to cardiac surgery. Combining some of the identified variables with biomarkers and imaging parameters may lead to better discrimination. The performance of any future scores can also be improved by adding intra-operatory variables or measures (as planned for PARADISE-2). Our research also supports future studies on other clinical outcomes, such as cerebrovascular ischemic events, following cardiac surgery, and interventions aimed at preventing them. An important aspect to clarify in future studies, as prevention treatment in the AFACS population starts to be considered, is the impact of AF burden or AF clinical type (paroxysmal vs. persistent) on subsequent clinical outcomes. This knowledge gap was highlighted in a systematic review by Lin et al. [31] assessing the impact of AFACS on mortality and stroke. The authors highlighted that none of the 35 included studies separately reported events for paroxysmal and persistent AF.

Conclusion

We established and validated a prognostic model for new-onset atrial fibrillation among patients who received cardiac surgery. This model, or a future iteration utilizing some of its variables, can be useful to identify a group of high-risk individuals who could benefit from mid to long-term arrhythmia monitoring, risk factor management, rhythm control and/or thromboprophylaxis.