Automated and Interpretable Patient ECG Profiles for Disease Detection, Tracking, and Discovery

Circ Cardiovasc Qual Outcomes. 2019 Sep;12(9):e005289. doi: 10.1161/CIRCOUTCOMES.118.005289. Epub 2019 Sep 5.

Abstract

Background: The ECG remains the most widely used diagnostic test for characterization of cardiac structure and electrical activity. We hypothesized that parallel advances in computing power, machine learning algorithms, and availability of large-scale data could substantially expand the clinical inferences derived from the ECG while at the same time preserving interpretability for medical decision-making.

Methods and results: We identified 36 186 ECGs from the University of California, San Francisco database that would enable training of models for estimation of cardiac structure or function or detection of disease. We segmented the ECG into standard component waveforms and intervals using a novel combination of convolutional neural networks and hidden Markov models and evaluated this segmentation by comparing resulting electrical intervals against 141 864 measurements produced during the clinical workflow. We then built a patient-level ECG profile, a 725-element feature vector and used this profile to train and interpret machine learning models for examples of cardiac structure (left ventricular mass, left atrial volume, and mitral annulus e-prime) and disease (pulmonary arterial hypertension, hypertrophic cardiomyopathy, cardiac amyloid, and mitral valve prolapse). ECG measurements derived from the convolutional neural network-hidden Markov model segmentation agreed with clinical estimates, with median absolute deviations as a fraction of observed value of 0.6% for heart rate and 4% for QT interval. Models trained using patient-level ECG profiles enabled surprising quantitative estimates of left ventricular mass and mitral annulus e' velocity (median absolute deviation of 16% and 19%, respectively) with good discrimination for left ventricular hypertrophy and diastolic dysfunction as binary traits. Model performance using our approach for disease detection demonstrated areas under the receiver operating characteristic curve of 0.94 for pulmonary arterial hypertension, 0.91 for hypertrophic cardiomyopathy, 0.86 for cardiac amyloid, and 0.77 for mitral valve prolapse.

Conclusions: Modern machine learning methods can extend the 12-lead ECG to quantitative applications well beyond its current uses while preserving the transparency that is so fundamental to clinical care.

Keywords: heart rate; hypertension; machine learning; mitral valve prolapse; work flow.

Publication types

  • Research Support, N.I.H., Extramural
  • Validation Study

MeSH terms

  • Action Potentials*
  • Cardiovascular Diseases / diagnosis*
  • Cardiovascular Diseases / physiopathology
  • Cardiovascular Diseases / therapy
  • Databases, Factual
  • Diagnosis, Computer-Assisted*
  • Electrocardiography*
  • Heart Rate*
  • Humans
  • Machine Learning*
  • Markov Chains
  • Neural Networks, Computer*
  • Pattern Recognition, Automated*
  • Predictive Value of Tests
  • Prognosis
  • Reproducibility of Results
  • Signal Processing, Computer-Assisted
  • Time Factors
  • Workflow