DOI: 10.1093/europace/euag105.1224 ISSN: 1099-5129

Deep learning from 1 million clinical ecgs using transform domain representations for arrhythmia classification and ejection fraction estimation

I Nenadic Wood, I Shahrabani, S Daries, M Fudim, J Piccini, Z Loring

Abstract

Background

Clinical ECGs contain complex temporal and spectral patterns that extend beyond human visual interpretation. Deep learning can capture these multidimensional features to detect both rhythm disturbances and ventricular dysfunction. Combining complementary time- and frequency-based representations may enhance diagnostic accuracy and generalizability across ECG sources.

Purpose

To develop and validate deep learning models trained on clinical ECGs for (1) multiclass arrhythmia classification and (2) estimation of reduced left ventricular ejection fraction (LVEF < 40%), using real-time, Fourier, and Stockwell transform representations.

Methods

A total of 1 000 000 clinical ECGs from a large healthcare system were analyzed. Of these, 725 000 ECGs were used for arrhythmia classification across 18 rhythm and conduction categories, and 275 000 ECG–LVEF pairs for ejection fraction estimation. Each ECG was represented in three complementary domains: (1) time domain (TD) preserving waveform morphology and conduction timing, (2) Fourier domain (FD) capturing global spectral features, and (3) Stockwell domain (TFD) encoding both temporal and spectral localization. Separate convolutional deep learning models were trained for each representation using four convolutional and two dense layers (ReLU activation, Adam optimizer, learning rate = 0.0001). Internal validation used held-out clinical ECGs; external validation employed open-access databases and additional ECGs in PDF format from independent institutions.

Results

For arrhythmia classification, mean AUC/F1-scores on internal validation were 0.933/0.883 (TD), 0.949/0.887 (FD), and 0.963/0.903 (TFD). External validation achieved 0.894/0.808, 0.920/0.828, and 0.933/0.852, respectively.

For LVEF classification, AUC ranged 0.85–0.92, with the TD model achieving AUC = 0.858 (accuracy 0.776). Validation on an independent clinical cohort of atrial fibrillation ECGs achieved AUC = 0.997, confirming strong generalizability. Across both tasks, the Stockwell (time–frequency) model consistently performed best, highlighting the benefit of joint temporal–spectral learning.

Conclusions

Deep learning models trained on 1 million clinical ECGs can accurately classify arrhythmias and detect reduced LVEF. Incorporating time-, frequency-, and time–frequency-domain representations improve feature extraction and model robustness across datasets, supporting the role of domain-diverse deep learning in next-generation AI-enabled ECG diagnostics.Deep Learning and Data PipelineResult Summary

More from our Archive