The Harvard-Emory ECG Database
3 hours ago
- #ECG
- #medical-database
- #health-data
- The Harvard-Emory ECG Database (HEEDB) is a large collection of 12-lead ECG recordings from Massachusetts General Brigham and Emory University.
- Version 4.0 includes 10,471,531 ECGs from 1,818,247 unique patients at institution I0001 and 968,680 ECGs from 349,548 patients at institution I0006.
- Data is stored in WFDB and Matlab formats, with metadata, 12SL diagnostic codes, and ICD-9/10 codes included.
- The database is part of the Human Sleep Project (HSP), funded by NHLBI grant R01HL161253.
- ECG recordings are 10 seconds long, sampled at 250 or 500 Hz, and de-identified using the Safe Harbor method.
- Directory structure includes separate folders for diagnoses, ICD codes, metadata, and waveform files for each institution.
- 12SL_diagnoses folder contains diagnostic outputs from GE Healthcare's 12SL software, with mappings to human-readable labels.
- ICD_codes folder includes ICD-9 and ICD-10 codes from EHRs, with descriptions and shifted dates.
- Metadata includes demographic and temporal information such as ECG acquisition time, date of birth, and age-related fields.
- Access is restricted to credentialed users who sign a Data Use Agreement and complete CITI training.
- The study was approved by IRBs at Massachusetts General Hospital and Beth Israel Deaconess Medical Center, with informed consent waived due to retrospective design.
- Dr. Westover has a conflict of interest as a co-founder of Beacon Biosignals; other authors declare no conflicts.