Abstract 15133: Topic Models Using Notes From Electronic Medical Records Can Classify Non ST Elevation Myocardial Infarction Patients Based on the Presence of an Occluded Culprit Artery
Dillon J Dzikowicz, Mehmed Aktas, Linwei Wang, Wojciech Zareba- Physiology (medical)
- Cardiology and Cardiovascular Medicine
Introduction: Approximately 33% of patients with a Non-ST-Elevation Myocardial Infarction (NSTEMI) have an occluded culprit artery which doubles their odds of adverse outcomes. Earlier identification of NSTEMI patients with occluded culprit artery may reduce their odds of adverse outcomes. Notes from the electronic health record (EHR) are rich sources of text data about signs and symptoms. Organizing text data into topics may help providers identify patients with NSTEMI and an occluded culprit artery sooner. Latent Dirichlet Allocation uses a deep-learning network to generate topics that can classify patients.
Methods: Notes transcribed during the first 72 hours of hospitalization were extracted from the EHR among NSTEMI patients who presented to a medical center between 2015-2020. The data was preprocessed by removing punctuation, numbers, and stop words. We decided 5 topics based on computed coherence value. Two authors determined whether the first 100 terms were associated with an occluded artery. Analyses were completed in R using packages topicmodels and tidyverse .
Results: We examined 45,025 notes from 2,223 patients (age=69
Conclusions: Topic 1 is most associated with an occluded culprit artery. Topic 5 is the least associated with an occluded culprit artery, instead associated with heart failure. Our results demonstrate it is feasible to develop topics to differentiate NSTEMI with and without an occluded culprit artery. Future research aims to identify patterns between signs and symptoms within topics.