Racial Differences in Accuracy of Predictive Models for High-Flow Nasal Cannula Failure in COVID-19
Philip Yang, Ismail A. Gregory, Chad Robichaux, Andre L. Holder, Greg S. Martin, Annette M. Esper, Rishikesan Kamaleswaran, Judy W. Gichoya, Sivasubramanium V. Bhavani- Critical Care and Intensive Care Medicine
OBJECTIVES:
To develop and validate machine learning (ML) models to predict high-flow nasal cannula (HFNC) failure in COVID-19, compare their performance to the respiratory rate-oxygenation (ROX) index, and evaluate model accuracy by self-reported race.
DESIGN:
Retrospective cohort study.
SETTING:
Four Emory University Hospitals in Atlanta, GA.
PATIENTS:
Adult patients hospitalized with COVID-19 between March 2020 and April 2022 who received HFNC therapy within 24 hours of ICU admission were included.
INTERVENTIONS:
None.
MEASUREMENTS AND MAIN RESULTS:
Four types of supervised ML models were developed for predicting HFNC failure (defined as intubation or death within 7 d of HFNC initiation), using routine clinical variables from the first 24 hours of ICU admission. Models were trained on the first 60% (
CONCLUSIONS:
Our XGB model had better discrimination for predicting HFNC failure in COVID-19 than the ROX index, but had racial differences in accuracy of predictions. Further studies are needed to understand and mitigate potential sources of biases in clinical ML models and to improve their equitability.