DOI: 10.1111/jdi.70364 ISSN: 2040-1116

Machine learning‐based prediction model for adverse pregnancy outcomes in women with gestational diabetes mellitus

Jooyeop Lee, Seoung‐Ho Choi, Hyeon Ji Kim, Jee Yoon Park, Tae Jung Oh, Sung Hee Choi, Hak Chul Jang, Joon Ho Moon

ABSTRACT

Aims/Introduction

Gestational diabetes mellitus (GDM) is one of the most frequent pregnancy complications. Investigating clinical risk factors for adverse pregnancy outcomes in women with GDM would help predict and prevent neonatal complications. We developed a machine learning model to discover risk factors for adverse pregnancy outcomes.

Materials and Methods

Women with GDM from tertiary hospitals in Korea were included (n = 305, discovery cohort; n  = 911, validation cohort). Supervised machine learning classification models, including ExtraTree, RandomForest, GradientBoosting, AdaBoost, Bagging, XGBoost, and Light Gradient Boosting Machine (LGBM), were developed to predict adverse pregnancy outcomes. Outcomes included large for gestational age (LGA), small for gestational age (SGA), low Apgar score, and preterm delivery. The top‐ranked risk factors identified through feature importance were further validated using binary logistic regression analysis.

Results

In predicting LGA, the RandomForest model achieved the highest AUROC of 0.726 on the validation cohort. For SGA, the RandomForest model achieved the highest AUROC of 0.628. For low Apgar score and preterm delivery, the ExtraTree model showed the best performance with AUROCs of 0.689 and 0.616, respectively.

Conclusions

This study presents machine learning models as a foundational tool for identifying factors associated with adverse pregnancy outcomes in women with GDM. These models may serve as a foundation for future development of clinical decision support tools.

More from our Archive