TY - JOUR
T1 - Early-Onset Versus Late-Onset Preeclampsia in Bogotá, Colombia: Differential Risk Factor Identification and Evaluation Using Traditional Statistics and Machine Learning
AU - Paola, Ayala-Ramírez
AU - Daniela, Mennickent
AU - Carlos, Farkas
AU - Enrique, Guzmán-Gutiérrez
AU - Eduardo, Retamal-Fredes
AU - Nancy, Segura-Guzmán
AU - Diego, Roca
AU - Manuel, Venegas
AU - Matias, Carrillo-Muñoz
AU - Yanitza, Gutierrez-Monsalve
AU - Doris, Sanabria
AU - Catalina, Ospina
AU - Jaime, Silva
AU - Mercedes, Olaya-C
AU - Reggie, García-Robles
PY - 2025/8/12
Y1 - 2025/8/12
N2 - Background/Objectives: Preeclampsia (PE) is a major cause of maternal and perinatal morbidity and mortality, particularly in low- and middle-income countries. Early-onset PE (EOP) and late-onset PE (LOP) are distinct clinical entities with differing pathophysiological mechanisms and prognoses. However, few studies have explored differential risk factors for EOP and LOP in Latin American populations. This study aimed to identify and assess clinical risk factors for predicting EOP and LOP in a cohort of pregnant women from Bogotá, Colombia, using traditional statistics and machine learning (ML). Methods: A cross-sectional observational study was conducted on 190 pregnant women diagnosed with PE (EOP = 80, LOP = 110) at a tertiary hospital in Bogotá between 2017 and 2018. Risk factors and perinatal outcomes were collected via structured interviews and clinical records. Traditional statistical analyses were performed to compare the study groups and identify associations between risk factors and outcomes. Eleven ML techniques were used to train and externally validate predictive models for PE subtype and secondary outcomes, incorporating permutation-based feature importance to enhance interpretability. Results: EOP was significantly associated with higher maternal education and history of hypertension, while LOP was linked to a higher prevalence of allergic history. The best-performing ML model for predicting PE subtype was linear discriminant analysis (recall = 0.71), with top predictors including education level, family history of perinatal death, number of sexual partners, primipaternity, and family history of hypertension. Conclusions: EOP and LOP exhibit distinct clinical profiles in this cohort. The combination of traditional statistics with ML may improve early risk stratification and support context-specific prenatal care strategies in similar settings.
AB - Background/Objectives: Preeclampsia (PE) is a major cause of maternal and perinatal morbidity and mortality, particularly in low- and middle-income countries. Early-onset PE (EOP) and late-onset PE (LOP) are distinct clinical entities with differing pathophysiological mechanisms and prognoses. However, few studies have explored differential risk factors for EOP and LOP in Latin American populations. This study aimed to identify and assess clinical risk factors for predicting EOP and LOP in a cohort of pregnant women from Bogotá, Colombia, using traditional statistics and machine learning (ML). Methods: A cross-sectional observational study was conducted on 190 pregnant women diagnosed with PE (EOP = 80, LOP = 110) at a tertiary hospital in Bogotá between 2017 and 2018. Risk factors and perinatal outcomes were collected via structured interviews and clinical records. Traditional statistical analyses were performed to compare the study groups and identify associations between risk factors and outcomes. Eleven ML techniques were used to train and externally validate predictive models for PE subtype and secondary outcomes, incorporating permutation-based feature importance to enhance interpretability. Results: EOP was significantly associated with higher maternal education and history of hypertension, while LOP was linked to a higher prevalence of allergic history. The best-performing ML model for predicting PE subtype was linear discriminant analysis (recall = 0.71), with top predictors including education level, family history of perinatal death, number of sexual partners, primipaternity, and family history of hypertension. Conclusions: EOP and LOP exhibit distinct clinical profiles in this cohort. The combination of traditional statistics with ML may improve early risk stratification and support context-specific prenatal care strategies in similar settings.
KW - late-onset preeclampsia
KW - early-onset preeclampsia
KW - hypertension in pregnancy
KW - Latin America
KW - risk factors
KW - traditional statistics
KW - machine learning
KW - artificial intelligence
UR - https://doi.org/10.3390/biomedicines13081958
U2 - 10.3390/biomedicines13081958
DO - 10.3390/biomedicines13081958
M3 - Article
SN - 2227-9059
VL - 13
JO - Biomedicines
JF - Biomedicines
IS - 8
ER -