TY - JOUR
T1 - Assessing the predictive ability of the Suicide Crisis Inventory for near-term suicidal behavior using machine learning approaches
AU - Parghi, Neelang
AU - Chennapragada, Lakshmi
AU - Barzilay, Shira
AU - Newkirk, Saskia
AU - Ahmedani, Brian
AU - Lok, Benjamin
AU - Galynker, Igor
N1 - Publisher Copyright: © 2020 The Authors. International Journal of Methods in Psychiatric Research published by John Wiley & Sons Ltd.
PY - 2021/3
Y1 - 2021/3
N2 - Objective: This study explores the prediction of near-term suicidal behavior using machine learning (ML) analyses of the Suicide Crisis Inventory (SCI), which measures the Suicide Crisis Syndrome, a presuicidal mental state. Methods: SCI data were collected from high-risk psychiatric inpatients (N = 591) grouped based on their short-term suicidal behavior, that is, those who attempted suicide between intake and 1-month follow-up dates (N = 20) and those who did not (N = 571). Data were analyzed using three predictive algorithms (logistic regression, random forest, and gradient boosting) and three sampling approaches (split sample, Synthetic minority oversampling technique, and enhanced bootstrap). Results: The enhanced bootstrap approach considerably outperformed the other sampling approaches, with random forest (98.0% precision; 33.9% recall; 71.0% Area under the precision-recall curve [AUPRC]; and 87.8% Area under the receiver operating characteristic [AUROC]) and gradient boosting (94.0% precision; 48.9% recall; 70.5% AUPRC; and 89.4% AUROC) algorithms performing best in predicting positive cases of near-term suicidal behavior using this dataset. Conclusions: ML can be useful in analyzing data from psychometric scales, such as the SCI, and for predicting near-term suicidal behavior. However, in cases such as the current analysis where the data are highly imbalanced, the optimal method of measuring performance must be carefully considered and selected.
AB - Objective: This study explores the prediction of near-term suicidal behavior using machine learning (ML) analyses of the Suicide Crisis Inventory (SCI), which measures the Suicide Crisis Syndrome, a presuicidal mental state. Methods: SCI data were collected from high-risk psychiatric inpatients (N = 591) grouped based on their short-term suicidal behavior, that is, those who attempted suicide between intake and 1-month follow-up dates (N = 20) and those who did not (N = 571). Data were analyzed using three predictive algorithms (logistic regression, random forest, and gradient boosting) and three sampling approaches (split sample, Synthetic minority oversampling technique, and enhanced bootstrap). Results: The enhanced bootstrap approach considerably outperformed the other sampling approaches, with random forest (98.0% precision; 33.9% recall; 71.0% Area under the precision-recall curve [AUPRC]; and 87.8% Area under the receiver operating characteristic [AUROC]) and gradient boosting (94.0% precision; 48.9% recall; 70.5% AUPRC; and 89.4% AUROC) algorithms performing best in predicting positive cases of near-term suicidal behavior using this dataset. Conclusions: ML can be useful in analyzing data from psychometric scales, such as the SCI, and for predicting near-term suicidal behavior. However, in cases such as the current analysis where the data are highly imbalanced, the optimal method of measuring performance must be carefully considered and selected.
KW - Imminent Risk
KW - machine learning
KW - risk assessment
KW - suicide
KW - suicide crisis syndrome
UR - http://www.scopus.com/inward/record.url?scp=85096703956&partnerID=8YFLogxK
U2 - https://doi.org/10.1002/mpr.1863
DO - https://doi.org/10.1002/mpr.1863
M3 - مقالة
C2 - 33166430
SN - 1049-8931
VL - 30
JO - International Journal of Methods in Psychiatric Research
JF - International Journal of Methods in Psychiatric Research
IS - 1
M1 - e1863
ER -