TY - JOUR
T1 - Machine learning-based prediction of 1-year mortality for acute coronary syndrome✰
AU - Hadanny, Amir
AU - Shouval, Roni
AU - Wu, Jianhua
AU - Gale, Chris P.
AU - Unger, Ron
AU - Zahger, Doron
AU - Gottlieb, Shmuel
AU - Matetzky, Shlomi
AU - Goldenberg, Ilan
AU - Beigel, Roy
AU - Iakobishvili, Zaza
N1 - Publisher Copyright:
© 2021
PY - 2022/3
Y1 - 2022/3
N2 - Background: Clinical risk assessment with quantitative formal risk scores may add to intuitive physician risk assessment and are advised by the international guidelines for the management of acute coronary syndrome (ACS) patients. Most previous studies have used the binary regression/classification approach (dead/alive) for long-term mortality post-ACS, without considering the time-to-event as in survival analysis. The use of machine learning (ML)-based survival models has yet to be validated. The primary objective was to compare survival prediction performance of 1-year mortality following ACS of two newly developed ML-based models [random survival forest (RSF) and deep learning (DeepSurv)] with the traditional Cox-proportional hazard (CPH) model. The secondary objective was external validation of the findings. Methods: This was a retrospective, supervised learning data mining study based on the Acute Coronary Syndrome Israeli Survey (ACSIS) and the Myocardial Ischemia National Audit Project (MINAP). The ACSIS data were divided to train/test in a 70/30 fashion. Next, the models were externally validated on the MINAP data. Harrell's C-index, inverse probability of censoring weighting (IPCW), and the Brier-score were used for models’ performance comparison. Results: RSF performed best among the three models, with Harrell's C-index on training and testing sets reaching 0.953 and 0.924 respectively, followed by CPH multivariate selected model (0.805/0.849), CPH Univariate selected model (0.828/0.806), DeepSurv model (0.801/0.804), and the traditional CPH model (0.826/0.738). The RSF model also had the highest performance on the validation data set with 0.811 for Harrell's C-index, 0.844 for IPCW, and 0.093 for Brier score. The CPH model performance on the validation set had C-index range between 0.689 to 0.790, 0.713 to 0.826 for IPCW, and 0.094 to 0.103 Brier score. Conclusions: RSF survival predictions for long-term mortality post-ACS show improved model performance compared with the classic statistical method. This may benefit patients by allowing better risk stratification and tailored therapy, however further prospective evaluations are required.
AB - Background: Clinical risk assessment with quantitative formal risk scores may add to intuitive physician risk assessment and are advised by the international guidelines for the management of acute coronary syndrome (ACS) patients. Most previous studies have used the binary regression/classification approach (dead/alive) for long-term mortality post-ACS, without considering the time-to-event as in survival analysis. The use of machine learning (ML)-based survival models has yet to be validated. The primary objective was to compare survival prediction performance of 1-year mortality following ACS of two newly developed ML-based models [random survival forest (RSF) and deep learning (DeepSurv)] with the traditional Cox-proportional hazard (CPH) model. The secondary objective was external validation of the findings. Methods: This was a retrospective, supervised learning data mining study based on the Acute Coronary Syndrome Israeli Survey (ACSIS) and the Myocardial Ischemia National Audit Project (MINAP). The ACSIS data were divided to train/test in a 70/30 fashion. Next, the models were externally validated on the MINAP data. Harrell's C-index, inverse probability of censoring weighting (IPCW), and the Brier-score were used for models’ performance comparison. Results: RSF performed best among the three models, with Harrell's C-index on training and testing sets reaching 0.953 and 0.924 respectively, followed by CPH multivariate selected model (0.805/0.849), CPH Univariate selected model (0.828/0.806), DeepSurv model (0.801/0.804), and the traditional CPH model (0.826/0.738). The RSF model also had the highest performance on the validation data set with 0.811 for Harrell's C-index, 0.844 for IPCW, and 0.093 for Brier score. The CPH model performance on the validation set had C-index range between 0.689 to 0.790, 0.713 to 0.826 for IPCW, and 0.094 to 0.103 Brier score. Conclusions: RSF survival predictions for long-term mortality post-ACS show improved model performance compared with the classic statistical method. This may benefit patients by allowing better risk stratification and tailored therapy, however further prospective evaluations are required.
KW - Acute coronary syndrome
KW - Machine learning
KW - Mortality
KW - Outcome
KW - Survival
UR - http://www.scopus.com/inward/record.url?scp=85120343286&partnerID=8YFLogxK
U2 - 10.1016/j.jjcc.2021.11.006
DO - 10.1016/j.jjcc.2021.11.006
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 34857429
AN - SCOPUS:85120343286
SN - 0914-5087
VL - 79
SP - 342
EP - 351
JO - Journal of Cardiology
JF - Journal of Cardiology
IS - 3
ER -