Skip to main navigation Skip to search Skip to main content

Transformers for cardiac patient mortality risk prediction from heterogeneous electronic health records

Research output: Contribution to journalArticleScientificpeer-review

36 Citations (Scopus)
52 Downloads (Pure)

Abstract

With over 17 million annual deaths, cardiovascular diseases (CVDs) dominate the cause of death statistics. CVDs can deteriorate the quality of life drastically and even cause sudden death, all the while inducing massive healthcare costs. This work studied state-of-the-art deep learning techniques to predict increased risk of death in CVD patients, building on the electronic health records (EHR) of over 23,000 cardiac patients. Taking into account the usefulness of the prediction for chronic disease patients, a prediction period of six months was selected. Two major transformer models that rely on learning bidirectional dependencies in sequential data, BERT and XLNet, were trained and compared. To our knowledge, the presented work is the first to apply XLNet on EHR data to predict mortality. The patient histories were formulated as time series consisting of varying types of clinical events, thus enabling the model to learn increasingly complex temporal dependencies. BERT and XLNet achieved an average area under the receiver operating characteristic curve (AUC) of 75.5% and 76.0%, respectively. XLNet surpassed BERT in recall by 9.8%, suggesting that it captures more positive cases than BERT, which is the main focus of recent research on EHRs and transformers.
Original languageEnglish
Article number3517
JournalScientific Reports
Volume13
Issue number1
DOIs
Publication statusPublished - 2 Mar 2023
Publication typeA1 Journal article-refereed

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Transformers for cardiac patient mortality risk prediction from heterogeneous electronic health records'. Together they form a unique fingerprint.

Cite this