Current State of AI
Current State of AI
https://doi.org/10.1093/ehjdh/ztac035
Received 5 January 2022; revised 20 May 2022; accepted 31 May 2022; online publish-ahead-of-print 24 June 2022
Aims Patients with congestive heart failure (HF) are prone to clinical deterioration leading to hospital admissions,
burdening both patients and the healthcare system. Predicting hospital admission in this patient group could enable
timely intervention, with subsequent reduction of these admissions. To date, hospital admission prediction remains
challenging. Increasing amounts of acquired data and development of artificial intelligence (AI) technology allow for
the creation of reliable hospital prediction algorithms for HF patients. This scoping review describes the current
literature on strategies and performance of AI-based algorithms for prediction of hospital admission in patients
with HF.
.........................................................................................................................................................................................
Methods and PubMed, EMBASE, and the Web of Science were used to search for articles using machine learning (ML) and deep learn-
results ing methods to predict hospitalization in patients with HF. After eligibility screening, 23 articles were included. Sixteen
articles predicted 30-day hospital (re-)admission resulting in an area under the curve (AUC) ranging from 0.61 to 0.79. Six
studies predicted hospital admission over longer time periods ranging from 6 months to 3 years, with AUC’s ranging
from 0.65 to 0.78. One study prospectively evaluated performance of a disposable sensory patch at home after hospi-
talization which resulted in an AUC of 0.89 for unplanned hospital admission prediction.
.........................................................................................................................................................................................
Conclusion AI has the potential to enable prediction of hospital admission in HF patients. Improvement of data management, adding
new data sources such as telemonitoring data and ML models and prospective and external validation of current models
must be performed before clinical applicability is possible.
------------------------------------------------------------------------------------------------------------------------------------------------------------
Graphical Abstract
EHR Data
Results are impossible to
accurately compare due to
Machine learning Sixteen articles reported heterogeneous methods in
30 day (re)admission terms of data and
(AUC 0.61−0.79) algorithms.
Administrative data
AI methods using
administrative or EHR data
Deep learning
Telemonitoring Data
EHR data generally
One prospective
outperformed administrative
telemonitoring study using
data in terms of AUC.
a multi sensory patch
(AUC 0.89)
.........................................................................................................................................................................................
Keywords Hospital admission prediction • Artificial intelligence • Machine learning • Heart failure • eHealth • Preventive
health
Search and selection method algorithm was extracted together with LR for comparison. All results
Continued
418 P.M. Croon et al.
Table 2 Continued
Year Authors Design No. Outcome Data source Data used Model used AUC Performance metrics
patients (30-day reported
readmission)
..................................................................................................................................................................................................................................................
AI-based algorithms
2016 Mortazavi, B. J., Retrospective 977 HF EHR + 472 features including clinical data, Random forest 0.68 (CI: PPV 0.22, Sens 61%, Spec
Downing, N. S., Telemonitoring physical examination, laboratory, 0.670– 61%, F-score 0.32
et al.32 demographics, socio-economics 0.678)
2016 Turgeman, L., May, Retrospective 4840 All-cause EHR Demographics, vitals, laboratory, C5 SVM ensemble 0.7 TH 0.7 Sens 26%, spec
J. H.41 comorbidities 91%, PPV 0.260, NPV
0.911, F1 0.261
2017 Frizzell, J. D., Liang, L., Retrospective 56477 HF Insurance Demographics, socio-economic status, Tree augmented 0.62
et al.29 medical history, medication, vital signs, naive bayesion
laboratory, interventions network
2017 Shameer, K., Johnson, Retrospective 1068 All-cause EHR ICD-codes, medication, laboratory, vital Naive Bayes 0.78 Accuracy 83,19%
K. W., et al.31 signs, procedures
2018 Xiao, C., Ma, T., et al.30 Retrospective 5393 All-cause EHR Comorbidity, laboratory, medication, Gated recurrent 0.61 (SD PR-AUC 0.39, accuracy
unit RNN 0.0153) 0.6934
2018 Golas, S. B., Shibahara, Retrospective 11510 All-cause EHR Demographic and clinical data Deep Unified 0.71 Accuracy 0.646, Precision
T., et al.27 (comorbidity, loboratory, medication, Network 0.360, recall 0.652, f1
procedures), notes 0.464
2018 Mahajan, S. M., Retrospective 1778 All-cause EHR Laboratory, vitals, demographics Boosted trees 0.719
Mahajan, A. S.,
et al.34
2019 Allam, A., Nagy, M., Retrospective 272.778 All-cause National database socio-demographics (age/gender), RNN with 0.642 (CI TPV 0.57, FPV 0.37
et al.23 Pay-source, hospital adission events, conditional 0.640–
diagnosis, procedures random fields 0.645)
2019 Awan, S. E., Retrospective 10757 HF Nation-wide demographics, admission characteristics, Multi-layer 0.66 Sens 56%, spec 66%
Bennamoun, M., system(EHR, GP medical history, socio-economics, perception
et al. etc) medication history
2019 Mahajan, S. M., Ghani, retrospective 1629 All-cause EHR Laboratory, vitals, demographics, clinical NLP for 0.65(CI
R. notes unstructured 0.59-0.71)
data, LR for
prediction
2019 Awan, S. E., Retrospective 10757 HF Nation-wide system demographics, admission characteristics, Multi-layer 0.63 Sens 48%, spec 70%
Bennamoun, M., (EHR, GP, and so medical history, socio-economics, perception
et al. on) medication history
2019 Mahajan, S. M., Ghani, Retrospective 36245 All-cause EHR Demographics, vitals, laboratory, ExtraTrees 0.70 (CI F1 score 0.58
R. comorbidities 0.69–
0.71)
Continued
419
Table 3. Ben-Assuli et al.42 reported the highest AUC with 0.79, using a
Accuracy 0.89
F1 score 0.51 cluding a comprehensive electronic health record (EHR), laboratory
FNR 0.250
information management system and picture archiving and communica-
tion system (PACS) for imaging results. The algorithm’s performance
enhanced when data from more clinical visits were present, resulting
in an AUC of 0.79 after four visits for all patients. Patients were divided
AUC
0.77 (SD:
worst in the most vulnerable group and best in the medium risk group
0.79 with an AUC after four visits of 0.65 and 0.77, respectively.42 Shameer
et al. reported an algorithm with similar performance, using retrospect-
ive EHR data of 1068 HF patients, in whom more than 4000 features
Hidden Markov
were extracted from the EHR including diagnosis codes, medication, la-
Model
gest patient group was by Allam et al. who included 273 000 patients
type, medication, all procedures,
and diagnosis. The authors best performing deep learning (DL) algo-
imaging, comorbidity,
EHR
EHR
longer than 30 days (Table 5). Five studies used retrospective data
and one used prospective data. Outcomes ranged from 6-month re-
readmission)
Outcome
from 0.65 to 0.78. The best performing two algorithms both re-
All-cause
All-cause
132
7655
4661
Retrospective
cation and comorbidity, and eight variables from the EMR laboratory
and left ventricular EF. The authors utilized multiple different models
Table 3 Continued
Authors
AUC of 0.78 (compared with 0.75 when only claims data was
A., et al.
T., et al.
et al.
Telemonitoring for hospitalization Although some AI-models seem promising, only the AUC was re-
ported in all studies and additional performance metrics were only
prediction
sometimes reported. A more detailed evaluation is needed to pro-
Three studies used AI algorithms on telemonitoring data for hospital
vide comprehensive insight in the performance of a model.
admission prediction (Table 6). Stehlik et al.44 monitored one hun-
Machine learning algorithms are prone to overfitting, meaning the al-
dred subjects prospectively at home for 3 months using a disposable
gorithm is trained extensively within a population. As a consequence,
multisensory patch monitoring vital parameters including ECG (ar-
this can result in a model that performs well within this population,
rhythmia burden, heart rate variability), skin impedance, tempera-
but not outside of this population, making generalizability of the re-
ture, and so on. The data acquired by the device was continuously
sults questionable. Moreover, many articles proposed a novel meth-
uploaded through a smartphone. Patients were instructed to wear
od and compared this with existing methods. We chose to only
the device 24 h a day. After initial placement of the device, baseline
report the best performing algorithm which always is the proposed
patterns in the data are identified. When this baseline is finalized,
method. However, it is questionable if the same amount of feature
the device switched to surveillance mode that compared temporal
engineering and/or hyperparameter optimization was performed
data to the baseline patterns using similarity-based modelling. This
on the compared method as to the proposed method, making the
approach resulted in an AUC of 0.89 for HF readmission and 0.84
best performing method more prone to overfitting. Overfitting
for unplanned non-trauma hospital admission. Sensitivity for HF hos-
can be minimized by using appropriate validation methods such as
pitalization was 86% at a specificity of 87.5%. The remaining two
nested cross validation and tested with external or prospective val-
studies used retrospective data both derived from clinical trials in
idation.46 The risk of overfitting is a major limitation of all included
which eHealth was used,32,45 which resulted in and AUC of 0.68
articles.
and 0.67 utilizing a random forest and Naïve Bayes, respectively.
Hospital (re-)admission of HF patients is a major problem re-
sulting in high patient disease burden and health care expenses.2,3
Discussion One proven effective strategy for reducing hospital admission in
this patient group is through signalling pre-clinical deterioration
This scoping review evaluated the current performance of AI-based using invasive pulmonary artery pressure home monitoring, which
hospital (re-)admission prediction algorithms based on EHR, admin- yielded a 37% reduction in hospitalization.4,16 This indicates that
istrative and telemonitoring data, and resulted in the following main detection of early, pre-clinical, deterioration can indeed result in
findings. (i) Multiple different classifiers were proposed all using dif- a reduction of hospital admissions. Randomized controlled trials
ferent sets of data to predict hospital admission for patients with on the effect of well performing, prospectively validated,
HF making it challenging to accurately compare results. (ii) AI meth- AI-based hospital admission prediction reduction are needed to
ods using EHR or administrative data achieved moderate perform- further evaluate if and to what extend early deterioration predic-
ance, one AI method using telemonitoring data achieved good tion leads to prevention of hospital (re-)admission and subse-
performance; (iii) EHR data generally performed better in compari- quent disease burden and cost.
son to administrative data or research data which is consistent Similar moderate performance of AI-hospitalization-prediction al-
with clinical expectation since EHR data yields much more compre- gorithms were achieved in non-HF patients. For example, Rajkomar
hensive clinical information; and (iv) almost all studies were per- et al.47 utilized a deep neural network (DNN) in a cohort of 216,221
formed on retrospective data without external or prospective adult patients hospitalized for more than 24 h for any cause and re-
validation. ported an AUC of 0.75–0.76 for 30-day hospital readmission using
Table 5 Results for long-term hospital admission prediction
422
Year Authors n. patients Design Outcome Data source Data used Model used AUC
..................................................................................................................................................................................................................................................
2016 Mortazavi, B. J., 977 HF patients Retrospective 180-day HF readmission EHR + 472 features including clinical data, Random forest in to SVM 0.657 (CI: 0.652– PPV 0.51, Sens 0.95, spec
Downing, N. S., et al. Telemonitoring physical examination, 0.661) 0.15, F-score 0.66
laboratory, demographics,
socio-economics
2020 Chen, P., Dong, W., 736 HF patients Retrospective 1-year all-cause readmission EHR 105 features including Attention-based model 0.691 (SD: 0.047) Acc 0.667, Precision
et al. demographics, vital signs, 0.710, Recall 0.795, F1
laboratory, echocardiography, 0.749
comorbidities, medication
2020 Angraal, S., Mortazavi, 1767 HFpEF patients Retrospective 3-year HF admission EHR Demographics, clinical data, ECG, Random forest 0.76 (CI: 0.71–0.85)
B. J., et al. laboratory, questionnaires
2020 Desai, R. J., Wang, S. V., 9502 HF patients Prospective 1-year HF admission EHR + Insurance 54 variables including ICD score, GBM 0.78 (CI: 0.753–
et al. ICD/CRT-D, HF related 0.802)
medication, frailty index,
laboratory, BMI, ejection
fraction
2021 Lewis, M., Elad, G., 92260 HF patients Retrospective 6-month all-cause admission, Insurance demographics, episode counts and Sequential deep learning 0.78 (CI: 0.784–0.79)
Beladev, M., et al. 6-month ED admission trends, hospital length of stay,
readmission rates, costs,
comorbidity, procedures,
medication, machine derived
features using NLP.
2000 Atienza, F., Martinez- 132 HF patients Retrospective 1-year admission EHR Demographic, clinical, non-invasive Neural Network Accuracy 93.2% sens 81%, Spec 94%
Alzamora, N., et al. laboratory (AUC not
reported)
2020 Chu, J., Dong, W., et al. 25402HF patiënts retrospective HF readmission EHR demographics, medication, RNN for data processing, LR 0.644 (SD: 0.005)
procedures, diagnosis, for endpoint prediction
laboratory, vital signs, physical
examination, etc.,
P.M. Croon et al.
EHR data including notes. Hilton et al.48 reported an AUC of 0.76 for
..................................................................................................................................................................................................................................................
prediction of 30-day readmission using a DNN trained, validated and
tested on 708 089 any cause hospitalized patients using EHR data.
PPV 0.22, Sens 0.61, Spec 0.61,
admission for all patients aged older than 18 years in the EHR includ-
HF 0.86,
0.80
modelling
(demographics, comorbitity,
questionnaires), baseline
Future perspective
In the future, connecting multiple conventional, readily available (e.g.
Telemonitoring
Telemonitoring
Data source
Wearable sensor
clinical variables, imaging data etc.) and newer data sources (health
trackers, social media behaviour, genomic sequencing) could add to
patch
peutic advantages. For example, prediction models may not only pre-
Table 6 Results for hospitalization prediction using telemonitoring
HF hospitalization
outpatient clinic visits, making it possible to scale down given care un-
til suggested differently. In summary, prediction models could en-
hance scalability and cost-effectiveness of our healthcare system.
In order to get to the point of AI-driven clinical support systems
advising doctors and patients daily and thus transforming current re-
active ‘disease care’ towards proactive personalized and predictive
Retrospective
Retrospective
design
Prospective
study
patients
patients
patients
100 HF
241 HF
C., et al.44
A., et al.45
2018
2016
2020
risk of 30-day readmissions in patients with heart failure: a retrospective analysis of with electronic medical records to predict heart failure outcomes. JAMA Network
electronic medical records data. BMC Med Inform Decis Mak 2018;18:44. Open 2020;3:e1918962.
28. Awan SE, Bennamoun M, Sohel F, Sanfilippo FM, Dwivedi G. Machine learning-based 40. Lewis M, Elad G, Beladev M, Maor G, Radinsky K, Hermann D, Litani Y, Geller T,
prediction of heart failure readmission or death: implications of choosing the right Pines JM, Shapiro Nl, Figueroa JF. Comparison of deep learning with traditional mod-
model and the right metrics. ESC Heart Fail 2019;6:428–435. els to predict preventable acute care use and spending among heart failure patients.
29. Frizzell JD, Liang L, Schulte PJ, Yancy CW, Heidenreich PA, Hernandez AF, Bhatt DL, Sci Rep 2021;11:1164.
Fonarow GC, Laskey WK. Prediction of 30-day all-cause readmissions in patients 41. Turgeman L, May JH. A mixed-ensemble model for hospital readmission. Artif Intell
hospitalized for heart failure: comparison of machine learning and other statistical Med 2016;72:72–82.
approaches. JAMA Cardiol 2017;2:204–209. 42. Ben-Assuli O, Heart T, Vest JR, Ramon-Gonen R, Shlomo N, Klempfner R. Profiling
30. Xiao C, Ma T, Dieng AB, Blei DM, Wang F. Readmission prediction via deep context- Readmissions Using Hidden Markov Model-the Case of Congestive Heart Failure.
ual embedding of clinical concepts. PLoS One 2018;13:e0195024-e. Inform Sys Manag 2021;38:237–249.
31. Shameer K, Johnson KW, Yahi A, Miotto R, Li LI, Ricks D, Jebakaran J, Kovatch P, 43. Atienza F, Martinez-Alzamora N, De Velasco JA, Dreiseitl S, Ohno-Machado L.
Sengupta PP, Gelijns S, Moskovitz A, Darrow B, David DL, Kasarskis A, Tatonetti Risk stratification in heart failure using artificial neural networks. Proc AMIA Symp
NP, Pinney S, Dudley JT. Predictive modeling of hospital readmission rates using elec- 2000;32:6.
tronic medical record-wide machine learning: a case-study using mount sinai heart 44. Stehlik J, Schmalfuss C, Bozkurt B, Nativi-Nicolau J, Wohlfahrt P, Wegerich S, Rose K
failure cohort. Pac Symp Biocomput 2017;22:276–287. Ray R, Schofield R, Deswal A, Sekaric J, Anand S, Richards D, Hanson H, Pipke M,
32. Mortazavi BJ, Downing NS, Bucholz EM, Dharmarajan K, Manhapra A, Li SX, Pham M. Continuous wearable monitoring analytics predict heart failure hospitaliza-