Back to Search Start Over

Deep embedded clustering generalisability and adaptation for integrating mixed datatypes: two critical care cohorts.

Authors :
de Kok JWTM
van Rosmalen F
Koeze J
Keus F
van Kuijk SMJ
Castela Forte J
Schnabel RM
Driessen RGH
van Herpt TTW
Sels JEM
Bergmans DCJJ
Lexis CPH
van Doorn WPTM
Meex SJR
Xu M
Borrat X
Cavill R
van der Horst ICC
van Bussel BCT
Source :
Scientific reports [Sci Rep] 2024 Jan 10; Vol. 14 (1), pp. 1045. Date of Electronic Publication: 2024 Jan 10.
Publication Year :
2024

Abstract

We validated a Deep Embedded Clustering (DEC) model and its adaptation for integrating mixed datatypes (in this study, numerical and categorical variables). Deep Embedded Clustering (DEC) is a promising technique capable of managing extensive sets of variables and non-linear relationships. Nevertheless, DEC cannot adequately handle mixed datatypes. Therefore, we adapted DEC by replacing the autoencoder with an X-shaped variational autoencoder (XVAE) and optimising hyperparameters for cluster stability. We call this model "X-DEC". We compared DEC and X-DEC by reproducing a previous study that used DEC to identify clusters in a population of intensive care patients. We assessed internal validity based on cluster stability on the development dataset. Since generalisability of clustering models has insufficiently been validated on external populations, we assessed external validity by investigating cluster generalisability onto an external validation dataset. We concluded that both DEC and X-DEC resulted in clinically recognisable and generalisable clusters, but X-DEC produced much more stable clusters.<br /> (© 2024. The Author(s).)

Details

Language :
English
ISSN :
2045-2322
Volume :
14
Issue :
1
Database :
MEDLINE
Journal :
Scientific reports
Publication Type :
Academic Journal
Accession number :
38200252
Full Text :
https://doi.org/10.1038/s41598-024-51699-z