Back to Search
Start Over
Distributed representations of diseases based on co-occurrence relationship.
- Source :
-
Expert Systems with Applications . Nov2021, Vol. 183, pN.PAG-N.PAG. 1p. - Publication Year :
- 2021
-
Abstract
- • Co-occurrence among diseases is crucial to knowledge discovery in medicine. • Existing researches are mainly based on clinical experience rather than data driven. • The distance in the disease embedding space reflects the co-occurrence relationship. • Existing medical concept embedding models focus on prediction tasks. • The first work to use disease embedding to study co-occurrence relationships. The co-occurrence relationship among diseases facilitates the knowledge discovery in the medical field. However, due to limited data, previous researches are mainly based on clinician experience and simple statistics which make it difficult to discover deep associations among diseases. Treating the diagnoses in an electronic medical record (EMR) as interrelated random variables, we use Markov random fields to model the co-occurrence relationship among diseases and propose Di2Vec to learn distributed representations of diseases. The diseases having high co-occurrence frequency will be very close to each other in the embedding space. Considering the hierarchical structure in each diagnosis code, we introduce the subword embedding and explore its impact on the quality of embeddings, where the embedding of each diagnosis is expressed as the sum of its subword embedding. Qualitative and Quantitative experiments show that our Di2Vec can make the embeddings of diseases with high co-occurrence frequency close to each other, and can also outperform Skip-gram and CBOW when use these embeddings as the feature representations for medical expense prediction. Using subword embedding will make the disease embeddings to have better clustering property, but to a certain extent, it loss the co-occurrence information contained in the disease embeddings. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 09574174
- Volume :
- 183
- Database :
- Academic Search Index
- Journal :
- Expert Systems with Applications
- Publication Type :
- Academic Journal
- Accession number :
- 152187576
- Full Text :
- https://doi.org/10.1016/j.eswa.2021.115418