Back to Search Start Over

Multi-view heterogeneous fusion and embedding for categorical attributes on mixed data.

Authors :
Li, Qiude
Xiong, Qingyu
Ji, Shengfen
Gao, Min
Yu, Yang
Wu, Chao
Source :
Soft Computing - A Fusion of Foundations, Methodologies & Applications. Jul2020, Vol. 24 Issue 14, p10843-10863. 21p.
Publication Year :
2020

Abstract

Categorical attributes are ubiquitous in real-world collected data. However, such attributes lack a well-defined distance metric and cannot be directly manipulated per algebraic operations, so many data mining algorithms are unable to work directly on them. Learning an appropriate metric or an effective numerical embedding is very vital yet challenging, for categorical attributes with multi-view heterogeneous data characteristics. This paper proposes a novel multi-view heterogeneous fusion model (MVHF), which first captures basic coupling information for each view and then fuses these heterogeneous information from different views by multi-kernel metric learning, to measure the intrinsic distances between this type of categorical attributes; based on these measured distances, further, we use the manifold learning method to learn a high-quality numerical embedding for each categorical value. Experiments on 33 mixed data sets demonstrate that MVHF-enabled classification significantly enhances the performance, compared with state-of-the-art distance metrics or embedding competitors. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14327643
Volume :
24
Issue :
14
Database :
Academic Search Index
Journal :
Soft Computing - A Fusion of Foundations, Methodologies & Applications
Publication Type :
Academic Journal
Accession number :
143802667
Full Text :
https://doi.org/10.1007/s00500-019-04586-z