Back to Search
Start Over
Unsupervised extremely randomized trees
- Source :
- PAKDD 2018-The 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2018-The 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining, May 2018, Melbourne, Australia, Advances in Knowledge Discovery and Data Mining ISBN: 9783319930398, PAKDD (3)
- Publication Year :
- 2018
- Publisher :
- HAL CCSD, 2018.
-
Abstract
- International audience; In this paper we present a method to compute dissimilarities on unlabeled data, based on extremely randomized trees. This method, Unsupervised Extremely Randomized Trees, is used jointly with a novel randomized labeling scheme we describe here, and that we call AddCl3. Unlike existing methods such as AddCl1 and AddCl2, no synthetic instances are generated, thus avoiding an increase in the size of the dataset. The empirical study of this method shows that Unsupervised Extremely Randomized Trees with AddCl3 provides competitive results regarding the quality of resulting clusterings, while clearly outperforming previous similar methods in terms of running time.
- Subjects :
- Scheme (programming language)
Computer science
unsupervised classification
media_common.quotation_subject
Decision tree
02 engineering and technology
Similarity measure
Machine learning
computer.software_genre
01 natural sciences
Clustering
010104 statistics & probability
Empirical research
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]
decision tree
0202 electrical engineering, electronic engineering, information engineering
Quality (business)
0101 mathematics
Cluster analysis
distance
computer.programming_language
media_common
business.industry
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION/I.5.3: Clustering
Running time
similarity measure
ComputingMethodologies_PATTERNRECOGNITION
extremely randomized trees
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.6: Learning
Subjects
Details
- Language :
- English
- ISBN :
- 978-3-319-93039-8
- ISBNs :
- 9783319930398
- Database :
- OpenAIRE
- Journal :
- PAKDD 2018-The 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2018-The 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining, May 2018, Melbourne, Australia, Advances in Knowledge Discovery and Data Mining ISBN: 9783319930398, PAKDD (3)
- Accession number :
- edsair.doi.dedup.....a379d68056b2aab68002658b54223163