Back to Search
Start Over
R Libraries {dendextend} and {magrittr} and Clustering Package scipy.cluster of Python For Modelling Diagrams of Dendrogram Trees
- Source :
- Carpathian Journal of Electronic and Computer Engineering, Carpathian Journal of Electronic and Computer Engineering, 2020, 13 (1), pp.5-12. ⟨10.2478/cjece-2020-0002⟩
- Publication Year :
- 2020
- Publisher :
- figshare, 2020.
-
Abstract
- The paper presents a comparison of the two languages Python and R related to the classification tools and demonstrates the differences in their syntax and graphical output. It indicates the functionality of R and Python packages {dendextend} and scipy.cluster as effective tools for the dendrogram modelling by the algorithms of sorting and ranking datasets. R and Python programming languages have been tested on a sample dataset including marine geological measurements. The work aims to detect how bathymetric data change along the 25 bathymetric profiles digitized across the Mariana Trench. The methodology includes performed hierarchical cluster analysis with dendrograms and plotted clustermap with marginal dendrograms. The statistical libraries include Matplotlib, SciPy, NumPy, Pandas by Python and {dendextend}, {pvclust}, {magrittr} by R. The dendrograms were compared by the model-simulated clusters of the bathymetric ranges. The results show three distinct groups of the profiles sorted by the elevation ranges with maximal depths detected in a group of profiles 19-21. The dendrogram visualization in a cluster analysis demonstrates the effective representation of the data sorting, grouping and classifying by the machine learning algorithms. The programming codes presented in this study enable to sort a dataset in a similar research aimed to group data based on the similarity of attributes. Effective visualization by dendrograms is a useful modelling tool for the geospatial management where data ranking is required. Plotting dendrograms by R, comparing to Python, presented functional and sophisticated algorithms, refined design control and fine graphical data output. The interdisciplinary nature of this work consists in application of the coding algorithms for spatial data analysis.
- Subjects :
- Computer science
data analysis
programming language
02 engineering and technology
computer.software_genre
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
data ranking
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
0202 electrical engineering, electronic engineering, information engineering
[INFO.INFO-RB]Computer Science [cs]/Robotics [cs.RO]
sort
computer.programming_language
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION
0303 health sciences
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION/I.5.3: Clustering/I.5.3.0: Algorithms
dendrogram
Dendrogram
[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION/I.5.1: Models
machine learning
020201 artificial intelligence & image processing
Data mining
ACM: I.: Computing Methodologies
clustering
[INFO.INFO-CC]Computer Science [cs]/Computational Complexity [cs.CC]
ACM: I.: Computing Methodologies/I.3: COMPUTER GRAPHICS
03 medical and health sciences
[INFO.INFO-CY]Computer Science [cs]/Computers and Society [cs.CY]
[INFO]Computer Science [cs]
data sorting
[INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]
Cluster analysis
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION/I.5.4: Applications
Spatial analysis
030304 developmental biology
[INFO.INFO-PL]Computer Science [cs]/Programming Languages [cs.PL]
NumPy
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION/I.5.3: Clustering
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION/I.5.3: Clustering/I.5.3.1: Similarity measures
[INFO.INFO-IA]Computer Science [cs]/Computer Aided Engineering
[INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA]
Python (programming language)
[INFO.INFO-MO]Computer Science [cs]/Modeling and Simulation
Hierarchical clustering
Visualization
ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE
[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]
computer
Python
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Carpathian Journal of Electronic and Computer Engineering, Carpathian Journal of Electronic and Computer Engineering, 2020, 13 (1), pp.5-12. ⟨10.2478/cjece-2020-0002⟩
- Accession number :
- edsair.doi.dedup.....1ff937cef9ed63906b191228ad709b25
- Full Text :
- https://doi.org/10.6084/m9.figshare.12752525