Back to Search Start Over

Similarity-Based Malware Classification Using Graph Neural Networks

Authors :
Yu-Hung Chen
Jiann-Liang Chen
Ren-Feng Deng
Source :
Applied Sciences, Vol 12, Iss 21, p 10837 (2022)
Publication Year :
2022
Publisher :
MDPI AG, 2022.

Abstract

This work proposes a novel malware identification model that is based on a graph neural network (GNN). The function call relationship and function assembly content obtained by analyzing the malware are used to generate a graph that represents the functional structure of a malware sample. In addition to establishing a multi-classification model for predicting malware family, this work implements a similarity model that is based on Siamese networks, measuring the distance between two samples in the feature space to determine whether they belong to the same malware family. The distance between the samples is gradually adjusted during the training of the model to improve the performance. A Malware Bazaar dataset analysis reveals that the proposed classification model has an accuracy and area under the curve (AUC) of 0.934 and 0.997, respectively. The proposed similarity model has an accuracy and AUC of 0.92 and 0.92, respectively. Further, the proposed similarity model identifies the unseen malware family with approximately 70% accuracy. Hence, the proposed similarity model exhibits better performance and scalability than the pure classification model and previous studies.

Details

Language :
English
ISSN :
20763417
Volume :
12
Issue :
21
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.99051c24fe614390ac477e1df0a345d5
Document Type :
article
Full Text :
https://doi.org/10.3390/app122110837