Back to Search Start Over

Programmers' de-anonymization using a hybrid approach of abstract syntax tree and deep learning.

Authors :
Ullah, Farhan
Jabbar, Sohail
Al-Turjman, Fadi
Source :
Technological Forecasting & Social Change; Oct2020, Vol. 159, pN.PAG-N.PAG, 1p
Publication Year :
2020

Abstract

• Extracting the abstract view features from different programming codes using abstract syntax tree. • SCAA-AST approach is proposed for programmers' classification. • TFIDF method is proposed to analyze the significance of each feature in terms of local and global weights. • Implementing the deep learning approach to classify programmers from different source codes. Source Code Authorship Attribution (SCAA) is a direct challenge to the privacy and anonymity of developers. However, it is important to recognize the malicious authors and the origin of the attack. In this paper, we proposed Source Code Authorship Attribution using Abstract Syntax Tree (SCAA-AST) for efficient classification of programmers. First, the AST hierarchal features are generated from different programming codes. Second, preprocessing techniques are used to obtain useful features without sound data. Third, the Term Frequency Inverse Document Frequency (TFIDF) weighting technique is used to zoom in on the significance of each feature. Fourth, the Adaptive Synthetic (ADASYN) oversampling method is used to solve the imbalanced class problem. Finally, a deep learning algorithm is designed with the TensorFlow framework, and the Keras API is used to classify programming authors. A deep learning algorithm is further configured with a dropout layer, learning error rate, loss and activation function, and dense layers to enhance the classification results. The results are appreciable in outperforming the existing techniques from the perspective of classification accuracy. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00401625
Volume :
159
Database :
Supplemental Index
Journal :
Technological Forecasting & Social Change
Publication Type :
Academic Journal
Accession number :
145413411
Full Text :
https://doi.org/10.1016/j.techfore.2020.120186