Back to Search Start Over

MPT‐embedding: An unsupervised representation learning of code for software defect prediction.

Authors :
Shi, Ke
Lu, Yang
Liu, Guangliang
Wei, Zhenchun
Chang, Jingfei
Source :
Journal of Software: Evolution & Process. Apr2021, Vol. 33 Issue 4, p1-20. 20p.
Publication Year :
2021

Abstract

Software project defect prediction can help developers allocate debugging resources. Existing software defect prediction models are usually based on machine learning methods, especially deep learning. Deep learning‐based methods tend to build end‐to‐end models that directly use source code‐based abstract syntax trees (ASTs) as input. They do not pay enough attention to the front‐end data representation. In this paper, we propose a new framework to represent source code called multiperspective tree embedding (MPT‐embedding), which is an unsupervised representation learning method. MPT‐embedding parses the nodes of ASTs from multiple perspectives and encodes the structural information of a tree into a vector sequence. Experiments on both cross‐project defect prediction (CPDP) and within‐project defect prediction (WPDP) show that, on average, MPT‐embedding provides improvements over the state‐of‐the‐art method. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20477473
Volume :
33
Issue :
4
Database :
Academic Search Index
Journal :
Journal of Software: Evolution & Process
Publication Type :
Academic Journal
Accession number :
149618402
Full Text :
https://doi.org/10.1002/smr.2330