Back to Search
Start Over
MPT‐embedding: An unsupervised representation learning of code for software defect prediction.
- Source :
-
Journal of Software: Evolution & Process . Apr2021, Vol. 33 Issue 4, p1-20. 20p. - Publication Year :
- 2021
-
Abstract
- Software project defect prediction can help developers allocate debugging resources. Existing software defect prediction models are usually based on machine learning methods, especially deep learning. Deep learning‐based methods tend to build end‐to‐end models that directly use source code‐based abstract syntax trees (ASTs) as input. They do not pay enough attention to the front‐end data representation. In this paper, we propose a new framework to represent source code called multiperspective tree embedding (MPT‐embedding), which is an unsupervised representation learning method. MPT‐embedding parses the nodes of ASTs from multiple perspectives and encodes the structural information of a tree into a vector sequence. Experiments on both cross‐project defect prediction (CPDP) and within‐project defect prediction (WPDP) show that, on average, MPT‐embedding provides improvements over the state‐of‐the‐art method. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 20477473
- Volume :
- 33
- Issue :
- 4
- Database :
- Academic Search Index
- Journal :
- Journal of Software: Evolution & Process
- Publication Type :
- Academic Journal
- Accession number :
- 149618402
- Full Text :
- https://doi.org/10.1002/smr.2330