Sorry, I don't understand your search. ×
Back to Search Start Over

ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction

Authors :
Keshavarz, Hossein
Nagappan, Meiyappan
Publication Year :
2022

Abstract

In this paper, we present ApacheJIT, a large dataset for Just-In-Time defect prediction. ApacheJIT consists of clean and bug-inducing software changes in popular Apache projects. ApacheJIT has a total of 106,674 commits (28,239 bug-inducing and 78,435 clean commits). Having a large number of commits makes ApacheJIT a suitable dataset for machine learning models, especially deep learning models that require large training sets to effectively generalize the patterns present in the historical data to future data.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2203.00101
Document Type :
Working Paper