Back to Search
Start Over
ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction
- Publication Year :
- 2022
-
Abstract
- In this paper, we present ApacheJIT, a large dataset for Just-In-Time defect prediction. ApacheJIT consists of clean and bug-inducing software changes in popular Apache projects. ApacheJIT has a total of 106,674 commits (28,239 bug-inducing and 78,435 clean commits). Having a large number of commits makes ApacheJIT a suitable dataset for machine learning models, especially deep learning models that require large training sets to effectively generalize the patterns present in the historical data to future data.
Details
- Database :
- arXiv
- Publication Type :
- Report
- Accession number :
- edsarx.2203.00101
- Document Type :
- Working Paper