7 results on '"Aleti, Aldeida"'
Search Results
2. A3Test: Assertion-Augmented Automated Test Case Generation
- Author
-
Alagarsamy, Saranya, Tantithamthavorn, Chakkrit, and Aleti, Aldeida
- Subjects
Software Engineering (cs.SE) ,FOS: Computer and information sciences ,Computer Science - Software Engineering - Abstract
Test case generation is an important activity, yet a time-consuming and laborious task. Recently, AthenaTest -- a deep learning approach for generating unit test cases -- is proposed. However, AthenaTest can generate less than one-fifth of the test cases correctly, due to a lack of assertion knowledge and test signature verification. In this paper, we propose A3Test, a DL-based test case generation approach that is augmented by assertion knowledge with a mechanism to verify naming consistency and test signatures. A3Test leverages the domain adaptation principles where the goal is to adapt the existing knowledge from an assertion generation task to the test case generation task. We also introduce a verification approach to verify naming consistency and test signatures. Through an evaluation of 5,278 focal methods from the Defects4j dataset, we find that our A3Test (1) achieves 147% more correct test cases and 15% more method coverage, with a lower number of generated test cases than AthenaTest; (2) still outperforms the existing pre-trained models for the test case generation task; (3) contributes substantially to performance improvement via our own proposed assertion pre-training and the verification components; (4) is 97.2% much faster while being more accurate than AthenaTest., Comment: Under Review at ACM Transactions on Software Engineering and Methodology
- Published
- 2023
- Full Text
- View/download PDF
3. Identifying Safety-critical Scenarios for Autonomous Vehicles via Key Features
- Author
-
Neelofar and Aleti, Aldeida
- Subjects
Software Engineering (cs.SE) ,FOS: Computer and information sciences ,Computer Science - Software Engineering ,D.2.5 - Abstract
A test scenario for an autonomous vehicle (AV) is considered safety-critical if it identifies any malfunctioning of the AV. Safety-critical test scenarios are rare under usual traffic conditions, hence simulations are often used to generate such scenarios. The representation of driving scenarios is complex, containing a multitude of static and dynamic features related to the AV, road users, such as other vehicles and pedestrians, and weather and road conditions. This makes the generation of an exhaustive test suite to identify critical test scenarios impractical, even virtually. In this paper, we present a systematic technique for the identification of significant features of test scenarios that impact their effectiveness, based on Instance Space Analysis (ISA). ISA identifies a combination of features that best differentiates safety-critical scenarios from normal driving scenarios and visualises the distribution of these features with respect to test scenario outcome. The visualisation helps identify the untested areas of the instance space where test scenarios are empirically possible, however, missing from the current test suite, which is an indicator of the quality of the test suite. We train four Machine Learning approaches to classify test scenarios as safety-critical or not. The high precision, recall, and F1-scores indicate that the proposed approach is effective in predicting the outcome of a test scenario before simulating it, thus aiding with test scenario prioritization., 12 pages, 6 figures
- Published
- 2022
4. Test-based Patch Clustering for Automatically-Generated Patches Assessment
- Author
-
Martinez, Matias, Kechagia, Maria, Perera, Anjana, Petke, Justyna, Sarro, Federica, and Aleti, Aldeida
- Subjects
Software Engineering (cs.SE) ,FOS: Computer and information sciences ,Computer Science - Software Engineering - Abstract
Previous studies have shown that Automated Program Repair (APR) techniques suffer from the overfitting problem. Overfitting happens when a patch is run and the test suite does not reveal any error, but the patch actually does not fix the underlying bug or it introduces a new defect that is not covered by the test suite. Therefore, the patches generated by APR tools need to be validated by human programmers, which can be very costly, and prevents APR tools adoption in practice.Our work aims at increasing developer trust in automated patch generation by minimizing the number of plausible patches that they have to review, thereby reducing the time required to find a correct patch. We introduce a novel light-weight test-based patch clustering approach called xTestCluster, which clusters patches based on their dynamic behavior. xTestCluster is applied after the patch generation phase in order to analyze the generated patches from one or more repair tools. The novelty of xTestCluster lies in using information from execution of newly generated test cases to cluster patches generated by multiple APR approaches. A cluster is formed with patches that fail on the same generated test cases. The output from xTestCluster gives developers a) a way of reducing the number of patches to analyze, as they can focus on analyzing a sample of patches from each cluster, b) additional information attached to each patch. After analyzing 1910 plausible patches from 25 Java APR tools, our results show that xTestCluster is able to reduce the number of patches to review and analyze with a median of 50%. xTestCluster can save a significant amount of time for developers that have to review the multitude of patches generated by APR tools, and provides them with new test cases that show the differences in behavior between generated patches.
- Published
- 2022
5. How good does a Defect Predictor need to be to guide Search-Based Software Testing?
- Author
-
Perera, Anjana, Turhan, Burak, Aleti, Aldeida, and Böhme, Marcel
- Subjects
Software Engineering (cs.SE) ,FOS: Computer and information sciences ,Computer Science - Software Engineering ,D.2.5 ,Hardware_PERFORMANCEANDRELIABILITY - Abstract
Defect predictors, static bug detectors and humans inspecting the code can locate the parts of the program that are buggy before they are discovered through testing. Automated test generators such as search-based software testing (SBST) techniques can use this information to direct their search for test cases to likely buggy code, thus speeding up the process of detecting existing bugs. However, often the predictions given by these tools or humans are imprecise, which can misguide the SBST technique and may deteriorate its performance. In this paper, we study the impact of imprecision in defect prediction on the bug detection effectiveness of SBST. Our study finds that the recall of the defect predictor, i.e., the probability of correctly identifying buggy code, has a significant impact on bug detection effectiveness of SBST with a large effect size. On the other hand, the effect of precision, a measure for false alarms, is not of meaningful practical significance as indicated by a very small effect size. In particular, the SBST technique finds 7.5 less bugs on average (out of 420 bugs) for every 5% decrements of the recall. In the context of combining defect prediction and SBST, our recommendation for practice is to increase the recall of defect predictors at the expense of precision, while maintaining a precision of at least 75%. To account for the imprecision of defect predictors, in particular low recall values, SBST techniques should be designed to search for test cases that also cover the predicted non-buggy parts of the program, while prioritising the parts that have been predicted as buggy., Comment: 12 pages, 4 figures
- Published
- 2021
- Full Text
- View/download PDF
6. The Neighbours' Similar Fitness Property for Local Search
- Author
-
Wallace, Mark and Aleti, Aldeida
- Subjects
FOS: Computer and information sciences ,Artificial Intelligence (cs.AI) ,Discrete Mathematics (cs.DM) ,Computer Science - Artificial Intelligence ,Computer Science - Discrete Mathematics - Abstract
For most practical optimisation problems local search outperforms random sampling - despite the "No Free Lunch Theorem". This paper introduces a property of search landscapes termed Neighbours' Similar Fitness (NSF) that underlies the good performance of neighbourhood search in terms of local improvement. Though necessary, NSF is not sufficient to ensure that searching for improvement among the neighbours of a good solution is better than random search. The paper introduces an additional (natural) property which supports a general proof that, for NSF landscapes, neighbourhood search beats random search.
- Published
- 2020
7. E-APR: Mapping the Effectiveness of Automated Program Repair
- Author
-
Aleti, Aldeida and Martinez, Matias
- Subjects
Software Engineering (cs.SE) ,FOS: Computer and information sciences ,Computer Science - Software Engineering - Abstract
Automated Program Repair (APR) is a fast growing area with numerous new techniques being developed to tackle one of the most challenging software engineering problems. APR techniques have shown promising results, giving us hope that one day it will be possible for software to repair itself. In this paper, we focus on the problem of objective performance evaluation of APR techniques. We introduce a new approach, Explaining Automated Program Repair (E-APR), which identifies features of buggy programs that explain why a particular instance is difficult for an APR technique. E-APR is used to examine the diversity and quality of the buggy programs used by most researchers, and analyse the strengths and weaknesses of existing APR techniques. E-APR visualises an instance space of buggy programs, with each buggy program represented as a point in the space. The instance space is constructed to reveal areas of hard and easy buggy programs, and enables the strengths and weaknesses of APR techniques to be identified.
- Published
- 2020
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.