Author: "Fang, Chunrong" / Database: OpenAIRE - Searchworks@Jio Institute Digital Library Search Results

1. Backdooring Neural Code Search

Author: Sun, Weisong, Chen, Yuchen, Tao, Guanhong, Fang, Chunrong, Zhang, Xiangyu, Zhang, Quanjun, and Luo, Bin
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, I.2.2, Computer Science - Software Engineering, D.2.13, Artificial Intelligence (cs.AI), Computer Science - Computation and Language, Computer Science - Artificial Intelligence, 68T01, Computation and Language (cs.CL)
Abstract: Reusing off-the-shelf code snippets from online repositories is a common practice, which significantly enhances the productivity of software developers. To find desired code snippets, developers resort to code search engines through natural language queries. Neural code search models are hence behind many such engines. These models are based on deep learning and gain substantial attention due to their impressive performance. However, the security aspect of these models is rarely studied. Particularly, an adversary can inject a backdoor in neural code search models, which return buggy or even vulnerable code with security/privacy issues. This may impact the downstream software (e.g., stock trading systems and autonomous driving) and cause financial loss and/or life-threatening incidents. In this paper, we demonstrate such attacks are feasible and can be quite stealthy. By simply modifying one variable/function name, the attacker can make buggy/vulnerable code rank in the top 11%. Our attack BADCODE features a special trigger generation and injection procedure, making the attack more effective and stealthy. The evaluation is conducted on two neural code search models and the results show our attack outperforms baselines by 60%. Our user study demonstrates that our attack is more stealthy than the baseline by two times based on the F1 score., Accepted to the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)
Published: 2023

2. A Survey of Learning-based Automated Program Repair

Author: Zhang, Quanjun, Fang, Chunrong, Ma, Yuxiang, Sun, Weisong, and Chen, Zhenyu
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: Automated program repair (APR) aims to fix software bugs automatically and plays a crucial role in software development and maintenance. With the recent advances in deep learning (DL), an increasing number of APR techniques have been proposed to leverage neural networks to learn bug-fixing patterns from massive open-source code repositories. Such learning-based techniques usually treat APR as a neural machine translation (NMT) task, where buggy code snippets (i.e., source language) are translated into fixed code snippets (i.e., target language) automatically. Benefiting from the powerful capability of DL to learn hidden relationships from previous bug-fixing datasets, learning-based APR techniques have achieved remarkable performance. In this paper, we provide a systematic survey to summarize the current state-of-the-art research in the learning-based APR community. We illustrate the general workflow of learning-based APR techniques and detail the crucial components, including fault localization, patch generation, patch ranking, patch validation, and patch correctness phases. We then discuss the widely-adopted datasets and evaluation metrics and outline existing empirical studies. We discuss several critical aspects of learning-based APR techniques, such as repair domains, industrial deployment, and the open science issue. We highlight several practical guidelines on applying DL techniques for future APR studies, such as exploring explainable patch generation and utilizing code features. Overall, our paper can help researchers gain a comprehensive understanding about the achievements of the existing learning-based APR techniques and promote the practical application of these techniques. Our artifacts are publicly available at \url{https://github.com/QuanjunZhang/AwesomeLearningAPR}.
Published: 2023

3. Boosting Automated Patch Correctness Prediction via Pre-trained Language Model

Author: Zhang, Quanjun, Fang, Chunrong, Sun, Weisong, Liu, Yan, He, Tieke, Hao, Xiaodong, and Chen, Zhenyu
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: Automated program repair (APR) aims to fix software bugs automatically without human debugging efforts and plays a crucial role in software development and maintenance. Despite the recent significant progress, APR is still challenged by a long-standing overfitting problem (i.e., the generated patch is plausible but overfitting). Various techniques have thus been proposed to address the overfitting problem. Among them, leveraging deep learning approaches to predict patch correctness is emerging along with the available large-scale patch benchmarks recently. However, existing learning-based techniques mainly rely on manually-designed code features, which can be extremely costly and challenging to construct in practice. In this paper, we propose APPT, a pre-trained model-based automated patch correctness assessment technique, which treats the source code as token sequences without extra overhead to design hand-crafted features. In particular, APPT adopts a pre-trained model as the encoder stack, followed by an LSTM stack and a deep learning classifier. Although our idea is general and can be built on various pre-trained models, we implemente APPT based on the BERT model. We conduct an extensive experiment on 1,183 Defects4J patches and the results show that APPT achieves prediction accuracy of 79.0% and recall of 81.3%, outperforming the state-of-the-art technique CACHE by 3.6% and 4.8%. Our additional investigation on 49,694 real-world patches shows that APPT achieves the optimum performance (exceeding 99% in five common metrics for assessing patch classification techniques) compared with existing representation learning techniques. We also prove that adopting code pre-trained models can further provide substantial advancement (e.g., GraphCodeBERT-based APPT improves BERT-based APPT by 3.0% and 2.6% in precision and recall, respectively), highlighting the generalizability of APPT.
Published: 2023
Full Text: View/download PDF

4. Automatic Code Summarization via ChatGPT: How Far Are We?

Author: Sun, Weisong, Fang, Chunrong, You, Yudu, Miao, Yun, Liu, Yi, Li, Yuekang, Deng, Gelei, Huang, Shenghan, Chen, Yuchen, Zhang, Quanjun, Qian, Hanwei, Liu, Yang, and Chen, Zhenyu
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, 68T50, D.2.3
Abstract: To support software developers in understanding and maintaining programs, various automatic code summarization techniques have been proposed to generate a concise natural language comment for a given code snippet. Recently, the emergence of large language models (LLMs) has led to a great boost in the performance of natural language processing tasks. Among them, ChatGPT is the most popular one which has attracted wide attention from the software engineering community. However, it still remains unclear how ChatGPT performs in (automatic) code summarization. Therefore, in this paper, we focus on evaluating ChatGPT on a widely-used Python dataset called CSN-Python and comparing it with several state-of-the-art (SOTA) code summarization models. Specifically, we first explore an appropriate prompt to guide ChatGPT to generate in-distribution comments. Then, we use such a prompt to ask ChatGPT to generate comments for all code snippets in the CSN-Python test set. We adopt three widely-used metrics (including BLEU, METEOR, and ROUGE-L) to measure the quality of the comments generated by ChatGPT and SOTA models (including NCS, CodeBERT, and CodeT5). The experimental results show that in terms of BLEU and ROUGE-L, ChatGPT's code summarization performance is significantly worse than all three SOTA models. We also present some cases and discuss the advantages and disadvantages of ChatGPT in code summarization. Based on the findings, we outline several open challenges and opportunities in ChatGPT-based code summarization.
Published: 2023
Full Text: View/download PDF

5. Certifying Robustness of Convolutional Neural Networks with Tight Linear Approximation

Author: Xiao, Yuan, Bai, Tongtong, Gu, Mingzheng, Fang, Chunrong, and Chen, Zhenyu
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Cryptography and Security (cs.CR), Machine Learning (cs.LG)
Abstract: The robustness of neural network classifiers is becoming important in the safety-critical domain and can be quantified by robustness verification. However, at present, efficient and scalable verification techniques are always sound but incomplete. Therefore, the improvement of certified robustness bounds is the key criterion to evaluate the superiority of robustness verification approaches. In this paper, we present a Tight Linear approximation approach for robustness verification of Convolutional Neural Networks(Ti-Lin). For general CNNs, we first provide a new linear constraints for S-shaped activation functions, which is better than both existing Neuron-wise Tightest and Network-wise Tightest tools. We then propose Neuron-wise Tightest linear bounds for Maxpool function. We implement Ti-Lin, the resulting verification method. We evaluate it with 48 different CNNs trained on MNIST, CIFAR-10, and Tiny ImageNet datasets. Experimental results show that Ti-Lin significantly outperforms other five state-of-the-art methods(CNN-Cert, DeepPoly, DeepCert, VeriNet, Newise). Concretely, Ti-Lin certifies much more precise robustness bounds on pure CNNs with Sigmoid/Tanh/Arctan functions and CNNs with Maxpooling function with at most 63.70% and 253.54% improvement, respectively.
Published: 2022

6. Universally Adaptive Cross-Platform Reinforcement Learning Testing via GUI Image Understanding

Author: Yu, Shengcheng, Fang, Chunrong, Liu, Yulei, Zhang, Ziqian, Yun, Yexiao, Li, Xin, and Chen, Zhenyu
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: With the rapid development of the Internet, more and more applications (app) are playing an important role in various aspects of the world. Among all apps, mobile apps and web apps are dominant in people's daily life and all industries. In order to tackle the challenges in ensuring the app quality, many approaches have been adopted to improve app GUI testing, including random technologies, model-based technologies, etc. However, existing approaches are still insufficient in reaching high code coverage, constructing high quality models, and achieving generalizability. Besides, current approaches is heavily dependent on the execution platforms (i.e., Android, Web). Apps of distinct platforms share commonalities in GUI design, which inspires us to propose a platform-independent approach with the development of computer vision algorithms. In this paper, we propose UniRLTest. It is a reinforcement learning based approach utilizing a universal framework with computer vision algorithms to conduct automated testing on apps from different platforms. UniRLTest extracts the GUI widgets from GUI pages and characterizes the GUI corresponding layouts, embedding the GUI pages as states. UniRLTest explores apps with the guidance of a novelly designed curiosity-driven strategy, which uses a Q-network to estimate the values of specific states and actions to encourage more exploration in uncovered pages without platform dependency. The state embedding similarity is used to calculate the rewards of each exploration step. We conduct an empirical study on 20 mobile apps and 5 web apps, and the results show that UniRLTest can perform better than the baselines, especially in the exploration of new states.
Published: 2022

7. Test Case Prioritization Using Partial Attention

Author: Zhang, Quanjun, Fang, Chunrong, Sun, Weisong, Yu, Shengcheng, and Xu, Yutao
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: Test case prioritization (TCP) aims to reorder the regression test suite with a goal of increasing the fault detection rate. Various TCP techniques have been proposed based on different prioritization strategies. Among them, the greedy-based techniques are the most widely-used TCP techniques. However, existing greedy-based techniques usually reorder all candidate test cases in prioritization iterations, resulting in both efficiency and effectiveness problems. In this paper, we propose a generic partial attention mechanism, which adopts the previous priority values (i.e., the number of additionally-covered code units) to avoid considering all candidate test cases. Incorporating the mechanism with the additional-greedy strategy, we implement a novel coverage-based TCP technique based on partition ordering (OCP). OCP first groups the candidate test cases into different partitions and updates the partitions on the descending order. We conduct a comprehensive experiment on 19 versions of Java programs and 30 versions of C programs to compare the effectiveness and efficiency of OCP with six state-of-the-art TCP techniques: total-greedy, additional-greedy, lexicographical-greedy, unify-greedy, art-based, and search-based. The experimental results show that OCP achieves a better fault detection rate than the state-of-the-arts. Moreover, the time costs of OCP are found to achieve 85%-99% improvement than most state-of-the-arts.
Published: 2022

8. An Extractive-and-Abstractive Framework for Source Code Summarization

Author: Sun, Weisong, Fang, Chunrong, Chen, Yuchen, Zhang, Quanjun, Tao, Guanhong, Han, Tingxu, Ge, Yifei, You, Yudu, and Luo, Bin
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Artificial Intelligence (cs.AI), D.2.3, I.2.7, Computer Science - Artificial Intelligence
Abstract: (Source) Code summarization aims to automatically generate summaries/comments for a given code snippet in the form of natural language. Such summaries play a key role in helping developers understand and maintain source code. Existing code summarization techniques can be categorized into extractive methods and abstractive methods. The extractive methods extract a subset of important statements and keywords from the code snippet using retrieval techniques, and generate a summary that preserves factual details in important statements and keywords. However, such a subset may miss identifier or entity naming, and consequently, the naturalness of generated summary is usually poor. The abstractive methods can generate human-written-like summaries leveraging encoder-decoder models from the neural machine translation domain. The generated summaries however often miss important factual details. To generate human-written-like summaries with preserved factual details, we propose a novel extractive-and-abstractive framework. The extractive module in the framework performs a task of extractive code summarization, which takes in the code snippet and predicts important statements containing key factual details. The abstractive module in the framework performs a task of abstractive code summarization, which takes in the entire code snippet and important statements in parallel and generates a succinct and human-written-like natural language summary. We evaluate the effectiveness of our technique, called EACS, by conducting extensive experiments on three datasets involving six programming languages. Experimental results show that EACS significantly outperforms state-of-the-art techniques in terms of all three widely used metrics, including BLEU, METEOR, and ROUGH-L.
Published: 2022

9. Program Repair: Automated vs. Manual

Author: Zhang, Quanjun, Zhao, Yuan, Sun, Weisong, Fang, Chunrong, Wang, Ziyuan, and Zhang, Lingming
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: Various automated program repair (APR) techniques have been proposed to fix bugs automatically in the last decade. Although recent researches have made significant progress on the effectiveness and efficiency, it is still unclear how APR techniques perform with human intervention in a real debugging scenario. To bridge this gap, we conduct an extensive study to compare three state-of-the-art APR tools with manual program repair, and further investigate whether the assistance of APR tools (i.e., repair reports) can improve manual program repair. To that end, we recruit 20 participants for a controlled experiment, resulting in a total of 160 manual repair tasks and a questionnaire survey. The experiment reveals several notable observations that (1) manual program repair may be influenced by the frequency of repair actions sometimes; (2) APR tools are more efficient in terms of debugging time, while manual program repair tends to generate a correct patch with fewer attempts; (3) APR tools can further improve manual program repair regarding the number of correctly-fixed bugs, while there exists a negative impact on the patch correctness; (4) participants are used to consuming more time to identify incorrect patches, while they are still misguided easily; (5) participants are positive about the tools' repair performance, while they generally lack confidence about the usability in practice. Besides, we provide some guidelines for improving the usability of APR tools (e.g., the misleading information in reports and the observation of feedback).
Published: 2022

10. TranCS

Author: sun, Weisong, Fang, Chunrong, Chen, Yuchen, Tao, Guanhong, Han, Tingxu, and Zhang, Quanjun
Abstract: This is the replication package for the paper "Code search based on Context-aware Code Translation", accepted at ICSE 2022.The full package contains all the details needed to reproduce the result as we claimed in our paper. In our package, the README.md file describes the structure of the package and gives basic information on the content of this package. The software and hardware requirements are described in the requirements.txt and REQUIREMENTS.md files, respectively. And "Code Search based on Context-aware Code Translation.pdf" is a copy of our accepted paper in PDF format.For any additional information, contact the first author by e-mail: Dr.Weisong Sun at
Published: 2022
Full Text: View/download PDF

11. TranCS.zip

Author: sun, Weisong, Fang, Chunrong, Chen, Yuchen, Tao, Guanhong, Han, Tingxu, and Zhang, Quanjun
Abstract: This is the replication package for the paper "Code search based on Context-aware Code Translation", accepted at ICSE 2022.
Published: 2022
Full Text: View/download PDF

12. Investigating Coverage Guided Fuzzing with Mutation Testing

Author: Qian, Ruixiang, Zhang, Quanjun, Fang, Chunrong, and Guo, Lihua
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, TheoryofComputation_LOGICSANDMEANINGSOFPROGRAMS
Abstract: Coverage guided fuzzing (CGF) is an effective testing technique which has detected hundreds of thousands of bugs from various software applications. It focuses on maximizing code coverage to reveal more bugs during fuzzing. However, a higher coverage does not necessarily imply a better fault detection capability. Triggering a bug involves not only exercising the specific program path but also reaching interesting program states in that path. In this paper, we use mutation testing to improve CGF in detecting bugs. We use mutation scores as feedback to guide fuzzing towards detecting bugs rather than just covering code. To evaluate our approach, we conduct a well-designed experiment on 5 benchmarks. We choose the state-of-the-art fuzzing technique Zest as baseline and construct two modified techniques on it using our approach. The experimental results show that our approach can improve CGF in both code coverage and bug detection., Comment: Accepted by Internetware 2022, conference, 10 pages
Published: 2022
Full Text: View/download PDF

13. Ruler

Author: Sun, Weisong, Tao, Guanhong, Han, Tingxu, Fang, Chunrong, and Zhang, Xiangyu
Abstract: The artifact of paper ' Ruler: Discriminative and Iterative Adversarial Training for Deep Neural Network Fairness ', published on ESEC/FSE 2022.
Published: 2022
Full Text: View/download PDF

14. Ruler-v3

Author: Tao, Guanhong, Sun, Weisong, Han, Tingxu, Fang, Chunrong, and Zhang, Xiangyu
Abstract: The artifact of paper ' Ruler: Discriminative and Iterative Adversarial Training for Deep Neural Network Fairness ', published on ESEC/FSE 2022.
Published: 2022
Full Text: View/download PDF

15. TranCS

Author: Sun, Weisong, Fang, Chunrong, Chen, Yuchen, Tao, Guanhong, Han, Tingxu, and Zhang, Quanjun
Abstract: This is the replication package for the paper "Code search based on Context-aware Code Translation", accepted at ICSE 2022.
Published: 2022
Full Text: View/download PDF

16. Ruler-v2

Author: Tao, Guanhong, Sun, Weisong, Han, Tingxu, Fang, Chunrong, and Zhang, Xiangyu
Abstract: The artifact of paper ' Ruler: Discriminative and Iterative Adversarial Training for Deep Neural Network Fairness ', published on ESEC/FSE 2022.
Published: 2022
Full Text: View/download PDF

17. TranCS

Author: Sun, Weisong, Fang, Chunrong, Chen, Yuchen, Tao, Guanhong, Han, Tingxu, and Zhang, Quanjun
Abstract: This is the replication package for the paper "Code search based on Context-aware Code Translation", accepted at ICSE 2022.
Published: 2022
Full Text: View/download PDF

18. Ruler-v2

Author: Tao, Guanhong, Sun, Weisong, Han, Tingxu, Fang, Chunrong, and Zhang, Xiangyu
Abstract: The artifact of paper ' Ruler: Discriminative and Iterative Adversarial Training for Deep Neural Network Fairness ', published on ESEC/FSE 2022.
Published: 2022
Full Text: View/download PDF

19. TranCS

Author: Sun, Weisong, Fang, Chunrong, Chen, Yuchen, Tao, Guanhong, Han, Tingxu, and Zhang, Quanjun
Abstract: This is the replication package for the paper "Code search based on Context-aware Code Translation", accepted at ICSE 2022.
Published: 2022
Full Text: View/download PDF

20. Ruler-v3

Author: Tao, Guanhong, Sun, Weisong, Han, Tingxu, Fang, Chunrong, and Zhang, Xiangyu
Abstract: The artifact of paper ' Ruler: Discriminative and Iterative Adversarial Training for Deep Neural Network Fairness ', published on ESEC/FSE 2022.
Published: 2022
Full Text: View/download PDF

21. Ruler

Author: Sun, Weisong, Tao, Guanhong, Han, Tingxu, Fang, Chunrong, and Zhang, Xiangyu
Abstract: The artifact of paper ' Ruler: Discriminative and Iterative Adversarial Training for Deep Neural Network Fairness ', published on ESEC/FSE 2022.
Published: 2022
Full Text: View/download PDF

22. TranCS

Author: sun, Weisong, Fang, Chunrong, Chen, Yuchen, Tao, Guanhong, Han, Tingxu, and Zhang, Quanjun
Abstract: This is the replication package for the paper "Code search based on Context-aware Code Translation", accepted at ICSE 2022.The full package contains all the details needed to reproduce the result as we claimed in our paper. In our package, the README.md file describes the structure of the package and gives basic information on the content of this package. The software and hardware requirements are described in the requirements.txt and REQUIREMENTS.md files, respectively. And "Code Search based on Context-aware Code Translation.pdf" is a copy of our accepted paper in PDF format.For any additional information, contact the first author by e-mail: Dr.Weisong Sun at
Published: 2022
Full Text: View/download PDF

23. TranCS.zip

Author: sun, Weisong, Fang, Chunrong, Chen, Yuchen, Tao, Guanhong, Han, Tingxu, and Zhang, Quanjun
Abstract: This is the replication package for the paper "Code search based on Context-aware Code Translation", accepted at ICSE 2022.
Published: 2022
Full Text: View/download PDF

24. BLESER: Bug Localization Based on Enhanced Semantic Retrieval

Author: Zou, Weiqin, Li, Enming, and Fang, Chunrong
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: Static bug localization techniques that locate bugs at method granularity have gained much attention from both researchers and practitioners. For a static method-level bug localization technique, a key but challenging step is to fully retrieve the semantics of methods and bug reports. Currently, existing studies mainly use the same bag-of-word space to represent the semantics of methods and bug reports without considering structure information of methods and textual contexts of bug reports, which largely and negatively affects bug localization performance. To address this problem, we develop BLESER, a new bug localization technique based on enhanced semantic retrieval. Specifically, we use an AST-based code embedding model (capturing code structure better) to retrieve the semantics of methods, and word embedding models (capturing textual contexts better) to represent the semantics of bug reports. Then, a deep learning model is built on the enhanced semantic representations. During model building, we compare five typical word embedding models in representing bug reports and try to explore the usefulness of re-sampling strategies and cost-sensitive strategies in handling class imbalance problems. We evaluate our BLESER on five Java projects from the Defects4J dataset. We find that: (1) On the whole, the word embedding model ELMo outperformed the other four models (including word2vec, BERT, etc.) in facilitating bug localization techniques. (2) Among four strategies aiming at solving class imbalance problems, the strategy ROS (random over-sampling) performed much better than the other three strategies (including random under-sampling, Focal Loss, etc.). (3) By integrating ELMo and ROS into BLESER, at method-level bug localization, we could achieve MAP of 0.108-0.504, MRR of 0.134-0.510, and Accuracy@1 of 0.125-0.5 on five Defects4J projects.
Published: 2021

25. Mobile App Crowdsourced Test Report Consistency Detection via Deep Image-and-Text Fusion Understanding

Author: Yu, Shengcheng, Fang, Chunrong, Zhang, Quanjun, Cao, Zhihao, Yun, Yexiao, Cao, Zhenfei, Mei, Kai, and Chen, Zhenyu
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: Crowdsourced testing, as a distinct testing paradigm, has attracted much attention in software testing, especially in mobile application (app) testing field. Compared with in-house testing, crowdsourced testing shows superiority with the diverse testing environments when faced with the mobile testing fragmentation problem. However, crowdsourced testing also encounters the low-quality test report problem caused by unprofessional crowdworkers involved with different expertise. In order to handle the submitted reports of uneven quality, app developers have to distinguish high-quality reports from low-quality ones to help the bug inspection. One kind of typical low-quality test report is inconsistent test reports, which means the textual descriptions are not focusing on the attached bug-occurring screenshots. According to our empirical survey, only 18.07% crowdsourced test reports are consistent. Inconsistent reports cause waste on mobile app testing. To solve the inconsistency problem, we propose ReCoDe to detect the consistency of crowdsourced test reports via deep image-and-text fusion understanding. ReCoDe is a two-stage approach that first classifies the reports based on textual descriptions into different categories according to the bug feature. In the second stage, ReCoDe has a deep understanding of the GUI image features of the app screenshots and then applies different strategies to handle different types of bugs to detect the consistency of the crowdsourced test reports. We conduct an experiment on a dataset with over 22k test reports to evaluate ReCoDe, and the results show the effectiveness of ReCoDe in detecting the consistency of crowdsourced test reports. Besides, a user study is conducted to prove the practical value of ReCoDe in effectively helping app developers improve the efficiency of reviewing the crowdsourced test reports.
Published: 2021
Full Text: View/download PDF

26. Graph-Based Fuzz Testing for Deep Learning Inference Engine

Author: Luo, Weisi, Chai, Dong, Run, Xiaoyue, Wang, Jiang, Fang, Chunrong, and Chen, Zhenyu
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering
Abstract: With the wide use of Deep Learning (DL) systems, academy and industry begin to pay attention to their quality. Testing is one of the major methods of quality assurance. However, existing testing techniques focus on the quality of DL models but lacks attention to the core underlying inference engines (i.e., frameworks and libraries). Inspired by the success stories of fuzz testing, we design a graph-based fuzz testing method to improve the quality of DL inference engines. This method is naturally followed by the graph structure of DL models. A novel operator-level coverage criterion based on graph theory is introduced and six different mutations are implemented to generate diversified DL models by exploring combinations of model structures, parameters, and data inputs. The Monte Carlo Tree Search (MCTS) is used to drive DL model generation without a training process. The experimental results show that the MCTS outperforms the random method in boosting operator-level coverage and detecting exceptions. Our method has discovered more than 40 different exceptions in three types of undesired behaviors: model conversion failure, inference failure, output comparison failure. The mutation strategies are useful to generate new valid test inputs, by up to 8.2% more operator-level coverage on average and 8.6 more exceptions captured.
Published: 2020
Full Text: View/download PDF

27. Testing as an Investment

Author: Xu, Xiaoran, Fang, Chunrong, Wu, Qing, Liu, Jia, and Chen, Zhenyu
Subjects: Computer Science - Software Engineering
Abstract: Software testing is an expensive and important task. Plenty of researches and industrial efforts have been invested on improving software testing techniques, including criteria, tools, etc. These studies can provide guidelines to select suitable test techniques for software engineers. However, in some engineering projects, business issues may be more important than technical ones, hence we need to lobby non-technical members to support our decisions. In this paper, a well-known investment model, Nelson-Siegel model, is introduced to evaluate and forecast the processes of testing with different testing criteria. Through this model, we provide a new perspective to understand short-term, medium-term, and long-term returns of investments throughout the process of testing. A preliminary experiment is conducted to investigate three testing criteria from the viewpoint of investments. The results show that statement-coverage criterion performs best in gaining long-term yields; the short-term and medium-term yields of testing depend on the scale of programs and the number of faults they contain., Comment: 6 pages, The 26th International Conference on Software Engineering and Knowledge Engineering (SEKE 2014)
Published: 2017

28. Testing as an Investment

Author: Xu, Xiaoran, Fang, Chunrong, Wu, Qing, Liu, Jia, and Chen, Zhenyu
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences
Abstract: Software testing is an expensive and important task. Plenty of researches and industrial efforts have been invested on improving software testing techniques, including criteria, tools, etc. These studies can provide guidelines to select suitable test techniques for software engineers. However, in some engineering projects, business issues may be more important than technical ones, hence we need to lobby non-technical members to support our decisions. In this paper, a well-known investment model, Nelson-Siegel model, is introduced to evaluate and forecast the processes of testing with different testing criteria. Through this model, we provide a new perspective to understand short-term, medium-term, and long-term returns of investments throughout the process of testing. A preliminary experiment is conducted to investigate three testing criteria from the viewpoint of investments. The results show that statement-coverage criterion performs best in gaining long-term yields; the short-term and medium-term yields of testing depend on the scale of programs and the number of faults they contain., 6 pages, The 26th International Conference on Software Engineering and Knowledge Engineering (SEKE 2014)
Published: 2017
Full Text: View/download PDF

29. FDFuzz: Applying Feature Detection to Fuzz Deep Learning Systems

Author: Wang Jie, Cao Kefan, Chen Jinxin, and Fang Chunrong
Subjects: business.industry, Computer science, Deep learning, Pattern recognition, Artificial intelligence, Fuzz testing, Neuron network, Safety, Risk, Reliability and Quality, business, Feature detection (computer vision)
Published: 2019
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

29 results on '"Fang, Chunrong"'

1. Backdooring Neural Code Search

2. A Survey of Learning-based Automated Program Repair

3. Boosting Automated Patch Correctness Prediction via Pre-trained Language Model

4. Automatic Code Summarization via ChatGPT: How Far Are We?

5. Certifying Robustness of Convolutional Neural Networks with Tight Linear Approximation

6. Universally Adaptive Cross-Platform Reinforcement Learning Testing via GUI Image Understanding

7. Test Case Prioritization Using Partial Attention

8. An Extractive-and-Abstractive Framework for Source Code Summarization

9. Program Repair: Automated vs. Manual

10. TranCS

11. TranCS.zip

12. Investigating Coverage Guided Fuzzing with Mutation Testing

13. Ruler

14. Ruler-v3

15. TranCS

16. Ruler-v2

17. TranCS

18. Ruler-v2

19. TranCS

20. Ruler-v3

21. Ruler

22. TranCS

23. TranCS.zip

24. BLESER: Bug Localization Based on Enhanced Semantic Retrieval

25. Mobile App Crowdsourced Test Report Consistency Detection via Deep Image-and-Text Fusion Understanding

26. Graph-Based Fuzz Testing for Deep Learning Inference Engine

27. Testing as an Investment

28. Testing as an Investment

29. FDFuzz: Applying Feature Detection to Fuzz Deep Learning Systems

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

Publisher

29 results on '"Fang, Chunrong"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources