Author: "Steffen Herbold" / Publisher: arxiv - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Steffen Herbold"' showing total 4 results

Start Over Author "Steffen Herbold" Publisher arxiv

4 results on '"Steffen Herbold"'

1. On the validity of pre-trained transformers for natural language processing in the software engineering domain

Author: Julian von der Mosel, Alexander Trautsch, and Steffen Herbold
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Computer Science - Machine Learning, Software, Machine Learning (cs.LG)
Abstract: Transformers are the current state-of-the-art of natural language processing in many domains and are using traction within software engineering research as well. Such models are pre-trained on large amounts of data, usually from the general domain. However, we only have a limited understanding regarding the validity of transformers within the software engineering domain, i.e., how good such models are at understanding words and sentences within a software engineering context and how this improves the state-of-the-art. Within this article, we shed light on this complex, but crucial issue. We compare BERT transformer models trained with software engineering data with transformers based on general domain data in multiple dimensions: their vocabulary, their ability to understand which words are missing, and their performance in classification tasks. Our results show that for tasks that require understanding of the software engineering context, pre-training with software engineering data is valuable, while general domain models are sufficient for general language understanding, also within the software engineering domain., Comment: Review status: submitted
Published: 2021
Full Text: View/download PDF

2. Exploring the relationship between performance metrics and cost saving potential of defect prediction models

Author: Steffen Tunkel and Steffen Herbold
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Software
Abstract: Context: Performance metrics are a core component of the evaluation of any machine learning model and used to compare models and estimate their usefulness. Recent work started to question the validity of many performance metrics for this purpose in the context of software defect prediction. Objective: Within this study, we explore the relationship between performance metrics and the cost saving potential of defect prediction models. We study whether performance metrics are suitable proxies to evaluate the cost saving capabilities and derive a theory for the relationship between performance metrics and cost saving potential. Methods: We measure performance metrics and cost saving potential in defect prediction experiments. We use a multinomial logit model, decision, and random forest to model the relationship between the metrics and the cost savings. Results: We could not find a stable relationship between cost savings and performance metrics. We attribute the lack of the relationship to the inability of performance metrics to account for the property that a small proportion of very large software artifacts are the main driver of the costs. Conclusion: Any defect prediction study interested in finding the best prediction model, must consider cost savings directly, because no reasonable claims regarding the economic benefits of defect prediction can be made otherwise., Comment: Under review
Published: 2021
Full Text: View/download PDF

3. A systematic mapping study of developer social network research

Author: Aynur Amirfallah, Fabian Trautsch, Steffen Herbold, and Jens Grabowski
Subjects: FOS: Computer and information sciences, Computer science, Developer social networks, 02 engineering and technology, External validity, Computer Science - Software Engineering, Software, 0502 economics and business, 0202 electrical engineering, electronic engineering, information engineering, Structure (mathematical logic), Social network, business.industry, 05 social sciences, 020207 software engineering, Data science, Replication (computing), Mapping study, Software Engineering (cs.SE), Literature survey, Hardware and Architecture, Systematic mapping, business, 050203 business & management, Information Systems
Abstract: Developer social networks (DSNs) are a tool for the analysis of community structures and collaborations between developers in software projects and software ecosystems. Within this paper, we present the results of a systematic mapping study on the use of DSNs in software engineering research. We identified 255 primary studies on DSNs. We mapped the primary studies to research directions, collected information about the data sources and the size of the studies, and conducted a bibliometric assessment. We found that nearly half of the research investigates the structure of developer communities. Other frequent topics are prediction systems build using DSNs, collaboration behavior between developers, and the roles of developers. Moreover, we determined that many publications use a small sample size regarding the number of projects, which could be problematic for the external validity of the research. Our study uncovered several open issues in the state of the art, e.g., studying inter-company collaborations, using multiple information sources for DSN research, as well as general lack of reporting guidelines or replication studies., Comment: Accepted at the Journal of Systems and Software
Published: 2019
Full Text: View/download PDF

4. Correction of 'A Comparative Study to Benchmark Cross-project Defect Prediction Approaches'

Author: Steffen Herbold, Jens Grabowski, and Alexander Trautsch
Subjects: FOS: Computer and information sciences, Computer science, business.industry, 020207 software engineering, 02 engineering and technology, Machine learning, computer.software_genre, Software Engineering (cs.SE), Computer Science - Software Engineering, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), Key (cryptography), Artificial intelligence, Cross project, business, computer, Software
Abstract: Unfortunately, the article "A Comparative Study to Benchmark Cross-project Defect Prediction Approaches" has a problem in the statistical analysis which was pointed out almost immediately after the pre-print of the article appeared online. While the problem does not negate the contribution of the the article and all key findings remain the same, it does alter some rankings of approaches used in the study. Within this correction, we will explain the problem, how we resolved it, and present the updated results.
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Steffen Herbold"'

1. On the validity of pre-trained transformers for natural language processing in the software engineering domain

2. Exploring the relationship between performance metrics and cost saving potential of defect prediction models

3. A systematic mapping study of developer social network research

4. Correction of 'A Comparative Study to Benchmark Cross-project Defect Prediction Approaches'

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

4 results on '"Steffen Herbold"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources