Author: "McIntosh, Shane" / Publication Year Range: Last 50 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"McIntosh, Shane"' showing total 307 results

Start Over Author "McIntosh, Shane" Publication Year Range Last 50 years

307 results on '"McIntosh, Shane"'

1. A Systematic Survey on Debugging Techniques for Machine Learning Systems

Author: Nguyen, Thanh-Dat, Tian, Haoye, Le, Bach, Thongtanunam, Patanamon, and McIntosh, Shane
Subjects: Computer Science - Software Engineering
Abstract: Debugging ML software (i.e., the detection, localization and fixing of faults) poses unique challenges compared to traditional software largely due to the probabilistic nature and heterogeneity of its development process. Various methods have been proposed for testing, diagnosing, and repairing ML systems. However, the big picture informing important research directions that really address the dire needs of developers is yet to unfold, leaving several key questions unaddressed: (1) What faults have been targeted in the ML debugging research that fulfill developers needs in practice? (2) How are these faults addressed? (3) What are the challenges in addressing the yet untargeted faults? In this paper, we conduct a systematic study of debugging techniques for machine learning systems. We first collect technical papers focusing on debugging components in machine learning software. We then map these papers to a taxonomy of faults to assess the current state of fault resolution identified in existing literature. Subsequently, we analyze which techniques are used to address specific faults based on the collected papers. This results in a comprehensive taxonomy that aligns faults with their corresponding debugging methods. Finally, we examine previously released transcripts of interviewing developers to identify the challenges in resolving unfixed faults. Our analysis reveals that only 48 percent of the identified ML debugging challenges have been explicitly addressed by researchers, while 46.9 percent remain unresolved or unmentioned. In real world applications, we found that 52.6 percent of issues reported on GitHub and 70.3% of problems discussed in interviews are still unaddressed by research in ML debugging. The study identifies 13 primary challenges in ML debugging, highlighting a significant gap between the identification of ML debugging issues and their resolution in practice.
Published: 2025

2. WIA-SZZ: Work Item Aware SZZ

Author: Perez-Rosero, Salomé, Dyer, Robert, Flint, Samuel W., McIntosh, Shane, and Srisa-an, Witawas
Subjects: Computer Science - Software Engineering
Abstract: Many software engineering maintenance tasks require linking a commit that induced a bug with the commit that later fixed that bug. Several existing SZZ algorithms provide a way to identify the potential commit that induced a bug when given a fixing commit as input. Prior work introduced the notion of a "work item", a logical grouping of commits that could be a single unit of work. Our key insight in this work is to recognize that a bug-inducing commit and the fix(es) for that bug together represent a "work item." It is not currently understood how these work items, which are logical groups of revisions addressing a single issue or feature, could impact the performance of algorithms such as SZZ. In this paper, we propose a heuristic that, given an input commit, uses information about changed methods to identify related commits that form a work item with the input commit. We hypothesize that given such a work item identifying heuristic, we can identify bug-inducing commits more accurately than existing SZZ approaches. We then build a new variant of SZZ that we call Work Item Aware SZZ (WIA-SZZ), that leverages our work item detecting heuristic to first suggest bug-inducing commits. If our heuristic fails to find any candidates, we then fall back to baseline variants of SZZ. We conduct a manual evaluation to assess the accuracy of our heuristic to identify work items. Our evaluation reveals the heuristic is 64% accurate in finding work items, but most importantly it is able to find many bug-inducing commits. We then evaluate our approach on 821 repositories that have been previously used to study the performance of SZZ, comparing our work against six SZZ variants. That evaluation shows an improvement in F1 scores ranging from 2% to 9%, or when looking only at the subset of cases that found work item improved 3% to 14%.
Published: 2024

3. On the Need to Monitor Continuous Integration Practices -- An Empirical Study

Author: Santos, Jadson, da Costa, Daniel Alencar, McIntosh, Shane, and Kulesza, Uirá
Subjects: Computer Science - Software Engineering
Abstract: Continuous Integration (CI) encompasses a set of widely adopted practices that enhance software development. However, there are indications that developers may not adequately monitor CI practices. Hence, this paper explores developers' perceptions regarding the monitoring CI practices. To achieve this, we first perform a Document Analysis to assess developers' expressed need for practice monitoring in pull requests comments generated by developers during the development process. After that, we conduct a survey among developers from 121 open-source projects to understand perception of the significance of monitoring seven CI practices in their projects. Finally, we triangulate the emergent themes from our survey by performing a second Document Analysis to understand the extent of monitoring features supported by existing CI services. Our key findings indicate that: 1) the most frequently mentioned CI practice during the development process is ``Test Coverage'' (> 80\%), while ``Build Health'' and ``Time to Fix a Broken Build'' present notable opportunities for monitoring CI practices; 2) developers do not adequately monitor all CI practices and express interest in monitoring additional practices; and 3) the most popular CI services currently offer limited native support for monitoring CI practices, requiring the use of third-party tools. Our results lead us to conclude that monitoring CI practices is often overlooked by both CI services and developers. Using third-party tools in conjunction with CI services is challenging, they monitor some redundant practices and still falls short of fully supporting CI practices monitoring. Therefore, CI services should implement CI practices monitoring, which would facilitate and encourage developers to monitor them., Comment: Submitted to the Empirical Software Engineering Journal
Published: 2024

4. Revisiting the Performance of Deep Learning-Based Vulnerability Detection on Realistic Datasets

Author: Chakraborty, Partha, Arumugam, Krishna Kanth, Alfadel, Mahmoud, Nagappan, Meiyappan, and McIntosh, Shane
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security, Computer Science - Machine Learning, D.2, I.2
Abstract: The impact of software vulnerabilities on everyday software systems is significant. Despite deep learning models being proposed for vulnerability detection, their reliability is questionable. Prior evaluations show high recall/F1 scores of up to 99%, but these models underperform in practical scenarios, particularly when assessed on entire codebases rather than just the fixing commit. This paper introduces Real-Vul, a comprehensive dataset representing real-world scenarios for evaluating vulnerability detection models. Evaluating DeepWukong, LineVul, ReVeal, and IVDetect shows a significant drop in performance, with precision decreasing by up to 95 percentage points and F1 scores by up to 91 points. Furthermore, Model performance fluctuates based on vulnerability characteristics, with better F1 scores for information leaks or code injection than for path resolution or predictable return values. The results highlight a significant performance gap that needs addressing before deploying deep learning-based vulnerability detection in practical settings. Overfitting is identified as a key issue, and an augmentation technique is proposed, potentially improving performance by up to 30%. Contributions include a dataset creation approach for better model evaluation, Real-Vul dataset, and empirical evidence of deep learning models struggling in real-world settings.
Published: 2024
Full Text: View/download PDF

5. Quantifying and Characterizing Clones of Self-Admitted Technical Debt in Build Systems

Author: Xiao, Tao, Zeng, Zhili, Wang, Dong, Hata, Hideaki, McIntosh, Shane, and Matsumoto, Kenichi
Subjects: Computer Science - Software Engineering
Abstract: Self-Admitted Technical Debt (SATD) annotates development decisions that intentionally exchange long-term software artifact quality for short-term goals. Recent work explores the existence of SATD clones (duplicate or near duplicate SATD comments) in source code. Cloning of SATD in build systems (e.g., CMake and Maven) may propagate suboptimal design choices, threatening qualities of the build system that stakeholders rely upon (e.g., maintainability, reliability, repeatability). Hence, we conduct a large-scale study on 50,608 SATD comments extracted from Autotools, CMake, Maven, and Ant build systems to investigate the prevalence of SATD clones and to characterize their incidences. We observe that: (i) prior work suggests that 41-65% of SATD comments in source code are clones, but in our studied build system context, the rates range from 62% to 95%, suggesting that SATD clones are a more prevalent phenomenon in build systems than in source code; (ii) statements surrounding SATD clones are highly similar, with 76% of occurrences having similarity scores greater than 0.8; (iii) a quarter of SATD clones are introduced by the author of the original SATD statements; and (iv) among the most commonly cloned SATD comments, external factors (e.g., platform and tool configuration) are the most frequent locations, limitations in tools and libraries are the most frequent causes, and developers often copy SATD comments that describe issues to be fixed later. Our work presents the first step toward systematically understanding SATD clones in build systems and opens up avenues for future work, such as distinguishing different SATD clone behavior, as well as designing an automated recommendation system for repaying SATD effectively based on resolved clones.
Published: 2024
Full Text: View/download PDF

6. What Is an App Store? The Software Engineering Perspective

Author: Zhu, Wenhan, Proksch, Sebastian, German, Daniel M., Godfrey, Michael W., Li, Li, and McIntosh, Shane
Subjects: Computer Science - Software Engineering
Abstract: "App stores" are online software stores where end users may browse, purchase, download, and install software applications. By far, the best known app stores are associated with mobile platforms, such as Google Play for Android and Apple's App Store for iOS. The ubiquity of smartphones has led to mobile app stores becoming a touchstone experience of modern living. However, most of app store research has concentrated on properties of the apps rather than the stores themselves. Today, there is a rich diversity of app stores and these stores have largely been overlooked by researchers: app stores exist on many distinctive platforms, are aimed at different classes of users, and have different end-goals beyond simply selling a standalone app to a smartphone user. We survey and characterize the broader dimensionality of app stores, and explore how and why they influence software development practices, such as system design and release management. We begin by collecting a set of app store examples from web search queries. By analyzing and curating the results, we derive a set of features common to app stores. We then build a dimensional model of app stores based on these features, and we fit each app store from our web search result set into this model. Next, we performed unsupervised clustering to the app stores to find their natural groupings. Our results suggest that app stores have become an essential stakeholder in modern software development. They control the distribution channel to end users and ensure that the applications are of suitable quality; in turn, this leads to developers adhering to various store guidelines when creating their applications. However, we found the app stores operational model could vary widely between stores, and this variability could in turn affect the generalizability of existing understanding of app stores., Comment: 41 pages
Published: 2024
Full Text: View/download PDF

7. LLbezpeky: Leveraging Large Language Models for Vulnerability Detection

Author: Mathews, Noble Saji, Brus, Yelizaveta, Aafer, Yousra, Nagappan, Meiyappan, and McIntosh, Shane
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Software Engineering
Abstract: Despite the continued research and progress in building secure systems, Android applications continue to be ridden with vulnerabilities, necessitating effective detection methods. Current strategies involving static and dynamic analysis tools come with limitations like overwhelming number of false positives and limited scope of analysis which make either difficult to adopt. Over the past years, machine learning based approaches have been extensively explored for vulnerability detection, but its real-world applicability is constrained by data requirements and feature engineering challenges. Large Language Models (LLMs), with their vast parameters, have shown tremendous potential in understanding semnatics in human as well as programming languages. We dive into the efficacy of LLMs for detecting vulnerabilities in the context of Android security. We focus on building an AI-driven workflow to assist developers in identifying and rectifying vulnerabilities. Our experiments show that LLMs outperform our expectations in finding issues within applications correctly flagging insecure apps in 91.67% of cases in the Ghera benchmark. We use inferences from our experiments towards building a robust and actionable vulnerability detection system and demonstrate its effectiveness. Our experiments also shed light on how different various simple configurations can affect the True Positive (TP) and False Positive (FP) rates., Comment: This project report was presented as a part of the course CS858 at the University of Waterloo under the supervision of Prof. Yousra Aafer
Published: 2024

8. Repeated Builds During Code Review: An Empirical Study of the OpenStack Community

Author: Maipradit, Rungroj, Wang, Dong, Thongtanunam, Patanamon, Kula, Raula Gaikovina, Kamei, Yasutaka, and McIntosh, Shane
Subjects: Computer Science - Software Engineering
Abstract: Code review is a popular practice where developers critique each others' changes. Since automated builds can identify low-level issues (e.g., syntactic errors, regression bugs), it is not uncommon for software organizations to incorporate automated builds in the code review process. In such code review deployment scenarios, submitted change sets must be approved for integration by both peer code reviewers and automated build bots. Since automated builds may produce an unreliable signal of the status of a change set (e.g., due to ``flaky'' or non-deterministic execution behaviour), code review tools, such as Gerrit, allow developers to request a ``recheck'', which repeats the build process without updating the change set. We conjecture that an unconstrained recheck command will waste time and resources if it is not applied judiciously. To explore how the recheck command is applied in a practical setting, in this paper, we conduct an empirical study of 66,932 code reviews from the OpenStack community. We quantitatively analyze (i) how often build failures are rechecked; (ii) the extent to which invoking recheck changes build failure outcomes; and (iii) how much waste is generated by invoking recheck. We observe that (i) 55% of code reviews invoke the recheck command after a failing build is reported; (ii) invoking the recheck command only changes the outcome of a failing build in 42% of the cases; and (iii) invoking the recheck command increases review waiting time by an average of 2,200% and equates to 187.4 compute years of waste -- enough compute resources to compete with the oldest land living animal on earth., Comment: conference
Published: 2023

9. Probing the toxicity of hydrothermal carbonised wastes on soil biota: Effect of reaction temperature and feedstock

Author: Luutu, Henry, Rose, Michael T., McIntosh, Shane, Van Zwieten, Lukas, and Rose, Terry J.
Published: 2024
Full Text: View/download PDF

10. Phytotoxicity induced by soil-applied hydrothermally-carbonised waste amendments: effect of reaction temperature, feedstock and soil nutrition

Author: Luutu, Henry, Rose, Michael T., McIntosh, Shane, Van Zwieten, Lukas, Weng, Han H., Pocock, Matt, and Rose, Terry J.
Published: 2023
Full Text: View/download PDF

11. Code Reviews with Divergent Review Scores: An Empirical Study of the OpenStack and Qt Communities

Author: Hirao, Toshiki, McIntosh, Shane, Ihara, Akinori, and Matsumoto, Kenichi
Subjects: Computer Science - Software Engineering
Abstract: Code review is a broadly adopted software quality practice where developers critique each others' patches. In addition to providing constructive feedback, reviewers may provide a score to indicate whether the patch should be integrated. Since reviewer opinions may differ, patches can receive both positive and negative scores. If reviews with divergent scores are not carefully resolved, they may contribute to a tense reviewing culture and may slow down integration. In this paper, we study patches with divergent review scores in the OPENSTACK and QT communities. Quantitative analysis indicates that patches with divergent review scores: (1) account for 15%-37% of patches that receive multiple review scores; (2) are integrated more often than they are abandoned; and (3) receive negative scores after positive ones in 70% of cases. Furthermore, a qualitative analysis indicates that patches with strongly divergent scores that: (4) are abandoned more often suffer from external issues (e.g., integration planning, content duplication) than patches with weakly divergent scores and patches without divergent scores; and (5) are integrated often address reviewer concerns indirectly (i.e., without changing patches). Our results suggest that review tooling should integrate with release schedules and detect concurrent development of similar patches to optimize review discussions with divergent scores. Moreover, patch authors should note that even the most divisive patches are often integrated through discussion, integration timing, and careful revision., Comment: 2 pages, 1 table, Journal First, International Conference on Software Engineering 2021
Published: 2021
Full Text: View/download PDF

12. Assessing the Exposure of Software Changes: The DiPiDi Approach

Author: Meidani, Mehran, Lamothe, Maxime, and McIntosh, Shane
Subjects: Computer Science - Software Engineering
Abstract: Context: Changing a software application with many build-time configuration settings may introduce unexpected side-effects. For example, a change intended to be specific to a platform (e.g., Windows) or product configuration (e.g., community editions) might impact other platforms or configurations. Moreover, a change intended to apply to a set of platforms or configurations may be unintentionally limited to a subset. Indeed, understanding the exposure of source code changes is an important risk mitigation step in change-based development approaches. Objective: In this experiment, we seek to evaluate DiPiDi, a prototype implementation of our approach to assess the exposure of source code changes by statically analyzing build specifications. We focus our evaluation on the effectiveness and efficiency of developers when assessing the exposure of source code changes. Method: We will measure the effectiveness and efficiency of developers when performing five tasks in which they must identify the deliverable(s) and conditions under which a change will propagate. We will assign participants into three groups: without explicit tool support, supported by existing impact analysis tools, and supported by DiPiDi.
Published: 2021

13. Characterizing and Mitigating Self-Admitted Technical Debt in Build Systems

Author: Xiao, Tao, Wang, Dong, McIntosh, Shane, Hata, Hideaki, Kula, Raula Gaikovina, Ishio, Takashi, and Matsumoto, Kenichi
Subjects: Computer Science - Software Engineering
Abstract: Technical Debt is a metaphor used to describe the situation in which long-term software artifact quality is traded for short-term goals in software projects. In recent years, the concept of self-admitted technical debt (SATD) was proposed, which focuses on debt that is intentionally introduced and described by developers. Although prior work has made important observations about admitted technical debt in source code, little is known about SATD in build systems. In this paper, we set out to better understand the characteristics of SATD in build systems. To do so, through a qualitative analysis of 500 SATD comments in the Maven build system of 291 projects, we characterize SATD by location and rationale (reason and purpose). Our results show that limitations in tools and libraries, and complexities of dependency management are the most frequent causes, accounting for 50% and 24% of the comments. We also find that developers often document SATD as issues to be fixed later. As a first step towards the automatic detection of SATD rationale, we train classifiers to detect the two most frequently occurring reasons and the four most frequently occurring purposes of SATD in the content of comments in Maven build systems. The classifier performance is promising, achieving an F1-score of 0.71-0.79. Finally, within 16 identified 'ready-to-be-addressed' SATD instances, the three SATD submitted by pull requests and the five SATD submitted by issue reports were resolved after developers were made aware. Our work presents the first step towards understanding technical debt in build systems and opens up avenues for future work, such as tool support to track and manage SATD backlogs.
Published: 2021
Full Text: View/download PDF

14. Studying the impact of risk assessment analytics on risk awareness and code review performance

Author: Yu, Xueyao, Cogo, Filipe R., McIntosh, Shane, and Godfrey, Michael W.
Published: 2024
Full Text: View/download PDF

15. Lags in the Release, Adoption, and Propagation of npm Vulnerability Fixes

Author: Chinthanet, Bodin, Kula, Raula Gaikovina, McIntosh, Shane, Ishio, Takashi, Ihara, Akinori, and Matsumoto, Kenichi
Subjects: Computer Science - Software Engineering, Computer Science - Cryptography and Security
Abstract: Security vulnerability in third-party dependencies is a growing concern not only for developers of the affected software, but for the risks it poses to an entire software ecosystem, e.g., Heartbleed vulnerability. Recent studies show that developers are slow to respond to the threat of vulnerability, sometimes taking four to eleven months to act. To ensure quick adoption and propagation of a release that contains the fix (fixing release), we conduct an empirical investigation to identify lags that may occur between the vulnerable release and its fixing release (package-side fixing release). Through a preliminary study of 231 package-side fixing release of npm projects on GitHub, we observe that a fixing release is rarely released on its own, with up to 85.72% of the bundled commits being unrelated to a fix. We then compare the package-side fixing release with changes on a client-side (client-side fixing release). Through an empirical study of the adoption and propagation tendencies of 1,290 package-side fixing releases that impact throughout a network of 1,553,325 releases of npm packages, we find that stale clients require additional migration effort, even if the package-side fixing release was quick (i.e., package patch landing). Furthermore, we show the influence of factors such as the branch that the package-side fixing release lands on and the severity of vulnerability on its propagation. In addition to these lags we identify and characterize, this paper lays the groundwork for future research on how to mitigate lags in an ecosystem., Comment: Published to Empirical Software Engineering Journal
Published: 2019
Full Text: View/download PDF

16. An empirical comparison of ethnic and gender diversity of DevOps and non-DevOps contributions to open-source projects

Author: Weeraddana, Nimmi Rashinika, Xu, Xiaoyan, Alfadel, Mahmoud, McIntosh, Shane, and Nagappan, Meiyappan
Published: 2023
Full Text: View/download PDF

17. The Impact of Automated Parameter Optimization on Defect Prediction Models

Author: Tantithamthavorn, Chakkrit, McIntosh, Shane, Hassan, Ahmed E., and Matsumoto, Kenichi
Subjects: Computer Science - Software Engineering
Abstract: Defect prediction models---classifiers that identify defect-prone software modules---have configurable parameters that control their characteristics (e.g., the number of trees in a random forest). Recent studies show that these classifiers underperform when default settings are used. In this paper, we study the impact of automated parameter optimization on defect prediction models. Through a case study of 18 datasets, we find that automated parameter optimization: (1) improves AUC performance by up to 40 percentage points; (2) yields classifiers that are at least as stable as those trained using default settings; (3) substantially shifts the importance ranking of variables, with as few as 28% of the top-ranked variables in optimized classifiers also being top-ranked in non-optimized classifiers; (4) yields optimized settings for 17 of the 20 most sensitive parameters that transfer among datasets without a statistically significant drop in performance; and (5) adds less than 30 minutes of additional computation to 12 of the 26 studied classification techniques. While widely-used classification techniques like random forest and support vector machines are not optimization-sensitive, traditionally overlooked techniques like C5.0 and neural networks can actually outperform widely-used techniques after optimization is applied. This highlights the importance of exploring the parameter space when using parameter-sensitive classification techniques., Comment: 32 pages, accepted at IEEE Transactions on Software Engineering
Published: 2018

18. Crop fertilisation potential of phosphorus in hydrochars produced from sewage sludge

Author: McIntosh, Shane, Padilla, Ricardo Vasquez, Rose, Terry, Rose, Andrew L., Boukaka, Exhaussée, and Erler, Dirk
Published: 2022
Full Text: View/download PDF

19. Assessing the exposure of software changes: The DiPiDi approach

Author: Meidani, Mehran, Lamothe, Maxime, and McIntosh, Shane
Published: 2023
Full Text: View/download PDF

20. Plant growth responses to soil-applied hydrothermally-carbonised waste amendments: a meta-analysis

Author: Luutu, Henry, Rose, Michael T., McIntosh, Shane, Van Zwieten, Lukas, and Rose, Terry
Published: 2022
Full Text: View/download PDF

21. Extracting Build Changes with BUILDDIFF

Author: Macho, Christian, McIntosh, Shane, and Pinzger, Martin
Subjects: Computer Science - Software Engineering
Abstract: Build systems are an essential part of modern software engineering projects. As software projects change continuously, it is crucial to understand how the build system changes because neglecting its maintenance can lead to expensive build breakage. Recent studies have investigated the (co-)evolution of build configurations and reasons for build breakage, but they did this only on a coarse grained level. In this paper, we present BUILDDIFF, an approach to extract detailed build changes from MAVEN build files and classify them into 95 change types. In a manual evaluation of 400 build changing commits, we show that BUILDDIFF can extract and classify build changes with an average precision and recall of 0.96 and 0.98, respectively. We then present two studies using the build changes extracted from 30 open source Java projects to study the frequency and time of build changes. The results show that the top 10 most frequent change types account for 73% of the build changes. Among them, changes to version numbers and changes to dependencies of the projects occur most frequently. Furthermore, our results show that build changes occur frequently around releases. With these results, we provide the basis for further research, such as for analyzing the (co-)evolution of build files with other artifacts or improving effort estimation approaches. Furthermore, our detailed change information enables improvements of refactoring approaches for build configurations and improvements of models to identify error-prone build files., Comment: Accepted at the International Conference of Mining Software Repositories (MSR), 2017
Published: 2017

22. Combined pyrolysis and sulphided NiMo/Al2O3 catalysed hydroprocessing in a multistage strategy for the production of biofuels from milk processing waste

Author: McIntosh, Shane, Nabi, Md Nurun, Moghaddam, Lalehvash, Brooks, Peter, Ghandehari, Parya Samadi, and Erler, Dirk
Published: 2021
Full Text: View/download PDF

23. The Classics Never Go Out of Style: An Empirical Study of Downgrades from the Bazel Build Technology

Author: Alfadel, Mahmoud, primary and McIntosh, Shane, additional
Published: 2024
Full Text: View/download PDF

24. How Trustworthy Is Your Continuous Integration (CI) Accelerator?: A Comparison of the Trustworthiness of CI Acceleration Products

Author: Zeng, Zhili, Xiao, Tao, Lamothe, Maxime, Hata, Hideaki, and McIntosh, Shane
Abstract: Continuous integraton (CI) accelerators “turbo-charge” developer feedback cycles, but make mistakes. By replaying failing CI jobs, we study the trustworthiness of CI accelerators. Both studied products have limitations, but the program analysis product is more trustworthy than its machine learning counterpart.
Published: 2024
Full Text: View/download PDF

25. What is an app store? The software engineering perspective

Author: Zhu, Wenhan, primary, Proksch, Sebastian, additional, German, Daniel M., additional, Godfrey, Michael W., additional, Li, Li, additional, and McIntosh, Shane, additional
Published: 2024
Full Text: View/download PDF

26. Characterizing Timeout Builds in Continuous Integration

Author: Weeraddana, Nimmi, primary, Alfadel, Mahmoud, additional, and McIntosh, Shane, additional
Published: 2024
Full Text: View/download PDF

27. The nature of build changes: An empirical study of Maven-based build systems

Author: Macho, Christian, Beyer, Stefanie, McIntosh, Shane, and Pinzger, Martin
Published: 2021
Full Text: View/download PDF

28. Mitigating the Uncertainty and Imprecision of Log-Based Code Coverage Without Requiring Additional Logging Statements

Author: Xu, Xiaoyan, Cogo, Filipe R., and McIntosh, Shane
Abstract: Understanding code coverage is an important precursor to software maintenance activities (e.g., better testing). Although modern code coverage tools provide key insights, they typically rely on code instrumentation, resulting in significant performance overhead. An alternative approach to code instrumentation is to process an application's source code and the associated log traces in tandem. This so-called “log-based code coverage” approach does not impose the same performance overhead as code instrumentation. Chen et al. proposed LogCoCo — a tool that implements log-based code coverage for Java. While LogCoCo breaks important new ground, it has fundamental limitations, namely: uncertainty due to the lack of logging statements in conditional branches, and imprecision caused by dependency injection. In this study, we propose Log2Cov, a tool that generates log-based code coverage for programs written in Python and addresses uncertainty and imprecision issues. We evaluate Log2Cov on three large and active open-source systems. More specifically, we compare the performance of Log2Cov to that of Coverage.py, an instrumentation-based coverage tool for Python. Our results indicate that 1) Log2Cov achieves high precision without introducing runtime overhead; and 2) uncertainty and imprecision can be reduced by up to 11% by statically analyzing the program's source code and execution logs, without requiring additional logging instrumentation from developers. While our enhancements make substantial improvements, we find that future work is needed to handle conditional statements and exception handling blocks to achieve parity with instrumentation-based approaches. We conclude the paper by drawing attention to these promising directions for future work.
Published: 2024
Full Text: View/download PDF

29. Characterizing the Prevalence, Distribution, and Duration of Stale Reviewer Recommendations

Author: Kazemi, Farshad, Lamothe, Maxime, and McIntosh, Shane
Abstract: The appropriate assignment of reviewers is a key factor in determining the value that organizations can derive from code review. While inappropriate reviewer recommendations can hinder the benefits of the code review process, identifying these assignments is challenging. Stale reviewers, i.e., those who no longer contribute to the project, are one type of reviewer recommendation that is certainly inappropriate. Understanding and minimizing this type of recommendation can thus enhance the benefits of the code review process. While recent work demonstrates the existence of stale reviewers, to the best of our knowledge, attempts have yet to be made to characterize and mitigate them. In this paper, we study the prevalence and potential effects. We then propose and assess a strategy to mitigate stale recommendations in existing code reviewer recommendation tools. By applying five code reviewer recommendation approaches (LearnRec, RetentionRec, cHRev, Sofia, and WLRRec) to three thriving open-source systems with 5,806 contributors, we observe that, on average, 12.59% of incorrect recommendations are stale due to developer turnover; however, fewer stale recommendations are made when the recency of contributions is considered by the recommendation objective function. We also investigate which reviewers appear in stale recommendations and observe that the top reviewers account for a considerable proportion of stale recommendations. For instance, in 15.31% of cases, the top-3 reviewers account for at least half of the stale recommendations. Finally, we study how long stale reviewers linger after the candidate leaves the project, observing that contributors who left the project 7.7 years ago are still suggested to review change sets. Based on our findings, we propose separating the reviewer contribution recency from the other factors that are used by the CRR objective function to filter out developers who have not contributed during a specified duration. By evaluating this strategy with different intervals, we assess the potential impact of this choice on the recommended reviewers. The proposed filter reduces the staleness of recommendations, i.e., the Staleness Reduction Ratio (SRR) improves between 21.44%–92.39%. Yet since the strategy may increase active reviewer workload, careful project-specific exploration of the impact of the cut-off setting is crucial.
Published: 2024
Full Text: View/download PDF

30. Understanding the quality and evolution of Android app build systems.

Author: Liu, Pei, Li, Li, Liu, Kui, McIntosh, Shane, and Grundy, John
Subjects: MOBILE app development, SOURCE code, SOFTWARE maintenance, COMPUTER software development, MOBILE apps
Abstract: Build systems are used to transform static source code into executable software. They play a crucial role in modern software development and maintenance. As such, much research effort has been invested in understanding the quality and evolution of build systems, including Apache ANT, Apache Maven, and Make‐based ones. However, the quality and evolution of build systems for mobile apps, such as on the Android platform, have not as yet been investigated in detail. Mobile app development, and the Android development context in particular, impose unique constrains, such as different device conditions and capabilities. It presents unique challenges, such as frequently upgraded Android frameworks, which those who implement and maintain build systems must tackle. In this paper, we present an exploratory empirical study of the build systems of 5222 Android projects to better understand their quality and evolution. We (a) study the build technology choices that Android developers make (Gradle being recommended and the most popular choice), (b) explore the sustainability of the official Gradle build system (parts of build files are updated more frequent that others and the update of the special Gradle plugin would induce unrecommended configurations), and (c) analyze the quality of Gradle scripts for Android apps—more than a half of the open‐source Android apps cannot be successfully built due to five common root causes. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

31. Guest Editorial: Special Issue on Predictive Models and Data Analytics in Software Engineering

Author: Tosun, Ayse, McIntosh, Shane, Minku, Leandro, and Turhan, Burak
Published: 2020
Full Text: View/download PDF

32. The impact of rapid release cycles on the integration delay of fixed issues

Author: Costa, Daniel Alencar da, McIntosh, Shane, Treude, Christoph, Kulesza, Uirá, and Hassan, Ahmed E.
Published: 2018
Full Text: View/download PDF

33. Repeated Builds During Code Review: An Empirical Study of the OpenStack Community

Author: Maipradit, Rungroj, primary, Wang, Dong, additional, Thongtanunam, Patanamon, additional, Kula, Raula Gaikovina, additional, Kamei, Yasutaka, additional, and McIntosh, Shane, additional
Published: 2023
Full Text: View/download PDF

34. Understanding the quality and evolution of Android app build systems

Author: Liu, Pei, primary, Li, Li, additional, Liu, Kui, additional, McIntosh, Shane, additional, and Grundy, John, additional
Published: 2023
Full Text: View/download PDF

35. Process options for conversion of Agave tequilana leaves into bioethanol

Author: Rijal, Deepa, Vancov, Tony, McIntosh, Shane, Ashwath, Nanjappa, and Stanley, Grant A.
Published: 2016
Full Text: View/download PDF

36. Lags in the release, adoption, and propagation of npm vulnerability fixes

Author: Chinthanet, Bodin, Kula, Raula Gaikovina, McIntosh, Shane, Ishio, Takashi, Ihara, Akinori, Matsumoto, Kenichi, Chinthanet, Bodin, Kula, Raula Gaikovina, McIntosh, Shane, Ishio, Takashi, Ihara, Akinori, and Matsumoto, Kenichi
Abstract: Security vulnerability in third-party dependencies is a growing concern not only for developers of the affected software, but for the risks it poses to an entire software ecosystem, e.g., Heartbleed vulnerability. Recent studies show that developers are slow to respond to the threat of vulnerability, sometimes taking four to eleven months to act. To ensure quick adoption and propagation of a release that contains the fix (fixing release), we conduct an empirical investigation to identify lags that may occur between the vulnerable release and its fixing release (package-side fixing release). Through a preliminary study of 231 package-side fixing release of npm projects on GitHub, we observe that a fixing release is rarely released on its own, with up to 85.72% of the bundled commits being unrelated to a fix. We then compare the package-side fixing release with changes on a client-side (client-side fixing release). Through an empirical study of the adoption and propagation tendencies of 1,290 package-side fixing releases that impact throughout a network of 1,553,325 releases of npm packages, we find that stale clients require additional migration effort, even if the package-side fixing release was quick (i.e., package-side fixing releasetypeSpatch). Furthermore, we show the influence of factors such as the branch that the package-side fixing release lands on and the severity of vulnerability on its propagation. In addition to these lags we identify and characterize, this paper lays the groundwork for future research on how to mitigate propagation lags in an ecosystem.
Published: 2023

37. Characterizing and Mitigating Self-Admitted Technical Debt in Build Systems

Author: Xiao, Tao, Wang, Dong, McIntosh, Shane, Hata, Hideaki, Kula, Raula Gaikovina, Ishio, Takashi, Matsumoto, Kenichi, Xiao, Tao, Wang, Dong, McIntosh, Shane, Hata, Hideaki, Kula, Raula Gaikovina, Ishio, Takashi, and Matsumoto, Kenichi
Abstract: Technical Debt is a metaphor used to describe the situation in which long-term software artifact quality is traded for short-term goals in software projects. In recent years, the concept of self-admitted technical debt (SATD) was proposed, which focuses on debt that is intentionally introduced and described by developers. Although prior work has made important observations about admitted technical debt in source code, little is known about SATD in build systems. In this paper, we set out to better understand the characteristics of SATD in build systems. To do so, through a qualitative analysis of 500 SATD comments in the Maven build system of 291 projects, we characterize SATD by location and rationale (reason and purpose). Our results show that limitations in tools and libraries, and complexities of dependency management are the most frequent causes, accounting for 50% and 24% of the comments. We also find that developers often document SATD as issues to be fixed later. As a first step towards the automatic detection of SATD rationale, we train classifiers to detect the two most frequently occurring reasons and the four most frequently occurring purposes of SATD in the content of comments in Maven build systems. The classifier performance is promising, achieving an F1-score of 0.71–0.79. Finally, within 16 identified ‘ready-to-be-addressed’ SATD instances, the three SATD submitted by pull requests and the five SATD submitted by issue reports were resolved after developers were made aware. Our work presents the first step towards understanding technical debt in build systems and opens up avenues for future work, such as tool support to track and manage SATD backlogs.
Published: 2023

38. The Impact of Mislabelling on the Performance and Interpretation of Defect Prediction Models

Author: Tantithamthavorn, Chakkrit, McIntosh, Shane, Hassan, Ahmed E., Ihara, Akinori, Matsumoto, Kenichi, Tantithamthavorn, Chakkrit, McIntosh, Shane, Hassan, Ahmed E., Ihara, Akinori, and Matsumoto, Kenichi
Abstract: The reliability of a prediction model depends on the quality of the data from which it was trained. Therefore, defect prediction models may be unreliable if they are trained using noisy data. Recent research suggests that randomly-injected noise that changes the classification (label) of software modules from defective to clean (and vice versa) can impact the performance of defect models. Yet, in reality, incorrectly labelled (i.e., mislabelled) issue reports are likely non-random. In this paper, we study whether mislabelling is random, and the impact that realistic mislabelling has on the performance and interpretation of defect models. Through a case study of 3,931 manually-curated issue reports from the Apache Jackrabbit and Lucene systems, we find that: (1) issue report mislabelling is not random; (2) precision is rarely impacted by mislabelled issue reports, suggesting that practitioners can rely on the accuracy of modules labelled as defective by models that are trained using noisy data; (3) however, models trained on noisy data typically achieve 56%-68% of the recall of models trained on clean data; and (4) only the metrics in top influence rank of our defect models are robust to the noise introduced by mislabelling, suggesting that the less influential metrics of models that are trained on noisy data should not be interpreted or used to make decisions., 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, 6-24 May 2015, Florence, Italy
Published: 2023

39. App Store Feature Groups and Labeling of App Stores Based on Feature Groups

Author: Zhu, Wenhan, Proksch, Sebastian, German, Daniel, Godfrey, Michael, Li, Li, and McIntosh, Shane
Subjects: empirical software engineering, app store
Abstract: Data for our work on understanding app stores.
Published: 2023
Full Text: View/download PDF

40. An empirical study of the integration time of fixed issues

Author: da Costa, Daniel Alencar, McIntosh, Shane, Kulesza, Uirá, Hassan, Ahmed E., and Abebe, Surafel Lemma
Published: 2017
Full Text: View/download PDF

41. An empirical study of unspecified dependencies in make-based build systems

Author: Bezemer, Cor-Paul, McIntosh, Shane, Adams, Bram, German, Daniel M., and Hassan, Ahmed E.
Published: 2017
Full Text: View/download PDF

42. Review participation in modern code review: An empirical study of the android, Qt, and OpenStack projects

Author: Thongtanunam, Patanamon, McIntosh, Shane, Hassan, Ahmed E., and Iida, Hajimu
Published: 2017
Full Text: View/download PDF

43. Identifying and understanding header file hotspots in C/C++ build processes

Author: McIntosh, Shane, Adams, Bram, Nagappan, Meiyappan, and Hassan, Ahmed E.
Published: 2016
Full Text: View/download PDF

44. Studying just-in-time defect prediction using cross-project models

Author: Kamei, Yasutaka, Fukushima, Takafumi, McIntosh, Shane, Yamashita, Kazuhiro, Ubayashi, Naoyasu, and Hassan, Ahmed E.
Published: 2016
Full Text: View/download PDF

45. An empirical study of the impact of modern code review practices on software quality

Author: McIntosh, Shane, Kamei, Yasutaka, Adams, Bram, and Hassan, Ahmed E.
Published: 2016
Full Text: View/download PDF

46. Struvite Production from Dairy Processing Waste

Author: McIntosh, Shane, primary, Hunt, Louise, additional, Thompson Brewster, Emma, additional, Rose, Andrew, additional, Thornton, Aaron, additional, and Erler, Dirk, additional
Published: 2022
Full Text: View/download PDF

47. Characterizing and Mitigating Self-Admitted Technical Debt in Build Systems

Author: Xiao, Tao, primary, Wang, Dong, additional, McIntosh, Shane, additional, Hata, Hideaki, additional, Kula, Raula Gaikovina, additional, Ishio, Takashi, additional, and Matsumoto, Kenichi, additional
Published: 2022
Full Text: View/download PDF

48. Exploring the Notion of Risk in Code Reviewer Recommendation

Author: Kazemi, Farshad, primary, Lamothe, Maxime, additional, and McIntosh, Shane, additional
Published: 2022
Full Text: View/download PDF

49. A Large-Scale Empirical Study of the Relationship between Build Technology and Build Maintenance

Author: McIntosh, Shane, Nagappan, Meiyappan, Adams, Bram, Mockus, Audris, and Hassan, Ahmed E.
Published: 2015
Full Text: View/download PDF

50. Use of ionic liquids in converting lignocellulosic material to biofuels

Author: Vancov, Tony, Alston, Amy-Sue, Brown, Trevor, and McIntosh, Shane
Published: 2012
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

307 results on '"McIntosh, Shane"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources