Descriptor: "1712 Software" / Topic: 10009 department of informatics - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"1712 Software"' showing total 482 results

Start Over Descriptor "1712 Software" Topic 10009 department of informatics

482 results on '"1712 Software"'

1. Robust Federated Learning for execution time-based device model identification under label-flipping attack

Author: Sánchez Sánchez, Pedro Miguel, Huertas Celdran, Alberto, Buendía Rubio, José R, Bovet, Gérôme, Martínez Pérez, Gregorio, University of Zurich, and Sánchez Sánchez, Pedro Miguel
Subjects: 1712 Software, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Cryptography and Security, 10009 Department of Informatics, Computer Networks and Communications, 1705 Computer Networks and Communications, 000 Computer science, knowledge & systems, Cryptography and Security (cs.CR), Software, Machine Learning (cs.LG)
Abstract: The computing device deployment explosion experienced in recent years, motivated by the advances of technologies such as Internet-of-Things (IoT) and 5G, has led to a global scenario with increasing cybersecurity risks and threats. Among them, device spoofing and impersonation cyberattacks stand out due to their impact and, usually, low complexity required to be launched. To solve this issue, several solutions have emerged to identify device models and types based on the combination of behavioral fingerprinting and Machine/Deep Learning (ML/DL) techniques. However, these solutions are not appropriate for scenarios where data privacy and protection are a must, as they require data centralization for processing. In this context, newer approaches such as Federated Learning (FL) have not been fully explored yet, especially when malicious clients are present in the scenario setup. The present work analyzes and compares the device model identification performance of a centralized DL model with an FL one while using execution time-based events. For experimental purposes, a dataset containing execution-time features of 55 Raspberry Pis belonging to four different models has been collected and published. Using this dataset, the proposed solution achieved 0.9999 accuracy in both setups, centralized and federated, showing no performance decrease while preserving data privacy. Later, the impact of a label-flipping attack during the federated model training is evaluated using several aggregation mechanisms as countermeasures. Zeno and coordinate-wise median aggregation show the best performance, although their performance greatly degrades when the percentage of fully malicious clients (all training samples poisoned) grows over 50%.
Published: 2023
Full Text: View/download PDF

2. First come first served: the impact of file position on code review

Author: Fregnan, Enrico, Braz, Larissa, D'Ambros, Marco, Çalıklı, Gül, Bacchelli, Alberto, and University of Zurich
Subjects: 1712 Software, Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, 10009 Department of Informatics, 1702 Artificial Intelligence, 000 Computer science, knowledge & systems
Abstract: The most popular code review tools (e.g., Gerrit and GitHub) present the files to review sorted in alphabetical order. Could this choice or, more generally, the relative position in which a file is presented bias the outcome of code reviews? We investigate this hypothesis by triangulating complementary evidence in a two-step study. First, we observe developers' code review activity. We analyze the review comments pertaining to 219,476 Pull Requests (PRs) from 138 popular Java projects on GitHub. We found files shown earlier in a PR to receive more comments than files shown later, also when controlling for possible confounding factors: e.g., the presence of discussion threads or the lines added in a file. Second, we measure the impact of file position on defect finding in code review. Recruiting 106 participants, we conduct an online controlled experiment in which we measure participants' performance in detecting two unrelated defects seeded into two different files. Participants are assigned to one of two treatments in which the position of the defective files is switched. For one type of defect, participants are not affected by its file's position; for the other, they have 64% lower odds to identify it when its file is last as opposed to first. Overall, our findings provide evidence that the relative position in which files are presented has an impact on code reviews' outcome; we discuss these results and implications for tool design and code review. Data and materials: https://doi.org/10.5281/zenodo.6901285, Comment: This paper has been accepted at ESEC/FSE '22 (30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering)
Published: 2022
Full Text: View/download PDF

3. Sources of software development task friction

Author: Nick C. Bradley, Thomas Fritz, Reid Holmes, University of Zurich, and Bradley, Nick C
Subjects: 1712 Software, 10009 Department of Informatics, 000 Computer science, knowledge & systems, Software
Published: 2022
Full Text: View/download PDF

4. The evolution of the code during review: an investigation on review changes

Author: Fregnan, Enrico, Petrulio, Fernando, Bacchelli, Alberto, University of Zurich, and Fregnan, Enrico
Subjects: 1712 Software, 10009 Department of Informatics, 000 Computer science, knowledge & systems, Software
Abstract: Code review is a software engineering practice in which reviewers manually inspect the code written by a fellow developer and propose any change that is deemed necessary or useful. The main goal of code review is to improve the quality of the code under review. Despite the widespread use of code review, only a few studies focused on the investigation of its outcomes, for example, investigating the code changes that happen to the code under review. The goal of this paper is to expand our knowledge on the outcome of code review while re-evaluating results from previous work. To this aim, we analyze changes that happened during the review process, which we define as review changes. Considering three popular open-source software projects, we investigate the types of review changes (based on existing taxonomies) and what triggers them; also, we study which code factors in a code review are most related to the number of review changes. Our results show that the majority of changes relate to evolvability concerns, with a strong prevalence of documentation and structure changes at type-level. Furthermore, differently from past work, we found that the majority of review changes are not triggered by reviewers’ comments. Finally, we find that the number of review changes in a code review is related to the size of the initial patch as well as the new lines of code that it adds. However, other factors, such as lines deleted or the author of the review patchset, do not always show an empirically supported relationship with the number of changes.
Published: 2022
Full Text: View/download PDF

5. Do Explicit Review Strategies Improve Code Review Performance?

Author: Fregnan, Enrico, Gonçalves, Pavlína, Baum, Tobias, Schneider, Kurt, Bacchelli, Alberto, and University of Zurich
Subjects: 10009 Department of Informatics, Computer science, Context (language use), 02 engineering and technology, 000 Computer science, knowledge & systems, computer.software_genre, Physical Sciences and Mathematics, 1706 Computer Science Applications, 0202 electrical engineering, electronic engineering, information engineering, Generalizability theory, Code review, Computer Sciences, Software Engineering, 020207 software engineering, Checklist, Test (assessment), 1712 Software, Task (computing), Risk analysis (engineering), Empirical Software Engineering, Mediation, 020201 artificial intelligence & image processing, computer, Cognitive load
Abstract: Context: Code review is a fundamental, yet expensive part of software engineering. Therefore, research on understanding code review and its efficiency and performance is paramount. Objective: We aim to test the effect of a guidance approach on review effectiveness and efficiency. This effect is expected to work by lowering the cognitive load of the task; thus, we analyze the mediation relationship as well. Method: To investigate this effect, we employ an experimental design where professional developers have to perform three code reviews. We use three conditions: no guidance, a checklist, and a checklist-based review strategy. Furthermore, we measure the reviewers' cognitive load. Limitations: The main limitations of this study concern the specific cohort of participants, the mono-operation bias for the guidance conditions, and the generalizability to other changes and defects. Full registered report: https://doi.org/10.17605/OSF.IO/5FPTJ; Materials: https://doi.org/10.6084/m9.figshare.11806656
Published: 2022
Full Text: View/download PDF

6. An Exploratory Study on Regression Vulnerabilities

Author: Larissa, Enrico, Vivek, Alberto, University of Zurich, Madeiral, Fernanda, Lassenius, Casper, Conte, Tayana, and Männistö, Tomi
Subjects: 1712 Software, Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, 10009 Department of Informatics, 1706 Computer Science Applications, 000 Computer science, knowledge & systems
Abstract: Background: Security regressions are vulnerabilities introduced in a previously unaffected software system. They often happen as a result of source code changes (e.g., a bug fix) and can have severe effects. Aims: To increase the understanding of security regressions. This is an important step in developing secure software engineering. Method: We perform an exploratory, mixed-method case study of Mozilla. First, we analyze 78 regression vulnerabilities and 72 bug reports where a bug fix introduced a regression vulnerability at Mozilla. We investigate how developers interact in these bug reports, how they perform the changes, and under what conditions they introduce regression vulnerabilities. Second, we conduct five semi-structured interviews with as many Mozilla developers involved in the vulnerability-inducing bug fixes. Results: Software security is not discussed during bug fixes. Developers' main concerns are the complexity of the bug at hand and the community pressure to fix it. Moreover, developers do not to worry about regression vulnerabilities and assume tools will detect them. Indeed, dynamic analysis tools helped finding around 30% of regression vulnerabilities at Mozilla. Conclusions: These results provide evidence that, although tool support helps identify regression vulnerabilities, it may not be enough to ensure security during bug fixes. Furthermore, our results call for further work on the security tooling support and how to integrate them during bug fixes. Data and materials: https://doi.org/10.5281/zenodo.6792317, This paper has been accepted at ESEM 2022 (16th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement)
Published: 2022

7. PGCNet: patch graph convolutional network for point cloud segmentation of indoor scenes

Author: Renato Pajarola, Yuliang Sun, Jiazhou Chen, Yongwei Miao, University of Zurich, and Miao, Yongwei
Subjects: Speedup, 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Point cloud, indoor scene reconstruction, 02 engineering and technology, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design, Computer graphics, 0202 electrical engineering, electronic engineering, information engineering, Segmentation, Computer vision, business.industry, graphics, Deep learning, segmentation, 020207 software engineering, Robotics, Computer Graphics and Computer-Aided Design, 1712 Software, Graph (abstract data type), 020201 artificial intelligence & image processing, Augmented reality, Computer Vision and Pattern Recognition, Artificial intelligence, business, Software, point cloud
Abstract: Semantic segmentation of 3D point clouds is a crucial task in scene understanding and is also fundamental to indoor scene applications such as indoor navigation, mobile robotics, augmented reality. Recently, deep learning frameworks have been successfully adopted to point clouds but are limited by the size of data. While most existing works focus on individual sampling points, we use surface patches as a more efficient representation and propose a novel indoor scene segmentation framework called patch graph convolution network (PGCNet). This framework treats patches as input graph nodes and subsequently aggregates neighboring node features by dynamic graph U-Net (DGU) module, which consists of dynamic edge convolution operation inside U-shaped encoder–decoder architecture. The DGU module dynamically update graph structures at each level to encode hierarchical edge features. Incorporating PGCNet, we can segment the input scene into two types, i.e., room layout and indoor objects, which is afterward utilized to carry out final rich semantic labeling of various indoor scenes. With considerable speedup training, the proposed framework achieves effective performance equivalent to state-of-the-art for segmenting standard indoor scene dataset.
Published: 2020
Full Text: View/download PDF

8. Enforcing Group Fairness in Algorithmic Decision Making: Utility Maximization Under Sufficiency

Author: Baumann, Joachim, Hannák, Anikó, Heitz, Christoph, and University of Zurich
Subjects: FOS: Computer and information sciences, Algorithmic fairness, Computer Science - Machine Learning, 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, 11476 Digital Society Initiative, Computing methodology, Applied computing, 005: Computerprogrammierung, Programme und Daten, 000 Computer science, knowledge & systems, Decision analysis, K.4, 658.403: Entscheidungsfindung, Informationsmanagement, Machine Learning (cs.LG), 1712 Software, 1709 Human-Computer Interaction, Computer Science - Computers and Society, 1705 Computer Networks and Communications, Computers and Society (cs.CY), Machine learning, Group fairness metrics, Prediction-based decision making
Abstract: Binary decision making classifiers are not fair by default. Fairness requirements are an additional element to the decision making rationale, which is typically driven by maximizing some utility function. In that sense, algorithmic fairness can be formulated as a constrained optimization problem. This paper contributes to the discussion on how to implement fairness, focusing on the fairness concepts of positive predictive value (PPV) parity, false omission rate (FOR) parity, and sufficiency (which combines the former two). We show that group-specific threshold rules are optimal for PPV parity and FOR parity, similar to well-known results for other group fairness criteria. However, depending on the underlying population distributions and the utility function, we find that sometimes an upper-bound threshold rule for one group is optimal: utility maximization under PPV parity (or FOR parity) might thus lead to selecting the individuals with the smallest utility for one group, instead of selecting the most promising individuals. This result is counter-intuitive and in contrast to the analogous solutions for statistical parity and equality of opportunity. We also provide a solution for the optimal decision rules satisfying the fairness constraint sufficiency. We show that more complex decision rules are required and that this leads to within-group unfairness for all but one of the groups. We illustrate our findings based on simulated and real data., 14 pages, accepted to ACM FAccT 2022, code available on GitHub: https://github.com/joebaumann/fair-prediction-based-decision-making
Published: 2022

9. abcOD: Mining Band Order Dependencies

Author: Pei Li, Jessica Jessica, Naida Tania, Michael Bohlen, Divesh Srivastava, Jaroslaw Szlichta, and University of Zurich
Subjects: 1712 Software, 10009 Department of Informatics, 1711 Signal Processing, 000 Computer science, knowledge & systems, 1710 Information Systems
Published: 2022
Full Text: View/download PDF

10. A framework for differentially-private knowledge graph embeddings

Author: Xiaolin Han, Daniele Dell’Aglio, Tobias Grubenmann, Reynold Cheng, Abraham Bernstein, University of Zurich, and Han, Xiaolin
Subjects: Computer Networks and Communications, 10009 Department of Informatics, Knowledge graph embeddings, 000 Computer science, knowledge & systems, Human-Computer Interaction, 1712 Software, Machine Learning, 1709 Human-Computer Interaction, Differential privacy, Privacy, 1705 Computer Networks and Communications, Knowledge Graphs, Software, Semantic Web
Abstract: Knowledge graph (KG) embedding methods are at the basis of many KG-based data mining tasks, such as link prediction and node clustering. However, graphs may contain confidential information about people or organizations, which may be leaked via embeddings. Research recently studied how to apply differential privacy to a number of graphs (and KG) analyses, but embedding methods have not been considered so far. This study moves a step toward filling such a gap, by proposing the Differential Private Knowledge Graph Embedding (DPKGE) framework. DPKGE extends existing KG embedding methods (e.g., TransE, TransM, RESCAL, and DistMult) and processes KGs containing both confidential and unrestricted statements. The resulting embeddings protect the presence of any of the former statements in the embedding space using differential privacy. Our experiments identify the cases where DPKGE produces useful embeddings, by analyzing the training process and tasks executed on top of the resulting embeddings.
Published: 2022
Full Text: View/download PDF

11. Less is More: Supporting Developers in Vulnerability Detection during Code Review

Author: Braz, Larissa, Aeberhard, Christian, Çalikli, Gül, Bacchelli, Alberto, and University of Zurich
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, 1712 Software, Computer Science - Software Engineering, 10009 Department of Informatics, 000 Computer science, knowledge & systems
Abstract: Reviewing source code from a security perspective has proven to be a difficult task. Indeed, previous research has shown that developers often miss even popular and easy-to-detect vulnerabilities during code review. Initial evidence suggests that a significant cause may lie in the reviewers' mental attitude and common practices. In this study, we investigate whether and how explicitly asking developers to focus on security during a code review affects the detection of vulnerabilities. Furthermore, we evaluate the effect of providing a security checklist to guide the security review. To this aim, we conduct an online experiment with 150 participants, of which 71% report to have three or more years of professional development experience. Our results show that simply asking reviewers to focus on security during the code review increases eight times the probability of vulnerability detection. The presence of a security checklist does not significantly improve the outcome further, even when the checklist is tailored to the change under review and the existing vulnerabilities in the change. These results provide evidence supporting the mental attitude hypothesis and call for further work on security checklists' effectiveness and design. Data and materials: https://doi.org/10.5281/zenodo.6026291, This paper has been accepted at ICSE 2022 (44th ACM/IEEE International Conference on Software Engineering). This version of the paper uses a different title to match the one used for ICSE 2022
Published: 2022

12. Software Security during Modern Code Review: The Developer's Perspective

Author: Braz, Larissa, Bacchelli, Alberto, and University of Zurich
Subjects: 1712 Software, Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, 10009 Department of Informatics, code review, 1702 Artificial Intelligence, security, 000 Computer science, knowledge & systems, software vulnerabilities
Abstract: To avoid software vulnerabilities, organizations are shifting security to earlier stages of the software development, such as at code review time. In this paper, we aim to understand the developers' perspective on assessing software security during code review, the challenges they encounter, and the support that companies and projects provide. To this end, we conduct a two-step investigation: we interview 10 professional developers and survey 182 practitioners about software security assessment during code review. The outcome is an overview of how developers perceive software security during code review and a set of identified challenges. Our study revealed that most developers do not immediately report to focus on security issues during code review. Only after being asked about software security, developers state to always consider it during review and acknowledge its importance. Most companies do not provide security training, yet expect developers to still ensure security during reviews. Accordingly, developers report the lack of training and security knowledge as the main challenges they face when checking for security issues. In addition, they have challenges with third-party libraries and to identify interactions between parts of code that could have security implications. Moreover, security may be disregarded during reviews due to developers' assumptions about the security dynamic of the application they develop. Data and materials: https://doi.org/10.5281/zenodo.6875435, Comment: This paper has been accepted at ESEC/FSE '22 (30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering)
Published: 2022
Full Text: View/download PDF

13. ManEx: The Visual Analysis of Measurements for the Assessment of Errors in Electrical Engines

Author: Joscha Eirich, Georgios Koutroulis, Belgin Mutlu, Dominik Jackle, Roman Kern, Tobias Schreck, Jurgen Bernard, University of Zurich, and Eirich, Joscha
Subjects: 1712 Software, 10009 Department of Informatics, 11476 Digital Society Initiative, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design, Computer Graphics and Computer-Aided Design, Software
Abstract: Electrical engines are a key technology all automotive manufacturers must master to stay competitive. Engineers need to analyze an overwhelming number of engine measurements to improve the manufacturing for this technology. They are hindered in the task of analyzing large numbers of engines, however, by the following challenges: 1) Engines comprise a complex hierarchical structure of subcomponents. 2) Locating the cause of errors along manufacturing processes is a difficult procedure. 3) Large numbers of heterogeneous measurements impair the ability to explain errors in engines. We address these challenges in a design study with automotive engineers and by developing the visual analytics system Manufacturing Explorer (ManEx), which provides interactive interfaces to analyze measurements of engines across the manufacturing process. ManEx was validated by five experts. Our results suggest high usability and usefulness scores and the improvement of a real-world manufacturing process. Specifically, with ManEx, experts reduced scraped parts by over 3%.
Published: 2022
Full Text: View/download PDF

14. Toward pre-standardization of reputation-based trust models beyond 5G

Author: José María Jorquera Valero, Alberto Huertas Celdrán, Pedro Miguel Sánchez Sánchez, Manuel Gil Pérez, Gregorio Martínez Pérez, University of Zurich, and Jorquera Valero, José María
Subjects: Standardization, business.industry, Computer science, 10009 Department of Informatics, media_common.quotation_subject, 1708 Hardware and Architecture, 000 Computer science, knowledge & systems, 3308 Law, 1712 Software, Risk analysis (engineering), Hardware and Architecture, Scalability, Key (cryptography), Mobile telephony, Performance indicator, Set (psychology), business, Law, Software, 5G, Reputation, media_common
Abstract: In the last years, the number of connections in mobile telecommunication networks has increased rampantly, and in consequence, the number and type of relationships among entities. Should such interactions are to be profitable, entities will need to rely on each other. Hence, mobile telecommunication networks demand trust and reputation models that allow developing feasible communications in 5G and beyond networks, through which a group of entities can establish chains of services between cross-operators/domains, with security and trustworthiness. One of the key obstacles to achieving generalized connectivity beyond 5G networks is the lack of automatized, efficient, and scalable models for establishing security and trust. In this vein, this article proposes a pre-standardization approach for reputation-based trust models beyond 5G. To this end, we have realized a thorough review of the literature to match trust standardization approaches. An abstract set of requirements and key performance indicators has been extracted, and some pre-standardization recommendations proposed to fulfill essential conditions of future networks and to cover the lack of common trust and reputation models beyond 5G.
Published: 2022
Full Text: View/download PDF

15. Speeding Up Reachability Queries in Public Transport Networks Using Graph Partitioning

Author: Nikolaus Augsten, Michael H. Böhlen, Bezaye Tesfaye, Mateusz Pawlik, Christian S. Jensen, University of Zurich, and Tesfaye, Bezaye
Subjects: Spatial network databases, Temporal graphs, Theoretical computer science, Computer Networks and Communications, Computer science, 10009 Department of Informatics, Node (networking), Graph partition, 000 Computer science, knowledge & systems, 1710 Information Systems, Theoretical Computer Science, Ranking (information retrieval), 1712 Software, Tree traversal, Reachability, Path (graph theory), Shortest path problem, 1705 Computer Networks and Communications, Reachability queries, 2614 Theoretical Computer Science, Dijkstra's algorithm, Software, Public transport networks, Information Systems
Abstract: Computing path queries such as the shortest path in public transport networks is challenging because the path costs between nodes change over time. A reachability query from a node at a given start time on such a network retrieves all points of interest (POIs) that are reachable within a given cost budget. Reachability queries are essential building blocks in many applications, for example, group recommendations, ranking spatial queries, or geomarketing. We propose an efficient solution for reachability queries in public transport networks. Currently, there are two options to solve reachability queries. (1) Execute a modified version of Dijkstra’s algorithm that supports time-dependent edge traversal costs; this solution is slow since it must expand edge by edge and does not use an index. (2) Issue a separate path query for each single POI, i.e., a single reachability query requires answering many path queries. None of these solutions scales to large networks with many POIs. We propose a novel and lightweight reachability index. The key idea is to partition the network into cells. Then, in contrast to other approaches, we expand the network cell by cell. Empirical evaluations on synthetic and real-world networks confirm the efficiency and the effectiveness of our index-based reachability query solution.
Published: 2022
Full Text: View/download PDF

16. What happens in my code reviews? An investigation on automatically classifying review changes

Author: Enrico Fregnan, Fernando Petrulio, Linda Di Geronimo, Alberto Bacchelli, University of Zurich, and Fregnan, Enrico
Subjects: 1712 Software, 10009 Department of Informatics, 000 Computer science, knowledge & systems, Software
Abstract: Code reviewing is a widespread practice used by software engineers to maintain high code quality. To date, the knowledge on the effect of code review on source code is still limited. Some studies have addressed this problem by classifying the types of changes that take place during the review process (a.k.a. review changes), as this strategy can, for example, pinpoint the immediate effect of reviews on code. Nevertheless, this classification (1) is not scalable, as it was conducted manually, and (2) was not assessed in terms of how meaningful the provided information is for practitioners. This paper aims at addressing these limitations: First, we investigate to what extent a machine learning-based technique can automatically classify review changes. Then, we evaluate the relevance of information on review change types and its potential usefulness, by conducting (1) semi-structured interviews with 12 developers and (2) a qualitative study with 17 developers, who are asked to assess reports on the review changes of their project. Key results of the study show that not only it is possible to automatically classify code review changes, but this information is also perceived by practitioners as valuable to improve the code review process. Data and materials: 10.5281/zenodo.5592254
Published: 2022
Full Text: View/download PDF

17. An Empirical Investigation of Personalization Factors on TikTok

Author: Boeker, Maximilian, Urman, Aleksandra, University of Zurich, Laforest, Frédérique, and et al
Subjects: 1712 Software, Social and Information Networks (cs.SI), FOS: Computer and information sciences, Computer Science - Computers and Society, 10009 Department of Informatics, 1705 Computer Networks and Communications, Computers and Society (cs.CY), Computer Science - Human-Computer Interaction, Computer Science - Social and Information Networks, 000 Computer science, knowledge & systems, Human-Computer Interaction (cs.HC)
Abstract: TikTok currently is the fastest growing social media platform with over 1 billion active monthly users of which the majority is from generation Z. Arguably, its most important success driver is its recommendation system. Despite the importance of TikTok's algorithm to the platform's success and content distribution, little work has been done on the empirical analysis of the algorithm. Our work lays the foundation to fill this research gap. Using a sock-puppet audit methodology with a custom algorithm developed by us, we tested and analysed the effect of the language and location used to access TikTok, follow- and like-feature, as well as how the recommended content changes as a user watches certain posts longer than others. We provide evidence that all the tested factors influence the content recommended to TikTok users. Further, we identified that the follow-feature has the strongest influence, followed by the like-feature and video view rate. We also discuss the implications of our findings in the context of the formation of filter bubbles on TikTok and the proliferation of problematic content., Comment: Accepted for publication at the Web Conference 2022
Published: 2022
Full Text: View/download PDF

18. Do explicit review strategies improve code review performance? Towards understanding the role of cognitive load

Author: Gonçalves, Pavlína Wurzel, Fregnan, Enrico, Baum, Tobias, Schneider, Kurt, Bacchelli, Alberto, University of Zurich, and Gonçalves, Pavlína Wurzel
Subjects: 1712 Software, Cognitive load, Dewey Decimal Classification::000 | Allgemeines, Wissenschaft::000 | Informatik, Wissen, Systeme::004 | Informatik, Code review, 10009 Department of Informatics, Reviewing strategies, Checklist-based reading, 000 Computer science, knowledge & systems, ddc:004, Software
Abstract: Code review is an important process in software engineering – yet, a very expensive one. Therefore, understanding code review and how to improve reviewers’ performance is paramount. In the study presented in this work, we test whether providing developers with explicit reviewing strategies improves their review effectiveness and efficiency. Moreover, we verify if review guidance lowers developers’ cognitive load. We employ an experimental design where professional developers have to perform three code review tasks. Participants are assigned to one of three treatments: ad hoc reviewing, checklist, and guided checklist. The guided checklist was developed to provide an explicit reviewing strategy to developers. While the checklist is a simple form of signaling (a method to reduce cognitive load), the guided checklist incorporates further methods to lower cognitive demands of the task such as segmenting and weeding. The majority of the participants are novice reviewers with low or no code review experience. Our results indicate that the guided checklist is a more effective aid for a simple review,while the checklist supports reviewers’ efficiency and effectiveness in a complex task. However, we did not identify a strong relationship between the guidance provided and code review performance. The checklist has the potential to lower developers’ cognitive load, but higher cognitive load led to better performance possibly due to the generally low effectiveness and efficiency of the study participants. Data and materials: 10.5281/zenodo.5653341. Registered report: 10.17605/OSF.IO/5FPTJ.
Published: 2022
Full Text: View/download PDF

19. Noise-based cyberattacks generating fake P300 waves in brain–computer interfaces

Author: Sergio López Bernal, Gregorio Martínez Pérez, Alberto Huertas Celdrán, Mario Quiles Pérez, Enrique Tomás Martínez Beltrán, University of Zurich, and Martínez Beltrán, Enrique Tomás
Subjects: 10009 Department of Informatics, Computer Networks and Communications, Computer science, 0206 medical engineering, 02 engineering and technology, 000 Computer science, knowledge & systems, Computer security, computer.software_genre, 020601 biomedical engineering, 1712 Software, 03 medical and health sciences, Noise, 0302 clinical medicine, 1705 Computer Networks and Communications, Set (psychology), computer, Computer communication networks, 030217 neurology & neurosurgery, Software, Brain–computer interface
Abstract: Most of the current Brain–Computer Interfaces (BCIs) application scenarios use electroencephalographic signals (EEG) containing the subject’s information. It means that if EEG were maliciously manipulated, the proper functioning of BCI frameworks could be at risk. Unfortunately, it happens in frameworks sensitive to noise-based cyberattacks, and more efforts are needed to measure the impact of these attacks. This work presents and analyzes the impact of four noise-based cyberattacks attempting to generate fake P300 waves in two different phases of a BCI framework. A set of experiments show that the greater the attacker’s knowledge regarding the P300 waves, processes, and data of the BCI framework, the higher the attack impact. In this sense, the attacker with less knowledge impacts 1% in the acquisition phase and 4% in the processing phase, while the attacker with the most knowledge impacts 22% and 74%, respectively.
Published: 2022
Full Text: View/download PDF

20. Toward Measuring the Resemblance of Embedding Models for Evolving Ontologies

Author: Abraham Bernstein, Daniele Dell'Aglio, Romana Pernisch, Artificial intelligence, University of Zurich, and Gentile, Anna Lisa
Subjects: Jaccard index, Theoretical computer science, Computer science, 10009 Department of Informatics, 11476 Digital Society Initiative, Context (language use), Similarity measure, Ontology (information science), 000 Computer science, knowledge & systems, 1710 Information Systems, Measure (mathematics), embedding similarity, Open Biomedical Ontologies, 1712 Software, Machine Learning, ontology evolution, Similarity (psychology), Embedding, Ontology evolution, knowledge graph embeddings
Abstract: Updates on ontologies affect the operations built on top of them. But not all changes are equal: some updates drastically change the result of operations; others lead to minor variations, if any. Hence, estimating the impact of a change ex-ante is highly important, as it might make ontology engineers aware of the consequences of their action during editing. However, in order to estimate the impact of changes, we need to understand how to measure them.To address this gap for embeddings, we propose a new measure called Embedding Resemblance Indicator (ERI), which takes into account both the stochasticity of learning embeddings as well as the shortcomings of established comparison methods. We base ERI on (i) a similarity score, (ii) a robustness factor $\hatμ $ (based on the embedding method, similarity measure, and dataset), and (iii) the number of added or deleted entities to the embedding computed with the Jaccard index.To evaluate ERI, we investigate its usage in the context of two biomedical ontologies and three embedding methods---GraRep, LINE, and DeepWalk---as well as the two standard benchmark datasets---FB15k-237 and Wordnet-18-RR---with TransE and RESCAL embeddings. To study different aspects of ERI, we introduce synthetic changes in the knowledge graphs, generating two test-cases with five versions each and compare their impact with the expected behaviour. Our studies suggests that ERI behaves as expected and captures the similarity of embeddings based on the severity of changes. ERI is crucial for enabling further studies into impact of changes on embeddings.
Published: 2021
Full Text: View/download PDF

21. The indolent lambdification of Java

Author: Fernando Petrulio, Anand Ashok Sawant, Alberto Bacchelli, University of Zurich, and Sawant, Anand Ashok
Subjects: Java, 10009 Department of Informatics, Interface (Java), Computer science, Programming language, Functional interface, 000 Computer science, knowledge & systems, computer.software_genre, Backward compatibility, 1712 Software, Constructed language, State (computer science), computer, Software, Kotlin, computer.programming_language, Rust (programming language)
Abstract: As Java 8 introduced functional interfaces and lambda expressions to the Java programming language, the JDK API was changed to introduce support for lambda expressions, thus allowing consumers to define lambda functions when using Java’s collections. While the JDK API allows for a functional paradigm, for API consumers to be able to completely embrace Java’s new functional features, third-party APIs must also support lambda expressions. To understand the current state of the Java ecosystem, we investigate (i) the extent to which third-party Java APIs have changed their interfaces, (ii) why or why not they introduce functional interface support and (iii) in the case the API has changed its interface how it does so. We also investigate the consumers’ perspective, particularly their ease in using lambda expressions in Java with APIs. We perform our investigation by manually analyzing the top 50 popular Java APIs, conducting in-person and email interviews with 23 API producers, and surveying 110 developers. We find that only a minority of the top 50 APIs support functional interfaces, the rest does not support them, predominantly in the interest of backward compatibility. Java 7 support is still greatly desirable due to enterprise projects not migrating to newer versions of Java. This suggests that the Java ecosystem is stagnant and that the introduction of new language features will not be enough to save it from the advent of new languages such as Kotlin (JVM based) and Rust (non-JVM based).
Published: 2021
Full Text: View/download PDF

22. Every build you break: developer-oriented assistance for build failure resolution

Author: Timothy Zemp, Harald C. Gall, Carmine Vassallo, Sebastian Proksch, University of Zurich, and Vassallo, Carmine
Subjects: 10009 Department of Informatics, business.industry, Computer science, Process (engineering), Time to market, 020207 software engineering, Information needs, 02 engineering and technology, Commit, 000 Computer science, knowledge & systems, Software quality, Software release life cycle, 1712 Software, Workflow, 0202 electrical engineering, electronic engineering, information engineering, Software engineering, business, Software, Agile software development
Abstract: Continuous integration is an agile software development practice. Instead of integrating features right before a release, they are constantly being integrated into an automated build process. This shortens the release cycle, improves software quality, and reduces time to market. However, the whole process will come to a halt when a commit breaks the build, which can happen for several reasons, e.g., compilation errors or test failures, and fixing the build suddenly becomes a top priority. Developers not only have to find the cause of the build break and fix it, but they have to be quick in all of it to avoid a delay for others. Unfortunately, these steps require deep knowledge and are often time-consuming. To support developers in fixing a build break, we propose Bart, a tool that summarizes the reasons for Maven build failures and suggests possible solutions found on the internet. We will show in a case study with 17 participants that developers find Bart useful to understand build breaks and that using Bart substantially reduces the time to fix a build break, on average by 37%. We have also conducted a qualitative study to better understand the workflows and information needs when fixing builds. We found that typical workflows differ substantially between various error categories, and that several uncommon build errors are both very hard to investigate and to fix. These findings will be useful to inform future research in this area.
Published: 2019
Full Text: View/download PDF

23. A large-scale empirical exploration on refactoring activities in open source software projects

Author: Alberto Bacchelli, Fabio Palomba, Carmine Vassallo, Giovanni Grano, Harald C. Gall, University of Zurich, and Vassallo, Carmine
Subjects: Structure (mathematical logic), Focus (computing), 10009 Department of Informatics, Computer science, business.industry, Scale (chemistry), 020207 software engineering, Workload, 02 engineering and technology, Commit, 000 Computer science, knowledge & systems, computer.software_genre, Maturity (finance), 1712 Software, Code refactoring, Software_SOFTWAREENGINEERING, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Software system, Software engineering, business, computer, Software
Abstract: Refactoring is a well-established practice that aims at improving the internal structure of a software system without changing its external behavior. Existing literature provides evidence of how and why developers perform refactoring in practice. In this paper, we continue on this line of research by performing a large-scale empirical analysis of refactoring practices in 200 open source systems. Specifically, we analyze the change history of these systems at commit level to investigate: (i) whether developers perform refactoring operations and, if so, which are more diffused and (ii) when refactoring operations are applied, and (iii) which are the main developer-oriented factors leading to refactoring. Based on our results, future research can focus on enabling automatic support for less frequent refactorings and on recommending refactorings based on the developer's workload, project's maturity and developer's commitment to the project.
Published: 2019
Full Text: View/download PDF

24. Applying test case prioritization to software microbenchmarks

Author: Laaber, Christoph, Gall, Harald C, Leitner, Philipp, University of Zurich, and Laaber, Christoph
Subjects: test case prioritization, Prioritization, Unit testing, 10009 Department of Informatics, Computer science, business.industry, Suite, 000 Computer science, knowledge & systems, Article, Test (assessment), software microbenchmarking, 1712 Software, Software, performance testing, Computer engineering, regression testing, Regression testing, JMH, Overhead (computing), business, Software evolution
Abstract: Regression testing comprises techniques which are applied during software evolution to uncover faults effectively and efficiently. While regression testing is widely studied for functional tests, performance regression testing, e.g., with software microbenchmarks, is hardly investigated. Applying test case prioritization (TCP), a regression testing technique, to software microbenchmarks may help capturing large performance regressions sooner upon new versions. This may especially be beneficial for microbenchmark suites, because they take considerably longer to execute than unit test suites. However, it is unclear whether traditional unit testing TCP techniques work equally well for software microbenchmarks. In this paper, we empirically study coverage-based TCP techniques, employing total and additional greedy strategies, applied to software microbenchmarks along multiple parameterization dimensions, leading to 54 unique technique instantiations. We find that TCP techniques have a mean APFD-P (average percentage of fault-detection on performance) effectiveness between 0.54 and 0.71 and are able to capture the three largest performance changes after executing 29% to 66% of the whole microbenchmark suite. Our efficiency analysis reveals that the runtime overhead of TCP varies considerably depending on the exact parameterization. The most effective technique has an overhead of 11% of the total microbenchmark suite execution time, making TCP a viable option for performance regression testing. The results demonstrate that the total strategy is superior to the additional strategy. Finally, dynamic-coverage techniques should be favored over static-coverage techniques due to their acceptable analysis overhead; however, in settings where the time for prioritzation is limited, static-coverage techniques provide an attractive alternative.
Published: 2021
Full Text: View/download PDF

25. Exploring Graph-querying approaches in LifeGraph

Author: Luca Rossetto, Lucien Heitz, Ralph Gasser, Ruijie Wang, Abraham Bernstein, Matthias R. Baumgartner, and University of Zurich
Subjects: Theoretical computer science, business.industry, Computer science, 10009 Department of Informatics, media_common.quotation_subject, 11476 Digital Society Initiative, Query formulation, Lifelog, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design, 1712 Software, Tree traversal, 1709 Human-Computer Interaction, Knowledge graph, Computer data storage, Graph (abstract data type), Quality (business), business, media_common
Abstract: The multi-modal and interrelated nature of lifelog data makes it well suited for graph-based representations. In this paper, we present the second iteration of LifeGraph, a Knowledge Graph for Lifelog Data, initially introduced during the 3rd Lifelog Search Challenge in 2020. This second iteration incorporates several lessons learned from the previous version. While the actual graph has undergone only small changes, the mechanisms by which it is traversed during querying as well as the underlying storage system which performs the traversal have been changed. The means for query formulation have also been slightly extended in capability and made more efficient and intuitive. All these changes have the aim of improving result quality and reducing query time.
Published: 2021

26. ChangeViz: Enhancing the GitHub Pull Request Interface with Method Call Information

Author: Larissa Braz, Enrico Fregnan, Alberto Bacchelli, Tobias Baum, Lorenzo Gasparini, and University of Zurich
Subjects: Code review, business.industry, Computer science, Interface (Java), 10009 Department of Informatics, Software development, Context (language use), 000 Computer science, knowledge & systems, computer.software_genre, Software quality, GeneralLiterature_MISCELLANEOUS, Visualization, 1712 Software, 2214 Media Technology, Data visualization, business, Software engineering, computer, Codebase, 2611 Modeling and Simulation
Abstract: Code review is a widely adopted software development practice aimed at finding defects, improving software quality, and transferring knowledge among developers. Performing an effective code review is a challenging task for developers. Two of the main challenges reviewers face are (1) understanding the content of a review change-set and (2) assessing the impact of a change on the codebase. Visualization techniques can be used to increase developers’ understanding of a changeset to review and its context. However, only a few attempts have been made to apply visualization to code review.In this paper, we present a novel approach we devised to support developers in understanding GitHub pull requests. Our approach expands the GitHub interface with two lateral bars to let developers navigate to the definition/uses of the methods in the changeset under review.We evaluated our approach’s interface through (1) interviews with eight developers and (2) a survey with 12 participants. Based on the results of this evaluation, we implemented our approach in a web-based tool, ChangeViz.Pre-print, data and materials, and demo video: https://doi.org/10.5281/zenodo.5175927.
Published: 2021
Full Text: View/download PDF

27. Visualizing Graph Neural Networks with CorGIE: Corresponding a Graph to Its Embedding

Author: Liu, Zipeng, Wang, Yang, Bernard, Jürgen, Munzner, Tamara, University of Zurich, and Liu, Zipeng
Subjects: Motion pictures, Pipelines, Aggregates, Computer Science - Machine Learning, 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, Layout, 11476 Digital Society Initiative, Computer Science - Human-Computer Interaction, Computational modeling, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design, Computer Graphics and Computer-Aided Design, 1712 Software, Machine Learning, Task analysis, Signal Processing, Computer Graphics, Training, Cluster Analysis, 1711 Signal Processing, Neural Networks, Computer, Computer Vision and Pattern Recognition, Software
Abstract: Graph neural networks (GNNs) are a class of powerful machine learning tools that model node relations for making predictions of nodes or links. GNN developers rely on quantitative metrics of the predictions to evaluate a GNN, but similar to many other neural networks, it is difficult for them to understand if the GNN truly learns characteristics of a graph as expected. We propose an approach to corresponding an input graph to its node embedding (aka latent space), a common component of GNNs that is later used for prediction. We abstract the data and tasks, and develop an interactive multi-view interface called CorGIE to instantiate the abstraction. As the key function in CorGIE, we propose the K-hop graph layout to show topological neighbors in hops and their clustering structure. To evaluate the functionality and usability of CorGIE, we present how to use CorGIE in two usage scenarios, and conduct a case study with five GNN experts. Availability: Open-source code at https://github.com/zipengliu/corgie-ui/, supplemental materialsvideo at https://osf.io/tr3sb/.
Published: 2021

28. Female by Default? – Exploring the Effect of Voice Assistant Gender and Pitch on Trait and Trust Attribution

Author: Jalil Sebastian Wahdatehagh, Andreas Janson, Suzanne Tolmeijer, Jan Marco Leimeister, Abraham Bernstein, Naim Zierau, University of Zurich, Kitamura, Yoshifumi, and et al
Subjects: Information management, 10009 Department of Informatics, 11476 Digital Society Initiative, 05 social sciences, User perception, Design elements and principles, information management, 020207 software engineering, 02 engineering and technology, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design, Business studies, 1712 Software, 1709 Human-Computer Interaction, Ascription, business studies, Gender-Ambiguous Voice, Gender Stereotypes, Trust Voice Assistants, Voice Design, Voice assistant, 0202 electrical engineering, electronic engineering, information engineering, Trait, 0501 psychology and cognitive sciences, Psychology, Attribution, Social psychology, 050107 human factors
Abstract: Gendered voice based on pitch is a prevalent design element in many contemporary Voice Assistants (VAs) but has shown to strengthen harmful stereotypes. Interestingly, there is a dearth of research that systematically analyses user perceptions of different voice genders in VAs. This study investigates gender-stereotyping across two different tasks by analyzing the influence of pitch (low, high) and gender (women, men) on stereotypical trait ascription and trust formation in an exploratory online experiment with 234 participants. Additionally, we deploy a gender-ambiguous voice to compare against gendered voices. Our findings indicate that implicit stereotyping occurs for VAs. Moreover, we can show that there are no significant differences in trust formed towards a gender-ambiguous voice versus gendered voices, which highlights their potential for commercial usage.
Published: 2021
Full Text: View/download PDF

29. ProSeCo: Visual analysis of class separation measures and dataset characteristics

Author: Bernard, Jürgen, Hutter, Marco, Zeppelzauer, Matthias, Sedlmair, Michael, Munzner, Tamara, University of Zurich, and Bernard, Jürgen
Subjects: 1712 Software, 1709 Human-Computer Interaction, 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, 11476 Digital Society Initiative, 2200 General Engineering, 1711 Signal Processing, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design
Published: 2021
Full Text: View/download PDF

30. Second chance for a first impression? Trust development in intelligent system interaction

Author: Suzanne Tolmeijer, Ramya Ghantasala, Akshit Gupta, Ujwal Gadiraju, Abraham Bernstein, University of Zurich, Masthoff, Judith, and et al
Subjects: Decision support system, Knowledge management, business.industry, Computer science, 10009 Department of Informatics, 11476 Digital Society Initiative, Intelligent decision support system, Decision Support, Intelligent System, Recommender system, 000 Computer science, knowledge & systems, Human-AI Interaction, Session (web analytics), 1712 Software, Dependability, User interface, First impression (psychology), business, Set (psychology), Trust development, Trust Repair
Abstract: There is a growing use of intelligent systems to support human decision-making across several domains. Trust in intelligent systems, however, is pivotal in shaping their widespread adoption. Little is currently understood about how trust in an intelligent system evolves over time and how it is mediated by the accuracy of the system. We aim to address this knowledge gap by exploring trust formation over time and its relation to system accuracy. To that end, we built an intelligent house recommendation system and carried out a longitudinal study consisting of 201 participants across 3 sessions in a week. In each session, participants were tasked with finding housing that fit a given set of constraints using a conventional web interface that reflected a typical housing search website. Participants could choose to use an intelligent decision support system to help them find the right house. Depending on the group, participants received a variation of accurate or inaccurate advice from the intelligent system throughout each session. We measured trust using a trust in automation scale at the end of each session. We found evidence suggesting that trust development is a slow process that evolves over multiple sessions, and that first impressions of the intelligent system are highly influential. Our results echo earlier research on trust formation in single session interactions, corroborating that reliability, validity, predictability, and dependability all influence trust formation. We also found that the age of the participants and their affinity with technology had an effect on their trust in the intelligent system. Our findings highlight the importance of first impressions and improvement of system accuracy for trust development. Hence, our study is an important first step in understanding trust development, breakdown of trust, and trust repair over multiple system interactions, informing improved system design.
Published: 2021

31. On the User-centric Comparative Remote Evaluation of Interactive Video Search Systems

Author: František Mejzlík, Ralph Gasser, Loris Sauter, Jakub Lokoč, Heiko Schuldt, Miroslav Kratochvíl, Patrik Vesely, Tomáš Souček, Florian Spiess, Mahnaz Parian-Scherb, Ladislav Peska, Silvan Heller, Luca Rossetto, and University of Zurich
Subjects: Open science, Information retrieval, Scope (project management), 10009 Department of Informatics, Computer science, Interactive video, 1708 Hardware and Architecture, 02 engineering and technology, Remote evaluation, 000 Computer science, knowledge & systems, Computer Science Applications, 1712 Software, 2214 Media Technology, Competition (economics), Hardware and Architecture, 020204 information systems, Signal Processing, 1706 Computer Science Applications, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, 1711 Signal Processing, 020201 artificial intelligence & image processing, Software, Video retrieval, User-centered design
Abstract: In the research of video retrieval systems, comparative assessments during dedicated retrieval competitions provide priceless insights into the performance of individual systems. The scope and depth of such evaluations is unfortunately hard to improve, due to the limitations by the set-up costs, logistics and organization complexity of large events. We show that this easily impairs the statistical significance of the collected results, and the reproducibility of the competition outcomes. In this paper, we present a methodology for remote comparative evaluations of content-based video retrieval systems and demonstrate that such evaluations scale-up to sizes that reliably produce statistically robust results, and propose additional measures that increase the replicability of the experiment. The proposed remote evaluation methodology forms a major contribution towards open science in interactive retrieval benchmarks. At the same time, the detailed evaluation reports form an interesting source of new observations about many subtle, previously inaccessible aspects of video retrieval.
Published: 2021
Full Text: View/download PDF

32. Single Point Incremental Fourier Transform on 2D Data Streams

Author: Michael H. Böhlen, Daniele Dell'Aglio, Muhammad Saad, Abraham Bernstein, University of Zurich, and IEEE
Subjects: Speedup, 10009 Department of Informatics, Data stream mining, Computer science, 11476 Digital Society Initiative, Embarrassingly parallel, 000 Computer science, knowledge & systems, 1710 Information Systems, Discrete Fourier transform, 1712 Software, symbols.namesake, Fourier transform, Parallel processing (DSP implementation), Radioastronomy, symbols, 1711 Signal Processing, Multiplication, Algorithm, Twiddle factor, Data streams
Abstract: In radio astronomy, antennas monitor portions of the sky to collect radio signals. The antennas produce data streams that are of high volume and velocity (~2.5 GB/s) and the inverse Fourier transform is used to convert the collected signals into sky images that astrophysicists use to conduct their research. Applying the inverse Fourier transform in a streaming setting, however, is not ideal since its computational complexity is quadratic in the size of the image.In this article, we propose the Single Point Incremental Fourier Transform (SPIFT), a novel incremental algorithm to produce sequences of sky images. SPIFT computes the Fourier transform for a new signal in a linear number of complex multiplications by exploiting twiddle factors, multiplicative constant coefficients. We prove that twiddle factors are periodic and show how circular shifts can be exploited to reuse multiplication results. The cost of the additive operations can be curbed by exploiting the embarrassingly parallel nature of the additions, which modern big data streaming frameworks can leverage to compute slices of the image in parallel. Our experiments suggest that SPIFT can efficiently generate sequences of sky images: it computes the complex multiplications 4 to 12x faster than the Discrete Fourier Transform, and its parallelisation of the additive operations shows linear speedup.
Published: 2021
Full Text: View/download PDF

33. Co-adaptive visual data analysis and guidance processes

Author: Fabian Sperrle, Jürgen Bernard, Daniel A. Keim, Mennatallah El-Assady, Astrik Jeitler, University of Zurich, and Sperrle, Fabian
Subjects: Visual analytics, 1707 Computer Vision and Pattern Recognition, Computer science, 10009 Department of Informatics, 11476 Digital Society Initiative, General Engineering, 000 Computer science, knowledge & systems, Computer Graphics and Computer-Aided Design, 1704 Computer Graphics and Computer-Aided Design, Task (project management), Human-Computer Interaction, 1712 Software, 1709 Human-Computer Interaction, Open research, Human–computer interaction, Dynamics (music), Taxonomy (general), Incremental learning, 2200 General Engineering, 1711 Signal Processing, Adaptation (computer science), Set (psychology)
Abstract: Mixed-initiative visual data analysis processes are characterized by the co-adaptation of users and systems over time. As the analysis progresses, both actors – users and systems – gather information, update their analysis behavior, and work on different tasks towards their respective goals. In this paper, we contribute a multigranular model of co-adaptive visual analysis that is centered around incremental learning goals derived from a hierarchical taxonomy of learning goals from pedagogy. Our model captures how both actors adapt their data-, task-, and user/system-models over time. We characterize interaction patterns in terms of the dynamics of learning and teaching that drive adaptation. To demonstrate our model’s applicability, we outline aspects of co-adaptation in related models of visual analytics and highlight co-adaptation in existing applications. We further postulate a set of expectations towards adaptation in mixed-initiative processes and identify open research questions and opportunities for future work in co-adaptation.
Published: 2021
Full Text: View/download PDF

34. Predicting unstable software benchmarks using static source code features

Author: Pasquale Salza, Mikael Basmaci, Christoph Laaber, University of Zurich, and Laaber, Christoph
Subjects: Source lines of code, Source code, Application programming interface, 10009 Department of Informatics, Computer science, business.industry, media_common.quotation_subject, Stability (learning theory), 000 Computer science, knowledge & systems, 1712 Software, Software, Computer engineering, Benchmark (computing), Code (cryptography), Nested loop join, business, media_common
Abstract: Software benchmarks are only as good as the performance measurements they yield. Unstable benchmarks show high variability among repeated measurements, which causes uncertainty about the actual performance and complicates reliable change assessment. However, if a benchmark is stable or unstable only becomes evident after it has been executed and its results are available. In this paper, we introduce a machine-learning-based approach to predict a benchmark’s stability without having to execute it. Our approach relies on 58 statically-computed source code features, extracted for benchmark code and code called by a benchmark, related to (1) meta information, e.g., lines of code (LOC), (2) programming language elements, e.g., conditionals or loops, and (3) potentially performance-impacting standard library calls, e.g., file and network input/output (I/O). To assess our approach’s effectiveness, we perform a large-scale experiment on 4,461 Go benchmarks coming from 230 open-source software (OSS) projects. First, we assess the prediction performance of our machine learning models using 11 binary classification algorithms. We find that Random Forest performs best with good prediction performance from 0.79 to 0.90, and 0.43 to 0.68, in terms of AUC and MCC, respectively. Second, we perform feature importance analyses for individual features and feature categories. We find that 7 features related to meta-information, slice usage, nested loops, and synchronization application programming interfaces (APIs) are individually important for good predictions; and that the combination of all features of the called source code is paramount for our model, while the combination of features of the benchmark itself is less important. Our results show that although benchmark stability is affected by more than just the source code, we can effectively utilize machine learning models to predict whether a benchmark will be stable or not ahead of execution. This enables spending precious testing time on reliable benchmarks, supporting developers to identify unstable benchmarks during development, allowing unstable benchmarks to be repeated more often, estimating stability in scenarios where repeated benchmark execution is infeasible or impossible, and warning developers if new benchmarks or existing benchmarks executed in new environments will be unstable.
Published: 2021
Full Text: View/download PDF

35. Dynamically reconfiguring software microbenchmarks: reducing execution time without sacrificing result quality

Author: Stefan Würsten, Harald C. Gall, Philipp Leitner, Christoph Laaber, and University of Zurich
Subjects: Java, 10009 Department of Informatics, Computer science, business.industry, Suite, media_common.quotation_subject, Control reconfiguration, 000 Computer science, knowledge & systems, computer.software_genre, Rendering (computer graphics), 1712 Software, Software, Operating system, Benchmark (computing), Quality (business), Duration (project management), business, computer, computer.programming_language, media_common
Abstract: Executing software microbenchmarks, a form of small-scale performance tests predominantly used for libraries and frameworks, is a costly endeavor. Full benchmark suites take up to multiple hours or days to execute, rendering frequent checks, e.g., as part of continuous integration (CI), infeasible. However, altering benchmark configurations to reduce execution time without considering the impact on result quality can lead to benchmark results that are not representative of the software’s true performance. We propose the first technique to dynamically stop software microbenchmark executions when their results are sufficiently stable. Our approach implements three statistical stoppage criteria and is capable of reducing Java Microbenchmark Harness (JMH) suite execution times by 48.4% to 86.0%. At the same time it retains the same result quality for 78.8% to 87.6% of the benchmarks, compared to executing the suite for the default duration. The proposed approach does not require developers to manually craft custom benchmark configurations; instead, it provides automated mechanisms for dynamic reconfiguration. Hence, making dynamic reconfiguration highly effective and efficient, potentially paving the way to inclusion of JMH microbenchmarks in CI.
Published: 2020
Full Text: View/download PDF

36. SecBot: a Business-Driven Conversational Agent for Cybersecurity Planning and Management

Author: Christian Killer, Bruno Rodrigues, Burkhard Stiller, Muriel Figueredo Franco, Arthur Selle Jacobs, Lisandro Zambenedetti Granville, Eder J. Scheid, and University of Zurich
Subjects: 10009 Department of Informatics, Computer science, 1708 Hardware and Architecture, media_common.quotation_subject, 1702 Artificial Intelligence, 020207 software engineering, 02 engineering and technology, 000 Computer science, knowledge & systems, Computer security, computer.software_genre, Chatbot, 1712 Software, Order (exchange), 1705 Computer Networks and Communications, 0202 electrical engineering, electronic engineering, information engineering, 1802 Information Systems and Management, 020201 artificial intelligence & image processing, Conversation, Dialog system, computer, Formal description, media_common
Abstract: Businesses were moving during the past decades to-ward full digital models, which made companies face new threats and cyberattacks affecting their services and, consequently, their profits. To avoid negative impacts, companies' investments in cybersecurity are increasing considerably. However, Small and Medium-sized Enterprises (SMEs) operate on small budgets, minimal technical expertise, and few personnel to address cybersecurity threats. In order to address such challenges, it is essential to promote novel approaches that can intuitively present cybersecurity-related technical information.This paper introduces SecBot, a cybersecurity-driven conversational agent (i.e., chatbot) for the support of cybersecurity planning and management. SecBot applies concepts of neural networks and Natural Language Processing (NLP), to interact and extract information from a conversation. SecBot can (a) identify cyberattacks based on related symptoms, (b) indicate solutions and configurations according to business demands, and (c) provide insightful information for the decision on cybersecurity investments and risks. A formal description had been developed to describe states, transitions, a language, and a Proof-of-Concept (PoC) implementation. A case study and a performance evaluation were conducted to provide evidence of the proposed solution's feasibility and accuracy.
Published: 2020
Full Text: View/download PDF

37. Investigating the criticality of user‐reported issues through their relations with app rating

Author: Corrado Aaron Visaggio, Andrea Di Sorbo, Giovanni Grano, Sebastiano Panichella, University of Zurich, and Di Sorbo, Andrea
Subjects: 10009 Department of Informatics, business.industry, media_common.quotation_subject, Applied psychology, Mobile apps, 005: Computerprogrammierung, Programme und Daten, 000 Computer science, knowledge & systems, Software quality, 1712 Software, Empirical research, User experience design, mental disorders, Quality (business), Android (operating system), Psychology, business, Software, User feedback, media_common
Abstract: This is the peer reviewed version which has been published in final form at https://doi.org/10.1002/smr.2316. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions., App quality impacts user experience and satisfaction. As a consequence, both app ratings and user feedback reported in app reviews are directly influenced by the user‐perceived app quality. Through an empirical study involving 210,517 reviews related to 317 Android apps, in this paper, we experiment with the combined usage of app rating and user reviews analysis (i) to investigate the most important factors influencing the perceived app quality, (ii) focusing on the topics discussed in user review that most relate with app rating. Besides, we investigate whether specific code quality metrics could be monitored to prevent the rising of negative user feedback (i.e., types of user review comments), connected with low ratings. Our study demonstrates that user comments reporting bugs are negatively correlated with the rating, while reviews reportingfeature requests do not. Interestingly, depending on the app category, we observed that different kinds of issues have rather different relationships with the rating and the user‐perceived quality of the app. In particular, we observe that for specific app categories (e.g., communication), some code quality factors have significant relationships with the raising of certain types of feedback, which, in turn, are negatively connected with app ratings.
Published: 2020
Full Text: View/download PDF

38. Configuration Smells in Continuous Delivery Pipelines: A Linter and A Six-Month Study on GitLab

Author: Harald C. Gall, Massimiliano Di Penta, Sebastian Proksch, Anna Jancso, Carmine Vassallo, and University of Zurich
Subjects: Anti-patterns, 10009 Department of Informatics, Computer science, Continuous Integration, Continuous delivery, 02 engineering and technology, 000 Computer science, knowledge & systems, computer.software_genre, Software, 020204 information systems, Server, 0202 electrical engineering, electronic engineering, information engineering, DevOps, Linter, Database, business.industry, 020207 software engineering, Pipeline (software), 1712 Software, Pipeline transport, Anti-pattern, GitLab, Precision and recall, business, Configuration, computer, Continuous Delivery
Abstract: An effective and efficient application of Continuous Integration (CI) and Delivery (CD) requires software projects to follow certain principles and good practices. Configuring such a CI/CD pipeline is challenging and error-prone. Therefore, automated linters have been proposed to detect errors in the pipeline. While existing linters identify syntactic errors, detect security vulnerabilities or misuse of the features provided by build servers, they do not support developers that want to prevent common misconfigurations of a CD pipeline that potentially violate CD principles ("CD smells"). To this end, we propose CD-Linter, a semantic linter that can automatically identify four different smells in such a pipeline configuration file. We have evaluated our approach through a large-scale and long-term study that consists of (i) monitoring 145 issues (opened in as many open-source projects) over a period of 6 months, (ii) manually validating the detection precision and recall on a representative sample of issues, and (iii) assessing the magnitude of the observed smells on 5,312 open-source projects on GitLab. Our results show that CD smells are accepted and fixed by most of the developers and our linter achieves a precision of 87% and a recall of 94%. Those smells can be frequently observed in the wild, as 31% of projects with long configurations are affected by at least one smell. Data and Material, To appear in the proceedings of the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE), Sun 8 - Fri 13 November 2020 Sacramento, California, United States.
Published: 2020
Full Text: View/download PDF

39. Requirements engineering for sustainability: an awareness framework for designing software systems for a better tomorrow

Author: Duboc, Leticia, Penzenstadler, Birgit, Porras, Jari, Akinli Koçak, Sedef, Betz, Stefanie, Chitchyan, Ruzanna, Leifler, Ola, Seyff, Norbert, Venters, Colin C, University of Zurich, and Venters, Colin C
Subjects: 1712 Software, 10009 Department of Informatics, 000 Computer science, knowledge & systems, 1710 Information Systems
Published: 2020
Full Text: View/download PDF

40. A Relational Matrix Algebra and its Implementation in a Column Store

Author: Michael H. Böhlen, Nikolaus Augsten, Oksana Dolmatova, and University of Zurich
Subjects: FOS: Computer and information sciences, Theoretical computer science, business.industry, Computer science, 10009 Department of Informatics, Databases (cs.DB), 02 engineering and technology, 000 Computer science, knowledge & systems, Data structure, 1710 Information Systems, Column (database), Matrix multiplication, Bridge (nautical), 1712 Software, Computer Science - Databases, Analytics, Matrix algebra, 020204 information systems, Linear algebra, 0202 electrical engineering, electronic engineering, information engineering, Relational model, 020201 artificial intelligence & image processing, business
Abstract: Analytical queries often require a mixture of relational and linear algebra operations applied to the same data. This poses a challenge to analytic systems that must bridge the gap between relations and matrices. Previous work has mainly strived to fix the problem at the implementation level. This paper proposes a principled solution at the logical level. We introduce the relational matrix algebra (RMA), which seamlessly integrates linear algebra operations into the relational model and eliminates the dichotomy between matrices and relations. RMA is closed: All our relational matrix operations are performed on relations and result in relations; no additional data structure is required. Our implementation in MonetDB shows the feasibility of our approach, and empirical evaluations suggest that in-database analytics performs well for mixed workloads., Comment: 16 pages, 18 figures
Published: 2020

41. Video to Events: Recycling Video Datasets for Event Cameras

Author: Javier Hidalgo-Carrio, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza, University of Zurich, and Gehrig, Daniel
Subjects: FOS: Computer and information sciences, 0209 industrial biotechnology, 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, business.industry, Event (computing), Computer science, Computer Vision and Pattern Recognition (cs.CV), Motion blur, Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, 000 Computer science, knowledge & systems, 1712 Software, 020901 industrial engineering & automation, Asynchronous communication, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Segmentation, Computer vision, Artificial intelligence, business, High dynamic range
Abstract: Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous "events" instead of intensity frames. They offer significant advantages with respect to conventional cameras: high dynamic range (HDR), high temporal resolution, and no motion blur. Recently, novel learning approaches operating on event data have achieved impressive results. Yet, these methods require a large amount of event data for training, which is hardly available due the novelty of event sensors in computer vision research. In this paper, we present a method that addresses these needs by converting any existing video dataset recorded with conventional cameras to synthetic event data. This unlocks the use of a virtually unlimited number of existing video datasets for training networks designed for real event data. We evaluate our method on two relevant vision tasks, i.e., object recognition and semantic segmentation, and show that models trained on synthetic events have several benefits: (i) they generalize well to real event data, even in scenarios where standard-camera images are blurry or overexposed, by inheriting the outstanding properties of event cameras; (ii) they can be used for fine-tuning on real data to improve over state-of-the-art for both classification and semantic segmentation.
Published: 2020
Full Text: View/download PDF

42. UI Dark Patterns and Where to Find Them A Study on Mobile Applications and User Perception

Author: Fabio Palomba, Alberto Bacchelli, Larissa Braz, Enrico Fregnan, Linda Di Geronimo, and University of Zurich
Subjects: Computer science, Interface (Java), 10009 Department of Informatics, 05 social sciences, User perception, Mobile apps, 020207 software engineering, 02 engineering and technology, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design, GeneralLiterature_MISCELLANEOUS, 1712 Software, 1709 Human-Computer Interaction, Work (electrical), Human–computer interaction, 0202 electrical engineering, electronic engineering, information engineering, 0501 psychology and cognitive sciences, 050107 human factors
Abstract: A Dark Pattern (DP) is an interface maliciously crafted to deceive users into performing actions they did not mean to do. In this work, we analyze Dark Patterns in 240 popular mobile apps and conduct an online experiment with 589 users on how they perceive Dark Patterns in such apps. The results of the analysis show that 95% of the analyzed apps contain one or more forms of Dark Patterns and, on average, popular applications include at least seven different types of deceiving interfaces. The online experiment shows that most users do not recognize Dark Patterns, but can perform better in recognizing malicious designs if informed on the issue. We discuss the impact of our work and what measures could be applied to alleviate the issue.
Published: 2020
Full Text: View/download PDF

43. Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis

Author: Torsten Sattler, Davide Scaramuzza, Zichao Zhang, University of Zurich, and Zhang, Zichao
Subjects: FOS: Computer and information sciences, 1707 Computer Vision and Pattern Recognition, Computer science, 10009 Department of Informatics, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 1702 Artificial Intelligence, 02 engineering and technology, 000 Computer science, knowledge & systems, Article, Rendering (computer graphics), Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, business.industry, 020207 software engineering, Visual localization, Benchmarking, Real image, View synthesis, 1712 Software, Scalability, 020201 artificial intelligence & image processing, Augmented reality, Benchmark construction, Learned local features, Computer Vision and Pattern Recognition, Artificial intelligence, business, Software, Feature matching
Abstract: Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality. High quality datasets with accurate 6 Degree-of-Freedom (DoF) reference poses are the foundation for benchmarking and improving existing methods. Traditionally, reference poses have been obtained via Structure-from-Motion (SfM). However, SfM itself relies on local features which are prone to fail when images were taken under different conditions, e.g., day/ night changes. At the same time, manually annotating feature correspondences is not scalable and potentially inaccurate. In this work, we propose a semi-automated approach to generate reference poses based on feature matching between renderings of a 3D model and real images via learned features. Given an initial pose estimate, our approach iteratively refines the pose based on feature matches against a rendering of the model from the current pose estimate. We significantly improve the nighttime reference poses of the popular Aachen Day-Night dataset, showing that state-of-the-art visual localization methods perform better (up to $47\%$) than predicted by the original reference poses. We extend the dataset with new nighttime test images, provide uncertainty estimates for our new reference poses, and introduce a new evaluation criterion. We will make our reference poses and our framework publicly available upon publication., 25 pages, 16 figures. Int J Comput Vis (2020)
Published: 2020

44. GARUSO: a gamification approach for involving stakeholders outside organizational reach in requirements engineering

Author: Kolpondinos, Martina Z, Glinz, Martin, University of Zurich, and Kolpondinos, Martina Z
Subjects: 1712 Software, 10009 Department of Informatics, 000 Computer science, knowledge & systems, 1710 Information Systems
Published: 2020
Full Text: View/download PDF

45. Voxel Map for Visual SLAM

Author: Zichao Zhang, Manasi Muglikar, Davide Scaramuzza, and University of Zurich
Subjects: FOS: Computer and information sciences, 0209 industrial biotechnology, 10009 Department of Informatics, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 2207 Control and Systems Engineering, 1702 Artificial Intelligence, 02 engineering and technology, 000 Computer science, knowledge & systems, Tracking (particle physics), computer.software_genre, Computer Science - Robotics, 020901 industrial engineering & automation, Voxel, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, Representation (mathematics), ComputingMethodologies_COMPUTERGRAPHICS, business.industry, 2208 Electrical and Electronic Engineering, Sampling (statistics), Grid, 1712 Software, 020201 artificial intelligence & image processing, Artificial intelligence, business, Robotics (cs.RO), computer
Abstract: In modern visual SLAM systems, it is a standard practice to retrieve potential candidate map points from overlapping keyframes for further feature matching or direct tracking. In this work, we argue that keyframes are not the optimal choice for this task, due to several inherent limitations, such as weak geometric reasoning and poor scalability. We propose a voxel-map representation to efficiently retrieve map points for visual SLAM. In particular, we organize the map points in a regular voxel grid. Visible points from a camera pose are queried by sampling the camera frustum in a raycasting manner, which can be done in constant time using an efficient voxel hashing method. Compared with keyframes, the retrieved points using our method are geometrically guaranteed to fall in the camera field-of-view, and occluded points can be identified and removed to a certain extend. This method also naturally scales up to large scenes and complicated multi-camera configurations. Experimental results show that our voxel map representation is as efficient as a keyframe map with 5 keyframes and provides significantly higher localization accuracy (average 46% improvement in RMSE) on the EuRoC dataset. The proposed voxel-map representation is a general approach to a fundamental functionality in visual SLAM and widely applicable.
Published: 2020
Full Text: View/download PDF

46. Associating working memory capacity and code change ordering with code review performance

Author: Baum, Tobias, Schneider, Kurt, Bacchelli, Alberto, University of Zurich, and Baum, Tobias
Subjects: Code review, 10009 Department of Informatics, Computer science, Working memory, business.industry, code review, working memory capacity, change ordering, 020207 software engineering, 02 engineering and technology, 000 Computer science, knowledge & systems, computer.software_genre, Reliability engineering, 1712 Software, Software, Software quality assurance, Order (business), 0202 electrical engineering, electronic engineering, information engineering, Code (cryptography), business, Association (psychology), computer, Cognitive load
Abstract: Change-based code review is a software quality assurance technique that is widely used in practice. Therefore, better understanding what influences performance in code review and finding ways to improve it can have a large impact. In this study, we examine the association of working memory capacity and cognitive load to code review performance and we test the predictions of a recent theory regarding improved code review efficiency with certain code change part orders. We perform a confirmatory experiment with 50 participants, mostly professional software developers. The participants performed code reviews on one small and two larger code changes from an open source software system to which we had seeded additional defects. We measured their efficiency and effectiveness in defect detection, their working memory capacity, and several potential confounding factors. We find that there is a moderate association between working memory capacity and the effectiveness of finding delocalized defects, influenced by other factors, whereas the association with other defect types is almost non-existing. We also confirm that the effectiveness of reviews is significantly larger for small code changes. We observe a tendency that the order of presenting the code changes influences the efficiency of code review, but cannot conclude this reliably., This is a pre-print of an article published in Empirical Software Engineering. The final authenticated version is available online at: https://doi.org/10.1007/s10664-018-9676-8
Published: 2019
Full Text: View/download PDF

47. Transparency of CHI Research Artifacts: Results of a Self-Reported Survey

Author: Florian Echtler, Chat Wacharamanotham, Lukas Eisenring, Steve Haroz, University of Zurich, Universität Zürich [Zürich] = University of Zurich (UZH), Analysis and Visualization (AVIZ), Inria Saclay - Ile de France, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire de Recherche en Informatique (LRI), CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), and Bauhaus-Universität Weimar
Subjects: Replication crisis, Open science, business.industry, Scientific progress, 10009 Department of Informatics, 05 social sciences, Internet privacy, 020207 software engineering, 02 engineering and technology, 000 Computer science, knowledge & systems, Transparency (behavior), 1704 Computer Graphics and Computer-Aided Design, Data availability, 1712 Software, Open data, 1709 Human-Computer Interaction, Open research, Credibility, [SCCO.PSYC]Cognitive science/Psychology, 0202 electrical engineering, electronic engineering, information engineering, 0501 psychology and cognitive sciences, business, Psychology, 050107 human factors
Abstract: International audience; Several fields of science are experiencing a "replication crisis" that has negatively impacted their credibility. Assessing the validity of a contribution via replicability of its experimental evidence and reproducibility of its analyses requires access to relevant study materials, data, and code. Failing to share them limits the ability to scrutinize or build-upon the research, ultimately hindering scientific progress. Understanding how the diverse research artifacts in HCI impact sharing can help produce informed recommendations for individual researchers and policy-makers in HCI. Therefore, we surveyed authors of CHI 2018–2019 papers, asking if they share their papers' research materials and data, how they share them, and why they do not. The results (N = 460/1356, 34% response rate) show that sharing is uncommon, partly due to misunderstandings about the purpose of sharing and reliable hosting. We conclude with recommendations for fostering open research practices. This paper and all data and materials are freely available at https://osf.io/csy8q
Published: 2020
Full Text: View/download PDF

48. Query Results over Ongoing Databases that Remain Valid as Time Passes By

Author: Mülle, Yvonne, Böhlen, Michael Hanspeter, and University of Zurich
Subjects: 1712 Software, 10009 Department of Informatics, 1711 Signal Processing, 000 Computer science, knowledge & systems, 1710 Information Systems
Published: 2020

49. Preserving Contextual Information in Relational Matrix Operations

Author: Michael H. Böhlen, Oksana Dolmatova, Nikolaus Augsten, and University of Zurich
Subjects: 050101 languages & linguistics, Information retrieval, 10009 Department of Informatics, Computer science, 05 social sciences, 02 engineering and technology, 000 Computer science, knowledge & systems, Relational algebra, 1710 Information Systems, computer.software_genre, Matrix multiplication, 1712 Software, Relational database management system, Schema (psychology), 0202 electrical engineering, electronic engineering, information engineering, Data analysis, Contextual information, Table (database), 1711 Signal Processing, 020201 artificial intelligence & image processing, 0501 psychology and cognitive sciences, computer
Abstract: There exist large amounts of numerical data that are stored in databases and must be analyzed. Database tables come with a schema and include non-numerical attributes; this is crucial contextual information that is needed for interpreting the numerical values. We propose relational matrix operations that support the analysis of data stored in tables and that preserve contextual information. The result of our approach are precisely defined relational matrix operations and a system implementation in MonetDB that illustrates the seamless integration of relational matrix operations into a relational DBMS.
Published: 2020
Full Text: View/download PDF

50. Multiresolution Volume Filtering in the Tensor Compressed Domain

Author: David Steiner, Renato Pajarola, Rafael Ballester-Ripoll, University of Zurich, and Ballester-Ripoll, Rafael
Subjects: 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, Computer science, domain, 02 engineering and technology, 000 Computer science, knowledge & systems, 1704 Computer Graphics and Computer-Aided Design, multiresolution, Data visualization, tensor approximation, volume rendering, Aliasing, 0202 electrical engineering, electronic engineering, information engineering, Kernel adaptive filter, Tensor, Image resolution, visualization, 020203 distributed computing, Signal processing, business.industry, Bandwidth (signal processing), 020207 software engineering, Filter (signal processing), compression, Computer Graphics and Computer-Aided Design, 1712 Software, Signal Processing, 1711 Signal Processing, Computer Vision and Pattern Recognition, business, Algorithm, Software, Tucker decomposition, Volume (compression)
Abstract: Signal processing and filter operations are important tools for visual data processing and analysis. Due to GPU memory and bandwidth limitations, it is challenging to apply complex filter operators to large-scale volume data interactively. We propose a novel and fast multiscale compression-domain volume filtering approach integrated into an interactive multiresolution volume visualization framework. In our approach, the raw volume data is decomposed offline into a compact hierarchical multiresolution tensor approximation model. We then demonstrate how convolution filter operators can effectively be applied in the compressed tensor approximation domain. To prevent aliasing due to multiresolution filtering, our solution (a) filters accurately at the full spatial volume resolution at a very low cost in the compressed domain, and (b) reconstructs and displays the filtered result at variable level-of-detail. The proposed system is scalable, allowing interactive display and filtering of large volume datasets that may exceed the available GPU memory. The desired filter kernel mask and size can be modified online, producing immediate visual results.
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

482 results on '"1712 Software"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources