Author: "Tithi, Jesmin Jahan" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Tithi, Jesmin Jahan"' showing total 129 results

Start Over Author "Tithi, Jesmin Jahan"

129 results on '"Tithi, Jesmin Jahan"'

1. Enhancing Scalability and Performance in Influence Maximization with Optimized Parallel Processing

Author: Wu, Hanjiang, Xu, Huan, Park, Joongun, Tithi, Jesmin Jahan, Checconi, Fabio, Wolfson-Pou, Jordi, Petrini, Fabrizio, and Krishna, Tushar
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Data Structures and Algorithms
Abstract: Influence Maximization (IM) is vital in viral marketing and biological network analysis for identifying key influencers. Given its NP-hard nature, approximate solutions are employed. This paper addresses scalability challenges in scale-out shared memory system by focusing on the state-of-the-art Influence Maximization via Martingales (IMM) benchmark. To enhance the work efficiency of the current IMM implementation, we propose EFFICIENTIMM with key strategies, including new parallelization scheme, NUMA-aware memory usage, dynamic load balancing and fine-grained adaptive data structures. Benchmarking on a 128-core CPU system with 8 NUMA nodes, EFFICIENTIMM demonstrated significant performance improvements, achieving an average 5.9x speedup over Ripples across 8 diverse SNAP datasets, when compared to the best execution times of the original Ripples framework. Additionally, on the Youtube graph, EFFICIENTIMM demonstrates a better memory access pattern with 357.4x reduction in L1+L2 cache misses as compared to Ripples.
Published: 2024

2. Efficient Parallel Multi-Hop Reasoning: A Scalable Approach for Knowledge Graph Analysis

Author: Tithi, Jesmin Jahan, Checconi, Fabio, and Petrini, Fabrizio
Subjects: Computer Science - Artificial Intelligence, Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning, Computer Science - Performance, H.4, C.4
Abstract: Multi-hop reasoning (MHR) is a process in artificial intelligence and natural language processing where a system needs to make multiple inferential steps to arrive at a conclusion or answer. In the context of knowledge graphs or databases, it involves traversing multiple linked entities and relationships to understand complex queries or perform tasks requiring a deeper understanding. Multi-hop reasoning is a critical function in various applications, including question answering, knowledge base completion, and link prediction. It has garnered significant interest in artificial intelligence, machine learning, and graph analytics. This paper focuses on optimizing MHR for time efficiency on large-scale graphs, diverging from the traditional emphasis on accuracy which is an orthogonal goal. We introduce a novel parallel algorithm that harnesses domain-specific learned embeddings to efficiently identify the top K paths between vertices in a knowledge graph to find the best answers to a three-hop query. Our contributions are: (1) We present a new parallel algorithm to enhance MHR performance, scalability and efficiency. (2) We demonstrate the algorithm's superior performance on leading-edge Intel and AMD architectures through empirical results. We showcase the algorithm's practicality through a case study on identifying academic affiliations of potential Turing Award laureates in Deep Learning, highlighting its capability to handle intricate entity relationships. This demonstrates the potential of our approach to enabling high-performance MHR, useful to navigate the growing complexity of modern knowledge graphs., Comment: 11 Pages with references
Published: 2024

3. Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation

Author: Laukemann, Jan, Helal, Ahmed E., Anderson, S. Isaac Geronimo, Checconi, Fabio, Soh, Yongseok, Tithi, Jesmin Jahan, Ranadive, Teresa, Gravelle, Brian J, Petrini, Fabrizio, and Choi, Jee
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Data Structures and Algorithms, Computer Science - Performance
Abstract: High-dimensional sparse data emerge in many critical application domains such as cybersecurity, healthcare, anomaly detection, and trend analysis. To quickly extract meaningful insights from massive volumes of these multi-dimensional data, scientists employ unsupervised analysis tools based on tensor decomposition (TD) methods. However, real-world sparse tensors exhibit highly irregular shapes, data distributions, and sparsity, which pose significant challenges for making efficient use of modern parallel architectures. This study breaks the prevailing assumption that compressing sparse tensors into coarse-grained structures (i.e., tensor slices or blocks) or along a particular dimension/mode (i.e., mode-specific) is more efficient than keeping them in a fine-grained, mode-agnostic form. Our novel sparse tensor representation, Adaptive Linearized Tensor Order (ALTO), encodes tensors in a compact format that can be easily streamed from memory and is amenable to both caching and parallel execution. To demonstrate the efficacy of ALTO, we accelerate popular TD methods that compute the Canonical Polyadic Decomposition (CPD) model across a range of real-world sparse tensors. Additionally, we characterize the major execution bottlenecks of TD methods on multiple generations of the latest Intel Xeon Scalable processors, including Sapphire Rapids CPUs, and introduce dynamic adaptation heuristics to automatically select the best algorithm based on the sparse tensor characteristics. Across a diverse set of real-world data sets, ALTO outperforms the state-of-the-art approaches, achieving more than an order-of-magnitude speedup over the best mode-agnostic formats. Compared to the best mode-specific formats, which require multiple tensor copies, ALTO achieves more than 5.1x geometric mean speedup at a fraction (25%) of their storage., Comment: We extend the results of our previous ICS paper to significantly improve the parallel performance of the Canonical Polyadic Alternating Least Squares (CP-ALS) algorithm for normally distributed data and the Canonical Polyadic Alternating Poisson Regression (CP-APR) algorithm for non-negative count data
Published: 2024

4. Large Language Models Based Automatic Synthesis of Software Specifications

Author: Mandal, Shantanu, Chethan, Adhrik, Janfaza, Vahid, Mahmud, S M Farabi, Anderson, Todd A, Turek, Javier, Tithi, Jesmin Jahan, and Muzahid, Abdullah
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: Software configurations play a crucial role in determining the behavior of software systems. In order to ensure safe and error-free operation, it is necessary to identify the correct configuration, along with their valid bounds and rules, which are commonly referred to as software specifications. As software systems grow in complexity and scale, the number of configurations and associated specifications required to ensure the correct operation can become large and prohibitively difficult to manipulate manually. Due to the fast pace of software development, it is often the case that correct software specifications are not thoroughly checked or validated within the software itself. Rather, they are frequently discussed and documented in a variety of external sources, including software manuals, code comments, and online discussion forums. Therefore, it is hard for the system administrator to know the correct specifications of configurations due to the lack of clarity, organization, and a centralized unified source to look at. To address this challenge, we propose SpecSyn a framework that leverages a state-of-the-art large language model to automatically synthesize software specifications from natural language sources. Our approach formulates software specification synthesis as a sequence-to-sequence learning problem and investigates the extraction of specifications from large contextual texts. This is the first work that uses a large language model for end-to-end specification synthesis from natural language texts. Empirical results demonstrate that our system outperforms prior the state-of-the-art specification synthesis tool by 21% in terms of F1 score and can find specifications from single as well as multiple sentences.
Published: 2023

5. An Optimal Level-synchronous Shared-memory Parallel BFS Algorithm with Optimal parallel Prefix-sum Algorithm and its Implications for Energy Consumption

Author: Tithi, Jesmin Jahan, Fogel, Yonatan, and Chowdhury, Rezaul
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Computational Complexity, Computer Science - Data Structures and Algorithms
Abstract: We present a work-efficient parallel level-synchronous Breadth First Search (BFS) algorithm for shared-memory architectures which achieves the theoretical lower bound on parallel running time. The optimality holds regardless of the shape of the graph. We also demonstrate the implication of this optimality for the energy consumption of the program empirically. The key idea is never to use more processing cores than necessary to complete the work in any computation step efficiently. We keep the rest of the cores idle to save energy and to reduce other resource contentions (e.g., bandwidth, shared caches, etc). Our BFS does not use locks and atomic instructions and is easily extendible to shared-memory coprocessors., Comment: 2 pages, brief announcement
Published: 2022

6. Ridgeline: A 2D Roofline Model for Distributed Systems

Author: Checconi, Fabio, Tithi, Jesmin Jahan, and Petrini, Fabrizio
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Performance
Abstract: In this short paper, we introduce the Ridgeline model, an extension of the Roofline model [4] for distributed systems. The Roofline model targets shared memory systems, bounding the performance of a kernel based on its operational intensity, and the peak compute throughput and memory bandwidth of the execution system. In a distributed setting, with multiple communicating compute entities, the network must be taken into account to model the system behavior accurately. The Ridgeline aggregates information on compute, memory, and network limits in one 2D plot to show, in an intuitive way, which of the resources is the expected bottleneck. We show the applicability of the Ridgeline in a case study based on a data-parallel Multi-Layer Perceptron (MLP) instance., Comment: 5 pages
Published: 2022

7. Using Sentence Embeddings and Semantic Similarity for Seeking Consensus when Assessing Trustworthy AI

Author: Vetter, Dennis, Tithi, Jesmin Jahan, Westerlund, Magnus, Zicari, Roberto V., and Roig, Gemma
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: Assessing the trustworthiness of artificial intelligence systems requires knowledge from many different disciplines. These disciplines do not necessarily share concepts between them and might use words with different meanings, or even use the same words differently. Additionally, experts from different disciplines might not be aware of specialized terms readily used in other disciplines. Therefore, a core challenge of the assessment process is to identify when experts from different disciplines talk about the same problem but use different terminologies. In other words, the problem is to group problem descriptions (a.k.a. issues) with the same semantic meaning but described using slightly different terminologies. In this work, we show how we employed recent advances in natural language processing, namely sentence embeddings and semantic textual similarity, to support this identification process and to bridge communication gaps in interdisciplinary teams of experts assessing the trustworthiness of an artificial intelligence system used in healthcare.
Published: 2022

8. How to Assess Trustworthy AI in Practice

Author: Zicari, Roberto V., Amann, Julia, Bruneault, Frédérick, Coffee, Megan, Düdder, Boris, Hickman, Eleanore, Gallucci, Alessio, Gilbert, Thomas Krendl, Hagendorff, Thilo, van Halem, Irmhild, Hildt, Elisabeth, Holm, Sune, Kararigas, Georgios, Kringen, Pedro, Madai, Vince I., Mathez, Emilie Wiinblad, Tithi, Jesmin Jahan, Vetter, Dennis, Westerlund, Magnus, and Wurth, Renee
Subjects: Computer Science - Computers and Society
Abstract: This report is a methodological reflection on Z-Inspection$^{\small{\circledR}}$. Z-Inspection$^{\small{\circledR}}$ is a holistic process used to evaluate the trustworthiness of AI-based technologies at different stages of the AI lifecycle. It focuses, in particular, on the identification and discussion of ethical issues and tensions through the elaboration of socio-technical scenarios. It uses the general European Union's High-Level Expert Group's (EU HLEG) guidelines for trustworthy AI. This report illustrates for both AI researchers and AI practitioners how the EU HLEG guidelines for trustworthy AI can be applied in practice. We share the lessons learned from conducting a series of independent assessments to evaluate the trustworthiness of AI systems in healthcare. We also share key recommendations and practical suggestions on how to ensure a rigorous trustworthy AI assessment throughout the life-cycle of an AI system., Comment: On behalf of the Z-Inspection$^{\small{\circledR}}$ initiative (2022)
Published: 2022
Full Text: View/download PDF

9. Efficient, Out-of-Memory Sparse MTTKRP on Massively Parallel Architectures

Author: Nguyen, Andy, Helal, Ahmed E., Checconi, Fabio, Laukemann, Jan, Tithi, Jesmin Jahan, Soh, Yongseok, Ranadive, Teresa, Petrini, Fabrizio, and Choi, Jee W.
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Data Structures and Algorithms, Computer Science - Performance
Abstract: Tensor decomposition (TD) is an important method for extracting latent information from high-dimensional (multi-modal) sparse data. This study presents a novel framework for accelerating fundamental TD operations on massively parallel GPU architectures. In contrast to prior work, the proposed Blocked Linearized Coordinate (BLCO) format enables efficient out-of-memory computation of tensor algorithms using a unified implementation that works on a single tensor copy. Our adaptive blocking and linearization strategies not only meet the resource constraints of GPU devices, but also accelerate data indexing, eliminate control-flow and memory-access irregularities, and reduce kernel launching overhead. To address the substantial synchronization cost on GPUs, we introduce an opportunistic conflict resolution algorithm, in which threads collaborate instead of contending on memory access to discover and resolve their conflicting updates on-the-fly, without keeping any auxiliary information or storing non-zero elements in specific mode orientations. As a result, our framework delivers superior in-memory performance compared to prior state-of-the-art, and is the only framework capable of processing out-of-memory tensors. On the latest Intel and NVIDIA GPUs, BLCO achieves 2.12-2.6X geometric-mean speedup (with up to 33.35X speedup) over the state-of-the-art mixed-mode compressed sparse fiber (MM-CSF) on a range of real-world sparse tensors., Comment: Accepted to ICS 2022
Published: 2022
Full Text: View/download PDF

10. Lessons Learned from Assessing Trustworthy AI in Practice

Author: Vetter, Dennis, Amann, Julia, Bruneault, Frédérick, Coffee, Megan, Düdder, Boris, Gallucci, Alessio, Gilbert, Thomas Krendl, Hagendorff, Thilo, van Halem, Irmhild, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kararigas, Georgios, Kringen, Pedro, Madai, Vince I., Wiinblad Mathez, Emilie, Tithi, Jesmin Jahan, Westerlund, Magnus, Wurth, Renee, and Zicari, Roberto V.
Published: 2023
Full Text: View/download PDF

11. A New Parallel Algorithm for Sinkhorn Word-Movers Distance and Its Performance on PIUMA and Xeon CPU

Author: Tithi, Jesmin Jahan and Petrini, Fabrizio
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Hardware Architecture, Computer Science - Machine Learning, Computer Science - Performance
Abstract: The Word Movers Distance (WMD) measures the semantic dissimilarity between two text documents by computing the cost of optimally moving all words of a source/query document to the most similar words of a target document. Computing WMD between two documents is costly because it requires solving an $O(V^3log(V))$ optimization problem where $V$ is the number of unique words in the document. Fortunately, WMD can be framed as an Earth Mover's Distance (EMD) for which the algorithmic complexity can be reduced to $O(V^2)$ by adding an entropy penalty to the optimization problem and solving it using the Sinkhorn-Knopp algorithm. Additionally, the computation can be made highly parallel by adopting a batching approach, i.e., computing the WMD of a single query document against multiple target documents at once. Sinkhorn WMD is a key kernel used in many ML/NLP applications. and usually gets implemented in Python. However, a straightforward Python implementation may leave significant performance on the table even though it may internally call optimized C++ BLAS routines. We present a new sparse {P}arallel {A}lgorithm for {S}inkhorn-Knopp {W}ord-movers {D}istance to compute the semantic distance of one document to many other documents by adopting the $O(V^2)$ EMD algorithm. We algorithmically transform $O(V^2)$ dense compute-heavy EMD version into an equivalent sparse one using new fused SDDMM-SpMM (sparse selection of dense-dense matrix-, sparse-dense matrix-multiplication) kernels. We implemented and optimized this algorithm for two very different architectures -- the new Intel Programmable Integrated Unified Memory Architecture (PIUMA) and Intel Xeon CPUs. We show that we were able to reach close to peak performance on both platforms., Comment: 11 Pages. arXiv admin note: substantial text overlap with arXiv:2005.06727
Published: 2021

12. Performance Optimization of SU3_Bench on Xeon and Programmable Integrated Unified Memory Architecture

Author: Tithi, Jesmin Jahan, Checconi, Fabio, Doerfler, Douglas, and Petrini, Fabrizio
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Hardware Architecture
Abstract: SU3\_Bench is a microbenchmark developed to explore performance portability across multiple programming models/methodologies using a simple, but nontrivial, mathematical kernel. This kernel has been derived from the MILC lattice quantum chromodynamics (LQCD) code. SU3\_Bench is bandwidth bound and generates regular compute and data access patterns. Therefore, on most traditional CPU and GPU-based systems, its performance is mainly determined by the achievable memory bandwidth. Although SU3\_Bench is a simple kernel, experience says its subtleties require a certain amount of tweaking to achieve peak performance for a given programming model and hardware, making performance portability challenging. In this paper, we share some of the challenges in obtaining the peak performance for SU3\_Bench on a state-of-the-art Intel Xeon machine, due to the nuances of variable definition, the nature of compiler-provided default constructors, how memory is accessed at object creation time, and the NUMA effects on the machine. We discuss how to tackle those challenges to improve SU3\_Bench's performance by $2\times$ compared to the original OpenMP implementation available at Github. This provides a valuable lesson for other similar kernels. Expanding on the performance portability aspects, we also show early results obtained porting SU3\_Bench to the new Intel Programmable Integrated Unified Memory Architecture (PIUMA), characterized by a more balanced flops-to-byte ratio. This paper shows that it is not the usual bandwidth or flops, rather the pipeline throughput, that determines SU3\_Bench's performance on PIUMA. Finally, we show how to improve performance on PIUMA and how that compares with the performance on Xeon, which has around one order of magnitude more flops-per-byte., Comment: 11 pages
Published: 2021

13. ALTO: Adaptive Linearized Storage of Sparse Tensors

Author: Helal, Ahmed E., Laukemann, Jan, Checconi, Fabio, Tithi, Jesmin Jahan, Ranadive, Teresa, Petrini, Fabrizio, and Choi, Jeewhan
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Data Structures and Algorithms, Computer Science - Performance
Abstract: The analysis of high-dimensional sparse data is becoming increasingly popular in many important domains. However, real-world sparse tensors are challenging to process due to their irregular shapes and data distributions. We propose the Adaptive Linearized Tensor Order (ALTO) format, a novel mode-agnostic (general) representation that keeps neighboring nonzero elements in the multi-dimensional space close to each other in memory. To generate the indexing metadata, ALTO uses an adaptive bit encoding scheme that trades off index computations for lower memory usage and more effective use of memory bandwidth. Moreover, by decoupling its sparse representation from the irregular spatial distribution of nonzero elements, ALTO eliminates the workload imbalance and greatly reduces the synchronization overhead of tensor computations. As a result, the parallel performance of ALTO-based tensor operations becomes a function of their inherent data reuse. On a gamut of tensor datasets, ALTO outperforms an oracle that selects the best state-of-the-art format for each dataset, when used in key tensor decomposition operations. Specifically, ALTO achieves a geometric mean speedup of 8X over the best mode-agnostic (coordinate and hierarchical coordinate) formats, while delivering a geometric mean compression ratio of 4.3X relative to the best mode-specific (compressed sparse fiber) formats., Comment: Accepted to ICS 2021
Published: 2021
Full Text: View/download PDF

14. Mapping Stencils on Coarse-grained Reconfigurable Spatial Architecture

Author: Tithi, Jesmin Jahan, Petrini, Fabrizio, Rong, Hongbo, Valentin, Andrei, and Ebeling, Carl
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Hardware Architecture, Computer Science - Performance
Abstract: Stencils represent a class of computational patterns where an output grid point depends on a fixed shape of neighboring points in an input grid. Stencil computations are prevalent in scientific applications engaging a significant portion of supercomputing resources. Therefore, it has been always important to optimize stencil programs for the best performance. A rich body of research has focused on optimizing stencil computations on almost all parallel architectures. Stencil applications have regular dependency patterns, inherent pipeline-parallelism, and plenty of data reuse. This makes these applications a perfect match for a coarse-grained reconfigurable spatial architecture (CGRA). A CGRA consists of many simple, small processing elements (PEs) connected with an on-chip network. Each PE can be configured to execute part of a stencil computation and all PEs run in parallel; the network can also be configured so that data loaded can be passed from a PE to a neighbor PE directly and thus reused by many PEs without register spilling and memory traffic. How to efficiently map a stencil computation to a CGRA is the key to performance. In this paper, we show a few unique and generalizable ways of mapping one- and multidimensional stencil computations to a CGRA, fully exploiting the data reuse opportunities and parallelism. Our simulation experiments demonstrate that these mappings are efficient and enable the CGRA to outperform state-of-the-art GPUs., Comment: 9 Pages
Published: 2020

15. PIUMA: Programmable Integrated Unified Memory Architecture

Author: Aananthakrishnan, Sriram, Ahmed, Nesreen K., Cave, Vincent, Cintra, Marcelo, Demir, Yigit, Bois, Kristof Du, Eyerman, Stijn, Fryman, Joshua B., Ganev, Ivan, Heirman, Wim, Hoppe, Hans-Christian, Howard, Jason, Hur, Ibrahim, Kodiyath, MidhunChandra, Jain, Samkit, Klowden, Daniel S., Landowski, Marek M., Montigny, Laurent, More, Ankit, Ossowski, Przemyslaw, Pawlowski, Robert, Pepperling, Nick, Petrini, Fabrizio, Sikora, Mariusz, Seshasayee, Balasubramanian, Smith, Shaden, Szkoda, Sebastian, Tayal, Sanjaya, Tithi, Jesmin Jahan, Vandriessche, Yves, and Wrosz, Izajasz P.
Subjects: Computer Science - Hardware Architecture, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: High performance large scale graph analytics is essential to timely analyze relationships in big data sets. Conventional processor architectures suffer from inefficient resource usage and bad scaling on graph workloads. To enable efficient and scalable graph analysis, Intel developed the Programmable Integrated Unified Memory Architecture (PIUMA). PIUMA consists of many multi-threaded cores, fine-grained memory and network accesses, a globally shared address space and powerful offload engines. This paper presents the PIUMA architecture, and provides initial performance estimations, projecting that a PIUMA node will outperform a conventional compute node by one to two orders of magnitude. Furthermore, PIUMA continues to scale across multiple nodes, which is a challenge in conventional multinode setups.
Published: 2020

16. MISIM: A Neural Code Semantics Similarity System Using the Context-Aware Semantics Structure

Author: Ye, Fangke, Zhou, Shengtian, Venkat, Anand, Marcus, Ryan, Tatbul, Nesime, Tithi, Jesmin Jahan, Hasabnis, Niranjan, Petersen, Paul, Mattson, Timothy, Kraska, Tim, Dubey, Pradeep, Sarkar, Vivek, and Gottschlich, Justin
Subjects: Computer Science - Machine Learning, Computer Science - Software Engineering, Statistics - Machine Learning
Abstract: Code semantics similarity can be used for many tasks such as code recommendation, automated software defect correction, and clone detection. Yet, the accuracy of such systems has not yet reached a level of general purpose reliability. To help address this, we present Machine Inferred Code Similarity (MISIM), a neural code semantics similarity system consisting of two core components: (i)MISIM uses a novel context-aware semantics structure, which was purpose-built to lift semantics from code syntax; (ii)MISIM uses an extensible neural code similarity scoring algorithm, which can be used for various neural network architectures with learned parameters. We compare MISIM to four state-of-the-art systems, including two additional hand-customized models, over 328K programs consisting of over 18 million lines of code. Our experiments show that MISIM has 8.08% better accuracy (using MAP@R) compared to the next best performing system., Comment: arXiv admin note: text overlap with arXiv:2003.11118
Published: 2020

17. An Efficient Shared-memory Parallel Sinkhorn-Knopp Algorithm to Compute the Word Mover's Distance

Author: Tithi, Jesmin Jahan and Petrini, Fabrizio
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing, Statistics - Machine Learning
Abstract: The Word Mover's Distance (WMD) is a metric that measures the semantic dissimilarity between two text documents by computing the cost of moving all words of a source/query document to the most similar words of a target document optimally. Computing WMD between two documents is costly because it requires solving an optimization problem that costs $O(V^3log(V))$ where $V$ is the number of unique words in the document. Fortunately, the WMD can be framed as the Earth Mover's Distance (EMD) (also known as the Optimal Transportation Distance) for which it has been shown that the algorithmic complexity can be reduced to $O(V^2)$ by adding an entropy penalty to the optimization problem and a similar idea can be adapted to compute WMD efficiently. Additionally, the computation can be made highly parallel by computing WMD of a single query document against multiple target documents at once (e.g., finding whether a given tweet is similar to any other tweets happened in a day). In this paper, we present a shared-memory parallel Sinkhorn-Knopp Algorithm to compute the WMD of one document against many other documents by adopting the $O(V^2)$ EMD algorithm. We used algorithmic transformations to change the original dense compute-heavy kernel to a sparse compute kernel and obtained $67\times$ speedup using $96$ cores on the state-of-the-art of Intel\textregistered{} 4-sockets Cascade Lake machine w.r.t. its sequential run. Our parallel algorithm is over $700\times$ faster than the naive parallel python code that internally uses optimized matrix library calls., Comment: 10 pages, 1 page for reference, total 11 pages
Published: 2020

18. Online and Real-time Object Tracking Algorithm with Extremely Small Matrices

Author: Tithi, Jesmin Jahan, Aananthakrishnan, Sriram, and Petrini, Fabrizio
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Online and Real-time Object Tracking is an interesting workload that can be used to track objects (e.g., car, human, animal) in a series of video sequences in real-time. For simple object tracking on edge devices, the output of object tracking could be as simple as drawing a bounding box around a detected object and in some cases, the input matrices used in such computation are quite small (e.g., 4x7, 3x3, 5x5, etc). As a result, the amount of actual work is low. Therefore, a typical multi-threading based parallelization technique can not accelerate the tracking application; instead, a throughput based parallelization technique where each thread operates on independent video sequences is more rewarding. In this paper, we share our experience in parallelizing a Simple Online and Real-time Tracking (SORT) application on shared-memory multicores., Comment: 5 Pages (4 Pages main paper, 5th page for reference), Accepted for presentation in WHPC 2020 Summit which got canceled for Corona. But it will not be published in Digital Library
Published: 2020

19. Context-Aware Parse Trees

Author: Ye, Fangke, Zhou, Shengtian, Venkat, Anand, Marcus, Ryan, Petersen, Paul, Tithi, Jesmin Jahan, Mattson, Tim, Kraska, Tim, Dubey, Pradeep, Sarkar, Vivek, and Gottschlich, Justin
Subjects: Computer Science - Programming Languages, Computer Science - Artificial Intelligence
Abstract: The simplified parse tree (SPT) presented in Aroma, a state-of-the-art code recommendation system, is a tree-structured representation used to infer code semantics by capturing program \emph{structure} rather than program \emph{syntax}. This is a departure from the classical abstract syntax tree, which is principally driven by programming language syntax. While we believe a semantics-driven representation is desirable, the specifics of an SPT's construction can impact its performance. We analyze these nuances and present a new tree structure, heavily influenced by Aroma's SPT, called a \emph{context-aware parse tree} (CAPT). CAPT enhances SPT by providing a richer level of semantic representation. Specifically, CAPT provides additional binding support for language-specific techniques for adding semantically-salient features, and language-agnostic techniques for removing syntactically-present but semantically-irrelevant features. Our research quantitatively demonstrates the value of our proposed semantically-salient features, enabling a specific CAPT configuration to be 39\% more accurate than SPT across the 48,610 programs we analyzed.
Published: 2020

20. SU3_Bench on a Programmable Integrated Unified Memory Architecture (PIUMA) and How that Differs from Standard NUMA CPUs

Author: Tithi, Jesmin Jahan, Checconi, Fabio, Doerfler, Douglas, Petrini, Fabrizio, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Varbanescu, Ana-Lucia, editor, Bhatele, Abhinav, editor, Luszczek, Piotr, editor, and Marc, Baboulin, editor
Published: 2022
Full Text: View/download PDF

21. Lessons Learned from Accelerating Quicksilver on Programmable Integrated Unified Memory Architecture (PIUMA) and How That’s Different from CPU

Author: Tithi, Jesmin Jahan, Petrini, Fabrizio, Richards, David F., Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Chamberlain, Bradford L., editor, Varbanescu, Ana-Lucia, editor, Ltaief, Hatem, editor, and Luszczek, Piotr, editor
Published: 2021
Full Text: View/download PDF

22. The Intel Programmable and Integrated Unified Memory Architecture Graph Analytics Processor

Author: Aananthakrishnan, Sriram, primary, Abedin, Shamsul, additional, Cavé, Vincent, additional, Checconi, Fabio, additional, Bois, Kristof Du, additional, Eyerman, Stijn, additional, Fryman, Joshua B., additional, Heirman, Wim, additional, Howard, Jason, additional, Hur, Ibrahim, additional, Jain, Samkit, additional, Landowski, Marek M., additional, Ma, Kevin, additional, Nelson, Jarrod A., additional, Pawlowski, Robert, additional, Petrini, Fabrizio, additional, Szkoda, Sebastian, additional, Tayal, Sanjaya, additional, Tithi, Jesmin Jahan, additional, and Vandriessche, Yves, additional
Published: 2023
Full Text: View/download PDF

23. An Efficient Cache-oblivious Parallel Viterbi Algorithm

Author: Chowdhury, Rezaul, Ganapathi, Pramod, Pradhan, Vivek, Tithi, Jesmin Jahan, Xiao, Yunpeng, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Dutot, Pierre-François, editor, and Trystram, Denis, editor
Published: 2016
Full Text: View/download PDF

24. Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams

Author: Soh, Yongseok, primary, Helal, Ahmed E., additional, Checconi, Fabio, additional, Laukemann, Jan, additional, Tithi, Jesmin Jahan, additional, Ranadive, Teresa, additional, Petrini, Fabrizio, additional, and Choi, Jee W., additional
Published: 2023
Full Text: View/download PDF

25. Assessing Trustworthy AI in Times of COVID-19: Deep Learning for Predicting a Multiregional Score Conveying the Degree of Lung Compromise in COVID-19 Patients

Author: Allahabadi, Himanshi, primary, Amann, Julia, additional, Balot, Isabelle, additional, Beretta, Andrea, additional, Binkley, Charles, additional, Bozenhard, Jonas, additional, Bruneault, Frederick, additional, Brusseau, James, additional, Candemir, Sema, additional, Cappellini, Luca Alessandro, additional, Chakraborty, Subrata, additional, Cherciu, Nicoleta, additional, Cociancig, Christina, additional, Coffee, Megan, additional, Ek, Irene, additional, Espinosa-Leal, Leonardo, additional, Farina, Davide, additional, Fieux-Castagnet, Genevieve, additional, Frauenfelder, Thomas, additional, Gallucci, Alessio, additional, Giuliani, Guya, additional, Golda, Adam, additional, van Halem, Irmhild, additional, Hildt, Elisabeth, additional, Holm, Sune, additional, Kararigas, Georgios, additional, Krier, Sebastien A., additional, Kuhne, Ulrich, additional, Lizzi, Francesca, additional, Madai, Vince I., additional, Markus, Aniek F., additional, Masis, Serg, additional, Mathez, Emilie Wiinblad, additional, Mureddu, Francesco, additional, Neri, Emanuele, additional, Osika, Walter, additional, Ozols, Matiss, additional, Panigutti, Cecilia, additional, Parent, Brendan, additional, Pratesi, Francesca, additional, Moreno-Sanchez, Pedro A., additional, Sartor, Giovanni, additional, Savardi, Mattia, additional, Signoroni, Alberto, additional, Sormunen, Hanna-Maria, additional, Spezzatti, Andy, additional, Srivastava, Adarsh, additional, Stephansen, Annette F., additional, Theng, Lau Bee, additional, Tithi, Jesmin Jahan, additional, Tuominen, Jarno, additional, Umbrello, Steven, additional, Vaccher, Filippo, additional, Vetter, Dennis, additional, Westerlund, Magnus, additional, Wurth, Renee, additional, and Zicari, Roberto V., additional
Published: 2022
Full Text: View/download PDF

26. Efficient, out-of-memory sparse MTTKRP on massively parallel architectures

Author: Nguyen, Andy, primary, Helal, Ahmed E., additional, Checconi, Fabio, additional, Laukemann, Jan, additional, Tithi, Jesmin Jahan, additional, Soh, Yongseok, additional, Ranadive, Teresa, additional, Petrini, Fabrizio, additional, and Choi, Jee W., additional
Published: 2022
Full Text: View/download PDF

27. Assessing Trustworthy AI in Times of COVID-19.:Deep Learning for Predicting a Multiregional Score Conveying the Degree of Lung Compromise in COVID-19 Patients

Author: Allahabadi, Himanshi, Amann, Julia, Balot, Isabelle, Beretta, Andrea, Binkley, Charles, Bozenhard, Jonas, Bruneault, Frederick, Brusseau, James, Candemir, Sema, Cappellini, Luca Alessandro, Chakraborty, Subrata, Cherciu, Nicoleta, Cociancig, Christina, Coffee, Megan, Ek, Irene, Espinosa-Leal, Leonardo, Farina, Davide, Fieux-Castagnet, Genevieve, Frauenfelder, Thomas, Gallucci, Alessio, Giuliani, Guya, Golda, Adam, van Halem, Irmhild, Hildt, Elisabeth, Holm, Sune, Kararigas, Georgios, Krier, Sebastien A, Kuhne, Ulrich, Lizzi, Francesca, Madai, Vince I, Markus, Aniek F, Masis, Serg, Mathez, Emilie Wiinblad, Mureddu, Francesco, Neri, Emanuele, Osika, Walter, Ozols, Matiss, Panigutti, Cecilia, Parent, Brendan, Pratesi, Francesca, Moreno-Sanchez, Pedro A, Sartor, Giovanni, Savardi, Mattia, Signoroni, Alberto, Sormunen, Hanna-Maria, Spezzatti, Andy, Srivastava, Adarsh, Stephansen, Annette F, Theng, Lau Bee, Tithi, Jesmin Jahan, Tuominen, Jarno, Umbrello, Steven, Vaccher, Filippo, Vetter, Dennis, Westerlund, Magnus, Wurth, Renee, Zicari, Roberto V, Allahabadi, Himanshi, Amann, Julia, Balot, Isabelle, Beretta, Andrea, Binkley, Charles, Bozenhard, Jonas, Bruneault, Frederick, Brusseau, James, Candemir, Sema, Cappellini, Luca Alessandro, Chakraborty, Subrata, Cherciu, Nicoleta, Cociancig, Christina, Coffee, Megan, Ek, Irene, Espinosa-Leal, Leonardo, Farina, Davide, Fieux-Castagnet, Genevieve, Frauenfelder, Thomas, Gallucci, Alessio, Giuliani, Guya, Golda, Adam, van Halem, Irmhild, Hildt, Elisabeth, Holm, Sune, Kararigas, Georgios, Krier, Sebastien A, Kuhne, Ulrich, Lizzi, Francesca, Madai, Vince I, Markus, Aniek F, Masis, Serg, Mathez, Emilie Wiinblad, Mureddu, Francesco, Neri, Emanuele, Osika, Walter, Ozols, Matiss, Panigutti, Cecilia, Parent, Brendan, Pratesi, Francesca, Moreno-Sanchez, Pedro A, Sartor, Giovanni, Savardi, Mattia, Signoroni, Alberto, Sormunen, Hanna-Maria, Spezzatti, Andy, Srivastava, Adarsh, Stephansen, Annette F, Theng, Lau Bee, Tithi, Jesmin Jahan, Tuominen, Jarno, Umbrello, Steven, Vaccher, Filippo, Vetter, Dennis, Westerlund, Magnus, Wurth, Renee, and Zicari, Roberto V
Abstract: This article's main contributions are twofold: 1) to demonstrate how to apply the general European Union's High-Level Expert Group's (EU HLEG) guidelines for trustworthy AI in practice for the domain of healthcare and 2) to investigate the research question of what does "trustworthy AI" mean at the time of the COVID-19 pandemic. To this end, we present the results of a post-hoc self-assessment to evaluate the trustworthiness of an AI system for predicting a multiregional score conveying the degree of lung compromise in COVID-19 patients, developed and verified by an interdisciplinary team with members from academia, public hospitals, and industry in time of pandemic. The AI system aims to help radiologists to estimate and communicate the severity of damage in a patient's lung from Chest X-rays. It has been experimentally deployed in the radiology department of the ASST Spedali Civili clinic in Brescia, Italy, since December 2020 during pandemic time. The methodology we have applied for our post-hoc assessment, called Z-Inspection®, uses sociotechnical scenarios to identify ethical, technical, and domain-specific issues in the use of the AI system in the context of the pandemic.
Published: 2022

28. Assessing Trustworthy AI in times of COVID-19:Deep Learning for predicting a multi-regional score conveying the degree of lung compromise in COVID-19 patients

Author: Allahabadi, Himanshi, Amann, Julia, Balot, Isabelle, Beretta, Andrea, Binkley, Charles, Bozenhard, Jonas, Bruneault, Frédérick, Brusseau, James, Candemir, Sema, Cappellini, Luca Alessandro, Chakraborty, Subrata, Cherciu, Nicoleta, Cociancig, Christina, Coffee, Megan, Ek, Irene, Espinosa-Leal, Leonardo, Farina, Davide, Fieux-Castagnet, Genevieve, Frauenfelder, Thomas, Gallucci, Alessio, Giuliani, Guya, Golda, Adam, Halem, Irmhild van, Hildt, Elisabeth, Holm, Sune, Kararigas, Georgios, Krier, Sebastien A., Kühne, Ulrich, Lizzi, Francesca, Madai, Vince I., Markus, Aniek F., Masis, Serg, Mathez, Emilie Wiinblad, Mureddu, Francesco, Neri, Emanuele, Osika, Walter, Ozols, Matiss, Panigutti, Cecilia, Parent, Brendan, Pratesi, Francesca, Moreno-Sánchez, Pedro A., Sartor, Giovanni, Savardi, Mattia, Signoroni, Alberto, Sormunen, Hanna, Spezzatti, Andy, Srivastava, Adarsh, Stephansen, Annette F., Theng, Lau Bee, Tithi, Jesmin Jahan, Tuominen, Jarno, Umbrello, Steven, Vaccher, Filippo, Vetter, Dennis, Westerlund, Magnus, Wurth, Renee, Zicari, Roberto V., Allahabadi, Himanshi, Amann, Julia, Balot, Isabelle, Beretta, Andrea, Binkley, Charles, Bozenhard, Jonas, Bruneault, Frédérick, Brusseau, James, Candemir, Sema, Cappellini, Luca Alessandro, Chakraborty, Subrata, Cherciu, Nicoleta, Cociancig, Christina, Coffee, Megan, Ek, Irene, Espinosa-Leal, Leonardo, Farina, Davide, Fieux-Castagnet, Genevieve, Frauenfelder, Thomas, Gallucci, Alessio, Giuliani, Guya, Golda, Adam, Halem, Irmhild van, Hildt, Elisabeth, Holm, Sune, Kararigas, Georgios, Krier, Sebastien A., Kühne, Ulrich, Lizzi, Francesca, Madai, Vince I., Markus, Aniek F., Masis, Serg, Mathez, Emilie Wiinblad, Mureddu, Francesco, Neri, Emanuele, Osika, Walter, Ozols, Matiss, Panigutti, Cecilia, Parent, Brendan, Pratesi, Francesca, Moreno-Sánchez, Pedro A., Sartor, Giovanni, Savardi, Mattia, Signoroni, Alberto, Sormunen, Hanna, Spezzatti, Andy, Srivastava, Adarsh, Stephansen, Annette F., Theng, Lau Bee, Tithi, Jesmin Jahan, Tuominen, Jarno, Umbrello, Steven, Vaccher, Filippo, Vetter, Dennis, Westerlund, Magnus, Wurth, Renee, and Zicari, Roberto V.
Abstract: The paper’s main contributions are twofold: to demonstrate how to apply the general European Union’s High-Level Expert Group’s (EU HLEG) guidelines for trustworthy AI in practice for the domain of healthcare; and to investigate the research question of what does “trustworthy AI” mean at the time of the COVID-19 pandemic. To this end, we present the results of a post-hoc self-assessment to evaluate the trustworthiness of an AI system for predicting a multi-regional score conveying the degree of lung compromise in COVID-19 patients, developed and verified by an interdisciplinary team with members from academia, public hospitals, and industry in time of pandemic. The AI system aims to help radiologists to estimate and communicate the severity of damage in a patient’s lung from Chest X-rays. It has been experimentally deployed in the radiology department of the ASST Spedali Civili clinic in Brescia (Italy) since December 2020 during pandemic time. The methodology we have applied for our post-hoc assessment, called Z-Inspection, uses socio-technical scenarios to identify ethical, technical and domain-specific issues in the use of the AI system in the context of the pandemic.
Published: 2022

29. An Efficient Cache-oblivious Parallel Viterbi Algorithm

Author: Chowdhury, Rezaul, primary, Ganapathi, Pramod, additional, Pradhan, Vivek, additional, Tithi, Jesmin Jahan, additional, and Xiao, Yunpeng, additional
Published: 2016
Full Text: View/download PDF

30. On assessing trustworthy AI in healthcare:Best practice for machine learning as a supportive tool to recognize cardiac arrest in emergency calls

Author: Zicari, Roberto V., Brusseau, James, Blomberg, Stig Nikolaj, Christensen, Helle Collatz, Coffee, Megan, Ganapini, Marianna B., Gerke, Sara, Gilbert, Thomas Krendl, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kühne, Ulrich, Madai, Vince I., Osika, Walter, Spezzatti, Andy, Schnebel, Eberhard, Tithi, Jesmin Jahan, Vetter, Dennis, Westerlund, Magnus, Wurth, Renee, Amann, Julia, Antun, Vegard, Beretta, Valentina, Bruneault, Frédérick, Campano, Erik, Düdder, Boris, Gallucci, Alessio, Goffi, Emmanuel, Haase, Christoffer Bjerre, Hagendorff, Thilo, Kringen, Pedro, Möslein, Florian, Ottenheimer, Davi, Ozols, Matiss, Palazzani, Laura, Petrin, Martin, Tafur, Karin, Tørresen, Jim, Volland, Holger, and Kararigas, Georgios
Subjects: ComputingMethodologies_PATTERNRECOGNITION, GeneralLiterature_MISCELLANEOUS
Abstract: Artificial Intelligence (AI) has the potential to greatly improve the delivery of healthcare and other services that advance population health and wellbeing. However, the use of AI in healthcare also brings potential risks that may cause unintended harm. To guide future developments in AI, the High-Level Expert Group on AI set up by the European Commission (EC), recently published ethics guidelines for what it terms “trustworthy” AI. These guidelines are aimed at a variety of stakeholders, especially guiding practitioners toward more ethical and more robust applications of AI. In line with efforts of the EC, AI ethics scholarship focuses increasingly on converting abstract principles into actionable recommendations. However, the interpretation, relevance, and implementation of trustworthy AI depend on the domain and the context in which the AI system is used. The main contribution of this paper is to demonstrate how to use the general AI HLEG trustworthy AI guidelines in practice in the healthcare domain. To this end, we present a best practice of assessing the use of machine learning as a supportive tool to recognize cardiac arrest in emergency calls. The AI system under assessment is currently in use in the city of Copenhagen in Denmark. The assessment is accomplished by an independent team composed of philosophers, policy makers, social scientists, technical, legal, and medical experts. By leveraging an interdisciplinary team, we aim to expose the complex trade-offs and the necessity for such thorough human review when tackling socio-technical applications of AI in healthcare. For the assessment, we use a process to assess trustworthy AI, called 1Z-Inspection® to identify specific challenges and potential ethical trade-offs when we consider AI in practice.
Published: 2021

31. Co-Design of a Trustworthy AI System in Healthcare: Deep Learning Based Skin Lesion Classifier

Author: Zicari, Roberto V., primary, Ahmed, Sheraz, additional, Amann, Julia, additional, Braun, Stephan Alexander, additional, Brodersen, John, additional, Bruneault, Frédérick, additional, Brusseau, James, additional, Campano, Erik, additional, Coffee, Megan, additional, Dengel, Andreas, additional, Düdder, Boris, additional, Gallucci, Alessio, additional, Gilbert, Thomas Krendl, additional, Gottfrois, Philippe, additional, Goffi, Emmanuel, additional, Haase, Christoffer Bjerre, additional, Hagendorff, Thilo, additional, Hickman, Eleanore, additional, Hildt, Elisabeth, additional, Holm, Sune, additional, Kringen, Pedro, additional, Kühne, Ulrich, additional, Lucieri, Adriano, additional, Madai, Vince I., additional, Moreno-Sánchez, Pedro A., additional, Medlicott, Oriana, additional, Ozols, Matiss, additional, Schnebel, Eberhard, additional, Spezzatti, Andy, additional, Tithi, Jesmin Jahan, additional, Umbrello, Steven, additional, Vetter, Dennis, additional, Volland, Holger, additional, Westerlund, Magnus, additional, and Wurth, Renee, additional
Published: 2021
Full Text: View/download PDF

32. On Assessing Trustworthy AI in Healthcare. Machine Learning as a Supportive Tool to Recognize Cardiac Arrest in Emergency Calls

Author: Zicari, Roberto V., primary, Brusseau, James, additional, Blomberg, Stig Nikolaj, additional, Christensen, Helle Collatz, additional, Coffee, Megan, additional, Ganapini, Marianna B., additional, Gerke, Sara, additional, Gilbert, Thomas Krendl, additional, Hickman, Eleanore, additional, Hildt, Elisabeth, additional, Holm, Sune, additional, Kühne, Ulrich, additional, Madai, Vince I., additional, Osika, Walter, additional, Spezzatti, Andy, additional, Schnebel, Eberhard, additional, Tithi, Jesmin Jahan, additional, Vetter, Dennis, additional, Westerlund, Magnus, additional, Wurth, Renee, additional, Amann, Julia, additional, Antun, Vegard, additional, Beretta, Valentina, additional, Bruneault, Frédérick, additional, Campano, Erik, additional, Düdder, Boris, additional, Gallucci, Alessio, additional, Goffi, Emmanuel, additional, Haase, Christoffer Bjerre, additional, Hagendorff, Thilo, additional, Kringen, Pedro, additional, Möslein, Florian, additional, Ottenheimer, Davi, additional, Ozols, Matiss, additional, Palazzani, Laura, additional, Petrin, Martin, additional, Tafur, Karin, additional, Tørresen, Jim, additional, Volland, Holger, additional, and Kararigas, Georgios, additional
Published: 2021
Full Text: View/download PDF

33. ALTO

Author: Helal, Ahmed E., primary, Laukemann, Jan, additional, Checconi, Fabio, additional, Tithi, Jesmin Jahan, additional, Ranadive, Teresa, additional, Petrini, Fabrizio, additional, and Choi, Jeewhan, additional
Published: 2021
Full Text: View/download PDF

34. Z-Inspection®: A Process to Assess Trustworthy AI

Author: Zicari, Roberto V., primary, Brodersen, John, additional, Brusseau, James, additional, Dudder, Boris, additional, Eichhorn, Timo, additional, Ivanov, Todor, additional, Kararigas, Georgios, additional, Kringen, Pedro, additional, McCullough, Melissa, additional, Moslein, Florian, additional, Mushtaq, Naveed, additional, Roig, Gemma, additional, Sturtz, Norman, additional, Tolle, Karsten, additional, Tithi, Jesmin Jahan, additional, van Halem, Irmhild, additional, and Westerlund, Magnus, additional
Published: 2021
Full Text: View/download PDF

35. On assessing trustworthy AI in healthcare : Machine learning as a supportive tool to recognize cardiac arrest in emergency calls

Author: Zicari, Roberto V., Brusseau, James, Blomberg, Stig Nikolaj, Christensen, Helle Collatz, Coffee, Megan, Ganapini, Marianna B., Gerke, Sara, Gilbert, Thomas Krendl, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kühne, Ulrich, Madai, Vince I., Osika, Walter, Spezzatti, Andy, Schnebel, Eberhard, Tithi, Jesmin Jahan, Vetter, Dennis, Westerlund, Magnus, Wurth, Renee, Amann, Julia, Antun, Vegard, Beretta, Valentina, Bruneault, Frédérick, Campano, Erik, Düdder, Boris, Gallucci, Alessio, Goffi, Emmanuel, Haase, Christoffer Bjerre, Hagendorff, Thilo, Kringen, Pedro, Möslein, Florian, Ottenheimer, Davi, Ozols, Matiss, Palazzani, Laura, Petrin, Martin, Tafur, Karin, Tørresen, Jim, Volland, Holger, Kararigas, Georgios, Zicari, Roberto V., Brusseau, James, Blomberg, Stig Nikolaj, Christensen, Helle Collatz, Coffee, Megan, Ganapini, Marianna B., Gerke, Sara, Gilbert, Thomas Krendl, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kühne, Ulrich, Madai, Vince I., Osika, Walter, Spezzatti, Andy, Schnebel, Eberhard, Tithi, Jesmin Jahan, Vetter, Dennis, Westerlund, Magnus, Wurth, Renee, Amann, Julia, Antun, Vegard, Beretta, Valentina, Bruneault, Frédérick, Campano, Erik, Düdder, Boris, Gallucci, Alessio, Goffi, Emmanuel, Haase, Christoffer Bjerre, Hagendorff, Thilo, Kringen, Pedro, Möslein, Florian, Ottenheimer, Davi, Ozols, Matiss, Palazzani, Laura, Petrin, Martin, Tafur, Karin, Tørresen, Jim, Volland, Holger, and Kararigas, Georgios
Abstract: Artificial Intelligence (AI) has the potential to greatly improve the delivery of healthcare and other services that advance population health and wellbeing. However, the use of AI in healthcare also brings potential risks that may cause unintended harm. To guide future developments in AI, the High-Level Expert Group on AI set up by the European Commission (EC), recently published ethics guidelines for what it terms “trustworthy” AI. These guidelines are aimed at a variety of stakeholders, especially guiding practitioners toward more ethical and more robust applications of AI. In line with efforts of the EC, AI ethics scholarship focuses increasingly on converting abstract principles into actionable recommendations. However, the interpretation, relevance, and implementation of trustworthy AI depend on the domain and the context in which the AI system is used. The main contribution of this paper is to demonstrate how to use the general AI HLEG trustworthy AI guidelines in practice in the healthcare domain. To this end, we present a best practice of assessing the use of machine learning as a supportive tool to recognize cardiac arrest in emergency calls. The AI system under assessment is currently in use in the city of Copenhagen in Denmark. The assessment is accomplished by an independent team composed of philosophers, policy makers, social scientists, technical, legal, and medical experts. By leveraging an interdisciplinary team, we aim to expose the complex trade-offs and the necessity for such thorough human review when tackling socio-technical applications of AI in healthcare. For the assessment, we use a process to assess trustworthy AI, called 1Z-Inspection® to identify specific challenges and potential ethical trade-offs when we consider AI in practice.
Published: 2021
Full Text: View/download PDF

36. Co-design of a trustworthy AI system in healthcare : deep learning based skin lesion classifier

Author: Zicari, Roberto V., Ahmed, Sheraz, Amann, Julia, Braun, Stephan Alexander, Brodersen, John, Bruneault, Frédérick, Brusseau, James, Campano, Erik, Coffee, Megan, Dengel, Andreas, Düdder, Boris, Gallucci, Alessio, Gilbert, Thomas Krendl, Gottfrois, Philippe, Goffi, Emmanuel, Haase, Christoffer Bjerre, Hagendorff, Thilo, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kringen, Pedro, Kühne, Ulrich, Lucieri, Adriano, Madai, Vince I., Moreno-Sánchez, Pedro A., Medlicott, Oriana, Ozols, Matiss, Schnebel, Eberhard, Spezzatti, Andy, Tithi, Jesmin Jahan, Umbrello, Steven, Vetter, Dennis, Volland, Holger, Westerlund, Magnus, Wurth, Renee, Zicari, Roberto V., Ahmed, Sheraz, Amann, Julia, Braun, Stephan Alexander, Brodersen, John, Bruneault, Frédérick, Brusseau, James, Campano, Erik, Coffee, Megan, Dengel, Andreas, Düdder, Boris, Gallucci, Alessio, Gilbert, Thomas Krendl, Gottfrois, Philippe, Goffi, Emmanuel, Haase, Christoffer Bjerre, Hagendorff, Thilo, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kringen, Pedro, Kühne, Ulrich, Lucieri, Adriano, Madai, Vince I., Moreno-Sánchez, Pedro A., Medlicott, Oriana, Ozols, Matiss, Schnebel, Eberhard, Spezzatti, Andy, Tithi, Jesmin Jahan, Umbrello, Steven, Vetter, Dennis, Volland, Holger, Westerlund, Magnus, and Wurth, Renee
Abstract: This paper documents how an ethically aligned co-design methodology ensures trustworthiness in the early design phase of an artificial intelligence (AI) system component for healthcare. The system explains decisions made by deep learning networks analyzing images of skin lesions. The co-design of trustworthy AI developed here used a holistic approach rather than a static ethical checklist and required a multidisciplinary team of experts working with the AI designers and their managers. Ethical, legal, and technical issues potentially arising from the future use of the AI system were investigated. This paper is a first report on co-designing in the early design phase. Our results can also serve as guidance for other early-phase AI-similar tool developments.
Published: 2021
Full Text: View/download PDF

37. Z-Inspection®: A Process to Assess Trustworthy AI

Author: Zicari, Roberto V., Brodersen, John, Brusseau, James, Dudder, Boris, Eichhorn, Timo, Ivanov, Todor, Kararigas, Georgios, Kringen, Pedro, McCullough, Melissa, Moslein, Florian, Mushtaq, Naveed, Roig, Gemma, Sturtz, Norman, Tolle, Karsten, Tithi, Jesmin Jahan, Halem, Irmhild van, Westerlund, Magnus, Zicari, Roberto V., Brodersen, John, Brusseau, James, Dudder, Boris, Eichhorn, Timo, Ivanov, Todor, Kararigas, Georgios, Kringen, Pedro, McCullough, Melissa, Moslein, Florian, Mushtaq, Naveed, Roig, Gemma, Sturtz, Norman, Tolle, Karsten, Tithi, Jesmin Jahan, Halem, Irmhild van, and Westerlund, Magnus
Published: 2021

38. Co-design of a trustworthy AI system in healthcare:Deep learning based skin lesion classifier

Author: Zicari, Roberto V., Ahmed, Sheraz, Amann, Julia, Braun, Stephan Alexander, Brodersen, John, Bruneault, Frédérick, Brusseau, James, Campano, Erik, Coffee, Megan, Dengel, Andreas, Düdder, Boris, Gallucci, Alessio, Gilbert, Thomas Krendl, Gottfrois, Philippe, Goffi, Emmanuel, Haase, Christoffer Bjerre, Hagendorff, Thilo, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kringen, Pedro, Kühne, Ulrich, Lucieri, Adriano, Madai, Vince I., Moreno-Sánchez, Pedro A., Medlicott, Oriana, Ozols, Matiss, Schnebel, Eberhard, Spezzatti, Andy, Tithi, Jesmin Jahan, Umbrello, Steven, Vetter, Dennis, Volland, Holger, Westerlund, Magnus, Wurth, Renee, Zicari, Roberto V., Ahmed, Sheraz, Amann, Julia, Braun, Stephan Alexander, Brodersen, John, Bruneault, Frédérick, Brusseau, James, Campano, Erik, Coffee, Megan, Dengel, Andreas, Düdder, Boris, Gallucci, Alessio, Gilbert, Thomas Krendl, Gottfrois, Philippe, Goffi, Emmanuel, Haase, Christoffer Bjerre, Hagendorff, Thilo, Hickman, Eleanore, Hildt, Elisabeth, Holm, Sune, Kringen, Pedro, Kühne, Ulrich, Lucieri, Adriano, Madai, Vince I., Moreno-Sánchez, Pedro A., Medlicott, Oriana, Ozols, Matiss, Schnebel, Eberhard, Spezzatti, Andy, Tithi, Jesmin Jahan, Umbrello, Steven, Vetter, Dennis, Volland, Holger, Westerlund, Magnus, and Wurth, Renee
Abstract: This paper documents how an ethically aligned co-design methodology ensures trustworthiness in the early design phase of an artificial intelligence (AI) system component for healthcare. The system explains decisions made by deep learning networks analyzing images of skin lesions. The co-design of trustworthy AI developed here used a holistic approach rather than a static ethical checklist and required a multidisciplinary team of experts working with the AI designers and their managers. Ethical, legal, and technical issues potentially arising from the future use of the AI system were investigated. This paper is a first report on co-designing in the early design phase. Our results can also serve as guidance for other early-phase AI-similar tool developments.
Published: 2021

39. Autogen: Automatic Discovery of Efficient Recursive Divide-8-Conquer Algorithms for Solving Dynamic Programming Problems

Author: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Chowdhury, Rezaul, Ganapathi, Pramod, Tschudi, Stephen, Tithi, Jesmin Jahan, Bachmeier, Charles, Leiserson, Charles E, Solar-Lezama, Armando, Kuszmaul, Bradley C, Tang, Yuan, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Chowdhury, Rezaul, Ganapathi, Pramod, Tschudi, Stephen, Tithi, Jesmin Jahan, Bachmeier, Charles, Leiserson, Charles E, Solar-Lezama, Armando, Kuszmaul, Bradley C, and Tang, Yuan
Abstract: © 2017 ACM. We present Autogen-an algorithm that for a wide class of dynamic programming (DP) problems automatically discovers highly efficient cache-oblivious parallel recursive divide-And-conquer algorithms from inefficient iterative descriptions of DP recurrences. Autogen analyzes the set of DP table locations accessed by the iterative algorithm when run on a DP table of small size and automatically identifies a recursive access pattern and a corresponding provably correct recursive algorithm for solving the DP recurrence.We use Autogen to autodiscover efficient algorithms for several well-known problems. Our experimental results show that several autodiscovered algorithms significantly outperform parallel looping and tiled loop-based algorithms. Also, these algorithms are less sensitive to fluctuations of memory and bandwidth compared with their looping counterparts, and their running times and energy profiles remain relatively more stable. To the best of our knowledge, Autogen is the first algorithm that can automatically discover new nontrivial divide-And-conquer algorithms.
Published: 2021

40. Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning

Author: Tithi, Jesmin Jahan, primary, Stasiak, Andrzej, additional, Aananthakrishnan, Sriram, additional, and Petrini, Fabrizio, additional
Published: 2020
Full Text: View/download PDF

41. Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning.

Author: Tithi, Jesmin Jahan, Stasiak, Andrzej, Aananthakrishnan, Sriram, and Petrini, Fabrizio
Published: 2020
Full Text: View/download PDF

42. Provably Efficient Scheduling of Cache-oblivious Wavefront Algorithms

Author: Chowdhury, Rezaul, primary, Ganapathi, Pramod, additional, Tang, Yuan, additional, and Tithi, Jesmin Jahan, additional
Published: 2017
Full Text: View/download PDF

43. Autogen

Author: Chowdhury, Rezaul, primary, Ganapathi, Pramod, additional, Tschudi, Stephen, additional, Tithi, Jesmin Jahan, additional, Bachmeier, Charles, additional, Leiserson, Charles E., additional, Solar-Lezama, Armando, additional, Kuszmaul, Bradley C., additional, and Tang, Yuan, additional
Published: 2017
Full Text: View/download PDF

44. POSTER

Author: Chowdhury, Rezaul, primary, Ganapathi, Pramod, additional, Tang, Yuan, additional, and Tithi, Jesmin Jahan, additional
Published: 2017
Full Text: View/download PDF

45. AUTOGEN

Author: Chowdhury, Rezaul, primary, Ganapathi, Pramod, additional, Tithi, Jesmin Jahan, additional, Bachmeier, Charles, additional, Kuszmaul, Bradley C., additional, Leiserson, Charles E., additional, Solar-Lezama, Armando, additional, and Tang, Yuan, additional
Published: 2016
Full Text: View/download PDF

46. Accelerated molecular mechanical and solvation energetics on multicore CPUs and manycore GPUs

Author: Cha, Deukhyun, primary, Zhang, Qin, additional, Tithi, Jesmin Jahan, additional, Rand, Alexander, additional, Chowdhury, Rezaul A., additional, and Bajaj, Chandrajit, additional
Published: 2015
Full Text: View/download PDF

47. Efficient computation of distance incorporated codon autocorrelation (DICA) score using fast Fourier transform

Author: Tithi, Jesmin Jahan, primary and Chowdhury, Rezaul, additional
Published: 2015
Full Text: View/download PDF

48. High-Performance Energy-Efficient Recursive Dynamic Programming with Matrix-Multiplication-Like Flexible Kernels

Author: Tithi, Jesmin Jahan, primary, Ganapathi, Pramod, additional, Talati, Aakrati, additional, Aggarwal, Sonal, additional, and Chowdhury, Rezaul, additional
Published: 2015
Full Text: View/download PDF

49. Cache-oblivious wavefront: improving parallelism of recursive dynamic programming algorithms without losing cache-efficiency

Author: Tang, Yuan, primary, You, Ronghui, additional, Kan, Haibin, additional, Tithi, Jesmin Jahan, additional, Ganapathi, Pramod, additional, and Chowdhury, Rezaul A., additional
Published: 2015
Full Text: View/download PDF

50. Improving Parallelism of Recursive Stencil Computations without Sacrificing Cache Performance

Author: Tang, Yuan, primary, You, Ronghui, additional, Kan, Haibin, additional, Tithi, Jesmin Jahan, additional, Ganapathi, Pramod, additional, and Chowdhury, Rezaul A., additional
Published: 2014
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

129 results on '"Tithi, Jesmin Jahan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources