388 results on '"Campagner A."'
Search Results
2. Three-way decision in machine learning tasks: a systematic review
- Author
-
Campagner, Andrea, Milella, Frida, Ciucci, Davide, and Cabitza, Federico
- Published
- 2024
- Full Text
- View/download PDF
3. Second opinion machine learning for fast-track pathway assignment in hip and knee replacement surgery: the use of patient-reported outcome measures
- Author
-
Andrea Campagner, Frida Milella, Giuseppe Banfi, and Federico Cabitza
- Subjects
Medical machine learning ,Patient-reported outcome measures ,Second opinion ,Fast track ,Controllable AI ,Medical decision making ,Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Abstract Background The frequency of hip and knee arthroplasty surgeries has been rising steadily in recent decades. This trend is attributed to an aging population, leading to increased demands on healthcare systems. Fast Track (FT) surgical protocols, perioperative procedures designed to expedite patient recovery and early mobilization, have demonstrated efficacy in reducing hospital stays, convalescence periods, and associated costs. However, the criteria for selecting patients for FT procedures have not fully capitalized on the available patient data, including patient-reported outcome measures (PROMs). Methods Our study focused on developing machine learning (ML) models to support decision making in assigning patients to FT procedures, utilizing data from patients’ self-reported health status. These models are specifically designed to predict the potential health status improvement in patients initially selected for FT. Our approach focused on techniques inspired by the concept of controllable AI. This includes eXplainable AI (XAI), which aims to make the model’s recommendations comprehensible to clinicians, and cautious prediction, a method used to alert clinicians about potential control losses, thereby enhancing the models’ trustworthiness and reliability. Results Our models were trained and tested using a dataset comprising 899 records from individual patients admitted to the FT program at IRCCS Ospedale Galeazzi-Sant’Ambrogio. After training and selecting hyper-parameters, the models were assessed using a separate internal test set. The interpretable models demonstrated performance on par or even better than the most effective ‘black-box’ model (Random Forest). These models achieved sensitivity, specificity, and positive predictive value (PPV) exceeding 70%, with an area under the curve (AUC) greater than 80%. The cautious prediction models exhibited enhanced performance while maintaining satisfactory coverage (over 50%). Further, when externally validated on a separate cohort from the same hospital-comprising patients from a subsequent time period-the models showed no pragmatically notable decline in performance. Conclusions Our results demonstrate the effectiveness of utilizing PROMs as basis to develop ML models for planning assignments to FT procedures. Notably, the application of controllable AI techniques, particularly those based on XAI and cautious prediction, emerges as a promising approach. These techniques provide reliable and interpretable support, essential for informed decision-making in clinical processes.
- Published
- 2024
- Full Text
- View/download PDF
4. Painting the black box white: experimental findings from applying XAI to an ECG reading setting
- Author
-
Cabitza, Federico, Cameli, Matteo, Campagner, Andrea, Natali, Chiara, and Ronzio, Luca
- Subjects
Computer Science - Artificial Intelligence ,I.2.1 ,H.5.2 - Abstract
The shift from symbolic AI systems to black-box, sub-symbolic, and statistical ones has motivated a rapid increase in the interest toward explainable AI (XAI), i.e. approaches to make black-box AI systems explainable to human decision makers with the aim of making these systems more acceptable and more usable tools and supports. However, we make the point that, rather than always making black boxes transparent, these approaches are at risk of \emph{painting the black boxes white}, thus failing to provide a level of transparency that would increase the system's usability and comprehensibility; or, even, at risk of generating new errors, in what we termed the \emph{white-box paradox}. To address these usability-related issues, in this work we focus on the cognitive dimension of users' perception of explanations and XAI systems. To this aim, we designed and conducted a questionnaire-based experiment by which we involved 44 cardiology residents and specialists in an AI-supported ECG reading task. In doing so, we investigated different research questions concerning the relationship between users' characteristics (e.g. expertise) and their perception of AI and XAI systems, including their trust, the perceived explanations' quality and their tendency to defer the decision process to automation (i.e. technology dominance), as well as the mutual relationships among these different dimensions. Our findings provide a contribution to the evaluation of AI-based support systems from a Human-AI interaction-oriented perspective and lay the ground for further investigation of XAI and its effects on decision making and user experience., Comment: 15 pages, 7 figures
- Published
- 2022
5. Everything is Varied: The Surprising Impact of Individual Variation on ML Robustness in Medicine
- Author
-
Campagner, Andrea, Famiglini, Lorenzo, Carobene, Anna, and Cabitza, Federico
- Subjects
Computer Science - Machine Learning - Abstract
In medical settings, Individual Variation (IV) refers to variation that is due not to population differences or errors, but rather to within-subject variation, that is the intrinsic and characteristic patterns of variation pertaining to a given instance or the measurement process. While taking into account IV has been deemed critical for proper analysis of medical data, this source of uncertainty and its impact on robustness have so far been neglected in Machine Learning (ML). To fill this gap, we look at how IV affects ML performance and generalization and how its impact can be mitigated. Specifically, we provide a methodological contribution to formalize the problem of IV in the statistical learning framework and, through an experiment based on one of the largest real-world laboratory medicine datasets for the problem of COVID-19 diagnosis, we show that: 1) common state-of-the-art ML models are severely impacted by the presence of IV in data; and 2) advanced learning strategies, based on data augmentation and data imprecisiation, and proper study designs can be effective at improving robustness to IV. Our findings demonstrate the critical relevance of correctly accounting for IV to enable safe deployment of ML in clinical settings.
- Published
- 2022
6. A Distributional Approach for Soft Clustering Comparison and Evaluation
- Author
-
Campagner, Andrea, Ciucci, Davide, and Denœux, Thierry
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
The development of external evaluation criteria for soft clustering (SC) has received limited attention: existing methods do not provide a general approach to extend comparison measures to SC, and are unable to account for the uncertainty represented in the results of SC algorithms. In this article, we propose a general method to address these limitations, grounding on a novel interpretation of SC as distributions over hard clusterings, which we call \emph{distributional measures}. We provide an in-depth study of complexity- and metric-theoretic properties of the proposed approach, and we describe approximation techniques that can make the calculations tractable. Finally, we illustrate our approach through a simple but illustrative experiment., Comment: This is the extended version of article "A Distributional Approach for Soft Clustering Comparison and Evaluation", accepted at BELIEF 2022 (http://hebergement.universite-paris-saclay.fr/belief2022/). Please cite the proceedings version of the article
- Published
- 2022
7. The use of machine learning for the prediction of response to follow-up in spine registries
- Author
-
Baroncini, Alice, Campagner, Andrea, Cabitza, Federico, Langella, Francesco, Barile, Francesca, Bellosta-López, Pablo, Compagnone, Domenico, Cecchinato, Riccardo, Damilano, Marco, Redaelli, Andrea, Vanni, Daniele, and Berjano, Pedro
- Published
- 2025
- Full Text
- View/download PDF
8. Unity Is Intelligence: A Collective Intelligence Experiment on ECG Reading to Improve Diagnostic Performance in Cardiology
- Author
-
Ronzio, Luca, Campagner, Andrea, Cabitza, Federico, and Gensini, Gian Franco
- Abstract
Medical errors have a huge impact on clinical practice in terms of economic and human costs. As a result, technology-based solutions, such as those grounded in artificial intelligence (AI) or collective intelligence (CI), have attracted increasing interest as a means of reducing error rates and their impacts. Previous studies have shown that a combination of individual opinions based on rules, weighting mechanisms, or other CI solutions could improve diagnostic accuracy with respect to individual doctors. We conducted a study to investigate the potential of this approach in cardiology and, more precisely, in electrocardiogram (ECG) reading. To achieve this aim, we designed and conducted an experiment involving medical students, recent graduates, and residents, who were asked to annotate a collection of 10 ECGs of various complexity and difficulty. For each ECG, we considered groups of increasing size (from three to 30 members) and applied three different CI protocols. In all cases, the results showed a statistically significant improvement (ranging from 9% to 88%) in terms of diagnostic accuracy when compared to the performance of individual readers; this difference held for not only large groups, but also smaller ones. In light of these results, we conclude that CI approaches can support the tasks mentioned above, and possibly other similar ones as well. We discuss the implications of applying CI solutions to clinical settings, such as cases of augmented 'second opinions' and decision-making.
- Published
- 2021
9. Learning from fuzzy labels: Theoretical issues and algorithmic solutions
- Author
-
Campagner, Andrea
- Published
- 2024
- Full Text
- View/download PDF
10. Toward a Perspectivist Turn in Ground Truthing for Predictive Computing
- Author
-
Basile, Valerio, Cabitza, Federico, Campagner, Andrea, and Fell, Michael
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Most Artificial Intelligence applications are based on supervised machine learning (ML), which ultimately grounds on manually annotated data. The annotation process is often performed in terms of a majority vote and this has been proved to be often problematic, as highlighted by recent studies on the evaluation of ML models. In this article we describe and advocate for a different paradigm, which we call data perspectivism, which moves away from traditional gold standard datasets, towards the adoption of methods that integrate the opinions and perspectives of the human subjects involved in the knowledge representation step of ML processes. Drawing on previous works which inspired our proposal we describe the potential of our proposal for not only the more subjective tasks (e.g. those related to human language) but also to tasks commonly understood as objective (e.g. medical decision making), and present the main advantages of adopting a perspectivist stance in ML, as well as possible disadvantages, and various ways in which such a stance can be implemented in practice. Finally, we share a set of recommendations and outline a research agenda to advance the perspectivist stance in ML., Comment: If you wish to cite this work, consider citing the AAAI 2023 proceedings version (https://doi.org/10.1609/aaai.v37i6.25840) and citing it in this way: Cabitza, F., Campagner, A., & Basile, V. (2023). Toward a Perspectivist Turn in Ground Truthing for Predictive Computing. Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 6860-6868. https://doi.org/10.1609/aaai.v37i6.25840
- Published
- 2021
- Full Text
- View/download PDF
11. Never tell me the odds: Investigating pro-hoc explanations in medical decision making
- Author
-
Cabitza, Federico, Natali, Chiara, Famiglini, Lorenzo, Campagner, Andrea, Caccavella, Valerio, and Gallazzi, Enrico
- Published
- 2024
- Full Text
- View/download PDF
12. Partially-defined equivalence relations: Relationship with orthopartitions and connection to rough sets
- Author
-
Boffa, Stefania, Campagner, Andrea, and Ciucci, Davide
- Published
- 2024
- Full Text
- View/download PDF
13. Evaluation of uncertainty quantification methods in multi-label classification: A case study with automatic diagnosis of electrocardiogram
- Author
-
Barandas, Marília, Famiglini, Lorenzo, Campagner, Andrea, Folgado, Duarte, Simão, Raquel, Cabitza, Federico, and Gamboa, Hugo
- Published
- 2024
- Full Text
- View/download PDF
14. A distributional framework for evaluation, comparison and uncertainty quantification in soft clustering
- Author
-
Campagner, Andrea, Ciucci, Davide, and Denœux, Thierry
- Published
- 2023
- Full Text
- View/download PDF
15. The use of machine learning for the prediction of response to follow-up in spine registries
- Author
-
A. Baroncini, A. Campagner, F. Langella, F. Cabitza, F. Barile, R. Cecchinato, M. Damilano, A. Redaelli, D. Vanni, D. Compagnone, and P. Berjano
- Subjects
Neurology. Diseases of the nervous system ,RC346-429 - Published
- 2024
- Full Text
- View/download PDF
16. Who wants accurate models? Arguing for a different metrics to take classification models seriously
- Author
-
Cabitza, Federico and Campagner, Andrea
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
With the increasing availability of AI-based decision support, there is an increasing need for their certification by both AI manufacturers and notified bodies, as well as the pragmatic (real-world) validation of these systems. Therefore, there is the need for meaningful and informative ways to assess the performance of AI systems in clinical practice. Common metrics (like accuracy scores and areas under the ROC curve) have known problems and they do not take into account important information about the preferences of clinicians and the needs of their specialist practice, like the likelihood and impact of errors and the complexity of cases. In this paper, we present a new accuracy measure, the H-accuracy (Ha), which we claim is more informative in the medical domain (and others of similar needs) for the elements it encompasses. We also provide proof that the H-accuracy is a generalization of the balanced accuracy and establish a relation between the H-accuracy and the Net Benefit. Finally, we illustrate an experimentation in two user studies to show the descriptive power of the Ha score and how complementary and differently informative measures can be derived from its formulation (a Python script to compute Ha is also made available)., Comment: https://github.com/AndreaCampagner/uncertainpy
- Published
- 2019
17. Painting the Black Box White: Experimental Findings from Applying XAI to an ECG Reading Setting
- Author
-
Federico Cabitza, Andrea Campagner, Chiara Natali, Enea Parimbelli, Luca Ronzio, and Matteo Cameli
- Subjects
explainable AI ,decision support systems ,ECG ,artificial intelligence ,XAI ,Computer engineering. Computer hardware ,TK7885-7895 - Abstract
The emergence of black-box, subsymbolic, and statistical AI systems has motivated a rapid increase in the interest regarding explainable AI (XAI), which encompasses both inherently explainable techniques, as well as approaches to make black-box AI systems explainable to human decision makers. Rather than always making black boxes transparent, these approaches are at risk of painting the black boxes white, thus failing to provide a level of transparency that would increase the system’s usability and comprehensibility, or even at risk of generating new errors (i.e., white-box paradox). To address these usability-related issues, in this work we focus on the cognitive dimension of users’ perception of explanations and XAI systems. We investigated these perceptions in light of their relationship with users’ characteristics (e.g., expertise) through a questionnaire-based user study involved 44 cardiology residents and specialists in an AI-supported ECG reading task. Our results point to the relevance and correlation of the dimensions of trust, perceived quality of explanations, and tendency to defer the decision process to automation (i.e., technology dominance). This contribution calls for the evaluation of AI-based support systems from a human–AI interaction-oriented perspective, laying the ground for further investigation of XAI and its effects on decision making and user experience.
- Published
- 2023
- Full Text
- View/download PDF
18. A cortico-collicular circuit for orienting to shelter during escape
- Author
-
Campagner, Dario, Vale, Ruben, Tan, Yu Lin, Iordanidou, Panagiota, Pavón Arocas, Oriol, Claudi, Federico, Stempel, A. Vanessa, Keshavarzi, Sepiedeh, Petersen, Rasmus S., Margrie, Troy W., and Branco, Tiago
- Published
- 2023
- Full Text
- View/download PDF
19. Rams, hounds and white boxes: Investigating human–AI collaboration protocols in medical diagnosis
- Author
-
Cabitza, Federico, Campagner, Andrea, Ronzio, Luca, Cameli, Matteo, Mandoli, Giulia Elena, Pastore, Maria Concetta, Sconfienza, Luca Maria, Folgado, Duarte, Barandas, Marília, and Gamboa, Hugo
- Published
- 2023
- Full Text
- View/download PDF
20. A general framework for evaluating and comparing soft clusterings
- Author
-
Campagner, Andrea, Ciucci, Davide, and Denœux, Thierry
- Published
- 2023
- Full Text
- View/download PDF
21. Quod erat demonstrandum? - Towards a typology of the concept of explanation for the design of explainable AI
- Author
-
Cabitza, Federico, Campagner, Andrea, Malgieri, Gianclaudio, Natali, Chiara, Schneeberger, David, Stoeger, Karl, and Holzinger, Andreas
- Published
- 2023
- Full Text
- View/download PDF
22. Aggregation models in ensemble learning: A large-scale comparison
- Author
-
Campagner, Andrea, Ciucci, Davide, and Cabitza, Federico
- Published
- 2023
- Full Text
- View/download PDF
23. Second opinion machine learning for fast-track pathway assignment in hip and knee replacement surgery: the use of patient-reported outcome measures
- Author
-
Campagner, A, Milella, F, Banfi, G, Cabitza, F, Campagner, Andrea, Milella, Frida, Banfi, Giuseppe, Cabitza, Federico, Campagner, A, Milella, F, Banfi, G, Cabitza, F, Campagner, Andrea, Milella, Frida, Banfi, Giuseppe, and Cabitza, Federico
- Abstract
Background: The frequency of hip and knee arthroplasty surgeries has been rising steadily in recent decades. This trend is attributed to an aging population, leading to increased demands on healthcare systems. Fast Track (FT) surgical protocols, perioperative procedures designed to expedite patient recovery and early mobilization, have demonstrated efficacy in reducing hospital stays, convalescence periods, and associated costs. However, the criteria for selecting patients for FT procedures have not fully capitalized on the available patient data, including patient-reported outcome measures (PROMs). Methods: Our study focused on developing machine learning (ML) models to support decision making in assigning patients to FT procedures, utilizing data from patients' self-reported health status. These models are specifically designed to predict the potential health status improvement in patients initially selected for FT. Our approach focused on techniques inspired by the concept of controllable AI. This includes eXplainable AI (XAI), which aims to make the model's recommendations comprehensible to clinicians, and cautious prediction, a method used to alert clinicians about potential control losses, thereby enhancing the models' trustworthiness and reliability. Results: Our models were trained and tested using a dataset comprising 899 records from individual patients admitted to the FT program at IRCCS Ospedale Galeazzi-Sant'Ambrogio. After training and selecting hyper-parameters, the models were assessed using a separate internal test set. The interpretable models demonstrated performance on par or even better than the most effective 'black-box' model (Random Forest). These models achieved sensitivity, specificity, and positive predictive value (PPV) exceeding 70%, with an area under the curve (AUC) greater than 80%. The cautious prediction models exhibited enhanced performance while maintaining satisfactory coverage (over 50%). Further, when externally validated on a
- Published
- 2024
24. Three-way decision in machine learning tasks: a systematic review
- Author
-
Campagner, A, Milella, F, Ciucci, D, Cabitza, F, Campagner A., Milella F., Ciucci D., Cabitza F., Campagner, A, Milella, F, Ciucci, D, Cabitza, F, Campagner A., Milella F., Ciucci D., and Cabitza F.
- Abstract
In this article, we survey the applications of Three-way decision theory (TWD) in machine learning (ML), focusing in particular on four tasks: weakly supervised learning and multi-source data management, missing data management, uncertainty quantification in classification, and uncertainty quantification in clustering. For each of these four tasks we present the results of a systematic review of the literature, by which we report on the main characteristics of the current state of the art, as well as on the quality of reporting and reproducibility level of the works found in the literature. To this aim, we discuss the main benefits, limitations and issues found in the reviewed articles, and we give clear indications and directions for quality improvement that are informed by validation, reporting, and reproducibility standards, guidelines and best practice that have recently emerged in the ML field. Finally, we discuss about the more promising and relevant directions for future research in regard to TWD.
- Published
- 2024
25. Dissimilar Similarities: Comparing Human and Statistical Similarity Evaluation in Medical AI
- Author
-
Torra, V, Narukawa, Y, Kikuchi, H, Cabitza, F, Famiglini, L, Campagner, A, Sconfienza, L, Fusco, S, Caccavella, V, Gallazzi, E, Cabitza F., Famiglini L., Campagner A., Sconfienza L. M., Fusco S., Caccavella V., Gallazzi E., Torra, V, Narukawa, Y, Kikuchi, H, Cabitza, F, Famiglini, L, Campagner, A, Sconfienza, L, Fusco, S, Caccavella, V, Gallazzi, E, Cabitza F., Famiglini L., Campagner A., Sconfienza L. M., Fusco S., Caccavella V., and Gallazzi E.
- Abstract
This study explores the concept of similarity in machine learning (ML) and its congruence with human judgment in medical contexts, focusing primarily on radiology. We conducted a user study involving two radiologists and two orthopedic and spine surgeons. These experts evaluated the similarity of 72 cases, selected from a larger dataset by an ML model based on Cosine and Euclidean distances, in comparison to 18 representative base cases of vertebral fractures. Our analysis focused on correlating these ML-derived distances with the experts’ assessments. The findings reveal that: (1) both Cosine and Euclidean distances had limited correlation with human judgments; (2) Cosine distances showed a marginally higher correlation than Euclidean distances; despite the limitations due to the small samples of evaluations and evaluators, our findings emphasize the necessity for ongoing research to enhance AI similarity metrics, aiming for greater human-centricity and relevance, particularly considering their critical role in ML training and inference. Our study’s implications are far-reaching, advocating for a comprehensive reevaluation of similarity assessments in AI to achieve a closer alignment with human cognitive processes, extending well beyond the realm of medical imaging.
- Published
- 2024
26. Learning from fuzzy labels: Theoretical issues and algorithmic solutions
- Author
-
Campagner, A, Campagner A., Campagner, A, and Campagner A.
- Abstract
In this article we study the problem of learning from fuzzy labels (LFL), a form of weakly supervised learning in which the supervision target is not precisely specified but is instead given in the form of possibility distributions, that express the imprecise knowledge of the annotating agent. While several approaches for LFL have been proposed in the literature, including generalized risk minimization (GRM), instance-based methods and pseudo label-based learning, both their theoretical properties and their empirical performance have scarcely been studied. We address this gap by: first, presenting a review of the previous results relative to the sample complexity and generalization bounds for GRM and instance-based methods; second, studying both their computational complexity, by proving in particular the impossibility of efficiently solving LFL using GRM, as well as impossibility theorems. We then propose a novel pseudo label-based learning method, called Random Resampling-based Learning (RRL), which directly draws from ensemble learning and possibility theory and study its learning- and complexity-theoretic properties, showing that it achieves guarantees similar to those for GRM while being computationally efficient. Finally, we study the empirical performance of several state-of-the-art LFL algorithms on wide set of synthetic and real-world benchmark datasets, by which we confirm the effectiveness of the proposed RRL method. Additionally, we describe directions for future research, and highlight opportunities for further interaction between machine learning and uncertainty representation theories.
- Published
- 2024
27. Ensemble Predictors: Possibilistic Combination of Conformal Predictors for Multivariate Time Series Classification
- Author
-
Campagner, A, Barandas, M, Folgado, D, Gamboa, H, Cabitza, F, Campagner, Andrea, Barandas, Marília, Folgado, Duarte, Gamboa, Hugo, Cabitza, Federico, Campagner, A, Barandas, M, Folgado, D, Gamboa, H, Cabitza, F, Campagner, Andrea, Barandas, Marília, Folgado, Duarte, Gamboa, Hugo, and Cabitza, Federico
- Abstract
In this article we propose a conceptual framework to study ensembles of conformal predictors (CP), that we call Ensemble Predictors (EP). Our approach is inspired by the application of imprecise probabilities in information fusion. Based on the proposed framework, we study, for the first time in the literature, the theoretical properties of CP ensembles in a general setting, by focusing on simple and commonly used possibilistic combination rules. We also illustrate the applicability of the proposed methods in the setting of multivariate time-series classification, showing that these methods provide better performance (in terms of both robustness, conservativeness, accuracy and running time) than both standard classification algorithms and other combination rules proposed in the literature, on a large set of benchmarks from the UCR time series archive.
- Published
- 2024
28. Aggregation operators on shadowed sets
- Author
-
Boffa, Stefania, Campagner, Andrea, Ciucci, Davide, and Yao, Yiyu
- Published
- 2022
- Full Text
- View/download PDF
29. Belief functions and rough sets: Survey and new insights
- Author
-
Campagner, Andrea, Ciucci, Davide, and Denœux, Thierry
- Published
- 2022
- Full Text
- View/download PDF
30. Multisensory coding of angular head velocity in the retrosplenial cortex
- Author
-
Keshavarzi, Sepiedeh, Bracey, Edward F., Faville, Richard A., Campagner, Dario, Tyson, Adam L., Lenzi, Stephen C., Branco, Tiago, and Margrie, Troy W.
- Published
- 2022
- Full Text
- View/download PDF
31. The need to move away from agential-AI: Empirical investigations, useful concepts and open issues
- Author
-
Cabitza, Federico, Campagner, Andrea, and Simone, Carla
- Published
- 2021
- Full Text
- View/download PDF
32. Three-way decision and conformal prediction: Isomorphisms, differences and theoretical properties of cautious learning approaches
- Author
-
Campagner, Andrea, Cabitza, Federico, Berjano, Pedro, and Ciucci, Davide
- Published
- 2021
- Full Text
- View/download PDF
33. Rough set-based feature selection for weakly labeled data
- Author
-
Campagner, Andrea, Ciucci, Davide, and Hüllermeier, Eyke
- Published
- 2021
- Full Text
- View/download PDF
34. Ground truthing from multi-rater labeling with three-way decision and possibility theory
- Author
-
Campagner, Andrea, Ciucci, Davide, Svensson, Carl-Magnus, Figge, Marc Thilo, and Cabitza, Federico
- Published
- 2021
- Full Text
- View/download PDF
35. Sub-cortical neural coding during active sensation in the mouse
- Author
-
Campagner, Dario, Petersen, Rasmus, and Furber, Stephen
- Subjects
612.8 ,whisker system ,neural coding ,active sensation ,behavior ,Mechanical model ,Decision making - Abstract
Two fundamental questions in the investigation of any sensory system are what physical signals drive its primary sensory neurons and how such signals are encoded by the successive neural levels during natural behaviour. Due to the complexity of experiments with awake, actively sensing animals, most previous studies focused on anesthetized animals, where the motor component of sensation is abolished and therefore those questions are so far largely unanswered. The aim of this thesis is to exploit recent advance in electrophysiological, behavioural and computational techniques to address those questions in the sub-cortical whisker system of the mouse. To determine the input to the whisker system, in Chapter 2 I recorded from primary whisker afferents (PWAs) of awake, head-fixed mice as they explored a pole with their whiskers, and simultaneously measured both whisker motion and forces with high-speed videography. To predict PWA firing, I used Generalised Linear Models. I found that PWA responses were poorly predicted by whisker angle, but well predicted by rotational force (moment) acting on the whiskers. This concept of âmoment encodingâ could account for the activity of PWAs under diverse conditions - whisking in air, active whisker-mediated touch and passive whisker deflection. The discovery that PWAs encode moment raises the question of how mice employ moment to control their tactile behaviours. In Chapter 3 I therefore measured moment at the base of the whiskers of head-fixed mice, performing a novel behavioural task, which involved whisker-based object localisation. I then tested which features of moment during whiskerobject touch could predict mouse choice. By using probabilistic classifiers, I discovered that mouse choices could be accurately predicted from moment magnitude and direction during touch, combined with a non-sensory variable - the mouse choice in the previous trial. Finally, in Chapter 4 I asked how tactile coding generalized to whisker system sub-cortical brains regions during a natural active whisker-based behaviour. I therefore combined a naturalistic whisker-guided navigation task and extracellular recording with a novel generation of high density silicon probes (O3 Neuropixel probes) and studied how touch and locomotion were encoded by the whisker first (ventral posterior nucleus, VPM) and higher order thalamic relay (posterior complex, PO) and hypothalamic regions and (zona incerta, ZI). Using multiple linear regressions, I found that neurons in the relay nucleus VPM encoded not only touch, but also locomotion signals. Similarly, neurons in the higherorder regions PO and ZI were driven by both touch and locomotion. My study showed that in the awake animal, in the central part of the whisker system, peripheral signals were preserved, but were encoded concomitantly with motor variables, such as locomotion. In summary, in this thesis I identified the mechanical variable representing the major sensory input to the whisker system. I showed that mice are able to employ it to guide behaviour and found that correlate of this signal was encoded by central neurons of the whisker system in VPM, PO and ZI, concomitantly with locomotion.
- Published
- 2017
36. The unbearable (technical) unreliability of automated facial emotion recognition
- Author
-
Federico Cabitza, Andrea Campagner, and Martina Mattioli
- Subjects
General Works - Abstract
Emotion recognition, and in particular acial emotion recognition (FER), is among the most controversial applications of machine learning, not least because of its ethical implications for human subjects. In this article, we address the controversial conjecture that machines can read emotions from our facial expressions by asking whether this task can be performed reliably. This means, rather than considering the potential harms or scientific soundness of facial emotion recognition systems, focusing on the reliability of the ground truths used to develop emotion recognition systems, assessing how well different human observers agree on the emotions they detect in subjects’ faces. Additionally, we discuss the extent to which sharing context can help observers agree on the emotions they perceive on subjects’ faces. Briefly, we demonstrate that when large and heterogeneous samples of observers are involved, the task of emotion detection from static images crumbles into inconsistency. We thus reveal that any endeavour to understand human behaviour from large sets of labelled patterns is over-ambitious, even if it were technically feasible. We conclude that we cannot speak of actual accuracy for facial emotion recognition systems for any practical purposes.
- Published
- 2022
- Full Text
- View/download PDF
37. The three-way-in and three-way-out framework to treat and exploit ambiguity in data
- Author
-
Campagner, Andrea, Cabitza, Federico, and Ciucci, Davide
- Published
- 2020
- Full Text
- View/download PDF
38. As if sand were stone. New concepts and metrics to probe the ground on which to build trustable AI
- Author
-
Federico Cabitza, Andrea Campagner, and Luca Maria Sconfienza
- Subjects
Gold standard ,Explainable AI ,Machine learning ,Reliability ,Usable AI ,Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Abstract Background We focus on the importance of interpreting the quality of the labeling used as the input of predictive models to understand the reliability of their output in support of human decision-making, especially in critical domains, such as medicine. Methods Accordingly, we propose a framework distinguishing the reference labeling (or Gold Standard) from the set of annotations from which it is usually derived (the Diamond Standard). We define a set of quality dimensions and related metrics: representativeness (are the available data representative of its reference population?); reliability (do the raters agree with each other in their ratings?); and accuracy (are the raters’ annotations a true representation?). The metrics for these dimensions are, respectively, the degree of correspondence, Ψ, the degree of weighted concordance ϱ, and the degree of fineness, Φ. We apply and evaluate these metrics in a diagnostic user study involving 13 radiologists. Results We evaluate Ψ against hypothesis-testing techniques, highlighting that our metrics can better evaluate distribution similarity in high-dimensional spaces. We discuss how Ψ could be used to assess the reliability of new predictions or for train-test selection. We report the value of ϱ for our case study and compare it with traditional reliability metrics, highlighting both their theoretical properties and the reasons that they differ. Then, we report the degree of fineness as an estimate of the accuracy of the collected annotations and discuss the relationship between this latter degree and the degree of weighted concordance, which we find to be moderately but significantly correlated. Finally, we discuss the implications of the proposed dimensions and metrics with respect to the context of Explainable Artificial Intelligence (XAI). Conclusion We propose different dimensions and related metrics to assess the quality of the datasets used to build predictive models and Medical Artificial Intelligence (MAI). We argue that the proposed metrics are feasible for application in real-world settings for the continuous development of trustable and interpretable MAI systems.
- Published
- 2020
- Full Text
- View/download PDF
39. Ordinal labels in machine learning: a user-centered approach to improve data validity in medical settings
- Author
-
Andrea Seveso, Andrea Campagner, Davide Ciucci, and Federico Cabitza
- Subjects
Ordinal scales ,Machine learning ,Fuzzy sets ,Ground truth ,Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Abstract Background Despite the vagueness and uncertainty that is intrinsic in any medical act, interpretation and decision (including acts of data reporting and representation of relevant medical conditions), still little research has focused on how to explicitly take this uncertainty into account. In this paper, we focus on the representation of a general and wide-spread medical terminology, which is grounded on a traditional and well-established convention, to represent severity of health conditions (for instance, pain, visible signs), ranging from Absent to Extreme. Specifically, we will study how both potential patients and doctors perceive the different levels of the terminology in both quantitative and qualitative terms, and if the embedded user knowledge could improve the representation of ordinal values in the construction of machine learning models. Methods To this aim, we conducted a questionnaire-based research study involving a relatively large sample of 1,152 potential patients and 31 clinicians to represent numerically the perceived meaning of standard and widely-applied labels to describe health conditions. Using these collected values, we then present and discuss different possible fuzzy-set based representations that address the vagueness of medical interpretation by taking into account the perceptions of domain experts. We also apply the findings of this user study to evaluate the impact of different encodings on the predictive performance of common machine learning models in regard to a real-world medical prognostic task. Results We found significant differences in the perception of pain levels between the two user groups. We also show that the proposed encodings can improve the performances of specific classes of models, and discuss when this is the case. Conclusions In perspective, our hope is that the proposed techniques for ordinal scale representation and ordinal encoding may be useful to the research community, and also that our methodology will be applied to other widely used ordinal scales for improving validity of datasets and bettering the results of machine learning tasks.
- Published
- 2020
- Full Text
- View/download PDF
40. Assessment of Fast-Track Pathway in Hip and Knee Replacement Surgery by Propensity Score Matching on Patient-Reported Outcomes
- Author
-
Andrea Campagner, Frida Milella, Stefania Guida, Susan Bernareggi, Giuseppe Banfi, and Federico Cabitza
- Subjects
fast track ,propensity score analysis ,patient-reported outcome measure ,orthopedic ,rehabilitation ,Medicine (General) ,R5-920 - Abstract
Total hip (THA) and total knee (TKA) arthroplasty procedures have steadily increased over the past few decades, and their use is expected to grow further, mainly due to an increasing number of elderly patients. Cost-containment strategies, supporting a rapid recovery with a positive functional outcomes, high patient satisfaction, and enhanced patient reported outcomes, are needed. A Fast Track surgical procedure (FT) is a coordinated perioperative approach aimed at expediting early mobilization and recovery following surgery and, accordingly, shortening the length of hospital stay (LOS), convalescence and costs. In this view, rapid rehabilitation surgery optimizes traditional rehabilitation methods by integrating evidence-based practices into the procedure. The aim of the present study was to compare the effectiveness of Fast Track versus Care-as-Usual surgical procedures and pathways (including rehabilitation) on a mid-term patient-reported outcome (PROs), the SF12 (with regard both to Physical and Mental Scores), 3 months after hip or knee replacement surgery, with the use of Propensity score-matching (PSM) analysis to address the issue of the comparability of the groups in a non-randomized study. We were interested in the evaluation of the entire pathways, including the postoperative rehabilitation stage, therefore, we only used early home discharge as a surrogate to differentiate between the Fast Track and Care-as-Usual rehabilitation pathways. Our study shows that the entire Fast Track pathway, which includes the post-operative rehabilitation stage, has a significantly positive impact on physical health-related status (SF12 Physical Scores), as perceived by patients 3 months after hip or knee replacement surgery, as opposed to the standardized program, both in terms of the PROs score and the relative improvements observed, as compared with the minimum clinically important difference. This result encourages additional research into the effects of Fast Track rehabilitation on the entire process of care for patients undergoing hip or knee arthroplasty, focusing only on patient-reported outcomes.
- Published
- 2023
- Full Text
- View/download PDF
41. Evidence-based XAI: An empirical approach to design more effective and explainable decision support systems
- Author
-
Famiglini, L, Campagner, A, Barandas, M, La Maida, G, Gallazzi, E, Cabitza, F, La Maida, GA, Famiglini, L, Campagner, A, Barandas, M, La Maida, G, Gallazzi, E, Cabitza, F, and La Maida, GA
- Abstract
This paper proposes a user study aimed at evaluating the impact of Class Activation Maps (CAMs) as an eXplainable AI (XAI) method in a radiological diagnostic task, the detection of thoracolumbar (TL) fractures from vertebral X-rays. In particular, we focus on two oft-neglected features of CAMs, that is granularity and coloring, in terms of what features, lower-level vs higher-level, should the maps highlight and adopting which coloring scheme, to bring better impact to the decision-making process, both in terms of diagnostic accuracy (that is effectiveness) and of user-centered dimensions, such as perceived confidence and utility (that is satisfaction), depending on case complexity, AI accuracy, and user expertise. Our findings show that lower-level features CAMs, which highlight more focused anatomical landmarks, are associated with higher diagnostic accuracy than higher-level features CAMs, particularly among experienced physicians. Moreover, despite the intuitive appeal of semantic CAMs, traditionally colored CAMs consistently yielded higher diagnostic accuracy across all groups. Our results challenge some prevalent assumptions in the XAI field and emphasize the importance of adopting an evidence-based and human-centered approach to design and evaluate AI- and XAI-assisted diagnostic tools. To this aim, the paper also proposes a hierarchy of evidence framework to help designers and practitioners choose the XAI solutions that optimize performance and satisfaction on the basis of the strongest evidence available or to focus on the gaps in the literature that need to be filled to move from opinionated and eminence-based research to one more based on empirical evidence and end-user work and preferences.
- Published
- 2024
42. Never tell me the odds: Investigating pro-hoc explanations in medical decision making
- Author
-
Cabitza, F, Natali, C, Famiglini, L, Campagner, A, Caccavella, V, Gallazzi, E, Cabitza, F, Natali, C, Famiglini, L, Campagner, A, Caccavella, V, and Gallazzi, E
- Abstract
This paper examines a kind of explainable AI, centered around what we term pro-hoc explanations, that is a form of support that consists of offering alternative explanations (one for each possible outcome) instead of a specific post-hoc explanation following specific advice. Specifically, our support mechanism utilizes explanations by examples, featuring analogous cases for each category in a binary setting. Pro-hoc explanations are an instance of what we called frictional AI, a general class of decision support aimed at achieving a useful compromise between the increase of decision effectiveness and the mitigation of cognitive risks, such as over-reliance, automation bias and deskilling. To illustrate an instance of frictional AI, we conducted an empirical user study to investigate its impact on the task of radiological detection of vertebral fractures in x-rays. Our study engaged 16 orthopedists in a ‘human-first, second-opinion’ interaction protocol. In this protocol, clinicians first made initial assessments of the x-rays without AI assistance and then provided their final diagnosis after considering the pro-hoc explanations. Our findings indicate that physicians, particularly those with less experience, perceived pro-hoc XAI support as significantly beneficial, even though it did not notably enhance their diagnostic accuracy. However, their increased confidence in final diagnoses suggests a positive overall impact. Given the promisingly high effect size observed, our results advocate for further research into pro-hoc explanations specifically, and into the broader concept of frictional AI.
- Published
- 2024
43. Partially-defined equivalence relations: Relationship with orthopartitions and connection to rough sets
- Author
-
Boffa, S, Campagner, A, Ciucci, D, Boffa, S, Campagner, A, and Ciucci, D
- Abstract
We introduce partially-defined equivalence relations as a type of equivalence relation that incorporates uncertainty. In these relations, certain pairs of objects are not definitively determined to be related or unrelated. The relationship with orthopartitions is put forward, providing the conditions under which an orthopartition can be transformed into an equivalent partially-defined equivalence relation and vice versa. Additionally, we explore their connection with reducts in rough set theory, offering insights into the characterization of similarity reducts in terms of orthopartitions.
- Published
- 2024
44. Evaluation of uncertainty quantification methods in multi-label classification: A case study with automatic diagnosis of electrocardiogram
- Author
-
Barandas, M, Famiglini, L, Campagner, A, Folgado, D, Simao, R, Cabitza, F, Gamboa, H, Barandas, M, Famiglini, L, Campagner, A, Folgado, D, Simao, R, Cabitza, F, and Gamboa, H
- Abstract
Artificial Intelligence (AI) use in automated Electrocardiogram (ECG) classification has continuously attracted the research community's interest, motivated by their promising results. Despite their great promise, limited attention has been paid to the robustness of their results, which is a key element for their implementation in clinical practice. Uncertainty Quantification (UQ) is a critical for trustworthy and reliable AI, particularly in safety-critical domains such as medicine. Estimating uncertainty in Machine Learning (ML) model predictions has been extensively used for Out-of-Distribution (OOD) detection under single-label tasks. However, the use of UQ methods in multi-label classification remains underexplored. This study goes beyond developing highly accurate models comparing five uncertainty quantification methods using the same Deep Neural Network (DNN) architecture across various validation scenarios, including internal and external validation as well as OOD detection, taking multi-label ECG classification as the example domain. We show the importance of external validation and its impact on classification performance, uncertainty estimates quality, and calibration. Ensemble-based methods yield more robust uncertainty estimations than single network or stochastic methods. Although current methods still have limitations in accurately quantifying uncertainty, particularly in the case of dataset shift, incorporating uncertainty estimates with a classification with a rejection option improves the ability to detect such changes. Moreover, we show that using uncertainty estimates as a criterion for sample selection in active learning setting results in greater improvements in classification performance compared to random sampling.
- Published
- 2024
45. External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count
- Author
-
Campagner, Andrea, Carobene, Anna, and Cabitza, Federico
- Published
- 2021
- Full Text
- View/download PDF
46. Studying human-AI collaboration protocols: the case of the Kasparov’s law in radiological double reading
- Author
-
Cabitza, Federico, Campagner, Andrea, and Sconfienza, Luca Maria
- Published
- 2021
- Full Text
- View/download PDF
47. Aggregation models in ensemble learning: A large-scale comparison
- Author
-
Andrea Campagner, Davide Ciucci, Federico Cabitza, Campagner, A, Ciucci, D, and Cabitza, F
- Subjects
Social choice theory ,Hardware and Architecture ,Ensemble learning ,Collective intelligence ,Signal Processing ,Uncertainty management ,Information fusion ,Aggregation method ,Software ,Information Systems - Abstract
In this work we present a large-scale comparison of 21 learning and aggregation methods proposed in the ensemble learning, social choice theory (SCT), information fusion and uncertainty management (IF-UM) and collective intelligence (CI) fields, based on a large collection of 40 benchmark datasets. The results of this comparison show that Bagging-based approaches reported performances comparable with XGBoost, and significantly outperformed other Boosting methods. In particular, ExtraTree-based approaches were as accurate as both XGBoost and Decision Tree-based ones while also being more computationally efficient. We also show how standard Bagging-based and IF-UM-inspired approaches outperformed the approaches based on CI and SCT. IF-UM-inspired approaches, in particular, reported the best performance (together with standard ExtraTrees), as well as the strongest resistance to label noise (together with XGBoost). Based on our results, we provide useful indications on the practical effectiveness of different state-of-the-art ensemble and aggregation methods in general settings.
- Published
- 2023
- Full Text
- View/download PDF
48. AI Shall Have No Dominion: on How to Measure Technology Dominance in AI-supported Human decision-making
- Author
-
Schmidt, A, Väänänen, K, Cabitza, F, Campagner, A, Angius, R, Natali, C, Reverberi, F, Cabitza, Federico, Campagner, Andrea, Angius, Riccardo, Natali, Chiara, Reverberi, Franco, Schmidt, A, Väänänen, K, Cabitza, F, Campagner, A, Angius, R, Natali, C, Reverberi, F, Cabitza, Federico, Campagner, Andrea, Angius, Riccardo, Natali, Chiara, and Reverberi, Franco
- Abstract
In this article, we propose a conceptual and methodological framework for measuring the impact of the introduction of AI systems in decision settings, based on the concept of technological dominance, i.e. the influence that an AI system can exert on human judgment and decisions. We distinguish between a negative component of dominance (automation bias) and a positive one (algorithm appreciation) by focusing on and systematizing the patterns of interaction between human judgment and AI support, or reliance patterns, and their associated cognitive effects. We then define statistical approaches for measuring these dimensions of dominance, as well as corresponding qualitative visualizations. By reporting about four medical case studies, we illustrate how the proposed methods can be used to inform assessments of dominance and of related cognitive biases in real-world settings. Our study lays the groundwork for future investigations into the effects of introducing AI support into naturalistic and collaborative decision-making.
- Published
- 2023
49. Biomarkers for Mixed Dementia: a hard bone to bite? Preliminary analyses and promising results for a debated topic
- Author
-
Fracasso, F, Gasparini, F, Milella, F, Campagner, A, Famiglini, L, Arosio, B, Rossi, P, Annoni, G, Cabitza, F, Campagner A., Famiglini L., Arosio B., Rossi P., Annoni G., Cabitza F., Fracasso, F, Gasparini, F, Milella, F, Campagner, A, Famiglini, L, Arosio, B, Rossi, P, Annoni, G, Cabitza, F, Campagner A., Famiglini L., Arosio B., Rossi P., Annoni G., and Cabitza F.
- Abstract
Dementia refers to a group of neurodegenerative disorders that impact the cognitive function of an increasing number of individuals. Because of the variety of manifestations, the idea of mixed dementia has recently garnered increased awareness and attention from the scientific community. In this work, we describe a high-quality dataset, as well as the findings of a preliminary analysis devoted to investigating the potential of computational methods that are highly indicative of mixed dementia. We will specifically describe the findings of a phenotypic stratification analysis, based on clustering approaches, that highlights possibly significant aspects of mixed dementia, paving the way for further research devoted to the application of Machine Learning techniques to the robust and early diagnosis of mixed dementia.
- Published
- 2023
50. The Impact of Gender and Personality in Human-AI Teaming: The Case of Collaborative Question Answering
- Author
-
Milella, F, Natali, C, Scantamburlo, T, Campagner, A, Cabitza, F, Milella F., Natali C., Scantamburlo T., Campagner A., Cabitza F., Milella, F, Natali, C, Scantamburlo, T, Campagner, A, Cabitza, F, Milella F., Natali C., Scantamburlo T., Campagner A., and Cabitza F.
- Abstract
This paper discusses the results of an exploratory study aimed at investigating the impact of conversational agents (CAs) and specifically their agential characteristics on collaborative decision-making processes. The study involved 29 participants divided into 8 small teams engaged in a question-and-answer trivia-style game with the support of a text-based CA, characterized by two independent binary variables: personality (gentle and cooperative vs blunt and uncooperative) and gender (female vs male). A semi-structured group interview was conducted at the end of the experimental sessions to investigate the perceived utility and level of satisfaction with the CAs. Our results show that when users interact with a gentle and cooperative CA, their user satisfaction is higher. Furthermore, female CAs are perceived as more useful and satisfying to interact with than male CAs. We show that group performance improves through interaction with the CAs, confirming that a stereotype favoring the female with a gentle and cooperative personality combination exists in regard to perceived satisfaction, even though this does not lead to greater perceived utility. Our study extends the current debate about the possible correlation between CA characteristics and human acceptance and suggests future research to investigate the role of gender bias and related biases in human-AI teaming.
- Published
- 2023
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.