1,586 results on '"Davidson, Ian"'
Search Results
2. Foundations for Unfairness in Anomaly Detection -- Case Studies in Facial Imaging Data
- Author
-
Livanos, Michael and Davidson, Ian
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Deep anomaly detection (AD) is perhaps the most controversial of data analytic tasks as it identifies entities that are then specifically targeted for further investigation or exclusion. Also controversial is the application of AI to facial imaging data. This work explores the intersection of these two areas to understand two core questions: "Who" these algorithms are being unfair to and equally important "Why". Recent work has shown that deep AD can be unfair to different groups despite being unsupervised with a recent study showing that for portraits of people: men of color are far more likely to be chosen to be outliers. We study the two main categories of AD algorithms: autoencoder-based and single-class-based which effectively try to compress all the instances with those that can not be easily compressed being deemed to be outliers. We experimentally verify sources of unfairness such as the under-representation of a group (e.g. people of color are relatively rare), spurious group features (e.g. men are often photographed with hats), and group labeling noise (e.g. race is subjective). We conjecture that lack of compressibility is the main foundation and the others cause it but experimental results show otherwise and we present a natural hierarchy amongst them., Comment: 16 pages, 8 figures, AAAI/ACM AIES24
- Published
- 2024
3. ChaosMining: A Benchmark to Evaluate Post-Hoc Local Attribution Methods in Low SNR Environments
- Author
-
Shi, Ge, Kan, Ziwen, Smucny, Jason, and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In this study, we examine the efficacy of post-hoc local attribution methods in identifying features with predictive power from irrelevant ones in domains characterized by a low signal-to-noise ratio (SNR), a common scenario in real-world machine learning applications. We developed synthetic datasets encompassing symbolic functional, image, and audio data, incorporating a benchmark on the {\it (Model \(\times\) Attribution\(\times\) Noise Condition)} triplet. By rigorously testing various classic models trained from scratch, we gained valuable insights into the performance of these attribution methods in multiple conditions. Based on these findings, we introduce a novel extension to the notable recursive feature elimination (RFE) algorithm, enhancing its applicability for neural networks. Our experiments highlight its strengths in prediction and feature selection, alongside limitations in scalability. Further details and additional minor findings are included in the appendix, with extensive discussions. The codes and resources are available at \href{https://github.com/geshijoker/ChaosMining/}{URL}., Comment: 19 pages, 10 figures, submission to Neurips 2024
- Published
- 2024
4. Times and Spaces Never Dreamed of in Diane di Prima’s Revolutionary Letters
- Author
-
Davidson, Ian
- Published
- 2019
- Full Text
- View/download PDF
5. Identification and Uses of Deep Learning Backbones via Pattern Mining
- Author
-
Livanos, Michael and Davidson, Ian
- Subjects
Computer Science - Artificial Intelligence - Abstract
Deep learning is extensively used in many areas of data mining as a black-box method with impressive results. However, understanding the core mechanism of how deep learning makes predictions is a relatively understudied problem. Here we explore the notion of identifying a backbone of deep learning for a given group of instances. A group here can be instances of the same class or even misclassified instances of the same class. We view each instance for a given group as activating a subset of neurons and attempt to find a subgraph of neurons associated with a given concept/group. We formulate this problem as a set cover style problem and show it is intractable and presents a highly constrained integer linear programming (ILP) formulation. As an alternative, we explore a coverage-based heuristic approach related to pattern mining, and show it converges to a Pareto equilibrium point of the ILP formulation. Experimentally we explore these backbones to identify mistakes and improve performance, explanation, and visualization. We demonstrate application-based results using several challenging data sets, including Bird Audio Detection (BAD) Challenge and Labeled Faces in the Wild (LFW), as well as the classic MNIST data., Comment: 9 pages, 6 figures, published SIAM SDM24
- Published
- 2024
6. Structural and functional effects of global invasion pressure on benthic marine communities—patterns, challenges and priorities
- Author
-
Zaiko, Anastasija, Cardeccia, Alice, Carlton, James T., Clark, Graeme F., Creed, Joel C., Davidson, Ian, Floerl, Oliver, Galil, Bella, Grosholz, Edwin, Hopkins, Grant A., Johnston, Emma L., Kotta, Jonne, Marchini, Agnese, Ojaveer, Henn, Ruiz, Gregory, Therriault, Thomas W., and Inglis, Graeme J.
- Published
- 2024
7. Cooperative Knowledge Distillation: A Learner Agnostic Approach
- Author
-
Livanos, Michael, Davidson, Ian, and Wong, Stephen
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Knowledge distillation is a simple but powerful way to transfer knowledge between a teacher model to a student model. Existing work suffers from at least one of the following key limitations in terms of direction and scope of transfer which restrict its use: all knowledge is transferred from teacher to student regardless of whether or not that knowledge is useful, the student is the only one learning in this exchange, and typically distillation transfers knowledge only from a single teacher to a single student. We formulate a novel form of knowledge distillation in which many models can act as both students and teachers which we call cooperative distillation. The models cooperate as follows: a model (the student) identifies specific deficiencies in it's performance and searches for another model (the teacher) who encodes learned knowledge into instructional virtual instances via counterfactual instance generation. Because different models may have different strengths and weaknesses, all models can act as either students or teachers (cooperation) when appropriate and only distill knowledge in areas specific to their strengths (focus). Since counterfactuals as a paradigm are not tied to any specific algorithm, we can use this method to distill knowledge between learners of different architectures, algorithms, and even feature spaces. We demonstrate that our approach not only outperforms baselines such as transfer learning, self-supervised learning, and multiple knowledge distillation algorithms on several datasets, but it can also be used in settings where the aforementioned techniques cannot., Comment: 8 pages, 7 figures, AAAI24
- Published
- 2024
8. Double Clad Antiresonant Hollow Core Fiber and Its Comparison with other Fibres for Multiphoton Micro-Endoscopy
- Author
-
Szwaj, Marzanna, Davidson, Ian A, Johnson, Peter B, Jasion, Greg, Jung, Yongmin, Sandoghchi, Seyed Reza, Herdzik, Krzysztof P, Bourdakos, Konstantinos N, Wheeler, Natalie V, Mulvad, Hans Christian, Richardson, David J, Poletti, Francesco, and Mahajan, Sumeet
- Subjects
Physics - Optics ,Physics - Applied Physics ,Physics - Instrumentation and Detectors - Abstract
In this work, we study a new hollow-core (air-filled) double-clad anti-resonant fiber (DC-ARF) as a potent candidate for multiphoton micro-endoscopy. We compare the fiber characteristics with a single-clad anti-resonant fiber (SC-ARF) and a solid core fiber (SCF). While the DC-ARF and the SC-ARF enable low-loss (<0.2 dBm-1), close to dispersion-free excitation pulse delivery (<10% pulse width increase at 900 nm per 1 m fiber) without any induced non-linearities, the SCF resulted in spectral broadening and pulse-stretching (> 2000% of pulse width increase at 900 nm per 1 m fiber). An ideal optical fiber endoscope needs to be several meters long and should enable both excitation and collection through the fiber. Therefore, we performed multiphoton imaging on endoscopy-compatible 1 m and 3 m lengths of fiber in the back-scattered geometry, wherein the signals were collected either directly (non-descanned detection) or through the fiber (descanned detection). Second harmonic images were collected from barium titanate crystals as well as from biological samples (rat tail tendon). In non-descanned detection conditions, the ARFs outperformed the SCF by up to 10 times in terms of signal-to-noise ratio of images. Significantly, only the DC-ARF, due to its high numerical aperture (0.45) and wide-collection bandwidth (>1 um), could provide images in the de-scanned detection configuration desirable for endoscopy. Thus, our systematic characterization and comparison of different optical fibres under different image collection configurations, confirms and establishes the utility of DC-ARFs for high-performing label-free multiphoton imaging based micro-endoscopy., Comment: 29 pages, 13 figures
- Published
- 2023
9. Mode attraction, rejection and control in nonlinear multimode optics
- Author
-
Ji, Kunhao, Davidson, Ian, Sahu, Jayantha, Richardson, David. J., Wabnitz, Stefan, and Guasoni, Massimiliano
- Subjects
Physics - Optics - Abstract
Novel fundamental notions helping in the interpretation of the complex dynamics of nonlinear systems are essential to our understanding and ability to exploit them. In this work we predict and demonstrate experimentally a fundamental property of Kerr-nonlinear media, which we name mode rejection and takes place when two intense counter-propagating beams interact in a multimode waveguide. In stark contrast to mode attraction phenomena, mode rejection leads to the selective suppression of a spatial mode in the forward beam, which is controlled via the counter-propagating backward beam. Starting from this observation we generalise the ideas of attraction and rejection in nonlinear multimode systems of arbitrary dimension, which paves the way towards a more general idea of all-optical mode control. These ideas represent universal tools to explore novel dynamics and applications in a variety of optical and non-optical nonlinear systems. Coherent beam combination in polarization-maintaining multicore fibres is demonstrated as example.
- Published
- 2023
10. Anthropogenic Vector Ecology and Management to Combat Disease Spread in Aquaculture
- Author
-
Lovett, Bailey, Cahill, Patrick, Fletcher, Lauren, Cunningham, Shaun, and Davidson, Ian
- Published
- 2024
- Full Text
- View/download PDF
11. Assessing European Foreign Policy
- Author
-
Davidson, Ian and Gordon, Philip H.
- Published
- 2011
12. On-target delivery of intense ultrafast laser pulses through hollow-core anti-resonant fibers
- Author
-
Lekosiotis, Athanasios, Belli, Federico, Brahms, Christian, Sabbah, Mohammed, Sakr, Hesham, Davidson, Ian A., Poletti, Francesco, and Travers, John C.
- Subjects
Physics - Optics - Abstract
We report the flexible on-target delivery of 800 nm wavelength, 5 GW peak power, 40 fs duration laser pulses through an evacuated and tightly coiled 10 m long hollow-core nested anti-resonant fiber by positively chirping the input pulses to compensate for the anomalous dispersion of the fiber. Near-transform-limited output pulses with high beam quality and a guided peak intensity of 3 PW/cm2 were achieved by suppressing plasma effects in the residual gas by pre-pumping the fiber after evacuation. This appears to cause a long-term removal of molecules from the fiber core. Identifying the fluence at the fiber core-wall interface as the damage origin, we scaled the coupled energy to 2.1 mJ using a short piece of larger-core fiber to obtain 20 GW at the fiber output. This scheme can pave the way towards the integration of anti-resonant fibers in mJ-level nonlinear optical experiments and laser-source development.
- Published
- 2023
13. Autism
- Author
-
Davidson, Ian A., primary, Doherty, Mary, additional, and Haydon, Clair, additional
- Published
- 2024
- Full Text
- View/download PDF
14. Scalable Spectral Clustering with Group Fairness Constraints
- Author
-
Wang, Ji, Lu, Ding, Davidson, Ian, and Bai, Zhaojun
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
There are synergies of research interests and industrial efforts in modeling fairness and correcting algorithmic bias in machine learning. In this paper, we present a scalable algorithm for spectral clustering (SC) with group fairness constraints. Group fairness is also known as statistical parity where in each cluster, each protected group is represented with the same proportion as in the entirety. While FairSC algorithm (Kleindessner et al., 2019) is able to find the fairer clustering, it is compromised by high costs due to the kernels of computing nullspaces and the square roots of dense matrices explicitly. We present a new formulation of underlying spectral computation by incorporating nullspace projection and Hotelling's deflation such that the resulting algorithm, called s-FairSC, only involves the sparse matrix-vector products and is able to fully exploit the sparsity of the fair SC model. The experimental results on the modified stochastic block model demonstrate that s-FairSC is comparable with FairSC in recovering fair clustering. Meanwhile, it is sped up by a factor of 12 for moderate model sizes. s-FairSC is further demonstrated to be scalable in the sense that the computational costs of s-FairSC only increase marginally compared to the SC without fairness constraints.
- Published
- 2022
15. Towards Auditing Unsupervised Learning Algorithms and Human Processes For Fairness
- Author
-
Davidson, Ian and Ravi, S. S.
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computers and Society ,Computer Science - Machine Learning - Abstract
Existing work on fairness typically focuses on making known machine learning algorithms fairer. Fair variants of classification, clustering, outlier detection and other styles of algorithms exist. However, an understudied area is the topic of auditing an algorithm's output to determine fairness. Existing work has explored the two group classification problem for binary protected status variables using standard definitions of statistical parity. Here we build upon the area of auditing by exploring the multi-group setting under more complex definitions of fairness., Comment: 22 pages, 3 figures
- Published
- 2022
16. Explainable Clustering via Exemplars: Complexity and Efficient Approximation Algorithms
- Author
-
Davidson, Ian, Livanos, Michael, Gourru, Antoine, Walker, Peter, Velcin, Julien, and Ravi, S. S.
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Explainable AI (XAI) is an important developing area but remains relatively understudied for clustering. We propose an explainable-by-design clustering approach that not only finds clusters but also exemplars to explain each cluster. The use of exemplars for understanding is supported by the exemplar-based school of concept definition in psychology. We show that finding a small set of exemplars to explain even a single cluster is computationally intractable; hence, the overall problem is challenging. We develop an approximation algorithm that provides provable performance guarantees with respect to clustering quality as well as the number of exemplars used. This basic algorithm explains all the instances in every cluster whilst another approximation algorithm uses a bounded number of exemplars to allow simpler explanations and provably covers a large fraction of all the instances. Experimental results show that our work is useful in domains involving difficult to understand deep embeddings of images and text., Comment: 22 pages; 4 figures
- Published
- 2022
17. Identification and Uses of Deep Learning Backbones via Pattern Mining
- Author
-
Livanos, Michael, primary and Davidson, Ian, additional
- Published
- 2024
- Full Text
- View/download PDF
18. An Exemplars-Based Approach for Explainable Clustering: Complexity and Efficient Approximation Algorithms
- Author
-
Davidson, Ian, primary, Livanos, Michael, additional, Gourru, Antoine, additional, Walker, Peter, additional, and Ravi, Julien Velcin S. S., additional
- Published
- 2024
- Full Text
- View/download PDF
19. Are factors that predict conversion to psychosis associated with initial transition to a high risk state? An adolescent brain cognitive development study analysis
- Author
-
Smucny, Jason, Wood, Avery, Davidson, Ian N., and Carter, Cameron S.
- Published
- 2024
- Full Text
- View/download PDF
20. Benefits and Harms of Interventions to Improve Anxiety, Depression, and Other Mental Health Outcomes for Autistic People: A Systematic Review and Network Meta-Analysis of Randomised Controlled Trials
- Author
-
Linden, Audrey, Best, Lawrence, Elise, Freya, Roberts, Danielle, Branagan, Aoife, Tay, Yong Boon Ernest, Crane, Laura, Cusack, James, Davidson, Brian, Davidson, Ian, Hearst, Caroline, Mandy, William, Rai, Dheeraj, Smith, Edward, and Gurusamy, Kurinchi
- Abstract
Mental health difficulties are prevalent in autistic people with [approximately]14%-50% having experienced depression and [approximately]40%-80% having experienced anxiety disorders. Identifying interventions that improve autistic people's mental health is a top priority. However, at present, there is no high-quality network meta-analysis of benefits and harms of different interventions. We conducted a systematic review and network meta-analysis of randomised controlled trials, searching MEDLINE, EMBASE, other databases, and trial registers until 17 October 2020. We included randomised controlled trials reporting anxiety or depression in a suitable format. We calculated effect estimates and 95% credible intervals using Bayesian network meta-analysis. Our search identified 13,794 reports, of which 71 randomised controlled trials (3630 participants) were eligible for inclusion. All trials had high risk of bias. The follow-up period ranged from 1 to 24 months. Evidence indicates uncertainty about the effects of different interventions, with more high-quality evidence needed. Available evidence suggests that some forms of cognitive behavioural therapy may decrease anxiety and depression scores in autistic children and adults; mindfulness therapy may decrease anxiety and depression scores in autistic adults with previous mental health conditions; and behavioural interventions may provide some benefit for depression in autistic children. We recommend that autistic people are given access to mental health interventions available to non-autistic people, following principles of person-centred care.
- Published
- 2023
- Full Text
- View/download PDF
21. Making clusterings fairer by post-processing: algorithms, complexity results and experiments
- Author
-
Davidson, Ian, Bai, Zilong, Tran, Cindy Mylinh, and Ravi, S. S.
- Published
- 2023
- Full Text
- View/download PDF
22. Data augmentation with Mixup: Enhancing performance of a functional neuroimaging-based prognostic deep learning classifier in recent onset psychosis
- Author
-
Smucny, Jason, Shi, Ge, Lesh, Tyler A, Carter, Cameron S, and Davidson, Ian
- Subjects
Biological Psychology ,Psychology ,Bioengineering ,Biomedical Imaging ,Clinical Research ,Neurosciences ,4.1 Discovery and preclinical testing of markers and technologies ,Mental health ,Humans ,Deep Learning ,Prognosis ,Functional Neuroimaging ,Neuroimaging ,Psychotic Disorders ,Cognitive control ,Ensemble learning ,fMRI ,Machine learning ,Schizophrenia ,Transfer learning ,Biological psychology ,Clinical and health psychology - Abstract
Although deep learning holds great promise as a prognostic tool in psychiatry, a limitation of the method is that it requires large training sample sizes to achieve replicable accuracy. This is problematic for fMRI datasets as they are typically small due to the considerable time, cost, and resources necessary to obtain them. A recently developed self-supervised learning method called Mixup may help overcome this challenge. In Mixup, the learner combines pairs of training instances to produce a virtual third instance that is a linear combination of the two instances and their labels. This procedure is also well-suited to the coregistered images typically found in fMRI datasets. Here we compared performance of a task fMRI-based deep learner with Mixup vs without Mixup on predicting response to treatment in recent onset psychosis. Whole brain fMRI time series data were extracted from a cognitive control task in 82 patients with recent onset psychosis and used to predict "Improver" (n = 47) vs "Non-Improver" (n = 35) status, with Improver defined as showing a 20 % reduction in total Brief Psychiatric Rating Scale score after 1 year of treatment. Mixup significantly improved performance (accuracy without Mixup: 76.5 % [95 % CI: 75.9-77.1 %]; accuracy with Mixup: 80.1 % [95 % CI: 79.4-80.8 %]). Ablation showed the improvement was due to improvement in both Improvers and Non-Improvers. These results suggest that using Mixup may significantly improve performance and reduce overfitting of fMRI-based prognostic deep learners and may also help overcome the small sample size challenge inherent to many neuroimaging datasets.
- Published
- 2022
23. Deep Learning in Neuroimaging: Overcoming Challenges With Emerging Approaches
- Author
-
Smucny, Jason, Shi, Ge, and Davidson, Ian
- Subjects
Biomedical and Clinical Sciences ,Clinical Sciences ,Behavioral and Social Science ,Brain Disorders ,Networking and Information Technology R&D (NITRD) ,Biomedical Imaging ,Clinical Research ,Machine Learning and Artificial Intelligence ,Data Science ,Basic Behavioral and Social Science ,Bioengineering ,Mental Health ,Neurosciences ,4.1 Discovery and preclinical testing of markers and technologies ,Mental health ,Good Health and Well Being ,deep learning ,mixup data augmentation ,transfer learning ,explainable AI ,fMRI ,Public Health and Health Services ,Psychology ,Clinical sciences - Abstract
Deep learning (DL) is of great interest in psychiatry due its potential yet largely untapped ability to utilize multidimensional datasets (such as fMRI data) to predict clinical outcomes. Typical DL methods, however, have strong assumptions, such as large datasets and underlying model opaqueness, that are suitable for natural image prediction problems but not medical imaging. Here we describe three relatively novel DL approaches that may help accelerate its incorporation into mainstream psychiatry research and ultimately bring it into the clinic as a prognostic tool. We first introduce two methods that can reduce the amount of training data required to develop accurate models. These may prove invaluable for fMRI-based DL given the time and monetary expense required to acquire neuroimaging data. These methods are (1) transfer learning - the ability of deep learners to incorporate knowledge learned from one data source (e.g., fMRI data from one site) and apply it toward learning from a second data source (e.g., data from another site), and (2) data augmentation (via Mixup) - a self-supervised learning technique in which "virtual" instances are created. We then discuss explainable artificial intelligence (XAI), i.e., tools that reveal what features (and in what combinations) deep learners use to make decisions. XAI can be used to solve the "black box" criticism common in DL and reveal mechanisms that ultimately produce clinical outcomes. We expect these techniques to greatly enhance the applicability of DL in psychiatric research and help reveal novel mechanisms and potential pathways for therapeutic intervention in mental illness.
- Published
- 2022
24. Deep Fair Discriminative Clustering
- Author
-
Zhang, Hongjing and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Computer Science - Computers and Society ,Statistics - Machine Learning - Abstract
Deep clustering has the potential to learn a strong representation and hence better clustering performance compared to traditional clustering methods such as $k$-means and spectral clustering. However, this strong representation learning ability may make the clustering unfair by discovering surrogates for protected information which we empirically show in our experiments. In this work, we study a general notion of group-level fairness for both binary and multi-state protected status variables (PSVs). We begin by formulating the group-level fairness problem as an integer linear programming formulation whose totally unimodular constraint matrix means it can be efficiently solved via linear programming. We then show how to inject this solver into a discriminative deep clustering backbone and hence propose a refinement learning algorithm to combine the clustering goal with the fairness objective to learn fair clusters adaptively. Experimental results on real-world datasets demonstrate that our model consistently outperforms state-of-the-art fair clustering algorithms. Our framework shows promising results for novel clustering tasks including flexible fairness constraints, multi-state PSVs and predictive clustering.
- Published
- 2021
25. Deep Descriptive Clustering
- Author
-
Zhang, Hongjing and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Recent work on explainable clustering allows describing clusters when the features are interpretable. However, much modern machine learning focuses on complex data such as images, text, and graphs where deep learning is used but the raw features of data are not interpretable. This paper explores a novel setting for performing clustering on complex data while simultaneously generating explanations using interpretable tags. We propose deep descriptive clustering that performs sub-symbolic representation learning on complex data while generating explanations based on symbolic data. We form good clusters by maximizing the mutual information between empirical distribution on the inputs and the induced clustering labels for clustering objectives. We generate explanations by solving an integer linear programming that generates concise and orthogonal descriptions for each cluster. Finally, we allow the explanation to inform better clustering by proposing a novel pairwise loss with self-generated constraints to maximize the clustering and explanation module's consistency. Experimental results on public data demonstrate that our model outperforms competitive baselines in clustering performance while offering high-quality cluster-level explanations., Comment: Paper accepted at IJCAI 2021
- Published
- 2021
26. A Framework for Deep Constrained Clustering
- Author
-
Zhang, Hongjing, Zhan, Tianyang, Basu, Sugato, and Davidson, Ian
- Subjects
Computer Science - Machine Learning - Abstract
The area of constrained clustering has been extensively explored by researchers and used by practitioners. Constrained clustering formulations exist for popular algorithms such as k-means, mixture models, and spectral clustering but have several limitations. A fundamental strength of deep learning is its flexibility, and here we explore a deep learning framework for constrained clustering and in particular explore how it can extend the field of constrained clustering. We show that our framework can not only handle standard together/apart constraints (without the well documented negative effects reported earlier) generated from labeled side information but more complex constraints generated from new types of side information such as continuous values and high-level domain knowledge. Furthermore, we propose an efficient training paradigm that is generally applicable to these four types of constraints. We validate the effectiveness of our approach by empirical results on both image and text datasets. We also study the robustness of our framework when learning with noisy constraints and show how different components of our framework contribute to the final performance. Our source code is available at $\href{https://github.com/blueocean92/deep_constrained_clustering}{\text{URL}}$., Comment: Data Mining and Knowledge Discovery, 2021. arXiv admin note: substantial text overlap with arXiv:1901.10061
- Published
- 2021
27. NZ Camel Corps badges : some extremely rare
- Author
-
Davidson, Ian
- Published
- 2008
28. Towards Fair Deep Anomaly Detection
- Author
-
Zhang, Hongjing and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Statistics - Machine Learning - Abstract
Anomaly detection aims to find instances that are considered unusual and is a fundamental problem of data science. Recently, deep anomaly detection methods were shown to achieve superior results particularly in complex data such as images. Our work focuses on deep one-class classification for anomaly detection which learns a mapping only from the normal samples. However, the non-linear transformation performed by deep learning can potentially find patterns associated with social bias. The challenge with adding fairness to deep anomaly detection is to ensure both making fair and correct anomaly predictions simultaneously. In this paper, we propose a new architecture for the fair anomaly detection approach (Deep Fair SVDD) and train it using an adversarial network to de-correlate the relationships between the sensitive attributes and the learned representations. This differs from how fairness is typically added namely as a regularizer or a constraint. Further, we propose two effective fairness measures and empirically demonstrate that existing deep anomaly detection methods are unfair. We show that our proposed approach can remove the unfairness largely with minimal loss on the anomaly detection performance. Lastly, we conduct an in-depth analysis to show the strength and limitations of our proposed model, including parameter analysis, feature visualization, and run-time analysis., Comment: Accepted for publication at the ACM Conference on Fairness, Accountability, and Transparency 2021 (ACM FAccT'21)
- Published
- 2020
29. Block Model Guided Unsupervised Feature Selection
- Author
-
Bai, Zilong, Nguyen, Hoa, and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Feature selection is a core area of data mining with a recent innovation of graph-driven unsupervised feature selection for linked data. In this setting we have a dataset $\mathbf{Y}$ consisting of $n$ instances each with $m$ features and a corresponding $n$ node graph (whose adjacency matrix is $\mathbf{A}$) with an edge indicating that the two instances are similar. Existing efforts for unsupervised feature selection on attributed networks have explored either directly regenerating the links by solving for $f$ such that $f(\mathbf{y}_i,\mathbf{y}_j) \approx \mathbf{A}_{i,j}$ or finding community structure in $\mathbf{A}$ and using the features in $\mathbf{Y}$ to predict these communities. However, graph-driven unsupervised feature selection remains an understudied area with respect to exploring more complex guidance. Here we take the novel approach of first building a block model on the graph and then using the block model for feature selection. That is, we discover $\mathbf{F}\mathbf{M}\mathbf{F}^T \approx \mathbf{A}$ and then find a subset of features $\mathcal{S}$ that induces another graph to preserve both $\mathbf{F}$ and $\mathbf{M}$. We call our approach Block Model Guided Unsupervised Feature Selection (BMGUFS). Experimental results show that our method outperforms the state of the art on several real-world public datasets in finding high-quality features for clustering., Comment: Published at KDD2020
- Published
- 2020
- Full Text
- View/download PDF
30. Efficient Algorithms for Generating Provably Near-Optimal Cluster Descriptors for Explainability
- Author
-
Sambaturu, Prathyush, Gupta, Aparna, Davidson, Ian, Ravi, S. S., Vullikanti, Anil, and Warren, Andrew
- Subjects
Computer Science - Data Structures and Algorithms ,Computer Science - Artificial Intelligence ,Computer Science - Discrete Mathematics ,Mathematics - Optimization and Control ,68W25, 68T01, 68R05 ,G.2 ,I.2 ,F.2 - Abstract
Improving the explainability of the results from machine learning methods has become an important research goal. Here, we study the problem of making clusters more interpretable by extending a recent approach of [Davidson et al., NeurIPS 2018] for constructing succinct representations for clusters. Given a set of objects $S$, a partition $\pi$ of $S$ (into clusters), and a universe $T$ of tags such that each element in $S$ is associated with a subset of tags, the goal is to find a representative set of tags for each cluster such that those sets are pairwise-disjoint and the total size of all the representatives is minimized. Since this problem is NP-hard in general, we develop approximation algorithms with provable performance guarantees for the problem. We also show applications to explain clusters from datasets, including clusters of genomic sequences that represent different threat levels.
- Published
- 2020
31. A Graph-Based Approach for Active Learning in Regression
- Author
-
Zhang, Hongjing, Ravi, S. S., and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Active learning aims to reduce labeling efforts by selectively asking humans to annotate the most important data points from an unlabeled pool and is an example of human-machine interaction. Though active learning has been extensively researched for classification and ranking problems, it is relatively understudied for regression problems. Most existing active learning for regression methods use the regression function learned at each active learning iteration to select the next informative point to query. This introduces several challenges such as handling noisy labels, parameter uncertainty and overcoming initially biased training data. Instead, we propose a feature-focused approach that formulates both sequential and batch-mode active regression as a novel bipartite graph optimization problem. We conduct experiments on both noise-free and noisy settings. Our experimental results on benchmark data sets demonstrate the effectiveness of our proposed approach., Comment: SDM 2020 camera-ready. 9 pages, 4 figures, links to supplementary material available at https://sdm2020.s3-us-west-1.amazonaws.com/supplementary.pdf
- Published
- 2020
32. Direct VCSEL interconnection with a hollow core fiber
- Author
-
Jung, Yongmin, Meng, Jing, Harrington, Kerrianne, Sakr, Hesham, Davidson, Ian A., Liang, Sijing, Jasion, Gregory, Poletti, Francesco, and Richardson, David J.
- Published
- 2023
- Full Text
- View/download PDF
33. Comparing machine and deep learning‐based algorithms for prediction of clinical improvement in psychosis with functional magnetic resonance imaging
- Author
-
Smucny, Jason, Davidson, Ian, and Carter, Cameron S
- Subjects
Biological Psychology ,Biomedical and Clinical Sciences ,Psychology ,Brain Disorders ,Mental Health ,Mental Illness ,Schizophrenia ,Serious Mental Illness ,Networking and Information Technology R&D (NITRD) ,Machine Learning and Artificial Intelligence ,Clinical Research ,Mental health ,Adolescent ,Adult ,Bipolar Disorder ,Deep Learning ,Dorsolateral Prefrontal Cortex ,Executive Function ,Female ,Follow-Up Studies ,Functional Neuroimaging ,Humans ,Machine Learning ,Magnetic Resonance Imaging ,Male ,Outcome Assessment ,Health Care ,Psychomotor Performance ,Support Vector Machine ,Young Adult ,cognitive control ,frontoparietal ,neuroimaging ,prognosis ,schizophrenia ,Neurosciences ,Cognitive Sciences ,Experimental Psychology ,Biological psychology ,Cognitive and computational psychology - Abstract
Previous work using logistic regression suggests that cognitive control-related frontoparietal activation in early psychosis can predict symptomatic improvement after 1 year of coordinated specialty care with 66% accuracy. Here, we evaluated the ability of six machine learning (ML) algorithms and deep learning (DL) to predict "Improver" status (>20% improvement on Brief Psychiatric Rating Scale [BPRS] total score at 1-year follow-up vs. baseline) and continuous change in BPRS score using the same functional magnetic resonance imaging-based features (frontoparietal activations during the AX-continuous performance task) in the same sample (individuals with either schizophrenia (n = 65, 49M/16F, mean age 20.8 years) or Type I bipolar disorder (n = 17, 9M/8F, mean age 21.6 years)). 138 healthy controls were included as a reference group. "Shallow" ML methods included Naive Bayes, support vector machine, K Star, AdaBoost, J48 decision tree, and random forest. DL included an explainable artificial intelligence (XAI) procedure for understanding results. The best overall performances (70% accuracy for the binary outcome and root mean square error = 9.47 for the continuous outcome) were achieved using DL. XAI revealed left DLPFC activation was the strongest feature used to make binary classification decisions, with a classification activation threshold (adjusted beta = .017) intermediate to the healthy control mean (adjusted beta = .15, 95% CI = -0.02 to 0.31) and patient mean (adjusted beta = -.13, 95% CI = -0.37 to 0.11). Our results suggest DL is more powerful than shallow ML methods for predicting symptomatic improvement. The left DLPFC may be a functional target for future biomarker development as its activation was particularly important for predicting improvement.
- Published
- 2021
34. Coverage-based Outlier Explanation
- Author
-
Wu, Yue, Akoglu, Leman, and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Outlier detection is a core task in data mining with a plethora of algorithms that have enjoyed wide scale usage. Existing algorithms are primarily focused on detection, that is the identification of outliers in a given dataset. In this paper we explore the relatively under-studied problem of the outlier explanation problem. Our goal is, given a dataset that is already divided into outliers and normal instances, explain what characterizes the outliers. We explore the novel direction of a semantic explanation that a domain expert or policy maker is able to understand. We formulate this as an optimization problem to find explanations that are both interpretable and pure. Through experiments on real-world data sets, we quantitatively show that our method can efficiently generate better explanations compared with rule-based learners.
- Published
- 2019
35. Assigning cause for emerging diseases of aquatic organisms
- Author
-
Hutson, Kate S., Davidson, Ian C., Bennett, Jerusha, Poulin, Robert, and Cahill, Patrick L.
- Published
- 2023
- Full Text
- View/download PDF
36. Towards Fair Deep Clustering With Multi-State Protected Variables
- Author
-
Wang, Bokun and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Fair clustering under the disparate impact doctrine requires that population of each protected group should be approximately equal in every cluster. Previous work investigated a difficult-to-scale pre-processing step for $k$-center and $k$-median style algorithms for the special case of this problem when the number of protected groups is two. In this work, we consider a more general and practical setting where there can be many protected groups. To this end, we propose Deep Fair Clustering, which learns a discriminative but fair cluster assignment function. The experimental results on three public datasets with different types of protected attribute show that our approach can steadily improve the degree of fairness while only having minor loss in terms of clustering quality., Comment: under review as a conference paper at icml 2019
- Published
- 2019
37. A Framework for Deep Constrained Clustering -- Algorithms and Advances
- Author
-
Zhang, Hongjing, Basu, Sugato, and Davidson, Ian
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
The area of constrained clustering has been extensively explored by researchers and used by practitioners. Constrained clustering formulations exist for popular algorithms such as k-means, mixture models, and spectral clustering but have several limitations. A fundamental strength of deep learning is its flexibility, and here we explore a deep learning framework for constrained clustering and in particular explore how it can extend the field of constrained clustering. We show that our framework can not only handle standard together/apart constraints (without the well documented negative effects reported earlier) generated from labeled side information but more complex constraints generated from new types of side information such as continuous values and high-level domain knowledge., Comment: Updated for ECML/PKDD 2019
- Published
- 2019
38. Salmon farm biofouling and potential health impacts to fish from stinging cnidarians
- Author
-
Fletcher, Lauren M., Davidson, Ian C., Bucknall, Bethany G., and Atalah, Javier
- Published
- 2023
- Full Text
- View/download PDF
39. Global marine biosecurity and ship lay-ups: intensifying effects of trade disruptions
- Author
-
Ruiz, Gregory M., Galil, Bella S., Davidson, Ian C., Donelan, Sarah C., Miller, A. Whitman, Minton, Mark S., Muirhead, Jim R., Ojaveer, Henn, Tamburri, Mario N., and Carlton, James T.
- Published
- 2022
- Full Text
- View/download PDF
40. The lover and the tribe
- Author
-
Davidson, Ian, primary
- Published
- 2023
- Full Text
- View/download PDF
41. On The Equivalence of Tries and Dendrograms - Efficient Hierarchical Clustering of Traffic Data
- Author
-
Kuo, Chia-Tung and Davidson, Ian
- Subjects
Computer Science - Databases ,Computer Science - Artificial Intelligence - Abstract
The widespread use of GPS-enabled devices generates voluminous and continuous amounts of traffic data but analyzing such data for interpretable and actionable insights poses challenges. A hierarchical clustering of the trips has many uses such as discovering shortest paths, common routes and often traversed areas. However, hierarchical clustering typically has time complexity of $O(n^2 \log n)$ where $n$ is the number of instances, and is difficult to scale to large data sets associated with GPS data. Furthermore, incremental hierarchical clustering is still a developing area. Prefix trees (also called tries) can be efficiently constructed and updated in linear time (in $n$). We show how a specially constructed trie can compactly store the trips and further show this trie is equivalent to a dendrogram that would have been built by classic agglomerative hierarchical algorithms using a specific distance metric. This allows creating hierarchical clusterings of GPS trip data and updating this hierarchy in linear time. %we can extract a meaningful kernel and can also interpret the structure as clusterings of differing granularity as one progresses down the tree. We demonstrate the usefulness of our proposed approach on a real world data set of half a million taxis' GPS traces, well beyond the capabilities of agglomerative clustering methods. Our work is not limited to trip data and can be used with other data with a string representation.
- Published
- 2018
42. Probabilistic Formulations of Regression with Mixed Guidance
- Author
-
Gress, Aubrey and Davidson, Ian
- Subjects
Computer Science - Learning ,Statistics - Machine Learning - Abstract
Regression problems assume every instance is annotated (labeled) with a real value, a form of annotation we call \emph{strong guidance}. In order for these annotations to be accurate, they must be the result of a precise experiment or measurement. However, in some cases additional \emph{weak guidance} might be given by imprecise measurements, a domain expert or even crowd sourcing. Current formulations of regression are unable to use both types of guidance. We propose a regression framework that can also incorporate weak guidance based on relative orderings, bounds, neighboring and similarity relations. Consider learning to predict ages from portrait images, these new types of guidance allow weaker forms of guidance such as stating a person is in their 20s or two people are similar in age. These types of annotations can be easier to generate than strong guidance. We introduce a probabilistic formulation for these forms of weak guidance and show that the resulting optimization problems are convex. Our experimental results show the benefits of these formulations on several data sets., Comment: Appeared in ICDM 2016
- Published
- 2018
43. Transfer Regression via Pairwise Similarity Regularization
- Author
-
Gress, Aubrey and Davidson, Ian
- Subjects
Computer Science - Learning - Abstract
Transfer learning methods address the situation where little labeled training data from the "target" problem exists, but much training data from a related "source" domain is available. However, the overwhelming majority of transfer learning methods are designed for simple settings where the source and target predictive functions are almost identical, limiting the applicability of transfer learning methods to real world data. We propose a novel, weaker, property of the source domain that can be transferred even when the source and target predictive functions diverge. Our method assumes the source and target functions share a Pairwise Similarity property, where if the source function makes similar predictions on a pair of instances, then so will the target function. We propose Pairwise Similarity Regularization Transfer, a flexible graph-based regularization framework which can incorporate this modeling assumption into standard supervised learning algorithms. We show how users can encode domain knowledge into our regularizer in the form of spatial continuity, pairwise "similarity constraints" and how our method can be scaled to large data sets using the Nystrom approximation. Finally, we present positive and negative results on real and synthetic data sets and discuss when our Pairwise Similarity transfer assumption seems to hold in practice.
- Published
- 2017
44. A Practical On-Line Solution of Control Ash Deposition
- Author
-
Davidson, Ian S., primary
- Published
- 2022
- Full Text
- View/download PDF
45. Recreational boats routinely transfer organisms and promote marine bioinvasions
- Author
-
Ashton, Gail V., Zabin, Chela J., Davidson, Ian C., and Ruiz, Gregory M.
- Published
- 2022
- Full Text
- View/download PDF
46. Dense Transformer Networks
- Author
-
Li, Jun, Chen, Yongjun, Cai, Lei, Davidson, Ian, and Ji, Shuiwang
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Learning ,Computer Science - Neural and Evolutionary Computing ,Statistics - Machine Learning - Abstract
The key idea of current deep learning methods for dense prediction is to apply a model on a regular patch centered on each pixel to make pixel-wise predictions. These methods are limited in the sense that the patches are determined by network architecture instead of learned from data. In this work, we propose the dense transformer networks, which can learn the shapes and sizes of patches from data. The dense transformer networks employ an encoder-decoder architecture, and a pair of dense transformer modules are inserted into each of the encoder and decoder paths. The novelty of this work is that we provide technical solutions for learning the shapes and sizes of patches from data and efficiently restoring the spatial correspondence required for dense prediction. The proposed dense transformer modules are differentiable, thus the entire network can be trained. We apply the proposed networks on natural and biological image segmentation tasks and show superior performance is achieved in comparison to baseline methods.
- Published
- 2017
47. Stronger increase of methane emissions from coastal wetlands by non‐native Spartina alterniflora than non‐native Phragmites australis.
- Author
-
Fuchs, Andrea, Davidson, Ian C., Megonigal, J. Patrick, Devaney, John L., Simkanin, Christina, Noyce, Genevieve L., Lu, Meng, and Cott, Grace M.
- Subjects
- *
GREENHOUSE gases , *COASTAL wetlands , *CARBON in soils , *ATMOSPHERIC carbon dioxide , *SPARTINA alterniflora , *WETLAND restoration , *PHRAGMITES - Abstract
Societal Impact Statement Summary The invasive species S. alterniflora and P. australis are fast growing coastal wetland plants sequestering large amounts of carbon in the soil and protect coastlines against erosion and storm surges. In this global analysis, we found that
Spartina andPhragmites increase methane but not nitrous oxide emissions, withPhragmites having a lesser effect. The impact of the invasive species on emissions differed greatly among different types of native plant groups, providing valuable information to managers and policymakers during coastal wetland planning and restoration efforts. Further, our estimated net emissions per wetland plant group facilitate regional and national blue carbon estimates. Globally, Spartina alterniflora and Phragmites australis are among the most pervasive invasive plants in coastal wetland ecosystems. Both species sequester large amounts of atmospheric carbon dioxide (CO2) and biogenic carbon in soils but also support production and emission of methane (CH4). In this study, we investigated the magnitude of their net greenhouse gas (GHG) release from invaded and non‐invaded habitats. We conducted a meta‐analysis of GHG fluxes associated with these two species and related soil carbon content and plant biomass in invaded coastal wetlands. Our results show that both invasive species increase CH4 fluxes compared to uninvaded coastal wetlands, but they do not significantly affect CO2 and N2O fluxes. The magnitude of emissions fromSpartina andPhragmites differs among native habitats. GHG fluxes, soil carbon and plant biomass ofSpartina ‐invaded habitats were highest compared to uninvaded mudflats and succulent forb‐dominated wetlands, while being lower compared to uninvaded mangroves (except for CH4). This meta‐analysis highlights the important role of individual plant traits as drivers of change by invasive species on plant‐mediated carbon cycles. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
48. Towards Description of Block Model on Graph
- Author
-
Bai, Zilong, Ravi, S. S., Davidson, Ian, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Hutter, Frank, editor, Kersting, Kristian, editor, Lijffijt, Jefrey, editor, and Valera, Isabel, editor
- Published
- 2021
- Full Text
- View/download PDF
49. 404. Predictors of Conversion to Psychosis Also Predict Transition to a High Risk State: An Adolescent Brain and Cognitive Development Study Analysis
- Author
-
Smucny, Jason, primary, Wood, Avery, additional, Davidson, Ian, additional, and Carter, Cameron, additional
- Published
- 2024
- Full Text
- View/download PDF
50. Double-Clad Antiresonant Hollow-Core Fiber and Its Comparison with Other Fibers for Multiphoton Micro-Endoscopy
- Author
-
Szwaj, Marzanna, primary, Davidson, Ian A., additional, Johnson, Peter B., additional, Jasion, Greg, additional, Jung, Yongmin, additional, Sandoghchi, Seyed Reza, additional, Herdzik, Krzysztof P., additional, Bourdakos, Konstantinos N., additional, Wheeler, Natalie V., additional, Mulvad, Hans Christian, additional, Richardson, David J., additional, Poletti, Francesco, additional, and Mahajan, Sumeet, additional
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.