27 results on '"equivariance"'
Search Results
2. Learning Temporally Equivariance for Degenerative Disease Progression in OCT by Predicting Future Representations
- Author
-
Emre, Taha, Chakravarty, Arunava, Lachinov, Dmitrii, Rivail, Antoine, Schmidt-Erfurth, Ursula, Bogunović, Hrvoje, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Linguraru, Marius George, editor, Dou, Qi, editor, Feragen, Aasa, editor, Giannarou, Stamatia, editor, Glocker, Ben, editor, Lekadir, Karim, editor, and Schnabel, Julia A., editor
- Published
- 2024
- Full Text
- View/download PDF
3. Self Supervised Contrastive Learning Combining Equivariance and Invariance
- Author
-
Yang, Longze, Yang, Yan, Jin, Hu, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Zhang, Wenjie, editor, Tung, Anthony, editor, Zheng, Zhonglong, editor, Yang, Zhengyi, editor, Wang, Xiaoyang, editor, and Guo, Hongjie, editor
- Published
- 2024
- Full Text
- View/download PDF
4. Complex-Valued FastICA Estimator with a Weighted Unitary Constraint: A Robust and Equivariant Estimator.
- Author
-
E, Jianwei and Yang, Mingshu
- Subjects
- *
BLIND source separation , *DIGITAL signal processing , *COMPLEX numbers , *BEHAVIORAL assessment , *MISSING data (Statistics) - Abstract
Independent component analysis (ICA), as a statistical and computational approach, has been successfully applied to digital signal processing. Performance analysis for the ICA approach is perceived as a challenging task to work on. This contribution concerns the complex-valued FastICA algorithm in the range of ICA over the complex number domain. The focus is on the robust and equivariant behavior analysis of the complex-valued FastICA estimator. Although the complex-valued FastICA algorithm as well as its derivatives have been widely used methods for approaching the complex blind signal separation problem, rigorous mathematical treatments of the robust measurement and equivariance for the complex-valued FastICA estimator are still missing. This paper strictly analyzes the robustness against outliers and separation performance depending on the global system. We begin with defining the influence function (IF) of complex-valued FastICA functional and followed by deriving its closed-form expression. Then, we prove that the complex-valued FastICA algorithm based on the optimizing cost function is linear-equivariant, depending only on the source signals. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Symmetry-aware Neural Architecture for Embodied Visual Navigation.
- Author
-
Liu, Shuang, Suganuma, Masanori, and Okatani, Takayuki
- Subjects
- *
DEEP reinforcement learning , *NAVIGATION , *REINFORCEMENT learning , *AERONAUTICAL navigation - Abstract
The existing methods for addressing visual navigation employ deep reinforcement learning as the standard tool for the task. However, they tend to be vulnerable to statistical shifts between the training and test data, resulting in poor generalization over novel environments that are out-of-distribution from the training data. In this study, we attempt to improve the generalization ability by utilizing the inductive biases available for the task. Employing the active neural SLAM that learns policies with the advantage actor-critic method as the base framework, we first point out that the mappings represented by the actor and the critic should satisfy specific symmetries. We then propose a network design for the actor and the critic to inherently attain these symmetries. Specifically, we use G-convolution instead of the standard convolution and insert the semi-global polar pooling layer, which we newly design in this study, in the last section of the critic network. Our method can be integrated into existing methods that utilize intermediate goals and 2D occupancy maps. Experimental results show that our method improves generalization ability by a good margin over visual exploration and object goal navigation, which are two main embodied visual navigation tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Boosting deep neural networks with geometrical prior knowledge: a survey.
- Author
-
Rath, Matthias and Condurache, Alexandru Paul
- Abstract
Deep neural networks achieve state-of-the-art results in many different problem settings by exploiting vast amounts of training data. However, collecting, storing and—in the case of supervised learning—labelling the data is expensive and time-consuming. Additionally, assessing the networks’ generalization abilities or predicting how the inferred output changes under input transformations is complicated since the networks are usually treated as a black box. Both of these problems can be mitigated by incorporating prior knowledge into the neural network. One promising approach, inspired by the success of convolutional neural networks in computer vision tasks, is to incorporate knowledge about symmetric geometrical transformations of the problem to solve that affect the output in a predictable way. This promises an increased data efficiency and more interpretable network outputs. In this survey, we try to give a concise overview about different approaches that incorporate geometrical prior knowledge into neural networks. Additionally, we connect those methods to 3D object detection for autonomous driving, where we expect promising results when applying those methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Computational aspects of experimental designs in multiple-group mixed models.
- Author
-
Prus, Maryna and Filová, Lenka
- Abstract
We extend the equivariance and invariance conditions for construction of optimal designs to multiple-group mixed models and, hence, derive the support of optimal designs for first- and second-order models on a symmetric square. Moreover, we provide a tool for computation of D- and L-efficient exact designs in multiple-group mixed models by adapting the algorithm of Harman et al. (Appl Stoch Models Bus Ind, 32:3–17, 2016). We show that this algorithm can be used both for size-constrained problems and also in settings that require multiple resource constraints on the design, such as cost constraints or marginal constraints. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. The Universal Equivariance Properties of Exotic Aromatic B-Series
- Author
-
Laurent, Adrien and Munthe-Kaas, Hans
- Published
- 2024
- Full Text
- View/download PDF
9. ℤ 2 × ℤ 2 Equivariant Quantum Neural Networks: Benchmarking against Classical Neural Networks.
- Author
-
Dong, Zhongtian, Comajoan Cara, Marçal, Dahale, Gopal Ramesh, Forestano, Roy T., Gleyzer, Sergei, Justice, Daniel, Kong, Kyoungchul, Magorsch, Tom, Matchev, Konstantin T., Matcheva, Katia, and Unlu, Eyup B.
- Subjects
- *
ARTIFICIAL neural networks , *LARGE Hadron Collider , *NETWORK performance , *DEEP learning , *TASK performance - Abstract
This paper presents a comparative analysis of the performance of Equivariant Quantum Neural Networks (EQNNs) and Quantum Neural Networks (QNNs), juxtaposed against their classical counterparts: Equivariant Neural Networks (ENNs) and Deep Neural Networks (DNNs). We evaluate the performance of each network with three two-dimensional toy examples for a binary classification task, focusing on model complexity (measured by the number of parameters) and the size of the training dataset. Our results show that the Z 2 × Z 2 EQNN and the QNN provide superior performance for smaller parameter sets and modest training data samples. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. A Comparison between Invariant and Equivariant Classical and Quantum Graph Neural Networks.
- Author
-
Forestano, Roy T., Comajoan Cara, Marçal, Dahale, Gopal Ramesh, Dong, Zhongtian, Gleyzer, Sergei, Justice, Daniel, Kong, Kyoungchul, Magorsch, Tom, Matchev, Konstantin T., Matcheva, Katia, and Unlu, Eyup B.
- Subjects
- *
GRAPH neural networks , *QUANTUM graph theory , *MACHINE learning , *LARGE Hadron Collider , *COLLISIONS (Nuclear physics) - Abstract
Machine learning algorithms are heavily relied on to understand the vast amounts of data from high-energy particle collisions at the CERN Large Hadron Collider (LHC). The data from such collision events can naturally be represented with graph structures. Therefore, deep geometric methods, such as graph neural networks (GNNs), have been leveraged for various data analysis tasks in high-energy physics. One typical task is jet tagging, where jets are viewed as point clouds with distinct features and edge connections between their constituent particles. The increasing size and complexity of the LHC particle datasets, as well as the computational models used for their analysis, have greatly motivated the development of alternative fast and efficient computational paradigms such as quantum computation. In addition, to enhance the validity and robustness of deep networks, we can leverage the fundamental symmetries present in the data through the use of invariant inputs and equivariant layers. In this paper, we provide a fair and comprehensive comparison of classical graph neural networks (GNNs) and equivariant graph neural networks (EGNNs) and their quantum counterparts: quantum graph neural networks (QGNNs) and equivariant quantum graph neural networks (EQGNN). The four architectures were benchmarked on a binary classification task to classify the parton-level particle initiating the jet. Based on their area under the curve (AUC) scores, the quantum networks were found to outperform the classical networks. However, seeing the computational advantage of quantum networks in practice may have to wait for the further development of quantum technology and its associated application programming interfaces (APIs). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Rotation-equivariant spherical vector networks for objects recognition with unknown poses.
- Author
-
Chen, Hao, Zhao, Jieyu, and Zhang, Qiang
- Subjects
- *
CONVOLUTIONAL neural networks , *OBJECT recognition (Computer vision) , *ROUTING algorithms , *POSE estimation (Computer vision) - Abstract
Analyzing 3D objects without pose priors using neural networks is challenging. In view of the shortcoming that spherical convolutional networks lack the construction of a part–whole hierarchy with rotation equivariance for 3D object recognition with unknown poses, which generates whole rotation-equivariant features that cannot be effectively preserved, a rotation-equivariant part–whole hierarchy spherical vector network is proposed in this paper. In our experiments, we map a 3D object onto the unit sphere, construct an ordered list of vectors from the convolutional layers of the rotation-equivariant spherical convolutional network and then construct a part–whole hierarchy to classify the 3D object using the proposed rotation-equivariant routing algorithm. The experimental results show that the proposed method improves not only the recognition of 3D objects with known poses, but also the recognition of 3D objects with unknown poses compared to previous spherical convolutional neural networks. This finding validates the construction of the rotation-equivariant part–whole hierarchy. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Randomization Tests for Peer Effects in Group Formation Experiments.
- Author
-
Basse, Guillaume, Ding, Peng, Feller, Avi, and Toulis, Panos
- Subjects
GROUP formation ,PEERS ,PERMUTATION groups ,EDUCATIONAL background ,CAUSAL inference - Abstract
Measuring the effect of peers on individuals' outcomes is a challenging problem, in part because individuals often select peers who are similar in both observable and unobservable ways. Group formation experiments avoid this problem by randomly assigning individuals to groups and observing their responses; for example, do first‐year students have better grades when they are randomly assigned roommates who have stronger academic backgrounds? In this paper, we propose randomization‐based permutation tests for group formation experiments, extending classical Fisher Randomization Tests to this setting. The proposed tests are justified by the randomization itself, require relatively few assumptions, and are exact in finite samples. This approach can also complement existing strategies, such as linear‐in‐means models, by using a regression coefficient as the test statistic. We apply the proposed tests to two recent group formation experiments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Galois equivariant functions on Galois orbits in large p-adic fields.
- Author
-
ALEXANDRU, VICTOR and VÂJÂITU, MARIAN
- Subjects
ORBITS (Astronomy) ,ANALYTIC functions ,ALGEBRAIC fields ,ORTHONORMAL basis ,DIFFERENTIABLE functions ,P-adic analysis - Abstract
Given a prime number p let C
p be the topological completion of the algebraic closure of the field of p-adic numbers. Let O(T) be the Galois orbit of a transcendental element T of Cp with respect to the absolute Galois group. Our aim is to study the class of Galois equivariant functions defined on O(T) with values in Cp . We show that each function from this class is continuous and we characterize the class of Lipschitz functions, respectively the class of differentiable functions, with respect to a new orthonormal basis. Then we discuss some aspects related to analytic continuation for the functions of this class. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
14. Complex-Valued FastICA Estimator with a Weighted Unitary Constraint: A Robust and Equivariant Estimator
- Author
-
Jianwei E and Mingshu Yang
- Subjects
complex-valued independent component analysis ,FastICA algorithm ,robustness ,equivariance ,Mathematics ,QA1-939 - Abstract
Independent component analysis (ICA), as a statistical and computational approach, has been successfully applied to digital signal processing. Performance analysis for the ICA approach is perceived as a challenging task to work on. This contribution concerns the complex-valued FastICA algorithm in the range of ICA over the complex number domain. The focus is on the robust and equivariant behavior analysis of the complex-valued FastICA estimator. Although the complex-valued FastICA algorithm as well as its derivatives have been widely used methods for approaching the complex blind signal separation problem, rigorous mathematical treatments of the robust measurement and equivariance for the complex-valued FastICA estimator are still missing. This paper strictly analyzes the robustness against outliers and separation performance depending on the global system. We begin with defining the influence function (IF) of complex-valued FastICA functional and followed by deriving its closed-form expression. Then, we prove that the complex-valued FastICA algorithm based on the optimizing cost function is linear-equivariant, depending only on the source signals.
- Published
- 2024
- Full Text
- View/download PDF
15. ℤ2 × ℤ2 Equivariant Quantum Neural Networks: Benchmarking against Classical Neural Networks
- Author
-
Zhongtian Dong, Marçal Comajoan Cara, Gopal Ramesh Dahale, Roy T. Forestano, Sergei Gleyzer, Daniel Justice, Kyoungchul Kong, Tom Magorsch, Konstantin T. Matchev, Katia Matcheva, and Eyup B. Unlu
- Subjects
quantum computing ,deep learning ,quantum machine learning ,equivariance ,invariance ,supervised learning ,Mathematics ,QA1-939 - Abstract
This paper presents a comparative analysis of the performance of Equivariant Quantum Neural Networks (EQNNs) and Quantum Neural Networks (QNNs), juxtaposed against their classical counterparts: Equivariant Neural Networks (ENNs) and Deep Neural Networks (DNNs). We evaluate the performance of each network with three two-dimensional toy examples for a binary classification task, focusing on model complexity (measured by the number of parameters) and the size of the training dataset. Our results show that the Z2×Z2 EQNN and the QNN provide superior performance for smaller parameter sets and modest training data samples.
- Published
- 2024
- Full Text
- View/download PDF
16. A Comparison between Invariant and Equivariant Classical and Quantum Graph Neural Networks
- Author
-
Roy T. Forestano, Marçal Comajoan Cara, Gopal Ramesh Dahale, Zhongtian Dong, Sergei Gleyzer, Daniel Justice, Kyoungchul Kong, Tom Magorsch, Konstantin T. Matchev, Katia Matcheva, and Eyup B. Unlu
- Subjects
quantum computing ,deep learning ,quantum machine learning ,equivariance ,invariance ,supervised learning ,Mathematics ,QA1-939 - Abstract
Machine learning algorithms are heavily relied on to understand the vast amounts of data from high-energy particle collisions at the CERN Large Hadron Collider (LHC). The data from such collision events can naturally be represented with graph structures. Therefore, deep geometric methods, such as graph neural networks (GNNs), have been leveraged for various data analysis tasks in high-energy physics. One typical task is jet tagging, where jets are viewed as point clouds with distinct features and edge connections between their constituent particles. The increasing size and complexity of the LHC particle datasets, as well as the computational models used for their analysis, have greatly motivated the development of alternative fast and efficient computational paradigms such as quantum computation. In addition, to enhance the validity and robustness of deep networks, we can leverage the fundamental symmetries present in the data through the use of invariant inputs and equivariant layers. In this paper, we provide a fair and comprehensive comparison of classical graph neural networks (GNNs) and equivariant graph neural networks (EGNNs) and their quantum counterparts: quantum graph neural networks (QGNNs) and equivariant quantum graph neural networks (EQGNN). The four architectures were benchmarked on a binary classification task to classify the parton-level particle initiating the jet. Based on their area under the curve (AUC) scores, the quantum networks were found to outperform the classical networks. However, seeing the computational advantage of quantum networks in practice may have to wait for the further development of quantum technology and its associated application programming interfaces (APIs).
- Published
- 2024
- Full Text
- View/download PDF
17. Equivariant tensor network potentials
- Author
-
M Hodapp and A Shapeev
- Subjects
machine learning ,interatomic potential ,tensor network ,equivariance ,Computer engineering. Computer hardware ,TK7885-7895 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Machine-learning interatomic potentials (MLIPs) have made a significant contribution to the recent progress in the fields of computational materials and chemistry due to the MLIPs’ ability of accurately approximating energy landscapes of quantum-mechanical models while being orders of magnitude more computationally efficient. However, the computational cost and number of parameters of many state-of-the-art MLIPs increases exponentially with the number of atomic features. Tensor (non-neural) networks, based on low-rank representations of high-dimensional tensors, have been a way to reduce the number of parameters in approximating multidimensional functions, however, it is often not easy to encode the model symmetries into them. In this work we develop a formalism for rank-efficient equivariant tensor networks (ETNs), i.e. tensor networks that remain invariant under actions of SO(3) upon contraction. All the key algorithms of tensor networks like orthogonalization of cores and DMRG-based algorithms carry over to our equivariant case. Moreover, we show that many elements of modern neural network architectures like message passing, pulling, or attention mechanisms, can in some form be implemented into the ETNs. Based on ETNs, we develop a new class of polynomial-based MLIPs that demonstrate superior performance over existing MLIPs for multicomponent systems.
- Published
- 2024
- Full Text
- View/download PDF
18. On the universality of Sn -equivariant k-body gates
- Author
-
Sujay Kazi, Martín Larocca, and M Cerezo
- Subjects
quantum neural networks ,equivariance ,quantum computing ,universality ,Science ,Physics ,QC1-999 - Abstract
The importance of symmetries has recently been recognized in quantum machine learning from the simple motto: if a task exhibits a symmetry (given by a group $\mathfrak{G}$ ), the learning model should respect said symmetry. This can be instantiated via $\mathfrak{G}$ -equivariant quantum neural networks (QNNs), i.e. parametrized quantum circuits whose gates are generated by operators commuting with a given representation of $\mathfrak{G}$ . In practice, however, there might be additional restrictions to the types of gates one can use, such as being able to act on at most k qubits. In this work we study how the interplay between symmetry and k -bodyness in the QNN generators affect its expressiveness for the special case of $\mathfrak{G} = S_n$ , the symmetric group. Our results show that if the QNN is generated by one- and two-body S _n -equivariant gates, the QNN is semi-universal but not universal. That is, the QNN can generate any arbitrary special unitary matrix in the invariant subspaces, but has no control over the relative phases between them. Then, we show that in order to reach universality one needs to include n -body generators (if n is even) or $(n-1)$ -body generators (if n is odd). As such, our results brings us a step closer to better understanding the capabilities and limitations of equivariant QNNs.
- Published
- 2024
- Full Text
- View/download PDF
19. Generalization in Deep RL for TSP Problems via Equivariance and Local Search
- Author
-
Ouyang, Wenbin, Wang, Yisen, Weng, Paul, and Han, Shaochen
- Published
- 2024
- Full Text
- View/download PDF
20. Equivariant graph convolutional neural networks for the representation of homogenized anisotropic microstructural mechanical response.
- Author
-
Patel, Ravi, Safta, Cosmin, and Jones, Reese E.
- Subjects
- *
CONVOLUTIONAL neural networks , *STRUCTURAL optimization , *MATERIAL plasticity , *COMPOSITE materials , *MICROSTRUCTURE - Abstract
Composite materials with different microstructural material symmetries are common in engineering applications where grain structure, alloying and particle/fiber packing are optimized via controlled manufacturing. In fact these microstructural tunings can be done throughout a part to achieve functional gradation and optimization at a structural level. To predict the performance of particular microstructural configuration and thereby overall performance, constitutive models of materials with microstructure are needed. In this work we provide neural network architectures that provide effective homogenization models of materials with anisotropic components. These models satisfy equivariance and material symmetry principles inherently through a combination of equivariant and tensor basis operations. We demonstrate them on datasets of stochastic volume elements with different textures and phases where the material undergoes elastic and plastic deformation, and show that the these network architectures provide significant performance improvements. [Display omitted] [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. UIR-ES: An unsupervised underwater image restoration framework with equivariance and stein unbiased risk estimator.
- Author
-
Zhu, Jiacheng, Wen, Junjie, Hong, Duanqin, Lin, Zhanpeng, and Hong, Wenxing
- Subjects
- *
ARTIFICIAL neural networks , *IMAGE reconstruction , *LEARNING strategies , *PRIOR learning , *SELF-efficacy - Abstract
Underwater imaging faces challenges for enhancing object visibility and restoring true colors due to the absorptive and scattering characteristics of water. Underwater image restoration (UIR) seeks solutions to restore clean images from degraded ones, providing significant utility in downstream tasks. Recently, data-driven UIR has garnered much attention due to the potent expressive capabilities of deep neural networks (DNNs). These DNNs are supervised, relying on a large amount of labeled training samples. However, acquiring such data is expensive or even impossible in real-world underwater scenarios. While recent researches suggest that unsupervised learning is effective in UIR, none of these frameworks consider signal physical priors. In this work, we present a novel physics-inspired unsupervised UIR framework empowered by equivariance and unbiased estimation techniques. Specifically, equivariance stems from the invariance, inherent in natural signals to enhance data-efficient learning. Given that degraded images invariably contain noise, we propose a noise-tolerant loss for unsupervised UIR based on the Stein unbiased risk estimator to achieve an accurate estimation of the data consistency. Extensive experiments on the benchmark UIR datasets, including the UIEB and RUIE datasets, validate the superiority of the proposed method in terms of quantitative scores, visual outcomes, and generalization ability, compared to state-of-the-art counterparts. Moreover, our method demonstrates even comparable performance with the supervised model. • Introduction of UIR-ES, a fully unsupervised learning framework for UIR. • Proposing scene radiance equivariant learning and observation equivariant learning strategies. • Using SURE to estimate data consistency for improved performance. • Superior UIR performance over previous unsupervised learning methods on two UIR datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. A geometric approach to robust medical image segmentation.
- Author
-
Santhirasekaram, Ainkaran, Winkler, Mathias, Rockall, Andrea, and Glocker, Ben
- Subjects
- *
CARDIAC magnetic resonance imaging , *MAGNETIC resonance imaging , *DEEP learning , *IMAGE segmentation , *GROUP theory - Abstract
Robustness of deep learning segmentation models is crucial for their safe incorporation into clinical practice. However, these models can falter when faced with distributional changes. This challenge is evident in magnetic resonance imaging (MRI) scans due to the diverse acquisition protocols across various domains, leading to differences in image characteristics such as textural appearances. We posit that the restricted anatomical differences between subjects could be harnessed to refine the latent space into a set of shape components. The learned set then aims to encompass the relevant anatomical shape variation found within the patient population. We explore this by utilising multiple MRI sequences to learn texture invariant and shape equivariant features which are used to construct a shape dictionary using vector quantisation. We investigate shape equivariance to a number of different types of groups. We hypothesise and prove that the greater the group order, i.e., the denser the constraint, the better becomes the model robustness. We achieve shape equivariance either with a contrastive based approach or by imposing equivariant constraints on the convolutional kernels. The resulting shape equivariant dictionary is then sampled to compose the segmentation output. Our method achieves state-of-the-art performance for the task of single domain generalisation for prostate and cardiac MRI segmentation. Code is available at https://github.com/AinkaranSanthi/A_Geometric_Perspective_For_Robust_Segmentation. • Geometric constraints in the latent space of a deep learning for Robust Segmentation. • We hypothesis and prove group equivariant constraints in the latent space improves robustness. • A discrete equivariant shape latent space is sampled to construct the segmentation map. • Method demonstrated in the task of single domain generalisation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. On Symmetries and Metrics in Geometric Inference
- Author
-
Marchetti, Giovanni Luca and Marchetti, Giovanni Luca
- Abstract
Spaces of data naturally carry intrinsic geometry. Statistics and machine learning can leverage on this rich structure in order to achieve efficiency and semantic generalization. Extracting geometry from data is therefore a fundamental challenge which by itself defines a statistical, computational and unsupervised learning problem. To this end, symmetries and metrics are two fundamental objects which are ubiquitous in continuous and discrete geometry. Both are suitable for data-driven approaches since symmetries arise as interactions and are thus collectable in practice while metrics can be induced locally from the ambient space. In this thesis, we address the question of extracting geometry from data by leveraging on symmetries and metrics. Additionally, we explore methods for statistical inference exploiting the extracted geometric structure. On the metric side, we focus on Voronoi tessellations and Delaunay triangulations, which are classical tools in computational geometry. Based on them, we propose novel non-parametric methods for machine learning and statistics, focusing on theoretical and computational aspects. These methods include an active version of the nearest neighbor regressor as well as two high-dimensional density estimators. All of them possess convergence guarantees due to the adaptiveness of Voronoi cells. On the symmetry side, we focus on representation learning in the context of data acted upon by a group. Specifically, we propose a method for learning equivariant representations which are guaranteed to be isomorphic to the data space, even in the presence of symmetries stabilizing data. We additionally explore applications of such representations in a robotics context, where symmetries correspond to actions performed by an agent. Lastly, we provide a theoretical analysis of invariant neural networks and show how the group-theoretical Fourier transform emerges in their weights. This addresses the problem of symmetry discovery in a self-supervise, Datamängder innehar en naturlig inneboende geometri. Statistik och maskininlärning kan dra nytta av denna rika struktur för att uppnå effektivitet och semantisk generalisering. Att extrahera geometri ifrån data är därför en grundläggande utmaning som i sig definierar ett statistiskt, beräkningsmässigt och oövervakat inlärningsproblem. För detta ändamål är symmetrier och metriker två grundläggande objekt som är allestädes närvarande i kontinuerlig och diskret geometri. Båda är lämpliga för datadrivna tillvägagångssätt eftersom symmetrier uppstår som interaktioner och är därmed i praktiken samlingsbara medan metriker kan induceras lokalt ifrån det omgivande rummet. I denna avhandling adresserar vi frågan om att extrahera geometri ifrån data genom att utnyttja symmetrier och metriker. Dessutom utforskar vi metoder för statistisk inferens som utnyttjar den extraherade geometriska strukturen. På den metriska sidan fokuserar vi på Voronoi-tessellationer och Delaunay-trianguleringar, som är klassiska verktyg inom beräkningsgeometri. Baserat på dem föreslår vi nya icke-parametriska metoder för maskininlärning och statistik, med fokus på teoretiska och beräkningsmässiga aspekter. Dessa metoder inkluderar en aktiv version av närmaste grann-regressorn samt två högdimensionella täthetsskattare. Alla dessa besitter konvergensgarantier på grund av Voronoi-cellernas anpassningsbarhet. På symmetrisidan fokuserar vi på representationsinlärning i sammanhanget av data som påverkas av en grupp. Specifikt föreslår vi en metod för att lära sig ekvivarianta representationer som garanteras vara isomorfa till datarummet, även i närvaro av symmetrier som stabiliserar data. Vi utforskar även tillämpningar av sådana representationer i ett robotiksammanhang, där symmetrier motsvarar handlingar utförda av en agent. Slutligen tillhandahåller vi en teoretisk analys av invarianta neuronnät och visar hur den gruppteoretiska Fouriertransformen framträder i deras vikter. Detta adresserar problemet med, QC 20240304
- Published
- 2024
24. On Color and Symmetries for Data Efficient Deep Learning
- Author
-
Lengyel, A. (author) and Lengyel, A. (author)
- Abstract
Computer vision algorithms are getting more advanced by the day and slowly approach human-like capabilities, such as detecting objects in cluttered scenes and recognizing facial expressions. Yet, computers learn to perform these tasks very differently from humans. Where humans can generalize between different lighting conditions or geometric orientations with ease, computers require vast amounts of training data to adapt from day to night images, or even to recognize a cat hanging upside-down. This requires additional data, annotations and compute power, increasing the development costs of useful computer vision models. This thesis is therefore concerned with reducing the data and compute hunger of computer vision algorithms by incorporating prior knowledge into the model architecture. Knowledge that is built in no longer needs to be learned from data. This thesis considers various knowledge priors. To improve the robustness of deep learning models to changes in illumination, we make use of color invariant representations derived from physics-based reflection models. We find that a color invariant input layer effectively normalizes the feature map activations throughout the entire network, thereby reducing the distribution shift that normally occurs between day and night images. Equivariance has proven to be a useful network property for improving data efficiency. We introduce the color equivariant convolution, where spatial features are explicitly shared between different colors. This improves generalization to out-of-distribution colors, and therefore reduces the amount of required training data. We subsequently investigate Group Equivariant Convolutions (GConvs). First, we discover that GConv filters learn redundant symmetries, which can be hard-coded using separable convolutions. This preserves equivariance to rotation and mirroring, and improves data and compute efficiency. We also explore the notion of approximate equivariance in GCo, Pattern Recognition and Bioinformatics
- Published
- 2024
25. Learning solid dynamics with graph neural network.
- Author
-
Li, Bohao, Du, Bowen, Ye, Junchen, Huang, Jiajing, Sun, Leilei, and Feng, Jinyan
- Subjects
- *
GRAPH neural networks , *PHYSICAL laws , *DEEP learning , *SOLIDS - Abstract
Deep learning has shown great promise in solid physic dynamic simulation. By incorporating physical laws, recent works have further improved performance. However, existing methods rarely conform to macrophysics and incur computational costs. Furthermore, the velocity direction features of the current model have not been decomposed, forcing the model to learn vector operations. To address the aforementioned problems, we propose a S olid- G raph N etwork (Solid-GN), designed for more accurate and efficient learning of solid dynamics. Firstly, we design a novel and cost-effective symmetric direction encoding system that achieves macroscopic equivariance in 2D space by representing same relative directional relationships among particle pairs and their flipping version. Secondly, we propose a message-passing mechanism incorporated with the contact process, facilitating an accurate description of interactions through the decomposition of normal and tangential effects. Lastly, four novel datasets for solid dynamics with non-uniform radius particles are released, thereby enabling more complex and realistic physical simulations. Experimental evaluations conducted on five datasets demonstrate our model's performance concerning predictive accuracy, macroscopic equivariance, and computational cost over the state-of-the-art ones. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. The Hodge realization of the polylogarithm and the Shintani generating class for totally real fields.
- Author
-
Bannai, Kenichi, Bekki, Hohto, Hagihara, Kei, Ohshita, Tatsuya, Yamada, Kazuki, and Yamamoto, Shuji
- Subjects
- *
ZETA functions , *TORUS , *TORSION , *INTEGERS , *LOGICAL prediction , *DEDEKIND sums - Abstract
In this article, we construct the Hodge realization of the polylogarithm class in the equivariant Deligne–Beilinson cohomology of a certain algebraic torus associated to a totally real field. We then prove that the de Rham realization of this polylogarithm gives the Shintani generating class , a cohomology class generating the values of the Lerch zeta functions of the totally real field at nonpositive integers. Inspired by this result, we give a conjecture concerning the specialization of this polylogarithm class at torsion points, and discuss its relation to the Beilinson conjecture for Hecke characters of totally real fields. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Attention-gated 3D CapsNet for robust hippocampal segmentation.
- Author
-
Poiret C, Bouyeure A, Patil S, Boniteau C, Duchesnay E, Grigis A, Lemaitre F, and Noulhiane M
- Abstract
Purpose: The hippocampus is organized in subfields (HSF) involved in learning and memory processes and widely implicated in pathologies at different ages of life, from neonatal hypoxia to temporal lobe epilepsy or Alzheimer's disease. Getting a highly accurate and robust delineation of sub-millimetric regions such as HSF to investigate anatomo-functional hypotheses is a challenge. One of the main difficulties encountered by those methodologies is related to the small size and anatomical variability of HSF, resulting in the scarcity of manual data labeling. Recently introduced, capsule networks solve analogous problems in medical imaging, providing deep learning architectures with rotational equivariance. Nonetheless, capsule networks are still two-dimensional and unassessed for the segmentation of HSF., Approach: We released a public 3D Capsule Network (3D-AGSCaps, https://github.com/clementpoiret/3D-AGSCaps) and compared it to equivalent architectures using classical convolutions on the automatic segmentation of HSF on small and atypical datasets (incomplete hippocampal inversion, IHI). We tested 3D-AGSCaps on three datasets with manually labeled hippocampi., Results: Our main results were: (1) 3D-AGSCaps produced segmentations with a better Dice Coefficient compared to CNNs on rotated hippocampi ( p = 0.004 , cohen's d = 0.179 ); (2) on typical subjects, 3D-AGSCaps produced segmentations with a Dice coefficient similar to CNNs while having 15 times fewer parameters (2.285M versus 35.069M). This may greatly facilitate the study of atypical subjects, including healthy and pathological cases like those presenting an IHI., Conclusion: We expect our newly introduced 3D-AGSCaps to allow a more accurate and fully automated segmentation on atypical populations, small datasets, as well as on and large cohorts where manual segmentations are nearly intractable., (© 2024 Society of Photo-Optical Instrumentation Engineers (SPIE).)
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.