1,033 results on '"learning systems"'
Search Results
2. GP3: A Sampling-based Analysis Framework for Gaussian Processes
- Author
-
Lederer, Armin, Kessler, Markus, and Hirche, Sandra
- Published
- 2020
- Full Text
- View/download PDF
3. SuspAct: novel suspicious activity prediction based on deep learning in the real-time environment.
- Author
-
Kansal, Sachin, Jain, Akshat Kumar, Biswas, Moyukh, Bansal, Shaurya, Mahindru, Namay, and Kansal, Priya
- Subjects
- *
DEEP learning , *MACHINE learning , *CRIME prevention , *PLURALITY voting , *PUBLIC safety , *VIDEO surveillance - Abstract
In today's evolving landscape of video surveillance, our study introduces SuspAct, an innovative ensemble model designed to detect suspicious activities in real time swiftly. Leveraging advanced Long-term Recurrent Convolutional Networks (LRCN), SuspAct represents a significant advancement in intelligent surveillance technology. By combining insights from various LRCN models through the Majority Voting ensemble technique, SuspAct enhances its overall robustness, outperforming traditional surveillance methods. Through rigorous experimentation on large-scale datasets, we demonstrate SuspAct's superiority in proactive crime prevention, showcasing its potential to revolutionize security protocols and contribute substantially to public safety. Our work addresses the challenges posed by the escalating volume of video data and lays a strong foundation for future advancements in intelligent video surveillance technology. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Interactive control algorithm for shoulder-amputated prosthesis and object based on reinforcement learning.
- Author
-
Li, Baojiang, Ye, Haiyan, Guo, Yuting, Wang, Haiyan, Qiu, Shengjie, and Bai, Jibo
- Subjects
- *
INTELLIGENT control systems , *REINFORCEMENT learning , *ARTIFICIAL hands , *MACHINE learning , *REINFORCEMENT (Psychology) , *RESIDUAL limbs , *PROSTHETICS - Abstract
As a prosthesis made to compensate for the residual loss of the amputee's limb, the shoulder disarticulation upper limb prosthesis replaces the missing arm function of the shoulder amputee to a certain extent. However, the current upper limb prosthesis mainly interacts with the outside world through the prosthetic hand for grasping and gripping, and the interaction between other parts and the environment is often neglected, which is not in line with the use habits of the human arm. To address this problem, this paper proposes a reinforcement learning–based method for controlling the forearm interaction of a shoulder-disconnected upper limb prosthesis, and analyzes and solves the forces during the interaction, reducing the impact of uncertainty on interaction actions and accelerating training while ensuring the stability of handheld items. We evaluated the performance of the control method during the interaction between the upper limb prosthesis and the external environment through simulation experiments. After the training, the bionic arm was able to push the object into the target range for different objects and pushing distance requirements, which showed the good control effect of the method. Also, the control method can be applied to improve the interaction between the robotic arm and the environment. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Artificial intelligence in dentistry
- Subjects
artificial intelligence ,data ,learning systems ,machine learning ,Dentistry ,RK1-715 - Published
- 2025
- Full Text
- View/download PDF
6. Application of analysis of variance to determine important features of signals for diagnostic classifiers of displacement pumps
- Author
-
Jarosław Konieczny, Waldemar Łatas, and Jerzy Stojek
- Subjects
Learning systems ,Machine learning ,Diagnostics ,Signal analysis ,Multi-piston pump ,Vibration ,Medicine ,Science - Abstract
Abstract This paper presents the use of one-way analysis of variance ANOVA as an effective tool for ranking the features calculated from diagnostic signals and evaluates their impact on the accuracy of the machine learning system's classification of displacement pump wear.The first part includes a review of contemporary diagnostic systems and a description of typical damage of multi-piston displacement pumps and Its causes. The work also contains description of a diagnostic experiment which was conducted in order to obtain the matrix of vibration signals and the matrix of pressures measured at selected locations on the pump housing and at the pump pressure line. The measured signals were subjected to time–frequency analysis. The features of signals calculated in the time and frequency domains were ranked using the ANOVA. The next step involved the use the available classifiers in pump wear evaluation, conducting tests and assessing their effectiveness in terms of the ranking of features and the origin of diagnostic signals.
- Published
- 2024
- Full Text
- View/download PDF
7. Proactive ransomware prevention in pervasive IoMT via hybrid machine learning.
- Author
-
Tariq, Usman and Tariq, Bilal
- Subjects
MACHINE learning ,RANSOMWARE ,FEATURE extraction ,INTERNET of things - Abstract
Advancements in information and communications technology (ICT) have fundamentally transformed computing, notably through the internet of things (IoT) and its healthcare-focused branch, the internet of medical things (IoMT). These technologies, while enhancing daily life, face significant security risks, including ransomware. To counter this, the authors present a scalable, hybrid machine learning framework that effectively identifies IoMT ransomware attacks, conserving the limited resources of IoMT devices. To assess the effectiveness of their proposed solution, the authors undertook an experiment using a state-of-the-art dataset. Their framework demonstrated superiority over conventional detection methods, achieving an impressive 87% accuracy rate. Building on this foundation, the framework integrates a multi-faceted feature extraction process that discerns between benign and malign actions, with a subsequent in-depth analysis via a neural network. This advanced analysis is pivotal in precisely detecting and terminating ransomware threats, offering a robust solution to secure the IoMT ecosystem. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Generative AI for Cyber Security: Analyzing the Potential of ChatGPT, DALL-E, and Other Models for Enhancing the Security Space
- Author
-
Siva Sai, Utkarsh Yashvardhan, Vinay Chamola, and Biplab Sikdar
- Subjects
Security ,Artificial Intelligence ,machine learning ,natural language processing ,learning systems ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
This research paper intends to provide real-life applications of Generative AI (GAI) in the cybersecurity domain. The frequency, sophistication and impact of cyber threats have continued to rise in today’s world. This ever-evolving threat landscape poses challenges for organizations and security professionals who continue looking for better solutions to tackle these threats. GAI technology provides an effective way for them to address these issues in an automated manner with increasing efficiency. It enables them to work on more critical security aspects which require human intervention, while GAI systems deal with general threat situations. Further, GAI systems can better detect novel malware and threatening situations than humans. This feature of GAI, when leveraged, can lead to higher robustness of the security system. Many tech giants like Google, Microsoft etc., are motivated by this idea and are incorporating elements of GAI in their cybersecurity systems to make them more efficient in dealing with ever-evolving threats. Many cybersecurity tools like Google Cloud Security AI Workbench, Microsoft Security Copilot, SentinelOne Purple AI etc., have come into the picture, which leverage GAI to develop more straightforward and robust ways to deal with emerging cybersecurity perils. With the advent of GAI in the cybersecurity domain, one also needs to take into account the limitations and drawbacks that such systems have. This paper also provides some of the limitations of GAI, like periodically giving wrong results, costly training, the potential of GAI being used by malicious actors for illicit activities etc.
- Published
- 2024
- Full Text
- View/download PDF
9. Adaptive stochastic model predictive control via network ensemble learning.
- Author
-
Xiong, Weiliang, He, Defeng, Mu, Jianbin, and Wang, Xiuli
- Subjects
- *
STOCHASTIC models , *PREDICTION models , *LINEAR systems , *BAYESIAN analysis , *EXPONENTIAL stability , *MACHINE learning , *ADAPTIVE control systems , *FEEDFORWARD neural networks - Abstract
This paper proposes a novel ensemble learning-based adaptive stochastic model predictive control (SMPC) algorithm for constrained linear systems with unknown nonlinear terms and random disturbances. The ensemble network combining a feedforward neural network and a Bayesian network is used to offline learn the nonlinear dynamics and disturbance distribution parameters. Then, the mixed-tube scheme is designed to cope with input constraints and state chance constraints while decreasing computational demands and conservativeness. The reliability of the stochastic tube is guaranteed using the Hoeffding inequality-based verification mechanism, which results in a chance constraint with double probabilities. The feasibility and exponential stability of the SMPC are rigorously proven. A numerical example verifies the merits of the proposed algorithm in terms of the control performance and the feasible domain. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Information Losses in Neural Classifiers From Sampling
- Author
-
Foggo, Brandon, Yu, Nanpeng, Shi, Jie, and Gao, Yuanqi
- Subjects
Information and Computing Sciences ,Machine Learning ,Neural networks ,Machine learning ,Training ,Random variables ,Training data ,Probability distribution ,Learning systems ,Deep learning ,information theory ,large deviations theory ,mutual information ,statistical learning theory ,cs.LG ,stat.ML ,Artificial Intelligence & Image Processing ,Artificial intelligence - Abstract
This article considers the subject of information losses arising from the finite data sets used in the training of neural classifiers. It proves a relationship between such losses as the product of the expected total variation of the estimated neural model with the information about the feature space contained in the hidden representation of that model. It then bounds this expected total variation as a function of the size of randomly sampled data sets in a fairly general setting, and without bringing in any additional dependence on model complexity. It ultimately obtains bounds on information losses that are less sensitive to input compression and in general much smaller than existing bounds. This article then uses these bounds to explain some recent experimental findings of information compression in neural networks that cannot be explained by previous work. Finally, this article shows that not only are these bounds much smaller than existing ones, but they also correspond well with experiments.
- Published
- 2020
11. Classification of Wear State for a Positive Displacement Pump Using Deep Machine Learning †.
- Author
-
Konieczny, Jarosław, Łatas, Waldemar, and Stojek, Jerzy
- Subjects
- *
HYDRAULIC control systems , *MACHINE learning , *DEEP learning , *RECIPROCATING pumps , *ENERGY dissipation , *TIME-frequency analysis , *CLASSIFICATION - Abstract
Hydraulic power systems are commonly used in heavy industry (usually highly energy-intensive) and are often associated with high power losses. Designing a suitable system to allow an early assessment of the wear conditions of components in a hydraulic system (e.g., an axial piston pump) can effectively contribute to reducing energy losses during use. This paper presents the application of a deep machine learning system to determine the efficiency state of a multi-piston positive displacement pump. Such pumps are significant in high-power hydraulic systems. The correct operation of the entire hydraulic system often depends on its proper functioning. The wear and tear of individual pump components usually leads to a decrease in the pump's operating pressure and volumetric losses, subsequently resulting in a decrease in overall pump efficiency and increases in vibration and pump noise. This in turn leads to an increase in energy losses throughout the hydraulic system, which releases excess heat. Typical failures of the discussed pumps and their causes are described after reviewing current research work using deep machine learning. Next, the test bench on which the diagnostic experiment was conducted and the selected operating signals that were recorded are described. The measured signals were subjected to a time–frequency analysis, and their features, calculated in terms of the time and frequency domains, underwent a significance ranking using the minimum redundancy maximum relevance (MRMR) algorithm. The next step was to design a neural network structure to classify the wear state of the pump and to test and evaluate the effectiveness of the network's recognition of the pump's condition. The whole study was summarized with conclusions. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
12. Graph Lifelong Learning: A Survey.
- Author
-
Febrinanto, Falih Gozi, Xia, Feng, Moore, Kristen, Thapa, Chandra, and Aggarwal, Charu
- Abstract
Graph learning is a popular approach for perfor ming machine learning on graph-structured data. It has revolutionized the machine learning ability to model graph data to address downstream tasks. Its application is wide due to the availability of graph data ranging from all types of networks to information systems. Most graph learning methods assume that the graph is static and its complete structure is known during training. This limits their applicability since they cannot be applied to problems where the underlying graph grows over time and/or new tasks emerge incrementally. Such applications require a lifelong learning approach that can learn the graph continuously and accommodate new information whilst retaining previously learned knowledge. Lifelong learning methods that enable continuous learning in regular domains like images and text cannot be directly applied to continuously evolving graph data, due to its irregular structure. As a result, graph lifelong learning is gaining attention from the research community. This survey paper provides a comprehensive overview of recent advancements in graph lifelong learning, including the categorization of existing methods, and the discussions of potential applications and open research problems. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. A Hybrid Deep Network Framework for Android Malware Detection.
- Author
-
Zhu, Hui-Juan, Wang, Liang-Min, Zhong, Sheng, Li, Yang, and Sheng, Victor S.
- Subjects
- *
DEEP learning , *MACHINE learning , *MALWARE , *SUPPORT vector machines , *BLENDED learning , *CLASSIFICATION algorithms - Abstract
Android is a growing target for malicious software (malware) because of its popularity and functionality. Malware poses a serious threat to users’ privacy, money, equipment and file integrity. A series of data-driven malware detection methods were proposed. However, there exist two key challenges for these methods: (1) how to learn effective feature representation from raw data; (2) how to reduce the dependence on the prior knowledge or human labors in feature learning. Inspired by the success of deep learning methods in the feature representation learning community, we propose a malware detection framework which starts with learning rich-features by a novel unsupervised feature learning algorithm Merged Sparse Auto-Encoder (MSAE). In order to extract more compact and discriminative feature from the rich-features to further boost the malware detection capability, a hybrid deep network learning algorithm Stacked Hybrid Learning MSAE and SDAE (SHLMD) is established by further incorporating a classical deep learning method Stacked Denoising Auto-encoders (SDAE). After that, we feed the feature learned by MSAE and SHLMD respectively to classification algorithms, e.g., Support Vector Machine (SVM) or K-NearestNeighbor (KNN), to train a malware detection model. Evaluation results on two real-world datasets demonstrate that SHLMD achieves 94.46 and 90.57 percent accuracy respectively, which outperforms the classical unsupervised feature representation learning Sparse Auto-encoder (SAE). MSAE performs similarly to SAE. SHLMD can further improve the performance of MSAE and the supervised fine-tuned method SDAE. Besides, we compare the performance of our methods with that of state-of-the-art detection approaches, including classical deep-learning-based methods. Extensive experiments show that our proposed methods are effective enough to detect Android malware. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
14. Safety Assurance of Artificial Intelligence-Based Systems: A Systematic Literature Review on the State of the Art and Guidelines for Future Work
- Author
-
Antonio V. Silva Neto, Joao B. Camargo, Jorge R. Almeida, and Paulo S. Cugnasca
- Subjects
Artificial intelligence ,formal verification ,learning systems ,machine learning ,neural networks ,product safety engineering ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
The objective of this research is to present the state of the art of the safety assurance of Artificial Intelligence (AI)-based systems and guidelines on future correlated work. For this purpose, a Systematic Literature Review comprising 5090 peer-reviewed references relating safety to AI has been carried out, with focus on a 329-reference subset in which the safety assurance of AI-based systems is directly conveyed. From 2016 onwards, the safety assurance of AI-based systems has experienced significant effervescence and leaned towards five main approaches: performing black-box testing, using safety envelopes, designing fail-safe AI, combining white-box analyses with explainable AI, and establishing a safety assurance process throughout systems’ lifecycles. Each of these approaches has been discussed in this paper, along with their features, pros and cons. Finally, guidelines for future research topics have also been presented. They result from an analysis based on both the cross-fertilization among the reviewed references and the authors’ experience with safety and AI. Among 15 research themes, these guidelines reinforce the need for deepening guidelines for the safety assurance of AI-based systems by, e.g., analyzing datasets from a safety perspective, designing explainable AI, setting and justifying AI hyperparameters, and assuring the safety of hardware-implemented AI-based systems.
- Published
- 2022
- Full Text
- View/download PDF
15. Emulating AC OPF Solvers With Neural Networks.
- Author
-
Baker, Kyri
- Subjects
- *
ELECTRICAL load , *ARTIFICIAL neural networks , *MATHEMATICAL optimization , *ACROMIOCLAVICULAR joint - Abstract
Using machine learning to obtain solutions to AC optimal power flow has recently been a very active area of research due to the astounding speedups that result from bypassing traditional optimization techniques. However, generally ensuring feasibility of the resulting predictions while maintaining these speedups is a challenging, unsolved problem. In this letter, we train a neural network to emulate an iterative solver in order to cheaply and approximately iterate towards the optimum. Once we are close to convergence, we then solve a power flow to obtain an overall AC-feasible solution. Results shown for networks up to 1,354 buses indicate the proposed method is capable of finding feasible, near-optimal solutions to AC OPF in milliseconds on a laptop computer. In addition, it is shown that the proposed method can find “difficult” AC OPF solutions that cause flat-start or DC-warm started algorithms to diverge. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
16. Argumentation-Based Online Incremental Learning.
- Author
-
Ayoobi, Hamed, Cao, Ming, Verbrugge, Rineke, and Verheij, Bart
- Subjects
- *
ONLINE education , *REINFORCEMENT learning , *HUMAN-robot interaction , *ONLINE algorithms , *MACHINE learning , *FAILED states - Abstract
The environment around general-purpose service robots has a dynamic nature. Accordingly, even the robot’s programmer cannot predict all the possible external failures which the robot may confront. This research proposes an online incremental learning method that can be further used to autonomously handle external failures originating from a change in the environment. Existing research typically offers special-purpose solutions. Furthermore, the current incremental online learning algorithms cannot generalize well with just a few observations. In contrast, our method extracts a set of hypotheses, which can then be used for finding the best recovery behavior at each failure state. The proposed argumentation-based online incremental learning approach uses an abstract and bipolar argumentation framework to extract the most relevant hypotheses and model the defeasibility relation between them. This leads to a novel online incremental learning approach that overcomes the addressed problems and can be used in different domains including robotic applications. We have compared our proposed approach with state-of-the-art online incremental learning approaches, an approximation-based reinforcement learning method, and several online contextual bandit algorithms. The experimental results show that our approach learns more quickly with a lower number of observations and also has higher final precision than the other methods. Note to Practitioners—This work proposes an online incremental learning method that learns faster by using a lower number of failure states than other state-of-the-art approaches. The resulting technique also has higher final learning precision than other methods. Argumentation-based online incremental learning generates an explainable set of rules which can be further used for human-robot interaction. Moreover, testing the proposed method using a publicly available dataset suggests wider applicability of the proposed incremental learning method outside the robotics field wherever an online incremental learner is required. The limitation of the proposed method is that it aims for handling discrete feature values. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
17. Adaptive Neighborhood Metric Learning.
- Author
-
Song, Kun, Han, Junwei, Cheng, Gong, Lu, Jiwen, and Nie, Feiping
- Subjects
- *
MACHINE learning , *NEIGHBORHOODS , *DEEP learning , *DISTANCE education - Abstract
In this paper, we reveal that metric learning would suffer from serious inseparable problem if without informative sample mining. Since the inseparable samples are often mixed with hard samples, current informative sample mining strategies used to deal with inseparable problem may bring up some side-effects, such as instability of objective function, etc. To alleviate this problem, we propose a novel distance metric learning algorithm, named adaptive neighborhood metric learning (ANML). In ANML, we design two thresholds to adaptively identify the inseparable similar and dissimilar samples in the training procedure, thus inseparable sample removing and metric parameter learning are implemented in the same procedure. Due to the non-continuity of the proposed ANML, we develop an ingenious function, named log-exp mean function to construct a continuous formulation to surrogate it, which can be efficiently solved by the gradient descent method. Similar to Triplet loss, ANML can be used to learn both the linear and deep embeddings. By analyzing the proposed method, we find it has some interesting properties. For example, when ANML is used to learn the linear embedding, current famous metric learning algorithms such as the large margin nearest neighbor (LMNN) and neighbourhood components analysis (NCA) are the special cases of the proposed ANML by setting the parameters different values. When it is used to learn deep features, the state-of-the-art deep metric learning algorithms such as Triplet loss, Lifted structure loss, and Multi-similarity loss become the special cases of ANML. Furthermore, the log-exp mean function proposed in our method gives a new perspective to review the deep metric learning methods such as Prox-NCA and N-pairs loss. At last, promising experimental results demonstrate the effectiveness of the proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
18. Toward Region-Aware Attention Learning for Scene Graph Generation.
- Author
-
Liu, An-An, Tian, Hongshuo, Xu, Ning, Nie, Weizhi, Zhang, Yongdong, and Kankanhalli, Mohan
- Subjects
- *
MACHINE learning , *VISUAL perception , *ARTIFICIAL neural networks - Abstract
Scene graph generation (SGGen) is a challenging task due to a complex visual context of an image. Intuitively, the human visual system can volitionally focus on attended regions by salient stimuli associated with visual cues. For example, to infer the relationship between man and horse, the interaction between human leg and horseback can provide strong visual evidence to predict the predicate ride. Besides, the attended region face can also help to determine the object man. Till now, most of the existing works studied the SGGen by extracting coarse-grained bounding box features while understanding fine-grained visual regions received limited attention. To mitigate the drawback, this article proposes a region-aware attention learning method. The key idea is to explicitly construct the attention space to explore salient regions with the object and predicate inferences. First, we extract a set of regions in an image with the standard detection pipeline. Each region regresses to an object. Second, we propose the object-wise attention graph neural network (GNN), which incorporates attention modules into the graph structure to discover attended regions for object inference. Third, we build the predicate-wise co-attention GNN to jointly highlight subject’s and object’s attended regions for predicate inference. Particularly, each subject-object pair is connected with one of the latent predicates to construct one triplet. The proposed intra-triplet and inter-triplet learning mechanism can help discover the pair-wise attended regions to infer predicates. Extensive experiments on two popular benchmarks demonstrate the superiority of the proposed method. Additional ablation studies and visualization further validate its effectiveness. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
19. Broad Learning Based Dynamic Fuzzy Inference System With Adaptive Structure and Interpretable Fuzzy Rules.
- Author
-
Bai, Kaiyuan, Zhu, Xiaomin, Wen, Shiping, Zhang, Runtong, and Zhang, Wenyu
- Subjects
FUZZY logic ,SMART structures ,FUZZY systems ,PARSIMONIOUS models ,ARTIFICIAL neural networks ,MACHINE learning - Abstract
This article investigates the feasibility of applying the broad learning system (BLS) to realize a novel Takagi–Sugeno–Kang (TSK) neuro-fuzzy model, namely a broad learning based dynamic fuzzy inference system (BL-DFIS). It not only improves the accuracy and interpretability of neuro-fuzzy models but also solves the challenging problem that models are incapable of determining the optimal architecture autonomously. BL-DFIS first accomplishes a TSK fuzzy system under the framework of BLS, in which an extreme learning machine auto-encoder is employed to obtain feature representation in a fast and analytical way, and an interpretable linguistic fuzzy rule is integrated into the enhancement node to ensure the high interpretability of the system. Meanwhile, the extended-enhancement unit is designed to achieve the first-order TSK fuzzy system. In addition, a dynamic incremental learning algorithm with internal pruning and updating mechanism is developed for the learning of BL-DFIS, which enables the system to automatically assemble the optimal structure to obtain a compact rule base and an excellent classification performance. Experiments on benchmark datasets demonstrate that the proposed BL-DFIS can achieve a better classification performance than some state-of-the-art nonfuzzy and neuro-fuzzy methods, simultaneously using the most parsimonious model structure. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
20. Unsupervised Meta Learning With Multiview Constraints for Hyperspectral Image Small Sample set Classification.
- Author
-
Gao, Kuiliang, Liu, Bing, Yu, Xuchu, and Yu, Anzhu
- Subjects
- *
DEEP learning , *SUPERVISED learning , *MACHINE learning , *CLASSIFICATION - Abstract
The difficulties of obtaining sufficient labeled samples have always been one of the factors hindering deep learning models from obtaining high accuracy in hyperspectral image (HSI) classification. To reduce the dependence of deep learning models on training samples, meta learning methods have been introduced, effectively improving the classification accuracy in small sample set scenarios. However, the existing methods based on meta learning still need to construct a labeled source data set with several pre-collected HSIs, and must utilize a large number of labeled samples for meta-training, which is actually time-consuming and labor-intensive. To solve this problem, this paper proposes a novel unsupervised meta learning method with multiview constraints for HSI small sample set classification. Specifically, the proposed method first builds an unlabeled source data set using unlabeled HSIs. Then, multiple spatial-spectral multiview features of each unlabeled sample are generated to construct tasks for unsupervised meta learning. Finally, the designed residual relation network is used for meta-training and small sample set classification based on the voting strategy. Compared with existing supervised meta learning methods for HSI classification, our method can only utilize HSIs without any label for unsupervised meta learning, which significantly reduces the number of requisite labeled samples in the whole classification process. To verify the effectiveness of the proposed method, extensive experiments are carried out on 8 public HSIs in the cross-domain and in-domain classification scenarios. The statistical results demonstrate that, compared with existing supervised meta learning methods and other advanced classification models, the proposed method can achieve competitive or better classification performance in small sample set scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. The Secrets of Data Science Deployments.
- Author
-
Fayyad, Usama M. and Fayyad, Usama
- Subjects
PRAGMATICS ,DATA science ,MACHINE learning ,INDUSTRIAL costs ,COVID-19 pandemic ,TRUST - Abstract
Much attention is paid to data science and machine learning as an effective means for getting value out of data and as a means for dealing with the large amounts of data we are accumulating at companies and organizations. This has gained importance with the major waves of digitization we have seen, especially with the COVID-19 pandemic accelerating digital everything. However, in reality, most machine learning models, despite achieving good technical solutions to predictive problems wind up not being deployed. The reasons for this are many and have their origin in data scientists and machine learning practitioners not paying enough attention to issues of deployment in production. The issues range all the way from establishing trust by business stakeholders and users, to failure to explain why models work and when they do not, to failing to appreciate the importance of establishing a robust quality data pipeline, to ignoring many constraints that apply to deployed models, and finally to a lack of understanding the true cost of production deployment and the associated ROI. We discuss many of these problems and we provide what we believe is a pragmatic approach to getting data science models successfully deployed in working environments. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
22. Practices and Infrastructures for Machine Learning Systems: An Interview Study in Finnish Organizations.
- Author
-
Muiruri, Dennis, Lwakatare, Lucy Ellen, Nurminen, Jukka K., and Mikkonen, Tommi
- Subjects
- *
INSTRUCTIONAL systems , *ARTIFICIAL intelligence , *MACHINE learning , *SOFTWARE engineering , *SOFTWARE engineers - Abstract
Using interviews, we investigated the practices and toolchains for machine learning (ML)-enabled systems from 16 organizations across various domains in Finland. We observed some well-established artificial intelligence engineering approaches, but practices and tools are still needed for the testing and monitoring of ML-enabled systems. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. Nonlocal Self-Similarity-Based Hyperspectral Remote Sensing Image Denoising With 3-D Convolutional Neural Network.
- Author
-
Wang, Zhicheng, Ng, Michael K., Zhuang, Lina, Gao, Lianru, and Zhang, Bing
- Subjects
- *
CONVOLUTIONAL neural networks , *IMAGE denoising , *REMOTE sensing , *THREE-dimensional imaging , *DEEP learning , *MACHINE learning - Abstract
Recently, deep-learning-based denoising methods for hyperspectral images (HSIs) have been comprehensively studied and achieved impressive performance because they can effectively extract complex and nonlinear image features. Compared with deep-learning-based methods, the nonlocal similarity-based denoising methods are more suitable for images containing edges or regular textures. We propose a powerful HSI denoising method, termed non-local 3-D convolutional neural network (NL-3DCNN), combining traditional machine learning and deep learning techniques. NL-3DCNN exploits the high spectral correlation of an HSI using subspace representation, and the corresponding representation coefficients are termed eigenimages. The high spatial correlation in eigenimages is exploited by grouping nonlocal similar patches, which are denoised by a 3-D convolutional neural network. The numerical and graphical denoising results of the simulated and real data show that the proposed method is superior to the state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
24. A Learning-Based AoA Estimation Method for Device-Free Localization.
- Author
-
Hong, Ke, Wang, Tianyu, Liu, Junchen, Wang, Yu, and Shen, Yuan
- Abstract
Device-free localization (DFL), an important aspect in integrated sensing and communication, can be achieved through exploiting multipath components in ultra-wide bandwidth systems. However, incorrect identification of multipath components in the channel impulse responses will lead to large angle-of-arrival (AoA) estimation errors and subsequently poor localization performance. This letter proposes a learning-based AoA estimation method to improve the DFL accuracy. In the proposed method, we first design a classifier to identify the multipath components and then exploit the phase-difference-of-arrival to mitigate the AoA estimation error through a multilayer perceptron. Our learning-based method is validated using the datasets collected by ultra-wide bandwidth arrays, which significantly outperforms conventional methods in terms of AoA estimation and localization performance. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
25. Incremental Motor Skill Learning and Generalization From Human Dynamic Reactions Based on Dynamic Movement Primitives and Fuzzy Logic System.
- Author
-
Lu, Zhenyu, Wang, Ning, Li, Miao, and Yang, Chenguang
- Subjects
FUZZY logic ,FUZZY systems ,RADIAL basis functions ,IMPEDANCE control ,MACHINE learning ,MOTOR learning ,STIMULUS generalization - Abstract
Different from previous work on single skill learning from human demonstrations, an incremental motor skill learning, generalization and control method based on dynamic movement primitives (DMP) and broad learning system (BLS) is proposed for extracting both ordinary skills and instant reactive skills from demonstrations, the latter of which is usually generated to avoid a sudden danger (e.g., touching a hot cup). The method is completed in three steps. First, the ordinary skills are basically learned from demonstrations in normal cases by using DMP. Then, the incremental learning idea of BLS is combined with DMP to achieve multistylistic reactive skill learning such that the forcing function of the ordinary skills will be reasonably extended into multiple stylistic functions by adding enhancement terms and updating weights of the radial basis function kernels. Finally, electromyography signals are collected from human muscles and processed to achieve stiffness factors. By using fuzzy logic system, the two kinds of skills learned are integrated and generalized in new cases such that not only start, end and scaling factors but also the environmental conditions, robot reactive strategies and impedance control factors will be generalized to lead to various reactions. To verify the effectiveness of the proposed method, an obstacle avoidance experiment that enables robots to approach destinations flexibly in various situations with barriers will be undertaken. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
26. Interactive Transfer Learning-Assisted Fuzzy Neural Network.
- Author
-
Han, Honggui, Liu, Hongxu, Liu, Zheng, and Qiao, Junfei
- Subjects
FUZZY neural networks ,INTERACTIVE learning ,MACHINE learning ,BENCHMARK problems (Computer science) ,ARTIFICIAL neural networks - Abstract
Transfer learning algorithm can provide a framework to utilize the previous knowledge to train fuzzy neural network (FNN). However, the performance of TL-based FNN will be destroyed by the knowledge over-fitting problem in the learning process. To solve this problem, an interactive transfer learning (ITL) algorithm, which can alleviate the negative transfer among different domains to improve the learning performance of FNN, is designed and analyzed in this article. This ITL-assisted FNN (ITL-FNN) contains the following advantages. First, a knowledge filter algorithm is developed to reconstruct the knowledge in source scene by balancing the matching accuracy and diversity. Then, the knowledge from source scene can fit the instance of target scene with suitable accuracy. Second, a self-balancing mechanism is designed to balance the driven information between the source and target scenes. Then, the knowledge can be refitted to reduce the useless information. Third, a structural competition algorithm is proposed to adjust the knowledge of FNN. Then, the proposed ITL-FNN can achieve compact structure to improve the generalization performance. Finally, some benchmark problems and industrial applications are provided to demonstrate the merits of ITL-FNN. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
27. Linear Quadratic Tracking With Reinforcement Learning Based Reference Trajectory Optimization for the Lunar Hopper in Simulated Environment
- Author
-
Toshiki Tanaka, Heidar Malki, and Marzia Cescon
- Subjects
Learning systems ,linear feedback control systems ,machine learning ,state feedback ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
In this work, we provide a novel optimal guidance and control strategy for lunar hopper obstacle-avoidance, descent, and landing problem and demonstrate its behavior using numerical simulations. More specifically, the major contributions of this paper are three-fold: 1) proposed a feedback-based reference trajectory design for lunar hopper guidance, 2) developed the mathematical models and equations of linear quadratic tracking (LQT) controller for lunar hopper control, and 3) developed a method using reinforcement learning to optimize the designed reference trajectory in conjunction with the designed LQT controller, the so- called linear quadratic tracking with reinforcement learning based reference trajectory optimization (LQT-RTO). We demonstrated the LQT-RTO under a 2-dimensional (2D) lunar hopper simulation environment with 1) the LQT with heuristic reference trajectory design (LQT-HTD) and 2) reinforcement learning (reinforcement learning based controller, or RLC). We confirmed by numerical simulation that the LTQ-RTO outperformed the LQT-HTD in terms of fuel consumption, and outperformed the RLC in terms of landing success rate. Lastly, we provided theoretical interpretation to the simulation results.
- Published
- 2021
- Full Text
- View/download PDF
28. Role-Based Graph Embeddings.
- Author
-
Ahmed, Nesreen K., Rossi, Ryan A., Lee, John Boaz, Willke, Theodore L., Zhou, Rong, Kong, Xiangnan, and Eldardiry, Hoda
- Subjects
- *
RANDOM walks , *TASK analysis - Abstract
Random walks are at the heart of many existing node embedding and network representation learning methods. However, such methods have many limitations that arise from the use of traditional random walks, e.g., the embeddings resulting from these methods capture proximity (communities) among the vertices as opposed to structural similarity (roles). Furthermore, the embeddings are unable to transfer to new nodes and graphs as they are tied to node identity. To overcome these limitations, we introduce the Role2Vec framework based on the proposed notion of attributed random walks to learn structural role-based embeddings. Notably, the framework serves as a basis for generalizing any walk-based method. The Role2Vec framework enables these methods to be more widely applicable by learning inductive functions that capture the structural roles in the graph. Furthermore, the original methods are recovered as a special case of the framework when each vertex is mapped to its own function that uniquely identifies it. Finally, the Role2Vec framework is shown to be effective with an average AUC improvement of 17.8 percent for link prediction while requiring on average 853x less space than existing methods on a variety of graphs from different domains. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
29. Incremental Learning With Open-Set Recognition for Remote Sensing Image Scene Classification.
- Author
-
Liu, Weiwei, Nie, Xiangli, Zhang, Bo, and Sun, Xian
- Subjects
- *
MACHINE learning , *REMOTE sensing , *IMAGE recognition (Computer vision) , *TEXT recognition , *MNEMONICS - Abstract
Image scene classification aiming to assign specific semantic labels for each image is vitally important for the applications of remote sensing (RS) data. In real world, since the observation environment is open and dynamic, RS images are collected sequentially and the numbers of images and classes grow rapidly over time. Most existing scene classification methods are offline learning algorithms, which are inefficient and unscalable for this scenario. In this article, an incremental learning with open-set recognition (ILOSR) framework is proposed for RS image scene classification in the open and dynamic environment, which can identify the unknown classes from a stream of data and learn these new classes incrementally. Specifically, a controllable convex hull-based exemplar selection strategy is designed to address the catastrophic forgetting issue in incremental learning, which can reduce training time and memory footprint effectively. In addition, a new loss function based on prototype learning and uncertainty measurement is proposed for OSR to enhance the interclass discrimination and intraclass compactness of the learned deep features. Experimental results on real RS datasets demonstrate that the proposed method can not only outperform the state-of-the-art approaches on offline classification, incremental learning, and OSR problem separately but also achieve better and more stable performance in the experiments for ILOSR. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
30. Pearson Correlation Coefficient-Based Performance Enhancement of Broad Learning System for Stock Price Prediction.
- Author
-
Li, Guanzhi, Zhang, Aining, Zhang, Qizhi, Wu, Di, and Zhan, Choujun
- Abstract
Accurate prediction of a stock price is a challenging task due to the complexity, chaos, and non-linearity nature of financial systems. In this brief, we proposed a multi-indicator feature selection method for stock price prediction based on Pearson correlation coefficient (PCC) and Broad Learning System (BLS), named the PCC-BLS framework. Firstly, PCC was used to select the input features from 35 features, including original stock price, technical indicators, and financial indicators. Secondly, these screened input features were used for rapid information feature extraction and training a BLS. Four stocks recorded on the Shanghai Stock Exchange or Shenzhen Stock Exchange were adopted to evaluate the performance of the proposed method. In addition, we compared the forecasting results with ten machine learning methods, including Support Vector Regression (SVR), Adaptive Boosting (Adaboost), Bootstrap aggregating (Bagging), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), Multi-layer Perceptron (MLP), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU) and Broad Learning System (BLS). Among all algorithms used in this brief, the proposed model showed the best performance with the highest model fitting ability. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
31. Cognitive Balance for Fog Computing Resource in Internet of Things: An Edge Learning Approach.
- Author
-
Liao, Siyi, Wu, Jun, Mumtaz, Shahid, Li, Jianhua, Morello, Rosario, and Guizani, Mohsen
- Subjects
INTERNET of things ,COGNITIVE computing ,MACHINE learning ,QUALITY of service ,COGNITIVE science ,COMPUTER architecture - Abstract
Currently, the highly dynamic fog computing resource requirements introduced by the diverse services of the Internet of Things (IoT) result in an imbalance between computing resource providers and consumers. However, current computing resource scheduling schemes cannot cognize the dynamic resources available and do not possess decision-making or management capabilities, which leads to inefficient use of computing resources and a decreased quality of service (QoS). Balancing computing resources cognitively at the IoT edge remains unresolved. In this paper, a cognition-centric fog computing resource balancing (CFCRB) scheme is proposed for edge intelligence-enabled IoT. First, we propose a cognitive balance architecture with a cognition plane, which includes service demand monitoring, policy processing and knowledge storage of cognitive fog resources. Second, we propose the fog functions structure with sensing, interaction and learning functionalities, realizing the knowledge-based proactive discovery and dynamic orchestration of resource sharing nodes. Finally, a distributed edge learning algorithm is proposed to construct knowledge of the balance between computing resource helpers and requesters in cognitive fogs, which is further proved with mathematics. The simulation results indicate the efficiency of the proposed scheme. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
32. A Deep Transfer Learning Based Architecture for Brain Tumor Classification Using MR Images.
- Author
-
Badjie, Bakary and Ülker, Ezgi Deniz
- Subjects
DEEP learning ,BRAIN tumors ,TUMOR classification ,MAGNETIC resonance imaging ,CONVOLUTIONAL neural networks ,MACHINE learning - Abstract
Deep Learning (DL) is becoming more popular in the healthcare sectors due to the exponential growth of data availability and its excellent performance in diagnosing various diseases. This paper has aimed to design a new possible brain tumor diagnostic model to improve accuracy and reliability of radiology. In this paper, an advanced deep learning algorithm is used to detect and classify brain tumors in magnetic resonance (MR) images. Diagnosing brain tumors in radiology is a significant issue, yet it is a difficult and time-consuming procedure that radiologists must pass through. The reliability of their assessment relies completely on their knowledge and personal judgements which are in most cases inaccurate. As a possible remedy to the growing concern in diagnosing brain tumors accurately, in this work a deep learning method is applied to classify the brain tumor MR images with very high performance accuracy. The research leveraged a transfer learning model known as AlexNet's convolutional neural network (CNN) to perform this operation. Our method helps to improve robustness, efficiencies and accuracy in the healthcare sector with the ability to automate the entire diagnostic process with the overall accuracy of 99.62%. Additionally, our model has the ability to detect and classify tumors at their different stages and magnitudes. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
33. Dynamic Double Classifiers Approximation for Cross-Domain Recognition.
- Author
-
Fang, Xiaozhao, Han, Na, Zhou, Guoxu, Teng, Shohua, Xu, Yong, and Xie, Shenli
- Abstract
In general, existing cross-domain recognition methods mainly focus on changing the feature representation of data or modifying the classifier parameter and their efficiencies are indicated by the better performance. However, most existing methods do not simultaneously integrate them into a unified optimization objective for further improving the learning efficiency. In this article, we propose a novel cross-domain recognition algorithm framework by integrating both of them. Specifically, we reduce the discrepancies in both the conditional distribution and marginal distribution between different domains in order to learn a new feature representation which pulls the data from different domains closer on the whole. However, the data from different domains but the same class cannot interlace together enough and thus it is not reasonable to mix them for training a single classifier. To this end, we further propose to learn double classifiers on the respective domain and require that they dynamically approximate to each other during learning. This guarantees that we finally learn a suitable classifier from the double classifiers by using the strategy of classifier fusion. The experiments show that the proposed method outperforms over the state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
34. Large-Scale Nonlinear AUC Maximization via Triply Stochastic Gradients.
- Author
-
Dang, Zhiyuan, Li, Xiang, Gu, Bin, Deng, Cheng, and Huang, Heng
- Subjects
- *
KERNEL functions , *MACHINE learning , *NONLINEAR equations , *SCALABILITY , *APPROXIMATION algorithms - Abstract
Learning to improve AUC performance for imbalanced data is an important machine learning research problem. Most methods of AUC maximization assume that the model function is linear in the original feature space. However, this assumption is not suitable for nonlinear separable problems. Although there have been some nonlinear methods of AUC maximization, scaling up nonlinear AUC maximization is still an open question. To address this challenging problem, in this paper, we propose a novel large-scale nonlinear AUC maximization method (named as TSAM) based on the triply stochastic gradient descents. Specifically, we first use the random Fourier feature to approximate the kernel function. After that, we use the triply stochastic gradients w.r.t. the pairwise loss and random feature to iteratively update the solution. Finally, we prove that TSAM converges to the optimal solution with the rate of $ \mathcal {O}(1/t)$ O (1 / t) after $t$ t iterations. Experimental results on a variety of benchmark datasets not only confirm the scalability of TSAM, but also show a significant reduction of computational time compared with existing batch learning algorithms, while retaining the similar generalization performance. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
35. On the Sufficient Condition for Solving the Gap-Filling Problem Using Deep Convolutional Neural Networks.
- Author
-
Peppert, Felix, Von Kleist, Max, Schutte, Christof, and Sunkara, Vikram
- Subjects
- *
CONVOLUTIONAL neural networks , *COMPUTER vision , *PROBLEM solving , *IMAGE segmentation , *MATHEMATICAL proofs - Abstract
Deep convolutional neural networks (DCNNs) are routinely used for image segmentation of biomedical data sets to obtain quantitative measurements of cellular structures like tissues. These cellular structures often contain gaps in their boundaries, leading to poor segmentation performance when using DCNNs like the U-Net. The gaps can usually be corrected by post-hoc computer vision (CV) steps, which are specific to the data set and require a disproportionate amount of work. As DCNNs are Universal Function Approximators, it is conceivable that the corrections should be obsolete by selecting the appropriate architecture for the DCNN. In this article, we present a novel theoretical framework for the gap-filling problem in DCNNs that allows the selection of architecture to circumvent the CV steps. Combining information-theoretic measures of the data set with a fundamental property of DCNNs, the size of their receptive field, allows us to formulate statements about the solvability of the gap-filling problem independent of the specifics of model training. In particular, we obtain mathematical proof showing that the maximum proficiency of filling a gap by a DCNN is achieved if its receptive field is larger than the gap length. We then demonstrate the consequence of this result using numerical experiments on a synthetic and real data set and compare the gap-filling ability of the ubiquitous U-Net architecture with variable depths. Our code is available at https://github.com/ai-biology/dcnn-gap-filling. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
36. Prediction With Unpredictable Feature Evolution.
- Author
-
Hou, Bo-Jian, Zhang, Lijun, and Zhou, Zhi-Hua
- Subjects
- *
FORECASTING , *LEARNING goals - Abstract
Learning with feature evolution studies the scenario where the features of the data streams can evolve, i.e., old features vanish and new features emerge. Its goal is to keep the model always performing well even when the features happen to evolve. To tackle this problem, canonical methods assume that the old features will vanish simultaneously and the new features themselves will emerge simultaneously as well. They also assume that there is an overlapping period where old and new features both exist when the feature space starts to change. However, in reality, the feature evolution could be unpredictable, which means that the features can vanish or emerge arbitrarily, causing the overlapping period incomplete. In this article, we propose a novel paradigm: prediction with unpredictable feature evolution (PUFE) where the feature evolution is unpredictable. To address this problem, we fill the incomplete overlapping period and formulate it as a new matrix completion problem. We give a theoretical bound on the least number of observed entries to make the overlapping period intact. With this intact overlapping period, we leverage an ensemble method to take the advantage of both the old and new feature spaces without manually deciding which base models should be incorporated. Theoretical and experimental results validate that our method can always follow the best base models and, thus, realize the goal of learning with feature evolution. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
37. Face Sketch Synthesis Using Regularized Broad Learning System.
- Author
-
Li, Ping, Sheng, Bin, and Chen, C. L. Philip
- Subjects
- *
INSTRUCTIONAL systems , *MACHINE learning , *FEATURE selection , *COMPUTATIONAL complexity , *HUMAN facial recognition software , *FACE - Abstract
There are two main categories of face sketch synthesis: data- and model-driven. The data-driven method synthesizes sketches from training photograph–sketch patches at the cost of detail loss. The model-driven method can preserve more details, but the mapping from photographs to sketches is a time-consuming training process, especially when the deep structures require to be refined. We propose a face sketch synthesis method via regularized broad learning system (RBLS). The broad learning-based system directly transforms photographs into sketches with rich details preserved. Also, the incremental learning scheme of broad learning system (BLS) ensures that our method easily increases feature mappings and remodels the network without retraining when the extracted feature mapping nodes are not sufficient. Besides, a Bayesian estimation-based regularization is introduced with the BLS to aid further feature selection and improve the generalization ability and robustness. Various experiments on the CUHK student data set and Aleix Robert (AR) data set demonstrated the effectiveness and efficiency of our RBLS method. Unlike existing methods, our method synthesizes high-quality face sketches much efficiently and greatly reduces computational complexity both in the training and test processes. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
38. A High-Efficient Hybrid Physics-Informed Neural Networks Based on Convolutional Neural Network.
- Author
-
Fang, Zhiwei
- Subjects
- *
CONVOLUTIONAL neural networks , *FINITE volume method , *PARTIAL differential equations , *AUTOMATIC differentiation , *INVERSE problems , *MACHINE learning - Abstract
In this article, we develop a hybrid physics-informed neural network (hybrid PINN) for partial differential equations (PDEs). We borrow the idea from the convolutional neural network (CNN) and finite volume methods. Unlike the physics-informed neural network (PINN) and its variations, the method proposed in this article uses an approximation of the differential operator to solve the PDEs instead of automatic differentiation (AD). The approximation is given by a local fitting method, which is the main contribution of this article. As a result, our method has been proved to have a convergent rate. This will also avoid the issue that the neural network gives a bad prediction, which sometimes happened in PINN. To the author’s best knowledge, this is the first work that the machine learning PDE’s solver has a convergent rate, such as in numerical methods. The numerical experiments verify the correctness and efficiency of our algorithm. We also show that our method can be applied in inverse problems and surface PDEs, although without proof. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
39. Memory Attention Networks for Skeleton-Based Action Recognition.
- Author
-
Li, Ce, Xie, Chunyu, Zhang, Baochang, Han, Jungong, Zhen, Xiantong, and Chen, Jie
- Subjects
- *
CONVOLUTIONAL neural networks , *MEMORY , *MACHINE learning , *SOURCE code , *INFORMATION modeling , *SKELETON - Abstract
Skeleton-based action recognition has been extensively studied, but it remains an unsolved problem because of the complex variations of skeleton joints in 3-D spatiotemporal space. To handle this issue, we propose a newly temporal-then-spatial recalibration method named memory attention networks (MANs) and deploy MANs using the temporal attention recalibration module (TARM) and spatiotemporal convolution module (STCM). In the TARM, a novel temporal attention mechanism is built based on residual learning to recalibrate frames of skeleton data temporally. In the STCM, the recalibrated sequence is transformed or encoded as the input of CNNs to further model the spatiotemporal information of skeleton sequence. Based on MANs, a new collaborative memory fusion module (CMFM) is proposed to further improve the efficiency, leading to the collaborative MANs (C-MANs), trained with two streams of base MANs. TARM, STCM, and CMFM form a single network seamlessly and enable the whole network to be trained in an end-to-end fashion. Comparing with the state-of-the-art methods, MANs and C-MANs improve the performance significantly and achieve the best results on six data sets for action recognition. The source code has been made publicly available at https://github.com/memory-attention-networks. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
40. On Training Traffic Predictors via Broad Learning Structures: A Benchmark Study.
- Author
-
Liu, Di, Baldi, Simone, Yu, Wenwu, Cao, Jinde, and Huang, Wei
- Subjects
- *
RECURRENT neural networks , *MACHINE learning , *COMPUTING platforms , *MAGNITUDE (Mathematics) , *COMPUTER architecture - Abstract
A fast architecture for real-time (i.e., minute-based) training of a traffic predictor is studied, based on the so-called broad learning system (BLS) paradigm. The study uses various traffic datasets by the California Department of Transportation, and employs a variety of standard algorithms (LASSO regression, shallow and deep neural networks, stacked autoencoders, convolutional, and recurrent neural networks) for comparison purposes: all algorithms are implemented in MATLAB on the same computing platform. The study demonstrates a BLS training process two-three orders of magnitude faster (tens of seconds against tens-hundreds of thousands of seconds), allowing unprecedented real-time capabilities. Additional comparisons with the extreme learning machine architecture, a learning algorithm sharing some features with BLS, confirm the fast training of least-square training as compared to gradient training. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
41. Online Rule-Based Classifier Learning on Dynamic Unlabeled Multivariate Time Series Data.
- Author
-
He, Guoliang, Xin, Xin, Peng, Rong, Han, Min, Wang, Juan, and Wu, Xiaoqun
- Subjects
- *
TIME series analysis , *ONLINE education , *ONLINE algorithms , *CLASSIFICATION algorithms , *MACHINE learning , *NAIVE Bayes classification - Abstract
Traditional classification learning algorithms have several limitations: 1) they are time consuming for the large-scale training multivariate time-series (MTS) data, and unsuitable for the dynamically added training data; 2) as the number of the training MTS data becomes larger, they could not achieve the desired classification accuracy; 3) most of them do not consider how to make use of the unlabeled samples to enhance the classifier performance; and 4) due to the high dimension of MTS and complex relationship among variables, existing online learning algorithms are not effective to update shapelet-based association rules. Up to now, few work touched online classification learning for dynamically added unlabeled examples. To efficiently address these issues, we propose an online rule-based classifier learning framework on dynamically added unlabeled MTS data (ORCL-U). This framework integrates a confidence-based labeling strategy (CLS) and an online rule-based classifier learning approach (ORBCL). Extensive experiments on ten datasets show the effectiveness and efficiency of our proposed approach. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
42. Local Discriminant Subspace Learning for Gas Sensor Drift Problem.
- Author
-
Yi, Zhengkun, Shang, Wanfeng, Xu, Tiantian, Guo, Shifeng, and Wu, Xinyu
- Subjects
- *
GAS detectors , *FISHER discriminant analysis - Abstract
Sensor drift is one of the severe issues that gas sensors suffer from. To alleviate the sensor drift problem, a gas sensor drift compensation approach is proposed based on local discriminant subspace projection (LDSP). The proposed approach aims to find a subspace to reduce the distribution difference between two domains, i.e., the source and target domain. Similar to domain regularized component analysis (DRCA) which is a recently proposed sensor drift correction method, the mean distribution discrepancy is minimized in the common subspace in our approach. LDSP extends DRCA in two aspects, i.e., it not only takes the label information of the source data into consideration to reduce the possibility of the case that samples in the subspace with different class labels stay close to each other, but also borrows the idea of locality-preserving projection to deal with multimodal data. Specifically, inspired by local Fisher discriminant analysis (LFDA), the label information is utilized to maximize the local between-class variance of source data in the latent common subspace and simultaneously minimize the local within-class variance. The formulation of LDSP is a generalized eigenvalue problem that can be readily solved. The experimental results have shown the proposed method outperforms other gas sensor drift compensation methods in terms of classification accuracy on two public gas sensor drift datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
43. Analysis and Variants of Broad Learning System.
- Author
-
Zhang, Liang, Li, Jiahao, Lu, Guoqing, Shen, Peiyi, Bennamoun, Mohammed, Shah, Syed Afaq Ali, Miao, Qiguang, Zhu, Guangming, Li, Ping, and Lu, Xiaoyuan
- Subjects
- *
MACHINE learning , *INSTRUCTIONAL systems , *COMPRESSED sensing , *FEATURE extraction , *DEEP learning , *LEARNING ability , *STREET addresses - Abstract
The broad learning system (BLS) is designed based on the technology of compressed sensing and pseudo-inverse theory, and consists of feature nodes and enhancement nodes, has been proposed recently. Compared with the popular deep learning structures, such as deep neural networks, BLS has the ability of rapid incremental learning and can remodel the system without the usual tedious retraining process. However, given that BLS is still in its infancy, it still needs analysis, improvements, and verification. In this article, we first analyze the principle of fast incremental learning ability of BLS in depth. Second, in order to provide an in-depth analysis of the BLS structure, according to the novel structure design concept of deep neural networks, we present four brand-new BLS variant networks and their incremental realizations. Third, based on our analysis of the effect of feature nodes and enhancement nodes, a new BLS structure with a semantic feature extraction layer has been proposed, which is called SFEBLS. The experimental results show that SFEBLS and its variants can increase the accuracy rate on the NORB dataset 6.18%, Fashion-MNIST dataset by 3.15%, ORL data by 5.00%, street view house number dataset by 12.88%, and CIFAR-10 dataset by 18.42%, respectively, and the four brand-new BLS variant networks also obviously outperform the original BLS. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
44. Learning Meta-Distance for Sequences by Learning a Ground Metric via Virtual Sequence Regression.
- Author
-
Su, Bing and Wu, Ying
- Subjects
- *
METAGENOMICS , *SEQUENCE alignment , *ARTIFICIAL neural networks - Abstract
Distance between sequences is structural by nature because it needs to establish the temporal alignments among the temporally correlated vectors in sequences with varying lengths. Generally, distances for sequences heavily depend on the ground metric between the vectors in sequences to infer the alignments and hence can be viewed as meta-distances upon the ground metric. Learning such meta-distance from multi-dimensional sequences is appealing but challenging. We propose to learn the meta-distance through learning a ground metric for the vectors in sequences. The learning samples are sequences of vectors for which how the ground metric between vectors induces the meta-distance is given. The objective is that the meta-distance induced by the learned ground metric produces large values for sequences from different classes and small values for those from the same class. We formulate the ground metric as a parameter of the meta-distance and regress each sequence to an associated pre-generated virtual sequence w.r.t. the meta-distance, where the virtual sequences for sequences of different classes are well-separated. We develop general iterative solutions to learn both the Mahalanobis metric and the deep metric induced by a neural network for any ground-metric-based sequence distance. Experiments on several sequence datasets demonstrate the effectiveness and efficiency of the proposed methods. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
45. A BERT-based Question Answering Architecture for Spanish Language.
- Author
-
Gutierrez Ramos, Robert C., Calderón-Vilca, Hugo D., and Cárdenas-Mariño, Flor C.
- Subjects
SPANISH language ,LANGUAGE & languages ,ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,MACHINE learning - Abstract
QA systems have had various approaches to achieve their goal of solving naturally formed questions, recent works use state of the art techniques such as neural networks, QA systems in different languages are increasing, as evidenced, they are advancing at different rates, despite the fact that there are efforts to increase research in this type of systems. In this research we analyze the main aspects of the contributions to Question Answering and present an architecture that is capable of answering questions in Spanish. The initial question is received by the system, which may or may not have a document corpus from which to extract the answer, if it does not have a specified document corpus, the Answer Generation module returns the answer to the initial question. The purpose of the system is to provide answers to factoid questions posed by users through a web and mobile platform. BI-LSTM was used for document retrieval and BERT was used to generate the answers. We tested the architecture with ten thousand questions reaching an accuracy of 0.7856. The result improved by entering QA to a more specialized BERT model adapted for the Spanish language, the multilingual version of BERT and the Spanish version of BETO were used. [ABSTRACT FROM AUTHOR]
- Published
- 2022
46. Boltzmann Machine Learning and Regularization Methods for Inferring Evolutionary Fields and Couplings From a Multiple Sequence Alignment.
- Abstract
The inverse Potts problem to infer a Boltzmann distribution for homologous protein sequences from their single-site and pairwise amino acid frequencies recently attracts a great deal of attention in the studies of protein structure and evolution. We study regularization and learning methods and how to tune regularization parameters to correctly infer interactions in Boltzmann machine learning. Using $L_2$ L 2 regularization for fields, group $L_1$ L 1 for couplings is shown to be very effective for sparse couplings in comparison with $L_2$ L 2 and $L_1$ L 1 . Two regularization parameters are tuned to yield equal values for both the sample and ensemble averages of evolutionary energy. Both averages smoothly change and converge, but their learning profiles are very different between learning methods. The Adam method is modified to make stepsize proportional to the gradient for sparse couplings and to use a soft-thresholding function for group $L_1$ L 1 . It is shown by first inferring interactions from protein sequences and then from Monte Carlo samples that the fields and couplings can be well recovered, but that recovering the pairwise correlations in the resolution of a total energy is harder for the natural proteins than for the protein-like sequences. Selective temperature for folding/structural constrains in protein evolution is also estimated. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
47. Multi-View Representation Learning With Deep Gaussian Processes.
- Author
-
Sun, Shiliang, Dong, Wenbo, and Liu, Qiuyang
- Subjects
- *
DEEP learning , *MACHINE learning , *GLOBAL Positioning System - Abstract
Multi-view representation learning is a promising and challenging research topic, which aims to integrate multiple data information from different views to improve the learning performance. The recent deep Gaussian processes (DGPs) have the advantages of good uncertainty estimates, powerful non-linear mapping ability and great generalization capability, which can be used as an excellent data representation learning method. However, DGPs only focus on single view data and are rarely applied to the multi-view scenario. In this paper, we propose a multi-view representation learning algorithm with deep Gaussian processes (named MvDGPs), which inherits the advantages of deep Gaussian processes and multi-view representation learning, and can learn more effective representation of multi-view data. The MvDGPs consist of two stages. The first stage is multi-view data representation learning, which is mainly used to learn more comprehensive representations of multi-view data. The second stage is classifier design, which aims to select an appropriate classifier to better employ the representations obtained in the first stage. In contrast with DGPs, MvDGPs support asymmetrical modeling depths for different views of data, resulting in better characterizations of the discrepancies among different views. Experimental results on real-world multi-view data sets verify the effectiveness of the proposed algorithm, which indicates that MvDGPs can integrate the complementary information in multiple views to discover a good representation of the data. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
48. Margin Distribution Analysis.
- Author
-
Wang, Jun and Zhou, Zhi-Hua
- Subjects
- *
MACHINE learning , *LINEAR equations , *CONCEPT learning - Abstract
Margin is an important concept in machine learning; theoretical analyses further reveal that the distribution of margin plays a more critical role than the minimum margin in generalization power. Recently, several approaches have achieved performance breakthroughs by optimizing the margin distribution, but their computational cost, which is usually higher than before, still hinders them to be widely applied. In this article, we propose margin distribution analysis (MDA), which optimizes the margin distribution more simply by maximizing the margin mean and minimizing the margin variance simultaneously. MDA is efficient and resistive to class-imbalance naturally, since its objective distinguishes the margin means of different classes and can be broken up into two linear equations. In practice, it can also cooperate with other frameworks such as reweight-minimization when facing complex circumstances with noise and outliers. Empirical studies validate the superiority of MDA in real-world data sets, and demonstrate that simple approaches can also perform competitively by optimizing margin distribution. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
49. Global Negative Correlation Learning: A Unified Framework for Global Optimization of Ensemble Models.
- Author
-
Perales-Gonzalez, Carlos, Fernandez-Navarro, Francisco, Carbonero-Ruz, Mariano, and Perez-Rodriguez, Javier
- Subjects
- *
GLOBAL optimization , *ERROR functions , *ANALYTICAL solutions , *MACHINE learning , *LEARNING communities , *RADIAL basis functions - Abstract
Ensembles are a widely implemented approach in the machine learning community and their success is traditionally attributed to the diversity within the ensemble. Most of these approaches foster diversity in the ensemble by data sampling or by modifying the structure of the constituent models. Despite this, there is a family of ensemble models in which diversity is explicitly promoted in the error function of the individuals. The negative correlation learning (NCL) ensemble framework is probably the most well-known algorithm within this group of methods. This article analyzes NCL and reveals that the framework actually minimizes the combination of errors of the individuals of the ensemble instead of minimizing the residuals of the final ensemble. We propose a novel ensemble framework, named global negative correlation learning (GNCL), which focuses on the optimization of the global ensemble instead of the individual fitness of its components. An analytical solution for the parameters of base regressors based on the NCL framework and the global error function proposed is also provided under the assumption of fixed basis functions (although the general framework could also be instantiated for neural networks with nonfixed basis functions). The proposed ensemble framework is evaluated by extensive experiments with regression and classification data sets. Comparisons with other state-of-the-art ensemble methods confirm that GNCL yields the best overall performance. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
50. Online Reinforcement Learning Control by Direct Heuristic Dynamic Programming: From Time-Driven to Event-Driven.
- Author
-
Zhao, Qingtao, Si, Jennie, and Sun, Jian
- Subjects
- *
HEURISTIC programming , *DYNAMIC programming , *REINFORCEMENT learning , *ONLINE education , *LYAPUNOV functions , *MACHINE learning - Abstract
In this work, time-driven learning refers to the machine learning method that updates parameters in a prediction model continuously as new data arrives. Among existing approximate dynamic programming (ADP) and reinforcement learning (RL) algorithms, the direct heuristic dynamic programming (dHDP) has been shown an effective tool as demonstrated in solving several complex learning control problems. It continuously updates the control policy and the critic as system states continuously evolve. It is therefore desirable to prevent the time-driven dHDP from updating due to insignificant system event such as noise. Toward this goal, we propose a new event-driven dHDP. By constructing a Lyapunov function candidate, we prove the uniformly ultimately boundedness (UUB) of the system states and the weights in the critic and the control policy networks. Consequently, we show the approximate control and cost-to-go function approaching Bellman optimality within a finite bound. We also illustrate how the event-driven dHDP algorithm works in comparison to the original time-driven dHDP. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.