1,917 results
Search Results
2. Towards an Ontology of Requirements for Pervasive Games Based Learning Systems: A Requirements Engineering Perspective
- Author
-
Mejbri, Yemna, Khemaja, Maha, Raies, Kaouther, Akan, Ozgur, Series editor, Bellavista, Paolo, Series editor, Cao, Jiannong, Series editor, Coulson, Geoffrey, Series editor, Dressler, Falko, Series editor, Ferrari, Domenico, Series editor, Gerla, Mario, Series editor, Kobayashi, Hisashi, Series editor, Palazzo, Sergio, Series editor, Sahni, Sartaj, Series editor, Shen, Xuemin (Sherman), Series editor, Stan, Mircea, Series editor, Xiaohua, Jia, Series editor, Zomaya, Albert Y., Series editor, Vaz de Carvalho, Carlos, editor, Escudeiro, Paula, editor, and Coelho, António, editor
- Published
- 2017
- Full Text
- View/download PDF
3. Mobile, secure, and programmable networking : third international conference, MSPN 2017, Paris, France, June 29-30, 2017, revised selected papers
- Author
-
Bouzefrane, Samia, Banerjee, Soumya, Sailhan, Françoise, Boumerdassi, Selma, Renault, Eric, CEDRIC. Réseaux et Objets Connectés (CEDRIC - ROC), Centre d'études et de recherche en informatique et communications (CEDRIC), Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise (ENSIIE)-Conservatoire National des Arts et Métiers [CNAM] (CNAM)-Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise (ENSIIE)-Conservatoire National des Arts et Métiers [CNAM] (CNAM), Birla Institute of Technology [Mesra] (BIT Mesra), Réseaux, Systèmes, Services, Sécurité (R3S-SAMOVAR), Services répartis, Architectures, MOdélisation, Validation, Administration des Réseaux (SAMOVAR), Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP)-Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP), Département Réseaux et Services Multimédia Mobiles (RS2M), Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP), Centre National de la Recherche Scientifique (CNRS), and Lecture Notes in Computer Science, volume 10566
- Subjects
Artificial intelligence ,Learning systems ,User inferfaces ,Applied computing ,Wireless sensor networks ,Telecommunication traffic ,Big data ,[INFO.INFO-NI]Computer Science [cs]/Networking and Internet Architecture [cs.NI] ,Feature selection ,Cryptography ,Wireless telecommunication systems ,Workflow system ,Data security ,ComputingMilieux_MISCELLANEOUS - Abstract
International audience
- Published
- 2017
4. A Three-Layered Mutually Reinforced Model for Personalized Citation Recommendation.
- Author
-
Cai, Xiaoyan, Han, Junwei, Li, Wenjie, Zhang, Renxian, Pan, Shirui, and Yang, Libin
- Subjects
COMPUTATIONAL complexity ,MACHINE learning ,CLUSTER analysis (Statistics) - Abstract
Fast-growing scientific papers pose the problem of rapidly and accurately finding a list of reference papers for a given manuscript. Citation recommendation is an indispensable technique to overcome this obstacle. In this paper, we propose a citation recommendation approach via mutual reinforcement on a three-layered graph, in which each paper, author or venue is represented as a vertex in the paper layer, author layer, and venue layer, respectively. For personalized recommendation, we initiate the random walk separately for each query researcher. However, this has a high computational complexity due to the large graph size. To solve this problem, we apply a three-layered interactive clustering approach to cluster related vertices in the graph. Personalized citation recommendations are then made on the subgraph, generated by the clusters associated with each researcher’s needs. When evaluated on the ACL anthology network, DBLP, and CiteSeer ML data sets, the performance of our proposed model-based citation recommendation approach is comparable with that of other state-of-the-art citation recommendation approaches. The results also demonstrate that the personalized recommendation approach is more effective than the nonpersonalized recommendation approach. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
5. Sentiment analysis and research based on two‐channel parallel hybrid neural network model with attention mechanism.
- Author
-
Chen, Na, Sun, Yanqiu, and Yan, Yan
- Subjects
CONVOLUTIONAL neural networks ,SENTIMENT analysis ,RECURRENT neural networks ,LANGUAGE models ,TRANSFORMER models - Abstract
Concerning the problems that the traditional Convolutional Neural Network (CNN) ignores contextual semantic information, and the traditional Recurrent Neural Network (RNN) has information memory loss and vanishing gradient, this paper proposes a Bi‐directional Encoder Representations from Transformers (BERT)‐based dual‐channel parallel hybrid neural network model for text sentiment analysis. The BERT model is used to convert text into word vectors; the dual‐channel parallel hybrid neural network model constructed by CNN and Bi‐directional Long Short‐Term Memory (BiLSTM) extracts local and global semantic features of the text, which can obtain more comprehensive sentiment features; the attention mechanism enables some words to get more attention that highlights important words and improves the model's sentiment classification ability. Finally, the dual‐channel output features are fused for sentiment classification. The experimental results on the hotel review datasets show that the Accuracy of the proposed model in sentiment classification reaches 92.35% and the F1 score reaches 91.59%. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
6. Dual Sparse Structured Subspaces and Graph Regularisation for Particle Swarm Optimisation-Based Multi-Label Feature Selection.
- Author
-
Demir, Kaan, Nguyen, Bach Hoai, Xue, Bing, and Zhang, Mengjie
- Abstract
Many real-world classification problems are becoming multi-label in nature, i.e., multiple class labels are assigned to an instance simultaneously. Multi-label classification is a challenging problem due to the involvement of three forms of interactions, i.e., feature-to-feature, feature-to-label, and label-to-label interactions. What further complicates the problem is that not all features are useful, and some can deteriorate the classification performance. Sparsity-based methods have been widely used to address multi-label feature selection due to their efficiency and effectiveness. However, most (if not all) existing methods do not consider the three forms of interactions simultaneously, which could hinder their ability to achieve good performance. Moreover, most existing methods are gradient-based, which are prone to getting stuck at local optima. This paper proposes a new sparsity-based feature selection approach that can simultaneously consider all three forms of interactions. Furthermore, this paper develops a novel sparse learning method based on particle swarm optimisation that can avoid local optima. The proposed method is compared against the state-of-the-art multi-label feature selection methods in terms of multi-label classification performance. The results show that our method performed significantly better in selecting high-quality feature subsets with respect to various feature subset sizes. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. A Novel Contact State Estimation Method for Robot Manipulation Skill Learning via Environment Dynamics and Constraints Modeling.
- Author
-
Liu, Xing, Huang, Panfeng, and Liu, Zhengxiong
- Subjects
CLASSROOM environment ,ROBOT motion ,ROBOTS ,GEOMETRIC modeling ,ROBOT control systems ,HUMAN beings - Abstract
Nowadays the robot manipulation skills are usually learned by human demonstration via trajectory-level learning, which somewhat lacks robustness and generalization. In this paper, we propose a novel contact state level learning method for robot manipulation skill acquisition via human demonstration. The robot-environment contact states are described via environment dynamics modelling and geometric constraints modelling for flexible contact and rigid contact cases, respectively. During human demonstration process, the robot-environment interaction force, the robot position, and velocity data are collected. After that, the environment dynamics and geometric constraints modelling methods are presented to determine the contact state changes during the robot manipulation process. Then the robot manipulator learns the contact state information rather than specific manipulation trajectory. On this basis, the manipulation control law using active exploration method is presented to control the robot during the button pressing process and peg-hole-insertion process, respectively. Finally, the performance of the presented methodology has been verified via experimental studies. Note to Practitioners—Intelligent robots will become the right assistants of human beings in the future, especially in various areas of manipulation occasions. The important premise of realizing this vision is that the robots should have certain ability of manipulation skill learning. A lot of research has been carried out in this field, many of which are focusing on trajectory level manipulation skill learning and reproduction. Other than the trajectory level learning, human beings can learn many other higher levels of manipulation skills, such as the contact state level and semantic level learning, which makes the learning results more robust and general. In this paper, the contact state estimation and learning method via environment dynamics and geometric constraints modelling is presented to learn the robot manipulation skill based on the contact state transition conditions. In this way, the robot needs less data in the skill learning process, and the trajectory level learning is avoided. After learning the contact state level manipulation skill, the lower trajectory level command is autonomously generated. Experiments on button pressing and peg-hole-insertion tasks by KUKA iiwa robot have obtained very good results. Other than the button pressing and peg-hole-insertion tasks, the presented methodology can be applied to many other manipulation tasks, as long as there are contact state changes in the manipulation process. The work of this paper lays a foundation for the robot learning of higher-level manipulation skills. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
8. Guest Editorial: Designing Technologies to Support Professional and Workplace Learning for Situated Practice.
- Author
-
Pammer-Schindler, Viktoria, Ley, Tobias, Kimmerle, Joachim, and Littlejohn, Allison
- Abstract
The papers in this special section focus on designing technologies to support professional and workplace learning in situated practices. In an era of global, organizational, and technological change, all of which are transforming the world of work, professional and workplace learning are critical for employability and organizational competitiveness. A range of fundamental transformations is changing how people work. Digital technologies are replacing human labor and, at the same time, are accelerating the expansion of job roles and work practices. Work is becoming increasingly specialized, which means that professionals in collaborative and networked ways across discipline and organization boundaries. In parallel, labor is increasingly decentralized, making decisionmaking more distributed and raising the need for remote communication and collaboration. Subsequently, work is becoming more independent from time and place, as people connect, collaborate, and work via digital technologies. These changes come with a need for substantial and continuous workplace learning, and with the need for changes in how workplace learning happens. Of course, digital technologies are already used to provide learning and training in workplaces. However, most of these learning technologies have been developed for formal education (e.g., K- 12 and higher education) rather than in workplace contexts. There is a need to understand and evidence workplace learning needs and to further develop technologies that can support and scale workplace learning. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
9. Autonomous Generation of Service Strategy for Household Tasks: A Progressive Learning Method With A Priori Knowledge and Reinforcement Learning.
- Author
-
Zhang, Mengyang, Tian, Guohui, Gao, Huanbing, and Zhang, Ying
- Subjects
REINFORCEMENT learning ,REWARD (Psychology) ,ARTIFICIAL neural networks - Abstract
Human beings tend to learn unknown knowledge in a gradual process, from the basic to the complex. Based on this point, we propose a progressive learning method for producing service strategies according to requests, with a hierarchical priori knowledge and reinforcement learning. Service strategy aims to guide how to perform home services and takes into consideration the relationship between actions and objects in home environment. In this paper, strategy generation is regarded as a text generation problem in question answering (QA). Firstly, a hierarchical priori knowledge with service-object correlation at the bottom and action-object correlation at the top is constructed to assist the understanding on the relationship of objects and actions in service strategies. Service-object correlation guides how to select proper objects with the correct order, while action-object correlation associates actions in strategies according to selected objects. Based on the hierarchical priori knowledge, a progressive learning method is proposed to make the model produce effective strategies with a sequential cognition, from service-object correlation (objects) to action-object correlation (actions). After that, reinforcement learning is employed to enhance the progressive guidance, by designing rewards in terms of the hierarchical priori knowledge. Finally, the proposed method is tested with both comparative experiments and ablation studies, and the experimental results demonstrate the superiority in producing comprehensive and logical strategies, indicating that the progressive learning method in our paper can further improve the QA performance. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
10. Robust Monocular Pose Tracking of Less-Distinct Objects Based on Contour-Part Model.
- Author
-
Sun, Xiaoliang, Zhou, Jiexin, Zhang, Wenlong, Wang, Zi, and Yu, Qifeng
- Subjects
MONOCULARS ,PROBLEM solving ,SINGLE-degree-of-freedom systems ,IMAGE segmentation - Abstract
Due to the lack of distinctiveness to the background, robust monocular 6DOF pose tracking of less-distinct objects remains an important open problem. In this paper, we firstly analyze the object distinctiveness in the tracking process. Then, we propose a novel contour-part model based robust monocular pose tracking method for less-distinct objects. This paper uses the traditional contour feature for pose tracking in a novel strategy called contour part model. First, the contour part model is built by segmenting the projected contour rendered from the 3D model into contour segments of a certain length adaptively according to the Shi-Tomasi cornerness scores. Then, the correspondence is detected in the input image for each contour part by gradient orientation based template matching. Finally, pose tracking is achieved by solving the PnP problem. Experiment results on semi-synthetic and real images show that the proposed method performs better than the existing methods when pose tracking less-distinct objects and shows great robustness toward interference. Additionally, we combine the edge feature and the regional feature in a simple strategy. The overall performance of the fused method is boosted according to the experimental results on the RBOT dataset. The fact shows that the fusion of the edge and regional features has great potential for improving tracking performance. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
11. Transparent human – (non-) transparent technology? The Janus-faced call for transparency in AI-based health care technologies.
- Author
-
Ott, Tabea and Dabrock, Peter
- Subjects
MEDICAL technology ,ARTIFICIAL intelligence ,DIGNITY ,MEDICAL care - Abstract
The use of Artificial Intelligence and Big Data in health care opens up new opportunities for the measurement of the human. Their application aims not only at gathering more and better data points but also at doing it less invasive. With this change in health care towards its extension to almost all areas of life and its increasing invisibility and opacity, new questions of transparency arise. While the complex human-machine interactions involved in deploying and using AI tend to become non-transparent, the use of these technologies makes the patient seemingly transparent. Papers on the ethical implementation of AI plead for transparency but neglect the factor of the “transparent patient” as intertwined with AI. Transparency in this regard appears to be Janus-faced: The precondition for receiving help - e.g., treatment advice regarding the own health - is to become transparent for the digitized health care system. That is, for instance, to donate data and become visible to the AI and its operators. The paper reflects on this entanglement of transparent patients and (non-) transparent technology. It argues that transparency regarding both AI and humans is not an ethical principle per se but an infraethical concept. Further, it is no sufficient basis for avoiding harm and human dignity violations. Rather, transparency must be enriched by intelligibility following Judith Butler’s use of the term. Intelligibility is understood as an epistemological presupposition for recognition and the ensuing humane treatment. Finally, the paper highlights ways to testify intelligibility in dealing with AI in health care ex ante, ex post, and continuously. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
12. On the Dynamics of Hopfield Neural Networks on Unit Quaternions.
- Author
-
Valle, Marcos Eduardo and de Castro, Fidelis Zanetti
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,NEURAL circuitry - Abstract
In this paper, we first address the dynamics of the elegant multivalued quaternionic Hopfield neural network (MV-QHNN) proposed by Minemoto et al. Contrary to what was expected, we show that the MV-QHNN, as well as one of its variation, does not always come to rest at an equilibrium state under the usual conditions. In fact, we provide simple examples in which the network yields a periodic sequence of quaternionic state vectors. Afterward, we turn our attention to the continuous-valued quaternionic Hopfield neural network (CV-QHNN), which can be derived from the MV-QHNN by means of a limit process. The CV-QHNN can be implemented more easily than the MV-QHNN model. Furthermore, the asynchronous CV-QHNN always settles down into an equilibrium state under the usual conditions. Theoretical issues are all illustrated by examples in this paper. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
13. Learning From Multiple Imperfect Instructors in Sensor Networks.
- Author
-
Virani, Nurali, Phoha, Shashi, and Ray, Asok
- Subjects
KERNEL (Mathematics) ,SEQUENTIAL learning ,MACHINE learning - Abstract
This paper presents a sequential learning framework for sensors in a network, where a few sensors assume the role of an instructor to train other sensors in the network. The instructors provide estimated labels for measurements of new sensors. These labels are possibly noisy, because a classifier of the instructor may not be perfect. A recursive density estimator is proposed to obtain the true measurement model (i.e., the observation density conditioned on the label) in spite of the training with noisy labels. Specifically, this paper answers the question “Can a sensor train other sensors?”, provides necessary conditions for sensors to act as instructors, presents a sequential learning framework using recursive nonparametric kernel density estimation, and provides a convergence rate for the expected error in an observation density. The underlying concepts are illustrated and validated with simulation results. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
14. Learning Entropy as a Learning-Based Information Concept
- Author
-
Ivo Bukovsky, Witold Kinsner, and Noriyasu Homma
- Subjects
Computer science ,General Physics and Astronomy ,lcsh:Astrophysics ,02 engineering and technology ,01 natural sciences ,Novelty detection ,010305 fluids & plasmas ,information ,learning systems ,lcsh:QB460-466 ,0103 physical sciences ,non-probabilistic entropy ,0202 electrical engineering, electronic engineering, information engineering ,Entropy (information theory) ,Learning based ,lcsh:Science ,learning ,business.industry ,Novelty ,Probabilistic logic ,Cognition ,Concept Paper ,lcsh:QC1-999 ,Informatics ,lcsh:Q ,020201 artificial intelligence & image processing ,Artificial intelligence ,Information measure ,business ,lcsh:Physics ,novelty detection - Abstract
Recently, a novel concept of a non-probabilistic novelty detection measure, based on a multi-scale quantification of unusually large learning efforts of machine learning systems, was introduced as learning entropy (LE). The key finding with LE is that the learning effort of learning systems is quantifiable as a novelty measure for each individually observed data point of otherwise complex dynamic systems, while the model accuracy is not a necessary requirement for novelty detection. This brief paper extends the explanation of LE from the point of an informatics approach towards a cognitive (learning-based) information measure emphasizing the distinction from Shannon’s concept of probabilistic information. Fundamental derivations of learning entropy and of its practical estimations are recalled and further extended. The potentials, limitations, and, thus, the current challenges of LE are discussed.
- Published
- 2019
15. A Deep Transfer Learning Based Architecture for Brain Tumor Classification Using MR Images.
- Author
-
Badjie, Bakary and Ülker, Ezgi Deniz
- Subjects
DEEP learning ,BRAIN tumors ,TUMOR classification ,MAGNETIC resonance imaging ,CONVOLUTIONAL neural networks ,MACHINE learning - Abstract
Deep Learning (DL) is becoming more popular in the healthcare sectors due to the exponential growth of data availability and its excellent performance in diagnosing various diseases. This paper has aimed to design a new possible brain tumor diagnostic model to improve accuracy and reliability of radiology. In this paper, an advanced deep learning algorithm is used to detect and classify brain tumors in magnetic resonance (MR) images. Diagnosing brain tumors in radiology is a significant issue, yet it is a difficult and time-consuming procedure that radiologists must pass through. The reliability of their assessment relies completely on their knowledge and personal judgements which are in most cases inaccurate. As a possible remedy to the growing concern in diagnosing brain tumors accurately, in this work a deep learning method is applied to classify the brain tumor MR images with very high performance accuracy. The research leveraged a transfer learning model known as AlexNet's convolutional neural network (CNN) to perform this operation. Our method helps to improve robustness, efficiencies and accuracy in the healthcare sector with the ability to automate the entire diagnostic process with the overall accuracy of 99.62%. Additionally, our model has the ability to detect and classify tumors at their different stages and magnitudes. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
16. ES-DQN: A Learning Method for Vehicle Intelligent Speed Control Strategy Under Uncertain Cut-In Scenario.
- Author
-
Chen, Qingyun, Zhao, Wanzhong, Li, Lin, Wang, Chunyan, and Chen, Feng
- Subjects
INTELLIGENT control systems ,REINFORCEMENT learning ,REWARD (Psychology) ,ADAPTIVE control systems ,AUTONOMOUS vehicles - Abstract
Uncertain cut-in maneuver of vehicles from adjacent lanes makes it difficult for vehicle's automatic speed control strategy to make judgments and effective control decisions. In this paper, an intelligent speed control strategy for uncertain cut-in scenarios is established based on a basic autonomous driving system. This strategy judges cut-in maneuver from surrounding vehicles and outputs adaptive control action under current environment according to Q value of state-action pair based on a Q network. In addition, according to the analysis of cut-in scenarios, the Q network is trained based on a novel reinforcement learning method named as experience screening deep Q-learning network (ES-DQN). The proposed ES-DQN is an extension of double deep Q-learning network (DDQN) algorithm, and includes two parts: experience screening and policy learning. Based on the experience screened from the experience screening part, the proposed learning method can train an intelligent speed control strategy which has stronger adaptability and control effect in uncertain cut-in scenarios. According to simulation results, the proposed intelligent speed control strategy trained by ES-DQN has better performance under uncertain cut-in scenarios than DDQN method and traditional ACC strategy. Meanwhile, by adjusting weight value in reward function, the system can realize different control target. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
17. Graph Lifelong Learning: A Survey.
- Author
-
Febrinanto, Falih Gozi, Xia, Feng, Moore, Kristen, Thapa, Chandra, and Aggarwal, Charu
- Abstract
Graph learning is a popular approach for perfor ming machine learning on graph-structured data. It has revolutionized the machine learning ability to model graph data to address downstream tasks. Its application is wide due to the availability of graph data ranging from all types of networks to information systems. Most graph learning methods assume that the graph is static and its complete structure is known during training. This limits their applicability since they cannot be applied to problems where the underlying graph grows over time and/or new tasks emerge incrementally. Such applications require a lifelong learning approach that can learn the graph continuously and accommodate new information whilst retaining previously learned knowledge. Lifelong learning methods that enable continuous learning in regular domains like images and text cannot be directly applied to continuously evolving graph data, due to its irregular structure. As a result, graph lifelong learning is gaining attention from the research community. This survey paper provides a comprehensive overview of recent advancements in graph lifelong learning, including the categorization of existing methods, and the discussions of potential applications and open research problems. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes.
- Author
-
Xuan, Junyu, Lu, Jie, Zhang, Guangquan, Xu, Richard Yi Da, and Luo, Xiangfeng
- Subjects
SPARSE matrices ,FACTORIZATION ,CLUSTER theory (Nuclear physics) ,GAUSSIAN function ,COPULA functions - Abstract
Sparse nonnegative matrix factorization (SNMF) aims to factorize a data matrix into two optimized nonnegative sparse factor matrices, which could benefit many tasks, such as document-word co-clustering. However, the traditional SNMF typically assumes the number of latent factors (i.e., dimensionality of the factor matrices) to be fixed. This assumption makes it inflexible in practice. In this paper, we propose a doubly sparse nonparametric NMF framework to mitigate this issue by using dependent Indian buffet processes (dIBP). We apply a correlation function for the generation of two stick weights associated with each column pair of factor matrices while still maintaining their respective marginal distribution specified by IBP. As a consequence, the generation of two factor matrices will be columnwise correlated. Under this framework, two classes of correlation function are proposed: 1) using bivariate Beta distribution and 2) using Copula function. Compared with the single IBP-based NMF, this paper jointly makes two factor matrices nonparametric and sparse, which could be applied to broader scenarios, such as co-clustering. This paper is seen to be much more flexible than Gaussian process-based and hierarchial Beta process-based dIBPs in terms of allowing the two corresponding binary matrix columns to have greater variations in their nonzero entries. Our experiments on synthetic data show the merits of this paper compared with the state-of-the-art models in respect of factorization efficiency, sparsity, and flexibility. Experiments on real-world data sets demonstrate the efficiency of this paper in document-word co-clustering tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
19. Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.
- Author
-
Jing, Longlong and Tian, Yingli
- Subjects
VISUAL learning ,SUPERVISED learning ,DEEP learning ,COMPUTER vision - Abstract
Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer vision applications. To avoid extensive cost of collecting and annotating large-scale datasets, as a subset of unsupervised learning methods, self-supervised learning methods are proposed to learn general image and video features from large-scale unlabeled data without using any human-annotated labels. This paper provides an extensive review of deep learning-based self-supervised general visual feature learning methods from images or videos. First, the motivation, general pipeline, and terminologies of this field are described. Then the common deep neural network architectures that used for self-supervised learning are summarized. Next, the schema and evaluation metrics of self-supervised learning methods are reviewed followed by the commonly used datasets for images, videos, audios, and 3D data, as well as the existing self-supervised visual feature learning methods. Finally, quantitative performance comparisons of the reviewed methods on benchmark datasets are summarized and discussed for both image and video feature learning. At last, this paper is concluded and lists a set of promising future directions for self-supervised visual feature learning. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
20. On the Work of the Institute of Control Sciences of the Russian Academy of Sciences in the Field of Pattern Recognition Theory and Applications in the 20th Century.
- Author
-
Mandel, A. S. and Mikhalsky, A. I.
- Abstract
Brief historical background on the establishment and activities of the Institute of Control Sciences (IPU RAS). The paper presents key findings of the Institute (obtained mainly in the 20th century) in the field of pattern recognition and of related analysis of complex data. It focuses on four areas of research including (a) the method of potential functions, (b) the theory of learning and self-learning systems, (c) the generalized portrait method and recovery of dependences based on empirical data, and (d) automatic classification methods and expert classification analysis. Relations between these areas are studied. The pioneers in the field are named (M.A. Aizerman, E.M. Braverman, L.I. Rozonoer, Ya.Z. Tsypkin, V.N. Vapnik, A.Ya. Chervonenkis, I.B. Muchnik, and A.A. Dorofeyuk among others) and brief biographical notes on the life and scientific work of these scientists are presented. The follow-ups of the results thus obtained are shown. The bibliography of publications by the Institute's researchers in leading journals of Russia on pattern recognition problems and related complex data analysis tasks is provided. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
21. Adaptive stochastic model predictive control via network ensemble learning.
- Author
-
Xiong, Weiliang, He, Defeng, Mu, Jianbin, and Wang, Xiuli
- Subjects
STOCHASTIC models ,PREDICTION models ,LINEAR systems ,BAYESIAN analysis ,EXPONENTIAL stability ,MACHINE learning ,ADAPTIVE control systems ,FEEDFORWARD neural networks - Abstract
This paper proposes a novel ensemble learning-based adaptive stochastic model predictive control (SMPC) algorithm for constrained linear systems with unknown nonlinear terms and random disturbances. The ensemble network combining a feedforward neural network and a Bayesian network is used to offline learn the nonlinear dynamics and disturbance distribution parameters. Then, the mixed-tube scheme is designed to cope with input constraints and state chance constraints while decreasing computational demands and conservativeness. The reliability of the stochastic tube is guaranteed using the Hoeffding inequality-based verification mechanism, which results in a chance constraint with double probabilities. The feasibility and exponential stability of the SMPC are rigorously proven. A numerical example verifies the merits of the proposed algorithm in terms of the control performance and the feasible domain. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
22. On solving single elevator-like problems using a learning automata-based paradigm
- Author
-
Ghaleb, Omar and Oommen, B. John
- Published
- 2021
- Full Text
- View/download PDF
23. Guest Editorial Deep Learning Models for Industry Informatics.
- Author
-
Agrawal, Dharma Prakash, Gupta, Brij Bhooshan, Wang, Haoxiang, Chang, Xiaojun, Yamaguchi, Shingo, and Perez, Gregorio Martinez
- Abstract
This papers in this special issue mainly focus on deep learning models for industry informatics, addressing both original algorithmic development and new applications of deep learning. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
24. Analysis With Histogram of Connectivity: For Automated Evaluation of Piping Layout.
- Author
-
Tan, Wei Chian, Chen, I-Ming, Pan, Sinno Jialin, and Tan, Hoon Kiang
- Subjects
HISTOGRAMS ,PIPING -- Design & construction ,DESCRIPTOR systems ,SUPPORT vector machines ,MACHINE learning - Abstract
An autonomous framework to evaluate layout of a piping design in the form of piping and instrumentation diagram (P&ID) according to a set of standards of marine and offshore industry is proposed. The method starts with transforming a P&ID into a vector x in R^d . Transformation is done based on a concept introduced for piping known as Histogram of Connectivity. The proposed descriptor captures two essential properties of P&ID: attributes of each component and connectivity among the components. Next, linear support vector machine (SVM) is used to learn a classifier from existing compliant and noncompliant designs. Subsequently, the linear classifier can be used to check if an unseen design complies with the standards. In addition, to enable follow up on noncompliant design including correction or modification, a method to analyze the reason of noncompliance prediction by the learned SVM model is introduced. The method has demonstrated encouraging performance in two challenging data sets of designs created with advice from experienced engineers in the industry, based on International Convention for the Prevention of Pollution from Ships (MARPOL) and Rules for Classification of Ships of Lloyd’s Register. Note to Practitioners—This paper is motivated by need of marine and offshore industry for automated solution for design appraisal. This paper aims to address this issue by using a machine learning-based approach. Some compliant and noncompliant designs are provided to a developed algorithm for a machine (or computer) to learn. After learning is completed, the machine is able to classify unseen designs as compliant or noncompliant. As highlighted in this paper, the developed method has demonstrated encouraging performance in two case studies, including specific parts in MARPOL and Rules of Lloyd’s Register. For adoption by industry, necessary steps include collecting some designs (compliant and noncompliant) available in an organization and feeding these into the developed method for learning by machine before it can predict. With ability of highlighting possible connections that cause noncompliance, follow up and correction on a noncompliant design is made possible. [ABSTRACT FROM PUBLISHER]
- Published
- 2018
- Full Text
- View/download PDF
25. The Boundedness Conditions for Model-Free HDP($\lambda$).
- Author
-
Al-Dabooni, Seaar and Wunsch, Donald
- Subjects
LYAPUNOV stability ,NONLINEAR systems ,ARTIFICIAL neural networks ,PENDULUMS ,STABILITY criterion ,QUALITY factor ,ERROR correction (Information theory) - Abstract
This paper provides the stability analysis for a model-free action-dependent heuristic dynamic programing (HDP) approach with an eligibility trace long-term prediction parameter ($\lambda $). HDP($\lambda $) learns from more than one future reward. Eligibility traces have long been popular in Q-learning. This paper proves and demonstrates that they are worthwhile to use with HDP. In this paper, we prove its uniformly ultimately bounded (UUB) property under certain conditions. Previous works present a UUB proof for traditional HDP [HDP($\lambda =0$)], but we extend the proof with the $\lambda $ parameter. By using Lyapunov stability, we demonstrate the boundedness of the estimated error for the critic and actor neural networks as well as learning rate parameters. Three case studies demonstrate the effectiveness of HDP($\lambda $). The trajectories of the internal reinforcement signal nonlinear system are considered as the first case. We compare the results with the performance of HDP and traditional temporal difference [TD($\lambda $)] with different $\lambda $ values. The second case study is a single-link inverted pendulum. We investigate the performance of the inverted pendulum by comparing HDP($\lambda $) with regular HDP, with different levels of noise. The third case study is a 3-D maze navigation benchmark, which is compared with state action reward state action, Q($\lambda $), HDP, and HDP($\lambda $). All these simulation results illustrate that HDP($\lambda $) has a competitive performance; thus this contribution is not only UUB but also useful in comparison with traditional HDP. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
26. Dualityfree Methods for Stochastic Composition Optimization.
- Author
-
Liu, Liu, Liu, Ji, and Tao, Dacheng
- Subjects
REINFORCEMENT learning ,STATISTICAL learning ,MACHINE learning ,CONJUGATE gradient methods ,EMBEDDINGS (Mathematics) ,ARTIFICIAL intelligence ,ALGORITHMS - Abstract
In this paper, we consider the composition optimization with two expected-value functions in the form of $({1}/{n})\sum _{i = 1}^{n} F_{i}\left({({1}/{m})\sum _{j = 1}^{m} G_{j}(x)}\right)+R(x)$ , which formulates many important problems in statistical learning and machine learning such as solving Bellman equations in reinforcement learning and nonlinear embedding. Full gradient- or classical stochastic gradient descent-based optimization algorithms are unsuitable or computationally expensive to solve this problem due to the inner expectation $({1}/{m})\sum _{j = 1}^{m} G_{j}(x)$. We propose a dualityfree-based stochastic composition method that combines the variance reduction methods to address the stochastic composition problem. We apply the stochastic variance reduction gradient- and stochastic average gradient algorithm-based methods to estimate the inner function and the dualityfree method to estimate the outer function. We prove the linear convergence rate not only for the convex composition problem but also for the case that the individual outer functions are nonconvex, while the objective function is strongly convex. We also provide the results of experiments that show the effectiveness of our proposed methods. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
27. Optimization of Distributions Differences for Classification.
- Author
-
Bonyadi, Mohammad Reza, Tieng, Quang M., and Reutens, David C.
- Subjects
MULTIDISCIPLINARY design optimization ,DISTRIBUTION (Probability theory) ,QUASI-Newton methods - Abstract
In this paper, we introduce a new classification algorithm called the optimization of distribution differences (ODD). The algorithm aims to find a transformation from the feature space to a new space where the instances in the same class are as close as possible to one another, whereas the gravity centers of these classes are as far as possible from one another. This aim is formulated as a multiobjective optimization problem that is solved by a hybrid of an evolutionary strategy and the quasi-Newton method. The choice of the transformation function is flexible and could be any continuous space function. We experiment with a linear and a nonlinear transformation in this paper. We show that the algorithm can outperform eight other classification methods, namely naive Bayes, support vector machines, linear discriminant analysis, multilayer perceptrons, decision trees, and $k$ -nearest neighbors, and two recently proposed classification methods, in 12 standard classification data sets. Our results show that the method is less sensitive to the imbalanced number of instances compared with these methods. We also show that ODD maintains its performance better than other classification methods in these data sets and hence offers a better generalization ability. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
28. Theoretical Study of Oscillator Neurons in Recurrent Neural Networks.
- Author
-
Zhang, Lei, Yi, Zhang, and Amari, Shun-ichi
- Subjects
ARTIFICIAL neural networks ,NEURONS - Abstract
Neurons in a network can be both active or inactive. Given a subset of neurons in a network, is it possible for the subset of neurons to evolve to form an active oscillator by applying some external periodic stimulus? Furthermore, can these oscillator neurons be observable, that is, is it a stable oscillator? This paper explores such possibility, finding that an important property: any subset of neurons can be intermittently co-activated to form a stable oscillator by applying some external periodic input without any condition. Thus, the existing of intermittently active oscillator neurons is an essential property possessed by the networks. Moreover, this paper shows that, under some conditions, a subset of neurons can be fully co-activated to form a stable oscillator. Such neurons are called selectable oscillator neurons. Necessary and sufficient conditions are established for a subset of neurons to be selectable oscillator neurons in linear threshold recurrent neuron networks. It is proved that a subset of neurons forms selectable oscillator neurons if and only if the real part of each eigenvalue of the associated synaptic connection weight submatrix of the network is not larger than one. This simple condition makes the concept of selectable oscillator neurons tractable. The selectable oscillator neurons can be regarded as memories stored in the synaptic connections of networks, which enables to find a new perspective of memories in neural networks, different from the equilibrium-type attractors. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
29. Modified Sparse Linear-Discriminant Analysis via Nonconvex Penalties.
- Author
-
Cai, Jia and Huang, Xiaolin
- Subjects
DISCRIMINANT analysis ,ORTHOGONAL matching pursuit ,SPARSE approximations - Abstract
This paper considers the linear-discriminant analysis (LDA) problem in the undersampled situation, in which the number of features is very large and the number of observations is limited. Sparsity is often incorporated in the solution of LDA to make a well interpretation of the results. However, most of the existing sparse LDA algorithms pursue sparsity by means of the $\ell _{1}$ -norm. In this paper, we give elaborate analysis for nonconvex penalties, including the $\ell _{0}$ -based and the sorted $\ell _{1}$ -based LDA methods. The latter one can be regarded as a bridge between the $\ell _{0}$ and $\ell _{1}$ penalties. These nonconvex penalty-based LDA algorithms are evaluated on the gene expression array and face database, showing high classification accuracy on real-world problems. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
30. ST-InNet: Deep Spatio-Temporal Inception Networks for Traffic Flow Prediction in Smart Cities.
- Author
-
Dai, Fei, Huang, Penggui, Mo, Qi, Xu, Xiaolong, Bilal, Muhammad, and Song, Houbing
- Abstract
Traffic flow prediction plays a critical role in reducing traffic congestion in transportation systems. However, accurate traffic flow prediction becomes challenging due to the impact of complex spatio-temporal (ST) correlations and the diversity of ST correlations. When modeling complicated ST correlations, researchers usu did not take the diversity of ST correlations into consideration, resulting in poor prediction accuracy. In this paper, we propose ST-InNet, a deep spatio-temporal Inception network for collectively predicting traffic flow in each city region. Specifically, ST-InNet employs two Inception networks to simultaneously capture various spatial and temporal correlations of traffic data, including temporal closeness, temporal periodicity, nearby spatial dependencies, and distant spatial dependencies. For the diversity of spatial correlations, ST-InNet presents an improved variant of an Inception module to explicitly capture the different contributions of spatial correlations for each region. For the diversity of temporal correlations, ST-InNet designs a fusion component to explicitly model the varying contributions of temporal correlations on prediction. The experiments are conducted on a real-world traffic dataset in Nanjing, demonstrating that ST-InNet outperforms five state-of-the-art baselines in short-term and long-term traffic flow predictions with an average accuracy improvement of 32.09% and 30.97%, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
31. Interactive control algorithm for shoulder-amputated prosthesis and object based on reinforcement learning.
- Author
-
Li, Baojiang, Ye, Haiyan, Guo, Yuting, Wang, Haiyan, Qiu, Shengjie, and Bai, Jibo
- Subjects
- *
INTELLIGENT control systems , *REINFORCEMENT learning , *ARTIFICIAL hands , *MACHINE learning , *REINFORCEMENT (Psychology) , *RESIDUAL limbs , *PROSTHETICS - Abstract
As a prosthesis made to compensate for the residual loss of the amputee's limb, the shoulder disarticulation upper limb prosthesis replaces the missing arm function of the shoulder amputee to a certain extent. However, the current upper limb prosthesis mainly interacts with the outside world through the prosthetic hand for grasping and gripping, and the interaction between other parts and the environment is often neglected, which is not in line with the use habits of the human arm. To address this problem, this paper proposes a reinforcement learning–based method for controlling the forearm interaction of a shoulder-disconnected upper limb prosthesis, and analyzes and solves the forces during the interaction, reducing the impact of uncertainty on interaction actions and accelerating training while ensuring the stability of handheld items. We evaluated the performance of the control method during the interaction between the upper limb prosthesis and the external environment through simulation experiments. After the training, the bionic arm was able to push the object into the target range for different objects and pushing distance requirements, which showed the good control effect of the method. Also, the control method can be applied to improve the interaction between the robotic arm and the environment. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Fast Matrix Factorization With Nonuniform Weights on Missing Data.
- Author
-
He, Xiangnan, Tang, Jinhui, Du, Xiaoyu, Hong, Richang, Ren, Tongwei, and Chua, Tat-Seng
- Subjects
MATRIX decomposition ,SINGULAR value decomposition ,STATISTICAL weighting ,DATA entry ,LEAST squares ,RECOMMENDER systems ,MACHINE learning - Abstract
Matrix factorization (MF) has been widely used to discover the low-rank structure and to predict the missing entries of data matrix. In many real-world learning systems, the data matrix can be very high dimensional but sparse. This poses an imbalanced learning problem since the scale of missing entries is usually much larger than that of the observed entries, but they cannot be ignored due to the valuable negative signal. For efficiency concern, existing work typically applies a uniform weight on missing entries to allow a fast learning algorithm. However, this simplification will decrease modeling fidelity, resulting in suboptimal performance for downstream applications. In this paper, we weight the missing data nonuniformly, and more generically, we allow any weighting strategy on the missing data. To address the efficiency challenge, we propose a fast learning method, for which the time complexity is determined by the number of observed entries in the data matrix rather than the matrix size. The key idea is twofold: 1) we apply truncated singular value decomposition on the weight matrix to get a more compact representation of the weights and 2) we learn MF parameters with elementwise alternating least squares (eALS) and memorize the key intermediate variables to avoid repeating computations that are unnecessary. We conduct extensive experiments on two recommendation benchmarks, demonstrating the correctness, efficiency, and effectiveness of our fast eALS method. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
33. Features Combined From Hundreds of Midlayers: Hierarchical Networks With Subnetwork Nodes.
- Author
-
Yang, Yimin and Wu, Q. M. Jonathan
- Subjects
LEARNING strategies ,ARTIFICIAL neural networks ,ITERATIVE learning control - Abstract
In this paper, we believe that the mixed selectivity of neuron in the top layer encodes distributed information produced from other neurons to offer a significant computational advantage over recognition accuracy. Thus, this paper proposes a hierarchical network framework that the learning behaviors of features combined from hundreds of midlayers. First, a subnetwork neuron, which itself could be constructed by other nodes, is functional as a subspace features extractor. The top layer of a hierarchical network needs subspace features produced by the subnetwork neurons to get rid of factors that are not relevant, but at the same time, to recast the subspace features into a mapping space so that the hierarchical network can be processed to generate more reliable cognition. Second, this paper shows that with noniterative learning strategy, the proposed method has a wider and shallower structure, providing a significant role in generalization performance improvements. Hence, compared with other state-of-the-art methods, multiple channel features with the proposed method could provide a comparable or even better performance, which dramatically boosts the learning speed. Our experimental results show that our platform can provide a much better generalization performance than 55 other state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
34. Adaptive Neighborhood Metric Learning.
- Author
-
Song, Kun, Han, Junwei, Cheng, Gong, Lu, Jiwen, and Nie, Feiping
- Subjects
MACHINE learning ,NEIGHBORHOODS ,DEEP learning ,DISTANCE education - Abstract
In this paper, we reveal that metric learning would suffer from serious inseparable problem if without informative sample mining. Since the inseparable samples are often mixed with hard samples, current informative sample mining strategies used to deal with inseparable problem may bring up some side-effects, such as instability of objective function, etc. To alleviate this problem, we propose a novel distance metric learning algorithm, named adaptive neighborhood metric learning (ANML). In ANML, we design two thresholds to adaptively identify the inseparable similar and dissimilar samples in the training procedure, thus inseparable sample removing and metric parameter learning are implemented in the same procedure. Due to the non-continuity of the proposed ANML, we develop an ingenious function, named log-exp mean function to construct a continuous formulation to surrogate it, which can be efficiently solved by the gradient descent method. Similar to Triplet loss, ANML can be used to learn both the linear and deep embeddings. By analyzing the proposed method, we find it has some interesting properties. For example, when ANML is used to learn the linear embedding, current famous metric learning algorithms such as the large margin nearest neighbor (LMNN) and neighbourhood components analysis (NCA) are the special cases of the proposed ANML by setting the parameters different values. When it is used to learn deep features, the state-of-the-art deep metric learning algorithms such as Triplet loss, Lifted structure loss, and Multi-similarity loss become the special cases of ANML. Furthermore, the log-exp mean function proposed in our method gives a new perspective to review the deep metric learning methods such as Prox-NCA and N-pairs loss. At last, promising experimental results demonstrate the effectiveness of the proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
35. BTWalk: Branching Tree Random Walk for Multi-Order Structured Network Embedding.
- Author
-
Xiong, Hao and Yan, Junchi
- Subjects
TREE branches ,RANDOM walks ,LEARNING strategies ,SAMPLING (Process) ,TASK analysis - Abstract
Multi-order proximity is useful for effective network embedding. In contrast to many previous works that only consider order-level weights, this paper proposes to explore a more expressive node-level weighting mechanism to encode the diverse local structure, with a scalable and theoretically justified sampling strategy for its learning. Specifically, we start with a formal definition of multi-order proximity matrix which leads to our new multi-order objective based on Laplacian Eigenmaps and Skip-Gram. Then we instantiate the node-specific multi-order weights in the objective with the help of neighborhood size estimation, which indicates node-specific multi-order information. For objective learning, it is implicitly fulfilled with our proposed branching tree-like random walk strategy termed by BTWalk, which differs from the dominant chain-like walk in existing sampling techniques. BTWalk is designed by a synergetic combination of BFS (breadth-first search) and DFS (depth-first search), which is modulated according to the weights of the considered proximity orders. We theoretically analyze its cost-efficiency, and further propose the so-called Vec4Cross framework that incorporates joint node embedding and network alignment for two partially overlapped networks based on the seed matchings, whereby BTWalk is also adopted for embedding. Promising experimental results are obtained on real-world datasets across popular tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
36. TDPN: Texture and Detail-Preserving Network for Single Image Super-Resolution.
- Author
-
Cai, Qing, Li, Jinxing, Li, Huafeng, Yang, Yee-Hong, Wu, Feng, and Zhang, David
- Subjects
HIGH resolution imaging ,CONVOLUTIONAL neural networks ,GENERATIVE adversarial networks ,SIGNAL-to-noise ratio - Abstract
Single image super-resolution (SISR) using deep convolutional neural networks (CNNs) achieves the state-of-the-art performance. Most existing SISR models mainly focus on pursuing high peak signal-to-noise ratio (PSNR) and neglect textures and details. As a result, the recovered images are often perceptually unpleasant. To address this issue, in this paper, we propose a texture and detail-preserving network (TDPN), which focuses not only on local region feature recovery but also on preserving textures and details. Specifically, the high-resolution image is recovered from its corresponding low-resolution input in two branches. First, a multi-reception field based branch is designed to let the network fully learn local region features by adaptively selecting local region features in different reception fields. Then, a texture and detail-learning branch supervised by the textures and details decomposed from the ground-truth high resolution image is proposed to provide additional textures and details for the super-resolution process to improve the perceptual quality. Finally, we introduce a gradient loss into the SISR field and define a novel hybrid loss to strengthen boundary information recovery and to avoid overly smooth boundary in the final recovered high-resolution image caused by using only the MAE loss. More importantly, the proposed method is model-agnostic, which can be applied to most off-the-shelf SISR networks. The experimental results on public datasets demonstrate the superiority of our TDPN on most state-of-the-art SISR methods in PSNR, SSIM and perceptual quality. We will share our code on https://github.com/tocaiqing/TDPN. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
37. PSO-Based Fuzzy Markup Language for Student Learning Performance Evaluation and Educational Application.
- Author
-
Lee, Chang-Shing, Wang, Mei-Hui, Wang, Chi-Shiang, Teytaud, Olivier, Liu, Jialin, Lin, Su-Wei, and Hung, Pi-Hsia
- Subjects
PARTICLE swarm optimization ,FUZZY sets ,XML (Extensible Markup Language) ,ITEM response theory ,BAYESIAN analysis - Abstract
Fuzzy relationships exist between students’ learning performance with various abilities and a test item. However, the challenges in implementing adaptive assessment agents are obtaining sufficient items, efficient and accurate computerized estimation, and a substantial feedback agent. Additionally, the agent must immediately estimate students’ ability item by item, which places a considerable burden on the server, especially for a group test. Hence, the implementation of an adaptive assessment agent is more difficult in practice. This paper proposes an agent with particle swarm optimization (PSO) based on a fuzzy markup language (FML) for students’ learning performance evaluation and educational applications, and the proposed agent is according to the response data from a conventional test and an item response theory (IRT)-based three-parameter logistic model. First, we apply a Gauss–Seidel based parameter estimation mechanism to estimate the items’ parameters according to the response data, and then to compare its results with those of an IRT-based Bayesian parameter estimation mechanism. In addition, we propose a static-IRT test assembly mechanism to assemble a form for the conventional test. The presented FML-based dynamic assessment mechanism infers the probability of making a correct response to the item for a student with various abilities. Moreover, this paper also proposes a novel PSO-based FML (PFML) learning mechanism for optimizing the parameters between items and students. Finally, we adopt a K-fold cross-validation mechanism to evaluate the performance of the proposed agent. Experimental results show that the novel PFML learning mechanism for the parameter estimation and learning optimization performs favorably. We believe the proposed PFML will be a reference for education research and pedagogy and an important colearning mechanism for future human–machine educational applications. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
38. SteeringLoss: A Cost-Sensitive Loss Function for the End-to-End Steering Estimation.
- Author
-
Yuan, Wei, Yang, Ming, Li, Hao, Wang, Chunxiang, and Wang, Bing
- Abstract
Imbalanced training is a challenge in the field of autonomous driving. For the steering estimation task, imbalanced training is the core reason that an end-to-end model cannot estimate sharp steering value well. Inspired by researches on the steering estimation, this paper proposes a novel loss function to train a high-performance end-to-end model for handling the imbalanced training problem, which is named SteeringLoss. Firstly, the imbalanced distribution of the steering value for driving datasets is analyzed, which is similar to the double long-tailed distribution. Secondly, with the feature of distribution, this paper designs a cost-sensitive loss function step by step, the new loss function can improve the impact of the sharp steering value while maintaining the impact of the small steering value. Thirdly, three typical steering estimation models are established with SteeringLoss for demonstration, including the CNN model, the CNN-LSTM model and the 3DCNN-LSTM model. Finally, three experiments are designed for SteeringLoss: Experiment I demonstrates SteeringLoss can avoid imbalanced training problem; Experiment II discusses the finetuning principle of SteeringLoss and gives the basic guideline for using SteeringLoss; Experiment III shows the results on different typical end-to-end steering estimation models, which shows effectiveness of SteeringLoss for all the models and gives different solutions for the steering estimation. Moreover, the SteeringLoss is suitable for the imbalanced training with similar distributions of datasets besides end-to-end steering estimation, which indicates the potential value of the research on SteeringLoss in the future. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
39. Data-Driven Learning Control for Stochastic Nonlinear Systems: Multiple Communication Constraints and Limited Storage.
- Author
-
Shen, Dong
- Subjects
DATA mining ,NONLINEAR systems ,ARTIFICIAL neural networks - Abstract
This paper proposes a data-driven learning control method for stochastic nonlinear systems under random communication conditions, including data dropouts, communication delays, and packet transmission disordering. A renewal mechanism is added to the buffer to regulate the arrived packets, and a recognition mechanism is introduced to the controller for the selection of suitable update packets. Both intermittent and successive update schemes are proposed based on the conventional P-type iterative learning control algorithm, and are shown to converge to the desired input with probability one. The convergence and effectiveness of the proposed algorithms are verified by means of illustrative simulations. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
40. A Pairwise Proximity Learning-Based Ant Colony Algorithm for Dynamic Vehicle Routing Problems.
- Author
-
Xiang, Xiaoshu, Tian, Ye, Zhang, Xingyi, Xiao, Jianhua, and Jin, Yaochu
- Abstract
Dynamic vehicle routing problems (DVRPs) have become a hot research topic due to their significance in logistics, although it is still very challenging for existing algorithms to solve DVRPs due to the dynamically changing customer requests during the optimization. In this paper, we propose a pairwise proximity learning-based ant colony algorithm, termed PPL-ACO, for tackling DVRPs. In PPL-ACO, a pairwise proximity learning method is suggested to predict the local visiting order of customers in the optimal route after the occurrence of changes, which is on the basis of learning from the optimal routes found before the changes occur. A radial basis function network is used to learn the local visiting order of customers based on the proximity between each pair of customer nodes, by which the optimal routes can be quickly tracked after changes occur. Experimental results on 22 popular DVRP instances show that the proposed PPL-ACO significantly outperforms four state-of-the-art approaches to DVRPs. More interestingly, the results on five large-scale DVRP instances demonstrate the superiority of the proposed PPL-ACO in solving large-scale DVPRs with up to 1000 customers. The results on a real case of Nankai Strict, Tianjin, China also verifies that the proposed PPL-ACO is more effective and efficient than the four compared approaches in solving real-world DVRPs. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
41. Byzantine-Resilient Decentralized Stochastic Gradient Descent.
- Author
-
Guo, Shangwei, Zhang, Tianwei, Yu, Han, Xie, Xiaofei, Ma, Lei, Xiang, Tao, and Liu, Yang
- Subjects
FAULT tolerance (Engineering) ,DEEP learning ,INSTRUCTIONAL systems - Abstract
Decentralized learning has gained great popularity to improve learning efficiency and preserve data privacy. Each computing node makes equal contribution to collaboratively learn a Deep Learning model. The elimination of centralized Parameter Servers (PS) can effectively address many issues such as privacy, performance bottleneck and single-point-failure. However, how to achieve Byzantine Fault Tolerance in decentralized learning systems is rarely explored, although this problem has been extensively studied in centralized systems. In this paper, we present an in-depth study towards the Byzantine resilience of decentralized learning systems with two contributions. First, from the adversarial perspective, we theoretically illustrate that Byzantine attacks are more dangerous and feasible in decentralized learning systems: even one malicious participant can arbitrarily alter the models of other participants by sending carefully crafted updates to its neighbors. Second, from the defense perspective, we propose Ubar, a novel algorithm to enhance decentralized learning with Byzantine Fault Tolerance. Specifically, Ubar provides a Uniform Byzantine-resilient Aggregation Rule for benign nodes to select the useful parameter updates and filter out the malicious ones in each training iteration. It guarantees that each benign node in a decentralized system can train a correct model under very strong Byzantine attacks with an arbitrary number of faulty nodes. We conduct extensive experiments on standard image classification tasks and the results indicate that Ubar can effectively defeat both simple and sophisticated Byzantine attacks with higher performance efficiency than existing solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
42. Online Learning for Network Constrained Demand Response Pricing in Distribution Systems.
- Author
-
Mieth, Robert and Dvorkin, Yury
- Abstract
Flexible demand response (DR) resources can be leveraged to accommodate the stochasticity of some distributed energy resources. This paper develops an online learning approach that continuously estimates price sensitivities of residential DR participants and produces such price signals to the DR participants that ensure a desired level of DR capacity. The proposed learning approach incorporates the dispatch decisions on DR resources into the distributionally robust chance-constrained optimal power flow (OPF) framework. This integration is shown to adequately remunerate DR resources and co-optimize the dispatch of DR and conventional generation resources. The distributionally robust chance-constrained formulation only relies on empirical data acquired over time and makes no restrictive assumptions on the underlying distribution of the demand uncertainty. The distributional robustness also allows for robustifying the otpimal solution against systematically misestimating empirically learned parameters. The effectiveness of the proposed learning approach is shown via numerical experiments. The paper is accompanied by the code and data supplement released for public use. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
43. Addressing the Item Cold-Start Problem by Attribute-Driven Active Learning.
- Author
-
Zhu, Yu, Lin, Jinghao, He, Shibi, Wang, Beidou, Guan, Ziyu, Liu, Haifeng, and Cai, Deng
- Subjects
MACHINE learning ,FILTERING software ,RECOMMENDER systems ,TASK analysis ,MATRIX decomposition ,FORECASTING - Abstract
In recommender systems, cold-start issues are situations where no previous events, e.g., ratings, are known for certain users or items. In this paper, we focus on the item cold-start problem. Both content information (e.g., item attributes) and initial user ratings are valuable for seizing users’ preferences on a new item. However, previous methods for the item cold-start problem either (1) incorporate content information into collaborative filtering to perform hybrid recommendation, or (2) actively select users to rate the new item without considering content information and then do collaborative filtering. In this paper, we propose a novel recommendation scheme for the item cold-start problem by leveraging both active learning and items’ attribute information. Specifically, we design useful user selection criteria based on items’ attributes and users’ rating history, and combine the criteria in an optimization framework for selecting users. By exploiting the feedback ratings, users’ previous ratings and items’ attributes, we then generate accurate rating predictions for the other unselected users. Experimental results on two real-world datasets show the superiority of our proposed method over traditional methods. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
44. Multi-Role Project (MRP): A New Project-Based Learning Method for STEM.
- Author
-
Warin, Bruno, Talbi, Omar, Kolski, Christophe, and Hoogstoel, Frederic
- Subjects
STEM education ,SOFTWARE engineering education ,INTERNET content management systems ,KNOWLEDGE management ,EDUCATION research - Abstract
This paper presents the “Multi-Role Project” method (MRP), a broadly applicable project-based learning method, and describes its implementation and evaluation in the context of a Science, Technology, Engineering, and Mathematics (STEM) course. The MRP method is designed around a meta-principle that considers the project learning activity as a role-playing game based on two projects: a learning project and an engineering project. The meta-principle is complemented by five principles that provide a framework to guide the working practices of student teams: distribution of responsibilities; regular interactions and solicitations within the team; anticipation and continuous improvement; positive interdependence and alternating individual/collective work; and open communication and content management. This paper presents the implementation of MRP in a course teaching software engineering, UML language, and project management. The results show that MRP helped the course's students to acquire important professional knowledge and skills, experience near-real-world professional realities, and develop their abilities to work both in teams and autonomously. [ABSTRACT FROM PUBLISHER]
- Published
- 2016
- Full Text
- View/download PDF
45. Deep Learning Movement Intent Decoders Trained With Dataset Aggregation for Prosthetic Limb Control.
- Author
-
Dantas, Henrique, Warren, David J., Wendelken, Suzanne M., Davis, Tyler S., Clark, Gregory A., and Mathews, V John
- Subjects
MULTILAYER perceptrons ,ARTIFICIAL neural networks ,DEEP learning ,ARTIFICIAL hands ,KALMAN filtering ,BIOMEDICAL signal processing - Abstract
Significance: The performance of traditional approaches to decoding movement intent from electromyograms (EMGs) and other biological signals commonly degrade over time. Furthermore, conventional algorithms for training neural network based decoders may not perform well outside the domain of the state transitions observed during training. The work presented in this paper mitigates both these problems, resulting in an approach that has the potential to substantially improve the quality of life of the people with limb loss. Objective: This paper presents and evaluates the performance of four decoding methods for volitional movement intent from intramuscular EMG signals. Methods: The decoders are trained using the dataset aggregation (DAgger) algorithm, in which the training dataset is augmented during each training iteration based on the decoded estimates from previous iterations. Four competing decoding methods, namely polynomial Kalman filters (KFs), multilayer perceptron (MLP) networks, convolutional neural networks (CNN), and long short-term memory (LSTM) networks, were developed. The performances of the four decoding methods were evaluated using EMG datasets recorded from two human volunteers with transradial amputation. Short-term analyses, in which the training and cross-validation data came from the same dataset, and long-term analyses, in which the training and testing were done in different datasets, were performed. Results: Short-term analyses of the decoders demonstrated that CNN and MLP decoders performed significantly better than KF and LSTM decoders, showing an improvement of up to 60% in the normalized mean-square decoding error in cross-validation tests. Long-term analyses indicated that the CNN, MLP, and LSTM decoders performed significantly better than a KF-based decoder at most analyzed cases of temporal separations (0–150 days) between the acquisition of the training and testing datasets. Conclusion: The short-term and long-term performances of MLP- and CNN-based decoders trained with DAgger demonstrated their potential to provide more accurate and naturalistic control of prosthetic hands than alternate approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
46. Deep Learning for Fast and Spatially Constrained Tissue Quantification From Highly Accelerated Data in Magnetic Resonance Fingerprinting.
- Author
-
Fang, Zhenghan, Chen, Yong, Liu, Mingxia, Xiang, Lei, Zhang, Qian, Wang, Qian, Lin, Weili, and Shen, Dinggang
- Subjects
DEEP learning ,MAGNETIC resonance ,HUMAN body ,RESOURCE recovery facilities ,FEATURE extraction - Abstract
Magnetic resonance fingerprinting (MRF) is a quantitative imaging technique that can simultaneously measure multiple important tissue properties of human body. Although MRF has demonstrated improved scan efficiency as compared to conventional techniques, further acceleration is still desired for translation into routine clinical practice. The purpose of this paper is to accelerate MRF acquisition by developing a new tissue quantification method for MRF that allows accurate quantification with fewer sampling data. Most of the existing approaches use the MRF signal evolution at each individual pixel to estimate tissue properties, without considering the spatial association among neighboring pixels. In this paper, we propose a spatially constrained quantification method that uses the signals at multiple neighboring pixels to better estimate tissue properties at the central pixel. Specifically, we design a unique two-step deep learning model that learns the mapping from the observed signals to the desired properties for tissue quantification, i.e.: 1) with a feature extraction module for reducing the dimension of signals by extracting a low-dimensional feature vector from the high-dimensional signal evolution and 2) a spatially constrained quantification module for exploiting the spatial information from the extracted feature maps to generate the final tissue property map. A corresponding two-step training strategy is developed for network training. The proposed method is tested on highly undersampled MRF data acquired from human brains. Experimental results demonstrate that our method can achieve accurate quantification for T1 and T2 relaxation times by using only 1/4 time points of the original sequence (i.e., four times of acceleration for MRF acquisition). [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
47. Learning Analytics for Learning Design: A Systematic Literature Review of Analytics-Driven Design to Enhance Learning.
- Author
-
Mangaroska, Katerina and Giannakos, Michail
- Abstract
As the fields of learning analytics and learning design mature, the convergence and synergies between the two are becoming an important area for research. This paper intends to summarize the main outcomes of a systematic review of empirical evidence on learning analytics for learning design. Moreover, this paper presents an overview of what and how learning analytics have been used to inform learning design decisions and in what contexts. The search was performed in seven academic databases, resulting in 43 papers included in the main analysis. The results from the review depict the ongoing design patterns and learning phenomena that emerged from the synergy that learning analytics and learning design impose on the current status of learning technologies. Finally, this review stresses that future research should consider developing a framework on how to capture and systematize learning design data grounded in learning analytics and learning theory, and document what learning design choices made by educators influence subsequent learning activities and performances over time. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
48. Invertibility-Driven Interpolation Filter for Video Coding.
- Author
-
Yan, Ning, Liu, Dong, Li, Houqiang, Li, Bin, Li, Li, and Wu, Feng
- Subjects
HILBERT-Huang transform ,VIDEO coding ,INTERPOLATION algorithms ,INTERPOLATION - Abstract
Motion compensation with fractional motion vector has been widely utilized in the video coding standards. The fractional samples are usually generated by fractional interpolation filters. Traditional interpolation filters are usually designed based on the signal processing theory with the assumption of band-limited signal, which cannot effectively capture the non-stationary property of video content and cannot adapt to the variety of video quality. In this paper, we reveal an intuitive property of the fractional interpolation problem, named invertibility. That is, the fractional interpolation filters should not only generate fractional samples from integer samples but also recover the integer samples from the fractional samples in an invertible manner. We prove in theory that the invertibility in the spatial domain is equivalent to the constant magnitude in the Fourier transform domain. Driven by the invertibility, we then develop a learning-based method to solve the fractional interpolation problem. Inspired by the advances of convolutional neural network (CNN), we propose to establish an end-to-end scheme using CNN to train invertibility-driven interpolation filter (InvIF). Different from the previous learning-based methods, the proposed training scheme does not need hand-crafted “ground truth” of fractional samples. The proposed InvIF is integrated into high efficiency video coding (HEVC), and extensive experiments are conducted to verify its effectiveness. The experimental results show that the proposed method can achieve on average 4.7% and 3.6% BD-rate reduction compared with the HEVC anchor, under low-delay-B and random-access configurations, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
49. Model Learning for Multistep Backward Prediction in Dyna- ${Q}$ Learning.
- Author
-
Hwang, Kao-Shing, Jiang, Wei-Cheng, Chen, Yu-Jen, and Hwang, Iris
- Subjects
REINFORCEMENT learning ,SIMULATION methods & models - Abstract
A model-based reinforcement learning (RL) method which interplays direct and indirect learning to update ${Q}$ functions is proposed. The environment is approximated by a virtual model that can predict the transition to the next state and the reward of the domain. This virtual model is used to train ${Q}$ functions to accelerate policy learning. Lookup table methods are usually used to establish such environmental models, but these methods need to collect tremendous amounts of experiences to enumerate responses of the environment. In this paper, a stochastic model learning method based on tree structures is presented. To model the transition probability, an online clustering method is applied to equip the model learning method with the abilities to evaluate the transition probability. By the virtual model, the RL method produces simulated experience in the stage of indirect learning. Since simulated transitions and backups are more usefully focused by working backward from the state-action, the pair estimated ${Q}$ value of which changes significantly, the useful one-step backups are actions that lead directly into the one state whose value has already obviously been changed. This, however, may induce a false positive; that is, a backup state may be an invalid state, such as an absorbing or terminal state, especially in cases where the changes of ${Q}$ values at the planning stage are still needed to put back for ranking even though they are based on a simulated experience and are possibly erroneous. It is obvious that when the agent is attracted to generate simulated experience around the area of these absorbing states, the learning efficiency is deteriorated. This paper proposes three detecting methods to solve this problem. Moreover, the policy learning can speed up. The effectiveness and generality of our method is further demonstrated in three numerical simulations. The simulation results demonstrate that the training rate of our method is obviously improved. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
50. Domain Invariant and Class Discriminative Feature Learning for Visual Domain Adaptation.
- Author
-
Li, Shuang, Song, Shiji, Huang, Gao, Ding, Zhengming, and Wu, Cheng
- Subjects
INVARIANTS (Mathematics) ,LEARNING ,MATHEMATICAL symmetry ,DATA analysis ,ARTIFICIAL neural networks - Abstract
Domain adaptation manages to build an effective target classifier or regression model for unlabeled target data by utilizing the well-labeled source data but lying different distributions. Intuitively, to address domain shift problem, it is crucial to learn domain invariant features across domains, and most existing approaches have concentrated on it. However, they often do not directly constrain the learned features to be class discriminative for both source and target data, which is of vital importance for the final classification. Therefore, in this paper, we put forward a novel feature learning method for domain adaptation to construct both domain invariant and class discriminative representations, referred to as DICD. Specifically, DICD is to learn a latent feature space with important data properties preserved, which reduces the domain difference by jointly matching the marginal and class-conditional distributions of both domains, and simultaneously maximizes the inter-class dispersion and minimizes the intra-class scatter as much as possible. Experiments in this paper have demonstrated that the class discriminative properties will dramatically alleviate the cross-domain distribution inconsistency, which further boosts the classification performance. Moreover, we show that exploring both domain invariance and class discriminativeness of the learned representations can be integrated into one optimization framework, and the optimal solution can be derived effectively by solving a generalized eigen-decomposition problem. Comprehensive experiments on several visual cross-domain classification tasks verify that DICD can outperform the competitors significantly. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.