Publication Year Range: Last 50 years / Topic: learning systems - Searchworks@Jio Institute Digital Library Search Results

Showing total 2,795 results

Start Over Topic learning systems Publication Year Range Last 50 years

2,795 results

1. Empowering Positive Behaviors: A Gamification-Based Approach

Author: Garbayo-Ágreda, Iñigo, Padilla-Zea, Natalia, Burgos, Daniel, Barbosa, Simone Diniz Junqueira, Editorial Board Member, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Yuan, Junsong, Founding Editor, Burgos, Daniel, editor, Cimitile, Marta, editor, Ducange, Pietro, editor, Pecori, Riccardo, editor, Picerno, Pietro, editor, Raviolo, Paolo, editor, and Stracke, Christian M., editor
Published: 2019
Full Text: View/download PDF

2. Towards an Ontology of Requirements for Pervasive Games Based Learning Systems: A Requirements Engineering Perspective

Author: Mejbri, Yemna, Khemaja, Maha, Raies, Kaouther, Akan, Ozgur, Series editor, Bellavista, Paolo, Series editor, Cao, Jiannong, Series editor, Coulson, Geoffrey, Series editor, Dressler, Falko, Series editor, Ferrari, Domenico, Series editor, Gerla, Mario, Series editor, Kobayashi, Hisashi, Series editor, Palazzo, Sergio, Series editor, Sahni, Sartaj, Series editor, Shen, Xuemin (Sherman), Series editor, Stan, Mircea, Series editor, Xiaohua, Jia, Series editor, Zomaya, Albert Y., Series editor, Vaz de Carvalho, Carlos, editor, Escudeiro, Paula, editor, and Coelho, António, editor
Published: 2017
Full Text: View/download PDF

3. Applying Reinforcement Learning to Multi-robot System Behavior Coordination

Author: Sanz, Yolanda, de Lope, Javier, Maravall, Darío, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Moreno-Díaz, Roberto, editor, Pichler, Franz, editor, and Quesada-Arencibia, Alexis, editor
Published: 2009
Full Text: View/download PDF

4. Understanding Collaborative Learning in Networked Organizations

Author: Gundry, John and Kaye, Anthony R., editor
Published: 1992
Full Text: View/download PDF

5. Mobile, secure, and programmable networking : third international conference, MSPN 2017, Paris, France, June 29-30, 2017, revised selected papers

Author: Bouzefrane, Samia, Banerjee, Soumya, Sailhan, Françoise, Boumerdassi, Selma, Renault, Eric, CEDRIC. Réseaux et Objets Connectés (CEDRIC - ROC), Centre d'études et de recherche en informatique et communications (CEDRIC), Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise (ENSIIE)-Conservatoire National des Arts et Métiers [CNAM] (CNAM)-Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise (ENSIIE)-Conservatoire National des Arts et Métiers [CNAM] (CNAM), Birla Institute of Technology [Mesra] (BIT Mesra), Réseaux, Systèmes, Services, Sécurité (R3S-SAMOVAR), Services répartis, Architectures, MOdélisation, Validation, Administration des Réseaux (SAMOVAR), Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP)-Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP), Département Réseaux et Services Multimédia Mobiles (RS2M), Institut Mines-Télécom [Paris] (IMT)-Télécom SudParis (TSP), Centre National de la Recherche Scientifique (CNRS), and Lecture Notes in Computer Science, volume 10566
Subjects: Artificial intelligence, Learning systems, User inferfaces, Applied computing, Wireless sensor networks, Telecommunication traffic, Big data, [INFO.INFO-NI]Computer Science [cs]/Networking and Internet Architecture [cs.NI], Feature selection, Cryptography, Wireless telecommunication systems, Workflow system, Data security, ComputingMilieux_MISCELLANEOUS
Abstract: International audience
Published: 2017

6. A Three-Layered Mutually Reinforced Model for Personalized Citation Recommendation.

Author: Cai, Xiaoyan, Han, Junwei, Li, Wenjie, Zhang, Renxian, Pan, Shirui, and Yang, Libin
Subjects: COMPUTATIONAL complexity, MACHINE learning, CLUSTER analysis (Statistics)
Abstract: Fast-growing scientific papers pose the problem of rapidly and accurately finding a list of reference papers for a given manuscript. Citation recommendation is an indispensable technique to overcome this obstacle. In this paper, we propose a citation recommendation approach via mutual reinforcement on a three-layered graph, in which each paper, author or venue is represented as a vertex in the paper layer, author layer, and venue layer, respectively. For personalized recommendation, we initiate the random walk separately for each query researcher. However, this has a high computational complexity due to the large graph size. To solve this problem, we apply a three-layered interactive clustering approach to cluster related vertices in the graph. Personalized citation recommendations are then made on the subgraph, generated by the clusters associated with each researcher’s needs. When evaluated on the ACL anthology network, DBLP, and CiteSeer ML data sets, the performance of our proposed model-based citation recommendation approach is comparable with that of other state-of-the-art citation recommendation approaches. The results also demonstrate that the personalized recommendation approach is more effective than the nonpersonalized recommendation approach. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

7. Sentiment analysis and research based on two‐channel parallel hybrid neural network model with attention mechanism.

Author: Chen, Na, Sun, Yanqiu, and Yan, Yan
Subjects: CONVOLUTIONAL neural networks, SENTIMENT analysis, RECURRENT neural networks, LANGUAGE models, TRANSFORMER models
Abstract: Concerning the problems that the traditional Convolutional Neural Network (CNN) ignores contextual semantic information, and the traditional Recurrent Neural Network (RNN) has information memory loss and vanishing gradient, this paper proposes a Bi‐directional Encoder Representations from Transformers (BERT)‐based dual‐channel parallel hybrid neural network model for text sentiment analysis. The BERT model is used to convert text into word vectors; the dual‐channel parallel hybrid neural network model constructed by CNN and Bi‐directional Long Short‐Term Memory (BiLSTM) extracts local and global semantic features of the text, which can obtain more comprehensive sentiment features; the attention mechanism enables some words to get more attention that highlights important words and improves the model's sentiment classification ability. Finally, the dual‐channel output features are fused for sentiment classification. The experimental results on the hotel review datasets show that the Accuracy of the proposed model in sentiment classification reaches 92.35% and the F1 score reaches 91.59%. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

8. V. Lifschitz, ed., formalizing common sense: papers by John McCarthy

Author: Varol Akman
Subjects: Computation theory, Problem solving, Artificial intelligence, Advice-taker systems, Learning systems, Knowledge representation, Pattern recognition, Computer programming, Formal logic, Common-sense reasoning
Abstract: A review is presented of Lifschitz's collection of seventeen papers written by McCarthy on the subject of common sense. The book opens with a fine overview of McCarthy's research in artificial intelligence (AI). Lifschitz offers an admirably succinct account of the development of McCarthy's ideas on common sense from the early days of AI to his current work. Lifschitz's introduction is especially useful in appreciating the dramatically original and permanently influential nature of McCarthy's work. While McCarthy's papers collected in this volume were written over the span of almost three decades, Lifschitz correctly observes that the underlying concern has always been the same: to understand and model the intellectual ability realized by human common sense.
Published: 1995

9. Fostering organizational sustainability through dialogic interaction

Author: Wals, Arjen E.J., Schwarzin, Lisa, and Smith, Peter A.C.
Published: 2012
Full Text: View/download PDF

10. Dual Sparse Structured Subspaces and Graph Regularisation for Particle Swarm Optimisation-Based Multi-Label Feature Selection.

Author: Demir, Kaan, Nguyen, Bach Hoai, Xue, Bing, and Zhang, Mengjie
Abstract: Many real-world classification problems are becoming multi-label in nature, i.e., multiple class labels are assigned to an instance simultaneously. Multi-label classification is a challenging problem due to the involvement of three forms of interactions, i.e., feature-to-feature, feature-to-label, and label-to-label interactions. What further complicates the problem is that not all features are useful, and some can deteriorate the classification performance. Sparsity-based methods have been widely used to address multi-label feature selection due to their efficiency and effectiveness. However, most (if not all) existing methods do not consider the three forms of interactions simultaneously, which could hinder their ability to achieve good performance. Moreover, most existing methods are gradient-based, which are prone to getting stuck at local optima. This paper proposes a new sparsity-based feature selection approach that can simultaneously consider all three forms of interactions. Furthermore, this paper develops a novel sparse learning method based on particle swarm optimisation that can avoid local optima. The proposed method is compared against the state-of-the-art multi-label feature selection methods in terms of multi-label classification performance. The results show that our method performed significantly better in selecting high-quality feature subsets with respect to various feature subset sizes. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. A Novel Contact State Estimation Method for Robot Manipulation Skill Learning via Environment Dynamics and Constraints Modeling.

Author: Liu, Xing, Huang, Panfeng, and Liu, Zhengxiong
Subjects: CLASSROOM environment, ROBOT motion, ROBOTS, GEOMETRIC modeling, ROBOT control systems, HUMAN beings
Abstract: Nowadays the robot manipulation skills are usually learned by human demonstration via trajectory-level learning, which somewhat lacks robustness and generalization. In this paper, we propose a novel contact state level learning method for robot manipulation skill acquisition via human demonstration. The robot-environment contact states are described via environment dynamics modelling and geometric constraints modelling for flexible contact and rigid contact cases, respectively. During human demonstration process, the robot-environment interaction force, the robot position, and velocity data are collected. After that, the environment dynamics and geometric constraints modelling methods are presented to determine the contact state changes during the robot manipulation process. Then the robot manipulator learns the contact state information rather than specific manipulation trajectory. On this basis, the manipulation control law using active exploration method is presented to control the robot during the button pressing process and peg-hole-insertion process, respectively. Finally, the performance of the presented methodology has been verified via experimental studies. Note to Practitioners—Intelligent robots will become the right assistants of human beings in the future, especially in various areas of manipulation occasions. The important premise of realizing this vision is that the robots should have certain ability of manipulation skill learning. A lot of research has been carried out in this field, many of which are focusing on trajectory level manipulation skill learning and reproduction. Other than the trajectory level learning, human beings can learn many other higher levels of manipulation skills, such as the contact state level and semantic level learning, which makes the learning results more robust and general. In this paper, the contact state estimation and learning method via environment dynamics and geometric constraints modelling is presented to learn the robot manipulation skill based on the contact state transition conditions. In this way, the robot needs less data in the skill learning process, and the trajectory level learning is avoided. After learning the contact state level manipulation skill, the lower trajectory level command is autonomously generated. Experiments on button pressing and peg-hole-insertion tasks by KUKA iiwa robot have obtained very good results. Other than the button pressing and peg-hole-insertion tasks, the presented methodology can be applied to many other manipulation tasks, as long as there are contact state changes in the manipulation process. The work of this paper lays a foundation for the robot learning of higher-level manipulation skills. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

12. Information extraction from semi-structured and un-structured documents using probabilistic context free grammar inference.

Author: Thakur, Ramesh, Jain, Suresh, Chaudhari, Narendra S., and Singhai, Rahul
Abstract: Large number of research papers are available in the form of un-structured (text) format. Knowledge discovery in un-structured document has been recognized as promising task. These documents are typically formatted for human viewing, which varies widely from document to document. Frequent change in their formatting causes difficulties in constructing a global schema. Thus, discovery of interesting rules from it is a complex and tedious process. Recently, conditional random fields (CRFs) and hand-coded wrappers have been used to label the text (such as Title, Author Name(s), Affiliation, Email, Contact number, etc. in research papers). In this paper we propose a novel hybrid approach to infer grammar rules using alignment similarity and probabilistic context free grammar. It helps in extracting desired information from the document. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

13. Guest Editorial: Designing Technologies to Support Professional and Workplace Learning for Situated Practice.

Author: Pammer-Schindler, Viktoria, Ley, Tobias, Kimmerle, Joachim, and Littlejohn, Allison
Abstract: The papers in this special section focus on designing technologies to support professional and workplace learning in situated practices. In an era of global, organizational, and technological change, all of which are transforming the world of work, professional and workplace learning are critical for employability and organizational competitiveness. A range of fundamental transformations is changing how people work. Digital technologies are replacing human labor and, at the same time, are accelerating the expansion of job roles and work practices. Work is becoming increasingly specialized, which means that professionals in collaborative and networked ways across discipline and organization boundaries. In parallel, labor is increasingly decentralized, making decisionmaking more distributed and raising the need for remote communication and collaboration. Subsequently, work is becoming more independent from time and place, as people connect, collaborate, and work via digital technologies. These changes come with a need for substantial and continuous workplace learning, and with the need for changes in how workplace learning happens. Of course, digital technologies are already used to provide learning and training in workplaces. However, most of these learning technologies have been developed for formal education (e.g., K- 12 and higher education) rather than in workplace contexts. There is a need to understand and evidence workplace learning needs and to further develop technologies that can support and scale workplace learning. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

14. Autonomous Generation of Service Strategy for Household Tasks: A Progressive Learning Method With A Priori Knowledge and Reinforcement Learning.

Author: Zhang, Mengyang, Tian, Guohui, Gao, Huanbing, and Zhang, Ying
Subjects: REINFORCEMENT learning, REWARD (Psychology), ARTIFICIAL neural networks
Abstract: Human beings tend to learn unknown knowledge in a gradual process, from the basic to the complex. Based on this point, we propose a progressive learning method for producing service strategies according to requests, with a hierarchical priori knowledge and reinforcement learning. Service strategy aims to guide how to perform home services and takes into consideration the relationship between actions and objects in home environment. In this paper, strategy generation is regarded as a text generation problem in question answering (QA). Firstly, a hierarchical priori knowledge with service-object correlation at the bottom and action-object correlation at the top is constructed to assist the understanding on the relationship of objects and actions in service strategies. Service-object correlation guides how to select proper objects with the correct order, while action-object correlation associates actions in strategies according to selected objects. Based on the hierarchical priori knowledge, a progressive learning method is proposed to make the model produce effective strategies with a sequential cognition, from service-object correlation (objects) to action-object correlation (actions). After that, reinforcement learning is employed to enhance the progressive guidance, by designing rewards in terms of the hierarchical priori knowledge. Finally, the proposed method is tested with both comparative experiments and ablation studies, and the experimental results demonstrate the superiority in producing comprehensive and logical strategies, indicating that the progressive learning method in our paper can further improve the QA performance. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

15. Transparent human – (non-) transparent technology? The Janus-faced call for transparency in AI-based health care technologies.

Author: Ott, Tabea and Dabrock, Peter
Subjects: MEDICAL technology, ARTIFICIAL intelligence, DIGNITY, MEDICAL care
Abstract: The use of Artificial Intelligence and Big Data in health care opens up new opportunities for the measurement of the human. Their application aims not only at gathering more and better data points but also at doing it less invasive. With this change in health care towards its extension to almost all areas of life and its increasing invisibility and opacity, new questions of transparency arise. While the complex human-machine interactions involved in deploying and using AI tend to become non-transparent, the use of these technologies makes the patient seemingly transparent. Papers on the ethical implementation of AI plead for transparency but neglect the factor of the “transparent patient” as intertwined with AI. Transparency in this regard appears to be Janus-faced: The precondition for receiving help - e.g., treatment advice regarding the own health - is to become transparent for the digitized health care system. That is, for instance, to donate data and become visible to the AI and its operators. The paper reflects on this entanglement of transparent patients and (non-) transparent technology. It argues that transparency regarding both AI and humans is not an ethical principle per se but an infraethical concept. Further, it is no sufficient basis for avoiding harm and human dignity violations. Rather, transparency must be enriched by intelligibility following Judith Butler’s use of the term. Intelligibility is understood as an epistemological presupposition for recognition and the ensuing humane treatment. Finally, the paper highlights ways to testify intelligibility in dealing with AI in health care ex ante, ex post, and continuously. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

16. Robust Monocular Pose Tracking of Less-Distinct Objects Based on Contour-Part Model.

Author: Sun, Xiaoliang, Zhou, Jiexin, Zhang, Wenlong, Wang, Zi, and Yu, Qifeng
Subjects: MONOCULARS, PROBLEM solving, SINGLE-degree-of-freedom systems, IMAGE segmentation
Abstract: Due to the lack of distinctiveness to the background, robust monocular 6DOF pose tracking of less-distinct objects remains an important open problem. In this paper, we firstly analyze the object distinctiveness in the tracking process. Then, we propose a novel contour-part model based robust monocular pose tracking method for less-distinct objects. This paper uses the traditional contour feature for pose tracking in a novel strategy called contour part model. First, the contour part model is built by segmenting the projected contour rendered from the 3D model into contour segments of a certain length adaptively according to the Shi-Tomasi cornerness scores. Then, the correspondence is detected in the input image for each contour part by gradient orientation based template matching. Finally, pose tracking is achieved by solving the PnP problem. Experiment results on semi-synthetic and real images show that the proposed method performs better than the existing methods when pose tracking less-distinct objects and shows great robustness toward interference. Additionally, we combine the edge feature and the regional feature in a simple strategy. The overall performance of the fused method is boosted according to the experimental results on the RBOT dataset. The fact shows that the fusion of the edge and regional features has great potential for improving tracking performance. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

17. On the Dynamics of Hopfield Neural Networks on Unit Quaternions.

Author: Valle, Marcos Eduardo and de Castro, Fidelis Zanetti
Subjects: ARTIFICIAL neural networks, MACHINE learning, NEURAL circuitry
Abstract: In this paper, we first address the dynamics of the elegant multivalued quaternionic Hopfield neural network (MV-QHNN) proposed by Minemoto et al. Contrary to what was expected, we show that the MV-QHNN, as well as one of its variation, does not always come to rest at an equilibrium state under the usual conditions. In fact, we provide simple examples in which the network yields a periodic sequence of quaternionic state vectors. Afterward, we turn our attention to the continuous-valued quaternionic Hopfield neural network (CV-QHNN), which can be derived from the MV-QHNN by means of a limit process. The CV-QHNN can be implemented more easily than the MV-QHNN model. Furthermore, the asynchronous CV-QHNN always settles down into an equilibrium state under the usual conditions. Theoretical issues are all illustrated by examples in this paper. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

18. Learning From Multiple Imperfect Instructors in Sensor Networks.

Author: Virani, Nurali, Phoha, Shashi, and Ray, Asok
Subjects: KERNEL (Mathematics), SEQUENTIAL learning, MACHINE learning
Abstract: This paper presents a sequential learning framework for sensors in a network, where a few sensors assume the role of an instructor to train other sensors in the network. The instructors provide estimated labels for measurements of new sensors. These labels are possibly noisy, because a classifier of the instructor may not be perfect. A recursive density estimator is proposed to obtain the true measurement model (i.e., the observation density conditioned on the label) in spite of the training with noisy labels. Specifically, this paper answers the question “Can a sensor train other sensors?”, provides necessary conditions for sensors to act as instructors, presents a sequential learning framework using recursive nonparametric kernel density estimation, and provides a convergence rate for the expected error in an observation density. The underlying concepts are illustrated and validated with simulation results. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

19. On the Work of the Institute of Control Sciences of the Russian Academy of Sciences in the Field of Pattern Recognition Theory and Applications in the 20th Century.

Author: Mandel, A. S. and Mikhalsky, A. I.
Abstract: Brief historical background on the establishment and activities of the Institute of Control Sciences (IPU RAS). The paper presents key findings of the Institute (obtained mainly in the 20th century) in the field of pattern recognition and of related analysis of complex data. It focuses on four areas of research including (a) the method of potential functions, (b) the theory of learning and self-learning systems, (c) the generalized portrait method and recovery of dependences based on empirical data, and (d) automatic classification methods and expert classification analysis. Relations between these areas are studied. The pioneers in the field are named (M.A. Aizerman, E.M. Braverman, L.I. Rozonoer, Ya.Z. Tsypkin, V.N. Vapnik, A.Ya. Chervonenkis, I.B. Muchnik, and A.A. Dorofeyuk among others) and brief biographical notes on the life and scientific work of these scientists are presented. The follow-ups of the results thus obtained are shown. The bibliography of publications by the Institute's researchers in leading journals of Russia on pattern recognition problems and related complex data analysis tasks is provided. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

20. Adaptive stochastic model predictive control via network ensemble learning.

Author: Xiong, Weiliang, He, Defeng, Mu, Jianbin, and Wang, Xiuli
Subjects: STOCHASTIC models, PREDICTION models, LINEAR systems, BAYESIAN analysis, EXPONENTIAL stability, MACHINE learning, ADAPTIVE control systems, FEEDFORWARD neural networks
Abstract: This paper proposes a novel ensemble learning-based adaptive stochastic model predictive control (SMPC) algorithm for constrained linear systems with unknown nonlinear terms and random disturbances. The ensemble network combining a feedforward neural network and a Bayesian network is used to offline learn the nonlinear dynamics and disturbance distribution parameters. Then, the mixed-tube scheme is designed to cope with input constraints and state chance constraints while decreasing computational demands and conservativeness. The reliability of the stochastic tube is guaranteed using the Hoeffding inequality-based verification mechanism, which results in a chance constraint with double probabilities. The feasibility and exponential stability of the SMPC are rigorously proven. A numerical example verifies the merits of the proposed algorithm in terms of the control performance and the feasible domain. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

21. Learning Entropy as a Learning-Based Information Concept

Author: Ivo Bukovsky, Witold Kinsner, and Noriyasu Homma
Subjects: Computer science, General Physics and Astronomy, lcsh:Astrophysics, 02 engineering and technology, 01 natural sciences, Novelty detection, 010305 fluids & plasmas, information, learning systems, lcsh:QB460-466, 0103 physical sciences, non-probabilistic entropy, 0202 electrical engineering, electronic engineering, information engineering, Entropy (information theory), Learning based, lcsh:Science, learning, business.industry, Novelty, Probabilistic logic, Cognition, Concept Paper, lcsh:QC1-999, Informatics, lcsh:Q, 020201 artificial intelligence & image processing, Artificial intelligence, Information measure, business, lcsh:Physics, novelty detection
Abstract: Recently, a novel concept of a non-probabilistic novelty detection measure, based on a multi-scale quantification of unusually large learning efforts of machine learning systems, was introduced as learning entropy (LE). The key finding with LE is that the learning effort of learning systems is quantifiable as a novelty measure for each individually observed data point of otherwise complex dynamic systems, while the model accuracy is not a necessary requirement for novelty detection. This brief paper extends the explanation of LE from the point of an informatics approach towards a cognitive (learning-based) information measure emphasizing the distinction from Shannon’s concept of probabilistic information. Fundamental derivations of learning entropy and of its practical estimations are recalled and further extended. The potentials, limitations, and, thus, the current challenges of LE are discussed.
Published: 2019

22. A Deep Transfer Learning Based Architecture for Brain Tumor Classification Using MR Images.

Author: Badjie, Bakary and Ülker, Ezgi Deniz
Subjects: DEEP learning, BRAIN tumors, TUMOR classification, MAGNETIC resonance imaging, CONVOLUTIONAL neural networks, MACHINE learning
Abstract: Deep Learning (DL) is becoming more popular in the healthcare sectors due to the exponential growth of data availability and its excellent performance in diagnosing various diseases. This paper has aimed to design a new possible brain tumor diagnostic model to improve accuracy and reliability of radiology. In this paper, an advanced deep learning algorithm is used to detect and classify brain tumors in magnetic resonance (MR) images. Diagnosing brain tumors in radiology is a significant issue, yet it is a difficult and time-consuming procedure that radiologists must pass through. The reliability of their assessment relies completely on their knowledge and personal judgements which are in most cases inaccurate. As a possible remedy to the growing concern in diagnosing brain tumors accurately, in this work a deep learning method is applied to classify the brain tumor MR images with very high performance accuracy. The research leveraged a transfer learning model known as AlexNet's convolutional neural network (CNN) to perform this operation. Our method helps to improve robustness, efficiencies and accuracy in the healthcare sector with the ability to automate the entire diagnostic process with the overall accuracy of 99.62%. Additionally, our model has the ability to detect and classify tumors at their different stages and magnitudes. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

23. ES-DQN: A Learning Method for Vehicle Intelligent Speed Control Strategy Under Uncertain Cut-In Scenario.

Author: Chen, Qingyun, Zhao, Wanzhong, Li, Lin, Wang, Chunyan, and Chen, Feng
Subjects: INTELLIGENT control systems, REINFORCEMENT learning, REWARD (Psychology), ADAPTIVE control systems, AUTONOMOUS vehicles
Abstract: Uncertain cut-in maneuver of vehicles from adjacent lanes makes it difficult for vehicle's automatic speed control strategy to make judgments and effective control decisions. In this paper, an intelligent speed control strategy for uncertain cut-in scenarios is established based on a basic autonomous driving system. This strategy judges cut-in maneuver from surrounding vehicles and outputs adaptive control action under current environment according to Q value of state-action pair based on a Q network. In addition, according to the analysis of cut-in scenarios, the Q network is trained based on a novel reinforcement learning method named as experience screening deep Q-learning network (ES-DQN). The proposed ES-DQN is an extension of double deep Q-learning network (DDQN) algorithm, and includes two parts: experience screening and policy learning. Based on the experience screened from the experience screening part, the proposed learning method can train an intelligent speed control strategy which has stronger adaptability and control effect in uncertain cut-in scenarios. According to simulation results, the proposed intelligent speed control strategy trained by ES-DQN has better performance under uncertain cut-in scenarios than DDQN method and traditional ACC strategy. Meanwhile, by adjusting weight value in reward function, the system can realize different control target. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

24. Graph Lifelong Learning: A Survey.

Author: Febrinanto, Falih Gozi, Xia, Feng, Moore, Kristen, Thapa, Chandra, and Aggarwal, Charu
Abstract: Graph learning is a popular approach for perfor ming machine learning on graph-structured data. It has revolutionized the machine learning ability to model graph data to address downstream tasks. Its application is wide due to the availability of graph data ranging from all types of networks to information systems. Most graph learning methods assume that the graph is static and its complete structure is known during training. This limits their applicability since they cannot be applied to problems where the underlying graph grows over time and/or new tasks emerge incrementally. Such applications require a lifelong learning approach that can learn the graph continuously and accommodate new information whilst retaining previously learned knowledge. Lifelong learning methods that enable continuous learning in regular domains like images and text cannot be directly applied to continuously evolving graph data, due to its irregular structure. As a result, graph lifelong learning is gaining attention from the research community. This survey paper provides a comprehensive overview of recent advancements in graph lifelong learning, including the categorization of existing methods, and the discussions of potential applications and open research problems. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

25. Doubly Nonparametric Sparse Nonnegative Matrix Factorization Based on Dependent Indian Buffet Processes.

Author: Xuan, Junyu, Lu, Jie, Zhang, Guangquan, Xu, Richard Yi Da, and Luo, Xiangfeng
Subjects: SPARSE matrices, FACTORIZATION, CLUSTER theory (Nuclear physics), GAUSSIAN function, COPULA functions
Abstract: Sparse nonnegative matrix factorization (SNMF) aims to factorize a data matrix into two optimized nonnegative sparse factor matrices, which could benefit many tasks, such as document-word co-clustering. However, the traditional SNMF typically assumes the number of latent factors (i.e., dimensionality of the factor matrices) to be fixed. This assumption makes it inflexible in practice. In this paper, we propose a doubly sparse nonparametric NMF framework to mitigate this issue by using dependent Indian buffet processes (dIBP). We apply a correlation function for the generation of two stick weights associated with each column pair of factor matrices while still maintaining their respective marginal distribution specified by IBP. As a consequence, the generation of two factor matrices will be columnwise correlated. Under this framework, two classes of correlation function are proposed: 1) using bivariate Beta distribution and 2) using Copula function. Compared with the single IBP-based NMF, this paper jointly makes two factor matrices nonparametric and sparse, which could be applied to broader scenarios, such as co-clustering. This paper is seen to be much more flexible than Gaussian process-based and hierarchial Beta process-based dIBPs in terms of allowing the two corresponding binary matrix columns to have greater variations in their nonzero entries. Our experiments on synthetic data show the merits of this paper compared with the state-of-the-art models in respect of factorization efficiency, sparsity, and flexibility. Experiments on real-world data sets demonstrate the efficiency of this paper in document-word co-clustering tasks. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

26. On solving single elevator-like problems using a learning automata-based paradigm

Author: Ghaleb, Omar and Oommen, B. John
Published: 2021
Full Text: View/download PDF

27. Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.

Author: Jing, Longlong and Tian, Yingli
Subjects: VISUAL learning, SUPERVISED learning, DEEP learning, COMPUTER vision
Abstract: Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer vision applications. To avoid extensive cost of collecting and annotating large-scale datasets, as a subset of unsupervised learning methods, self-supervised learning methods are proposed to learn general image and video features from large-scale unlabeled data without using any human-annotated labels. This paper provides an extensive review of deep learning-based self-supervised general visual feature learning methods from images or videos. First, the motivation, general pipeline, and terminologies of this field are described. Then the common deep neural network architectures that used for self-supervised learning are summarized. Next, the schema and evaluation metrics of self-supervised learning methods are reviewed followed by the commonly used datasets for images, videos, audios, and 3D data, as well as the existing self-supervised visual feature learning methods. Finally, quantitative performance comparisons of the reviewed methods on benchmark datasets are summarized and discussed for both image and video feature learning. At last, this paper is concluded and lists a set of promising future directions for self-supervised visual feature learning. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

28. Guest Editorial Deep Learning Models for Industry Informatics.

Author: Agrawal, Dharma Prakash, Gupta, Brij Bhooshan, Wang, Haoxiang, Chang, Xiaojun, Yamaguchi, Shingo, and Perez, Gregorio Martinez
Abstract: This papers in this special issue mainly focus on deep learning models for industry informatics, addressing both original algorithmic development and new applications of deep learning. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

29. Analysis With Histogram of Connectivity: For Automated Evaluation of Piping Layout.

Author: Tan, Wei Chian, Chen, I-Ming, Pan, Sinno Jialin, and Tan, Hoon Kiang
Subjects: HISTOGRAMS, PIPING -- Design & construction, DESCRIPTOR systems, SUPPORT vector machines, MACHINE learning
Abstract: An autonomous framework to evaluate layout of a piping design in the form of piping and instrumentation diagram (P&ID) according to a set of standards of marine and offshore industry is proposed. The method starts with transforming a P&ID into a vector x in R^d . Transformation is done based on a concept introduced for piping known as Histogram of Connectivity. The proposed descriptor captures two essential properties of P&ID: attributes of each component and connectivity among the components. Next, linear support vector machine (SVM) is used to learn a classifier from existing compliant and noncompliant designs. Subsequently, the linear classifier can be used to check if an unseen design complies with the standards. In addition, to enable follow up on noncompliant design including correction or modification, a method to analyze the reason of noncompliance prediction by the learned SVM model is introduced. The method has demonstrated encouraging performance in two challenging data sets of designs created with advice from experienced engineers in the industry, based on International Convention for the Prevention of Pollution from Ships (MARPOL) and Rules for Classification of Ships of Lloyd’s Register. Note to Practitioners—This paper is motivated by need of marine and offshore industry for automated solution for design appraisal. This paper aims to address this issue by using a machine learning-based approach. Some compliant and noncompliant designs are provided to a developed algorithm for a machine (or computer) to learn. After learning is completed, the machine is able to classify unseen designs as compliant or noncompliant. As highlighted in this paper, the developed method has demonstrated encouraging performance in two case studies, including specific parts in MARPOL and Rules of Lloyd’s Register. For adoption by industry, necessary steps include collecting some designs (compliant and noncompliant) available in an organization and feeding these into the developed method for learning by machine before it can predict. With ability of highlighting possible connections that cause noncompliance, follow up and correction on a noncompliant design is made possible. [ABSTRACT FROM PUBLISHER]
Published: 2018
Full Text: View/download PDF

30. The Boundedness Conditions for Model-Free HDP($\lambda$).

Author: Al-Dabooni, Seaar and Wunsch, Donald
Subjects: LYAPUNOV stability, NONLINEAR systems, ARTIFICIAL neural networks, PENDULUMS, STABILITY criterion, QUALITY factor, ERROR correction (Information theory)
Abstract: This paper provides the stability analysis for a model-free action-dependent heuristic dynamic programing (HDP) approach with an eligibility trace long-term prediction parameter ($\lambda $). HDP($\lambda $) learns from more than one future reward. Eligibility traces have long been popular in Q-learning. This paper proves and demonstrates that they are worthwhile to use with HDP. In this paper, we prove its uniformly ultimately bounded (UUB) property under certain conditions. Previous works present a UUB proof for traditional HDP [HDP($\lambda =0$)], but we extend the proof with the $\lambda $ parameter. By using Lyapunov stability, we demonstrate the boundedness of the estimated error for the critic and actor neural networks as well as learning rate parameters. Three case studies demonstrate the effectiveness of HDP($\lambda $). The trajectories of the internal reinforcement signal nonlinear system are considered as the first case. We compare the results with the performance of HDP and traditional temporal difference [TD($\lambda $)] with different $\lambda $ values. The second case study is a single-link inverted pendulum. We investigate the performance of the inverted pendulum by comparing HDP($\lambda $) with regular HDP, with different levels of noise. The third case study is a 3-D maze navigation benchmark, which is compared with state action reward state action, Q($\lambda $), HDP, and HDP($\lambda $). All these simulation results illustrate that HDP($\lambda $) has a competitive performance; thus this contribution is not only UUB but also useful in comparison with traditional HDP. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

31. Dualityfree Methods for Stochastic Composition Optimization.

Author: Liu, Liu, Liu, Ji, and Tao, Dacheng
Subjects: REINFORCEMENT learning, STATISTICAL learning, MACHINE learning, CONJUGATE gradient methods, EMBEDDINGS (Mathematics), ARTIFICIAL intelligence, ALGORITHMS
Abstract: In this paper, we consider the composition optimization with two expected-value functions in the form of $({1}/{n})\sum _{i = 1}^{n} F_{i}\left({({1}/{m})\sum _{j = 1}^{m} G_{j}(x)}\right)+R(x)$ , which formulates many important problems in statistical learning and machine learning such as solving Bellman equations in reinforcement learning and nonlinear embedding. Full gradient- or classical stochastic gradient descent-based optimization algorithms are unsuitable or computationally expensive to solve this problem due to the inner expectation $({1}/{m})\sum _{j = 1}^{m} G_{j}(x)$. We propose a dualityfree-based stochastic composition method that combines the variance reduction methods to address the stochastic composition problem. We apply the stochastic variance reduction gradient- and stochastic average gradient algorithm-based methods to estimate the inner function and the dualityfree method to estimate the outer function. We prove the linear convergence rate not only for the convex composition problem but also for the case that the individual outer functions are nonconvex, while the objective function is strongly convex. We also provide the results of experiments that show the effectiveness of our proposed methods. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

32. Optimization of Distributions Differences for Classification.

Author: Bonyadi, Mohammad Reza, Tieng, Quang M., and Reutens, David C.
Subjects: MULTIDISCIPLINARY design optimization, DISTRIBUTION (Probability theory), QUASI-Newton methods
Abstract: In this paper, we introduce a new classification algorithm called the optimization of distribution differences (ODD). The algorithm aims to find a transformation from the feature space to a new space where the instances in the same class are as close as possible to one another, whereas the gravity centers of these classes are as far as possible from one another. This aim is formulated as a multiobjective optimization problem that is solved by a hybrid of an evolutionary strategy and the quasi-Newton method. The choice of the transformation function is flexible and could be any continuous space function. We experiment with a linear and a nonlinear transformation in this paper. We show that the algorithm can outperform eight other classification methods, namely naive Bayes, support vector machines, linear discriminant analysis, multilayer perceptrons, decision trees, and $k$ -nearest neighbors, and two recently proposed classification methods, in 12 standard classification data sets. Our results show that the method is less sensitive to the imbalanced number of instances compared with these methods. We also show that ODD maintains its performance better than other classification methods in these data sets and hence offers a better generalization ability. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

33. Theoretical Study of Oscillator Neurons in Recurrent Neural Networks.

Author: Zhang, Lei, Yi, Zhang, and Amari, Shun-ichi
Subjects: ARTIFICIAL neural networks, NEURONS
Abstract: Neurons in a network can be both active or inactive. Given a subset of neurons in a network, is it possible for the subset of neurons to evolve to form an active oscillator by applying some external periodic stimulus? Furthermore, can these oscillator neurons be observable, that is, is it a stable oscillator? This paper explores such possibility, finding that an important property: any subset of neurons can be intermittently co-activated to form a stable oscillator by applying some external periodic input without any condition. Thus, the existing of intermittently active oscillator neurons is an essential property possessed by the networks. Moreover, this paper shows that, under some conditions, a subset of neurons can be fully co-activated to form a stable oscillator. Such neurons are called selectable oscillator neurons. Necessary and sufficient conditions are established for a subset of neurons to be selectable oscillator neurons in linear threshold recurrent neuron networks. It is proved that a subset of neurons forms selectable oscillator neurons if and only if the real part of each eigenvalue of the associated synaptic connection weight submatrix of the network is not larger than one. This simple condition makes the concept of selectable oscillator neurons tractable. The selectable oscillator neurons can be regarded as memories stored in the synaptic connections of networks, which enables to find a new perspective of memories in neural networks, different from the equilibrium-type attractors. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

34. Modified Sparse Linear-Discriminant Analysis via Nonconvex Penalties.

Author: Cai, Jia and Huang, Xiaolin
Subjects: DISCRIMINANT analysis, ORTHOGONAL matching pursuit, SPARSE approximations
Abstract: This paper considers the linear-discriminant analysis (LDA) problem in the undersampled situation, in which the number of features is very large and the number of observations is limited. Sparsity is often incorporated in the solution of LDA to make a well interpretation of the results. However, most of the existing sparse LDA algorithms pursue sparsity by means of the $\ell _{1}$ -norm. In this paper, we give elaborate analysis for nonconvex penalties, including the $\ell _{0}$ -based and the sorted $\ell _{1}$ -based LDA methods. The latter one can be regarded as a bridge between the $\ell _{0}$ and $\ell _{1}$ penalties. These nonconvex penalty-based LDA algorithms are evaluated on the gene expression array and face database, showing high classification accuracy on real-world problems. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

35. ST-InNet: Deep Spatio-Temporal Inception Networks for Traffic Flow Prediction in Smart Cities.

Author: Dai, Fei, Huang, Penggui, Mo, Qi, Xu, Xiaolong, Bilal, Muhammad, and Song, Houbing
Abstract: Traffic flow prediction plays a critical role in reducing traffic congestion in transportation systems. However, accurate traffic flow prediction becomes challenging due to the impact of complex spatio-temporal (ST) correlations and the diversity of ST correlations. When modeling complicated ST correlations, researchers usu did not take the diversity of ST correlations into consideration, resulting in poor prediction accuracy. In this paper, we propose ST-InNet, a deep spatio-temporal Inception network for collectively predicting traffic flow in each city region. Specifically, ST-InNet employs two Inception networks to simultaneously capture various spatial and temporal correlations of traffic data, including temporal closeness, temporal periodicity, nearby spatial dependencies, and distant spatial dependencies. For the diversity of spatial correlations, ST-InNet presents an improved variant of an Inception module to explicitly capture the different contributions of spatial correlations for each region. For the diversity of temporal correlations, ST-InNet designs a fusion component to explicitly model the varying contributions of temporal correlations on prediction. The experiments are conducted on a real-world traffic dataset in Nanjing, demonstrating that ST-InNet outperforms five state-of-the-art baselines in short-term and long-term traffic flow predictions with an average accuracy improvement of 32.09% and 30.97%, respectively. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

36. Adaptive Neighborhood Metric Learning.

Author: Song, Kun, Han, Junwei, Cheng, Gong, Lu, Jiwen, and Nie, Feiping
Subjects: MACHINE learning, NEIGHBORHOODS, DEEP learning, DISTANCE education
Abstract: In this paper, we reveal that metric learning would suffer from serious inseparable problem if without informative sample mining. Since the inseparable samples are often mixed with hard samples, current informative sample mining strategies used to deal with inseparable problem may bring up some side-effects, such as instability of objective function, etc. To alleviate this problem, we propose a novel distance metric learning algorithm, named adaptive neighborhood metric learning (ANML). In ANML, we design two thresholds to adaptively identify the inseparable similar and dissimilar samples in the training procedure, thus inseparable sample removing and metric parameter learning are implemented in the same procedure. Due to the non-continuity of the proposed ANML, we develop an ingenious function, named log-exp mean function to construct a continuous formulation to surrogate it, which can be efficiently solved by the gradient descent method. Similar to Triplet loss, ANML can be used to learn both the linear and deep embeddings. By analyzing the proposed method, we find it has some interesting properties. For example, when ANML is used to learn the linear embedding, current famous metric learning algorithms such as the large margin nearest neighbor (LMNN) and neighbourhood components analysis (NCA) are the special cases of the proposed ANML by setting the parameters different values. When it is used to learn deep features, the state-of-the-art deep metric learning algorithms such as Triplet loss, Lifted structure loss, and Multi-similarity loss become the special cases of ANML. Furthermore, the log-exp mean function proposed in our method gives a new perspective to review the deep metric learning methods such as Prox-NCA and N-pairs loss. At last, promising experimental results demonstrate the effectiveness of the proposed method. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

37. Fast Matrix Factorization With Nonuniform Weights on Missing Data.

Author: He, Xiangnan, Tang, Jinhui, Du, Xiaoyu, Hong, Richang, Ren, Tongwei, and Chua, Tat-Seng
Subjects: MATRIX decomposition, SINGULAR value decomposition, STATISTICAL weighting, DATA entry, LEAST squares, RECOMMENDER systems, MACHINE learning
Abstract: Matrix factorization (MF) has been widely used to discover the low-rank structure and to predict the missing entries of data matrix. In many real-world learning systems, the data matrix can be very high dimensional but sparse. This poses an imbalanced learning problem since the scale of missing entries is usually much larger than that of the observed entries, but they cannot be ignored due to the valuable negative signal. For efficiency concern, existing work typically applies a uniform weight on missing entries to allow a fast learning algorithm. However, this simplification will decrease modeling fidelity, resulting in suboptimal performance for downstream applications. In this paper, we weight the missing data nonuniformly, and more generically, we allow any weighting strategy on the missing data. To address the efficiency challenge, we propose a fast learning method, for which the time complexity is determined by the number of observed entries in the data matrix rather than the matrix size. The key idea is twofold: 1) we apply truncated singular value decomposition on the weight matrix to get a more compact representation of the weights and 2) we learn MF parameters with elementwise alternating least squares (eALS) and memorize the key intermediate variables to avoid repeating computations that are unnecessary. We conduct extensive experiments on two recommendation benchmarks, demonstrating the correctness, efficiency, and effectiveness of our fast eALS method. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

38. Features Combined From Hundreds of Midlayers: Hierarchical Networks With Subnetwork Nodes.

Author: Yang, Yimin and Wu, Q. M. Jonathan
Subjects: LEARNING strategies, ARTIFICIAL neural networks, ITERATIVE learning control
Abstract: In this paper, we believe that the mixed selectivity of neuron in the top layer encodes distributed information produced from other neurons to offer a significant computational advantage over recognition accuracy. Thus, this paper proposes a hierarchical network framework that the learning behaviors of features combined from hundreds of midlayers. First, a subnetwork neuron, which itself could be constructed by other nodes, is functional as a subspace features extractor. The top layer of a hierarchical network needs subspace features produced by the subnetwork neurons to get rid of factors that are not relevant, but at the same time, to recast the subspace features into a mapping space so that the hierarchical network can be processed to generate more reliable cognition. Second, this paper shows that with noniterative learning strategy, the proposed method has a wider and shallower structure, providing a significant role in generalization performance improvements. Hence, compared with other state-of-the-art methods, multiple channel features with the proposed method could provide a comparable or even better performance, which dramatically boosts the learning speed. Our experimental results show that our platform can provide a much better generalization performance than 55 other state-of-the-art methods. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

39. BTWalk: Branching Tree Random Walk for Multi-Order Structured Network Embedding.

Author: Xiong, Hao and Yan, Junchi
Subjects: TREE branches, RANDOM walks, LEARNING strategies, SAMPLING (Process), TASK analysis
Abstract: Multi-order proximity is useful for effective network embedding. In contrast to many previous works that only consider order-level weights, this paper proposes to explore a more expressive node-level weighting mechanism to encode the diverse local structure, with a scalable and theoretically justified sampling strategy for its learning. Specifically, we start with a formal definition of multi-order proximity matrix which leads to our new multi-order objective based on Laplacian Eigenmaps and Skip-Gram. Then we instantiate the node-specific multi-order weights in the objective with the help of neighborhood size estimation, which indicates node-specific multi-order information. For objective learning, it is implicitly fulfilled with our proposed branching tree-like random walk strategy termed by BTWalk, which differs from the dominant chain-like walk in existing sampling techniques. BTWalk is designed by a synergetic combination of BFS (breadth-first search) and DFS (depth-first search), which is modulated according to the weights of the considered proximity orders. We theoretically analyze its cost-efficiency, and further propose the so-called Vec4Cross framework that incorporates joint node embedding and network alignment for two partially overlapped networks based on the seed matchings, whereby BTWalk is also adopted for embedding. Promising experimental results are obtained on real-world datasets across popular tasks. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

40. TDPN: Texture and Detail-Preserving Network for Single Image Super-Resolution.

Author: Cai, Qing, Li, Jinxing, Li, Huafeng, Yang, Yee-Hong, Wu, Feng, and Zhang, David
Subjects: HIGH resolution imaging, CONVOLUTIONAL neural networks, GENERATIVE adversarial networks, SIGNAL-to-noise ratio
Abstract: Single image super-resolution (SISR) using deep convolutional neural networks (CNNs) achieves the state-of-the-art performance. Most existing SISR models mainly focus on pursuing high peak signal-to-noise ratio (PSNR) and neglect textures and details. As a result, the recovered images are often perceptually unpleasant. To address this issue, in this paper, we propose a texture and detail-preserving network (TDPN), which focuses not only on local region feature recovery but also on preserving textures and details. Specifically, the high-resolution image is recovered from its corresponding low-resolution input in two branches. First, a multi-reception field based branch is designed to let the network fully learn local region features by adaptively selecting local region features in different reception fields. Then, a texture and detail-learning branch supervised by the textures and details decomposed from the ground-truth high resolution image is proposed to provide additional textures and details for the super-resolution process to improve the perceptual quality. Finally, we introduce a gradient loss into the SISR field and define a novel hybrid loss to strengthen boundary information recovery and to avoid overly smooth boundary in the final recovered high-resolution image caused by using only the MAE loss. More importantly, the proposed method is model-agnostic, which can be applied to most off-the-shelf SISR networks. The experimental results on public datasets demonstrate the superiority of our TDPN on most state-of-the-art SISR methods in PSNR, SSIM and perceptual quality. We will share our code on https://github.com/tocaiqing/TDPN. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

41. Byzantine-Resilient Decentralized Stochastic Gradient Descent.

Author: Guo, Shangwei, Zhang, Tianwei, Yu, Han, Xie, Xiaofei, Ma, Lei, Xiang, Tao, and Liu, Yang
Subjects: FAULT tolerance (Engineering), DEEP learning, INSTRUCTIONAL systems
Abstract: Decentralized learning has gained great popularity to improve learning efficiency and preserve data privacy. Each computing node makes equal contribution to collaboratively learn a Deep Learning model. The elimination of centralized Parameter Servers (PS) can effectively address many issues such as privacy, performance bottleneck and single-point-failure. However, how to achieve Byzantine Fault Tolerance in decentralized learning systems is rarely explored, although this problem has been extensively studied in centralized systems. In this paper, we present an in-depth study towards the Byzantine resilience of decentralized learning systems with two contributions. First, from the adversarial perspective, we theoretically illustrate that Byzantine attacks are more dangerous and feasible in decentralized learning systems: even one malicious participant can arbitrarily alter the models of other participants by sending carefully crafted updates to its neighbors. Second, from the defense perspective, we propose Ubar, a novel algorithm to enhance decentralized learning with Byzantine Fault Tolerance. Specifically, Ubar provides a Uniform Byzantine-resilient Aggregation Rule for benign nodes to select the useful parameter updates and filter out the malicious ones in each training iteration. It guarantees that each benign node in a decentralized system can train a correct model under very strong Byzantine attacks with an arbitrary number of faulty nodes. We conduct extensive experiments on standard image classification tasks and the results indicate that Ubar can effectively defeat both simple and sophisticated Byzantine attacks with higher performance efficiency than existing solutions. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

42. A Pairwise Proximity Learning-Based Ant Colony Algorithm for Dynamic Vehicle Routing Problems.

Author: Xiang, Xiaoshu, Tian, Ye, Zhang, Xingyi, Xiao, Jianhua, and Jin, Yaochu
Abstract: Dynamic vehicle routing problems (DVRPs) have become a hot research topic due to their significance in logistics, although it is still very challenging for existing algorithms to solve DVRPs due to the dynamically changing customer requests during the optimization. In this paper, we propose a pairwise proximity learning-based ant colony algorithm, termed PPL-ACO, for tackling DVRPs. In PPL-ACO, a pairwise proximity learning method is suggested to predict the local visiting order of customers in the optimal route after the occurrence of changes, which is on the basis of learning from the optimal routes found before the changes occur. A radial basis function network is used to learn the local visiting order of customers based on the proximity between each pair of customer nodes, by which the optimal routes can be quickly tracked after changes occur. Experimental results on 22 popular DVRP instances show that the proposed PPL-ACO significantly outperforms four state-of-the-art approaches to DVRPs. More interestingly, the results on five large-scale DVRP instances demonstrate the superiority of the proposed PPL-ACO in solving large-scale DVPRs with up to 1000 customers. The results on a real case of Nankai Strict, Tianjin, China also verifies that the proposed PPL-ACO is more effective and efficient than the four compared approaches in solving real-world DVRPs. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

43. PSO-Based Fuzzy Markup Language for Student Learning Performance Evaluation and Educational Application.

Author: Lee, Chang-Shing, Wang, Mei-Hui, Wang, Chi-Shiang, Teytaud, Olivier, Liu, Jialin, Lin, Su-Wei, and Hung, Pi-Hsia
Subjects: PARTICLE swarm optimization, FUZZY sets, XML (Extensible Markup Language), ITEM response theory, BAYESIAN analysis
Abstract: Fuzzy relationships exist between students’ learning performance with various abilities and a test item. However, the challenges in implementing adaptive assessment agents are obtaining sufficient items, efficient and accurate computerized estimation, and a substantial feedback agent. Additionally, the agent must immediately estimate students’ ability item by item, which places a considerable burden on the server, especially for a group test. Hence, the implementation of an adaptive assessment agent is more difficult in practice. This paper proposes an agent with particle swarm optimization (PSO) based on a fuzzy markup language (FML) for students’ learning performance evaluation and educational applications, and the proposed agent is according to the response data from a conventional test and an item response theory (IRT)-based three-parameter logistic model. First, we apply a Gauss–Seidel based parameter estimation mechanism to estimate the items’ parameters according to the response data, and then to compare its results with those of an IRT-based Bayesian parameter estimation mechanism. In addition, we propose a static-IRT test assembly mechanism to assemble a form for the conventional test. The presented FML-based dynamic assessment mechanism infers the probability of making a correct response to the item for a student with various abilities. Moreover, this paper also proposes a novel PSO-based FML (PFML) learning mechanism for optimizing the parameters between items and students. Finally, we adopt a K-fold cross-validation mechanism to evaluate the performance of the proposed agent. Experimental results show that the novel PFML learning mechanism for the parameter estimation and learning optimization performs favorably. We believe the proposed PFML will be a reference for education research and pedagogy and an important colearning mechanism for future human–machine educational applications. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

44. Domain Invariant and Class Discriminative Feature Learning for Visual Domain Adaptation.

Author: Li, Shuang, Song, Shiji, Huang, Gao, Ding, Zhengming, and Wu, Cheng
Subjects: INVARIANTS (Mathematics), LEARNING, MATHEMATICAL symmetry, DATA analysis, ARTIFICIAL neural networks
Abstract: Domain adaptation manages to build an effective target classifier or regression model for unlabeled target data by utilizing the well-labeled source data but lying different distributions. Intuitively, to address domain shift problem, it is crucial to learn domain invariant features across domains, and most existing approaches have concentrated on it. However, they often do not directly constrain the learned features to be class discriminative for both source and target data, which is of vital importance for the final classification. Therefore, in this paper, we put forward a novel feature learning method for domain adaptation to construct both domain invariant and class discriminative representations, referred to as DICD. Specifically, DICD is to learn a latent feature space with important data properties preserved, which reduces the domain difference by jointly matching the marginal and class-conditional distributions of both domains, and simultaneously maximizes the inter-class dispersion and minimizes the intra-class scatter as much as possible. Experiments in this paper have demonstrated that the class discriminative properties will dramatically alleviate the cross-domain distribution inconsistency, which further boosts the classification performance. Moreover, we show that exploring both domain invariance and class discriminativeness of the learned representations can be integrated into one optimization framework, and the optimal solution can be derived effectively by solving a generalized eigen-decomposition problem. Comprehensive experiments on several visual cross-domain classification tasks verify that DICD can outperform the competitors significantly. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

45. Model Learning for Multistep Backward Prediction in Dyna- ${Q}$ Learning.

Author: Hwang, Kao-Shing, Jiang, Wei-Cheng, Chen, Yu-Jen, and Hwang, Iris
Subjects: REINFORCEMENT learning, SIMULATION methods & models
Abstract: A model-based reinforcement learning (RL) method which interplays direct and indirect learning to update ${Q}$ functions is proposed. The environment is approximated by a virtual model that can predict the transition to the next state and the reward of the domain. This virtual model is used to train ${Q}$ functions to accelerate policy learning. Lookup table methods are usually used to establish such environmental models, but these methods need to collect tremendous amounts of experiences to enumerate responses of the environment. In this paper, a stochastic model learning method based on tree structures is presented. To model the transition probability, an online clustering method is applied to equip the model learning method with the abilities to evaluate the transition probability. By the virtual model, the RL method produces simulated experience in the stage of indirect learning. Since simulated transitions and backups are more usefully focused by working backward from the state-action, the pair estimated ${Q}$ value of which changes significantly, the useful one-step backups are actions that lead directly into the one state whose value has already obviously been changed. This, however, may induce a false positive; that is, a backup state may be an invalid state, such as an absorbing or terminal state, especially in cases where the changes of ${Q}$ values at the planning stage are still needed to put back for ranking even though they are based on a simulated experience and are possibly erroneous. It is obvious that when the agent is attracted to generate simulated experience around the area of these absorbing states, the learning efficiency is deteriorated. This paper proposes three detecting methods to solve this problem. Moreover, the policy learning can speed up. The effectiveness and generality of our method is further demonstrated in three numerical simulations. The simulation results demonstrate that the training rate of our method is obviously improved. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

46. Single channel informed signal separation using artificial‐stereophonic mixtures and exemplar‐guided matrix factor deconvolution.

Author: Al‐Tmeme, Ahmed, Woo, W. L., Dlay, S. S., and Gao, Bin
Subjects: SIGNAL separation, STEREOPHONIC sound systems, INTELLIGENT control systems, ADAPTIVE signal processing, INSTRUCTIONAL systems
Abstract: Summary: In this paper, a method is proposed to tackle the problem of single channel audio separation. The proposed method leverages on the exemplar source is used to emulate the targeted speech signal. A multicomponent nonnegative matrix factor 2D deconvolution (NMF2D) is proposed to model the temporal and spectral changes and the number of spectral basis of the audio signals. The paper proposes an artificial auxiliary channel to imitate a pair of stereo mixture signals, which is termed as “artificial‐stereophonic mixtures.” The artificial‐stereophonic mixtures and the exemplar source are jointly used to guide the factorization process of the NMF2D. The factorization is adapted under a hybrid framework that combines the generalized expectation–maximization algorithm with multiplicative update adaptation. The proposed algorithm leads to fast and stable convergence and ensures the nonnegativity constraints of the solution are satisfied. Adaptive sparsity has also been introduced on each sparse parameter in the multicomponent NMF2D model when the exemplar deviates from the target signal. Experimental results have shown the competence of the proposed algorithms in comparison with other algorithms. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

47. SteeringLoss: A Cost-Sensitive Loss Function for the End-to-End Steering Estimation.

Author: Yuan, Wei, Yang, Ming, Li, Hao, Wang, Chunxiang, and Wang, Bing
Abstract: Imbalanced training is a challenge in the field of autonomous driving. For the steering estimation task, imbalanced training is the core reason that an end-to-end model cannot estimate sharp steering value well. Inspired by researches on the steering estimation, this paper proposes a novel loss function to train a high-performance end-to-end model for handling the imbalanced training problem, which is named SteeringLoss. Firstly, the imbalanced distribution of the steering value for driving datasets is analyzed, which is similar to the double long-tailed distribution. Secondly, with the feature of distribution, this paper designs a cost-sensitive loss function step by step, the new loss function can improve the impact of the sharp steering value while maintaining the impact of the small steering value. Thirdly, three typical steering estimation models are established with SteeringLoss for demonstration, including the CNN model, the CNN-LSTM model and the 3DCNN-LSTM model. Finally, three experiments are designed for SteeringLoss: Experiment I demonstrates SteeringLoss can avoid imbalanced training problem; Experiment II discusses the finetuning principle of SteeringLoss and gives the basic guideline for using SteeringLoss; Experiment III shows the results on different typical end-to-end steering estimation models, which shows effectiveness of SteeringLoss for all the models and gives different solutions for the steering estimation. Moreover, the SteeringLoss is suitable for the imbalanced training with similar distributions of datasets besides end-to-end steering estimation, which indicates the potential value of the research on SteeringLoss in the future. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

48. A Discovery Method of Trend Rules from Complex Sequential Data.

Author: Sakurai, Shigeaki, Makino, Kyoko, and Matsumoto, Shigeru
Abstract: This paper proposes a method that discovers trend rules from complex sequential data. The rules represent relationships among evaluation objects, keywords, and changes of numerical values related to the evaluation objects. The data is composed of numerical sequential data and text sequential data. The method extracts frequent patterns from transaction sets based on the changes. Also, it regards combinations of the patterns and the changes as trend rules. This paper applies the method to data sets composed of stock data and news headlines. Lastly, this paper compares the method with a method based on the random selection and shows the effect of the proposed method. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

49. Data-Driven Learning Control for Stochastic Nonlinear Systems: Multiple Communication Constraints and Limited Storage.

Author: Shen, Dong
Subjects: DATA mining, NONLINEAR systems, ARTIFICIAL neural networks
Abstract: This paper proposes a data-driven learning control method for stochastic nonlinear systems under random communication conditions, including data dropouts, communication delays, and packet transmission disordering. A renewal mechanism is added to the buffer to regulate the arrived packets, and a recognition mechanism is introduced to the controller for the selection of suitable update packets. Both intermittent and successive update schemes are proposed based on the conventional P-type iterative learning control algorithm, and are shown to converge to the desired input with probability one. The convergence and effectiveness of the proposed algorithms are verified by means of illustrative simulations. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

50. Guest Editors' Introduction: Special Section on Learning Deep Architectures.

Author: Bengio, Samy, Deng, Li, Larochelle, Hugo, Lee, Honglak, and Salakhutdinov, Ruslan
Subjects: COMPUTER architecture, MACHINE learning, SPECIAL issues of periodicals, DATA extraction, SIGNAL processing, DATA mining, ARTIFICIAL neural networks
Abstract: There has been a resurgence of research in the design of deep architecture models and learning algorithms, i.e., methods that rely on the extraction of a multilayer representation of the data. Often referred to as deep learning, this topic of research has been building on and contributing to many different research topics, such as neural networks, graphical models, feature learning, unsupervised learning, optimization, pattern recognition, and signal processing. Deep learning is also motivated and inspired by neuroscience and has had a tremendous impact on various applications such as computer vision, speech recognition, and natural language processing. The clearly multidisciplinary nature of deep learning led to a call for papers for a special issue dedicated to learning deep architectures. [ABSTRACT FROM PUBLISHER]
Published: 2013
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

2,795 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources