5,041 results on '"incremental learning"'
Search Results
2. Mapping the Unknown: A New Approach to Open-World Video Recognition
- Author
-
Parga, César D., Pardo, Xosé M., Regueiro, Carlos V., Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
3. Foundation Model-Powered 3D Few-Shot Class Incremental Learning via Training-Free Adaptor
- Author
-
Ahmadi, Sahar, Cheraghian, Ali, Saberi, Morteza, Abir, Md.Towsif, Dastmalchi, Hamidreza, Hussain, Farookh, Rahman, Shafin, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Cho, Minsu, editor, Laptev, Ivan, editor, Tran, Du, editor, Yao, Angela, editor, and Zha, Hongbin, editor
- Published
- 2025
- Full Text
- View/download PDF
4. Federated Class Incremental Learning: A Pseudo Feature Based Approach Without Exemplars
- Author
-
Yoo, Min Kyoon, Park, Yu Rang, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Cho, Minsu, editor, Laptev, Ivan, editor, Tran, Du, editor, Yao, Angela, editor, and Zha, Hongbin, editor
- Published
- 2025
- Full Text
- View/download PDF
5. Face to Cartoon Incremental Super-Resolution Using Knowledge Distillation
- Author
-
Devkatte, Trinetra, Dubey, Shiv Ram, Singh, Satish Kumar, Hadid, Abdenour, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
6. CrackUDA: Incremental Unsupervised Domain Adaptation for Improved Crack Segmentation in Civil Structures
- Author
-
Srivastava, Kushagra, Kancharla, Damodar Datta, Tahereen, Rizvi, Ramancharla, Pradeep Kumar, Sarvadevabhatla, Ravi Kiran, Kandath, Harikumar, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
7. Anticipating Future Object Compositions Without Forgetting
- Author
-
Zahran, Youssef, Burghouts, Gertjan, B. Eisma, Yke, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
8. Incremental Object 6D Pose Estimation
- Author
-
Tian, Long, Sorrenti, Amelia, Pang, Yik Lung, Bellitto, Giovanni, Palazzo, Simone, Spampinato, Concetto, Oh, Changjae, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
9. Task Consistent Prototype Learning for Incremental Few-Shot Semantic Segmentation
- Author
-
Xu, Wenbo, Wu, Yanan, Jiang, Haoran, Wang, Yang, Wu, Qiang, Zhang, Jian, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
10. Learning Dynamic Representations in Large Language Models for Evolving Data Streams
- Author
-
Srivastava, Ashish, Bhatnagar, Shalabh, Narasimha Murty, M., Aravinda Raman, J., Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
11. CL3: A Collaborative Learning Framework for the Medical Data Ensuring Data Privacy in the Hyperconnected Environment
- Author
-
Parvez, Mohammad Zavid, Islam, Rafiqul, Islam, Md Zahidul, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Barhamgi, Mahmoud, editor, Wang, Hua, editor, and Wang, Xin, editor
- Published
- 2025
- Full Text
- View/download PDF
12. Real-Time Human Activity Recognition Using Non-intrusive Sensing and Continual Learning
- Author
-
Rahman, Md Geaur, ur Rehman, Sabih, Fealy, Shanna, Vallejo, Johan Sebastian Ramirez, Fuskelay, Aayush, Moni, Mohammad Ali, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Gong, Mingming, editor, Song, Yiliao, editor, Koh, Yun Sing, editor, Xiang, Wei, editor, and Wang, Derui, editor
- Published
- 2025
- Full Text
- View/download PDF
13. Information Bottleneck Based Data Correction in Continual Learning
- Author
-
Chen, Shuai, Zhang, Mingyi, Zhang, Junge, Huang, Kaiqi, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
- Published
- 2025
- Full Text
- View/download PDF
14. Virtual Learning Machine for Tiny Devices
- Author
-
Kitagawa, Nozomi, Yamauchi, Koichiro, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Wu, Shiqing, editor, Su, Xing, editor, Xu, Xiaolong, editor, and Kang, Byeong Ho, editor
- Published
- 2025
- Full Text
- View/download PDF
15. Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
- Author
-
Mo, Qijie, Gao, Yipeng, Fu, Shenghao, Yan, Junkai, Wu, Ancong, Zheng, Wei-Shi, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
- Published
- 2025
- Full Text
- View/download PDF
16. Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning
- Author
-
Park, Min-Yeong, Lee, Jae-Ho, Park, Gyeong-Moon, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
- Published
- 2025
- Full Text
- View/download PDF
17. FIL-FLD: Few-Shot Incremental Learning with EMD Metric for High Generalization Fingerprint Liveness Detection
- Author
-
Yuan, Chengsheng, Qiu, Wenqian, Zhou, Zhili, Li, Xinting, Chen, Xianyi, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lin, Zhouchen, editor, Cheng, Ming-Ming, editor, He, Ran, editor, Ubul, Kurban, editor, Silamu, Wushouer, editor, Zha, Hongbin, editor, Zhou, Jie, editor, and Liu, Cheng-Lin, editor
- Published
- 2025
- Full Text
- View/download PDF
18. Incremental Learning for Object Classification in a Real and Dynamic World
- Author
-
Aburto Sánchez, Yareli, Morales, Eduardo F., Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Martínez-Villaseñor, Lourdes, editor, and Ochoa-Ruiz, Gilberto, editor
- Published
- 2025
- Full Text
- View/download PDF
19. Resilience to the Flowing Unknown: An Open Set Recognition Framework for Data Streams
- Author
-
Barcina-Blanco, Marcos, L. Lobo, Jesus, Garcia-Bringas, Pablo, Del Ser, Javier, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Quintián, Héctor, editor, Corchado, Emilio, editor, Troncoso Lora, Alicia, editor, Pérez García, Hilde, editor, Jove Pérez, Esteban, editor, Calvo Rolle, José Luis, editor, Martínez de Pisón, Francisco Javier, editor, García Bringas, Pablo, editor, Martínez Álvarez, Francisco, editor, Herrero, Álvaro, editor, and Fosci, Paolo, editor
- Published
- 2025
- Full Text
- View/download PDF
20. An incremental learning framework for pipeline weld crack damage identification and leakage rate prediction.
- Author
-
Huang, Jing, Zhang, Zhifen, Yu, Yanlong, Li, Yongjie, Zhang, Shuai, Qin, Rui, Xing, Ji, Cheng, Wei, Wen, Guangrui, and Chen, Xuefeng
- Abstract
The weld crack leakage due to stress concentration and external load is a significant safety risk in pressure pipelines. Microstructural variations and dynamic propagation lead to unpredictable changes in leakage rate over time and conditions. To address the above problems, a novel framework called OILS-TCN for weld crack pattern recognition and leakage rate prediction is proposed. Firstly, the adaptive threshold optimization algorithm is introduced into the self-organizing incremental neural network to update and increase the crack leakage pattern. Secondly, the depth first search algorithm is combined with the radial basis function neural network to perform online increment labelling of the leakage state. Then, according to the attenuation characteristics of acoustic emission signals, a portable input-attention module is designed to add different weights to the input sequence. Finally, the accurate prediction of leakage rate under different conditions is realized based on the temporal convolutional network. Compared with other advanced methods, the proposed method has obvious advantages in the adaptability and accuracy of pipeline weld crack leakage rate prediction. In addition, the validity and necessity of each part of the framework proposed are discussed based on ablation experiments. The proposed method can predict the leakage rate in real time without modifying the hyperparameters of the model, and can provide a powerful guide for the online monitoring of the leakage AE technology of pressure pipeline in complex systems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. A quantitative monitoring method for the pretightening state of bolts based on nonlinear Lamb waves and incremental learning.
- Author
-
Wang, Tiantian, Tian, Longzhen, Yang, Jinsong, Xie, Jingsong, and Zhang, Zhikang
- Subjects
- *
MACHINE learning , *LAMB waves , *NONLINEAR waves , *POINT processes , *QUANTITATIVE research - Abstract
The degradation of the pretightening state of bolts is a multistage process. Utilising a single model to monitor the pretightening state in the full degradation stage and continuously updating the monitoring model is important. Therefore, a quantitative monitoring method for the pretightening state of bolts based on nonlinear Lamb waves and incremental learning is proposed. In the proposed method, phase reversal technology is first adopted to enhance the sensitivity for bolt loosening, and then a relative nonlinear coefficient based on phase reversal (RCP) is constructed. The disadvantage that linear indicators are insensitive to early loosening is overcome and the critical points of the multidegradation process are identified by this indicator, the tight contact stage (TCS) and the significant loosening stage. After the TCS is determined, a quantitative monitoring model, which fuses seven nonlinear damage indexes, is established based on incremental canonical correlation forests (ICCF). This algorithm achieves incremental learning by continuously increasing the number of decision trees. To verify the effectiveness of the method, an experimental study is carried out on four bolts. The monitoring effects of different indicators show that the method has higher accuracy and retains the ability to dynamically update data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Open-world object detection: A solution based on reselection mechanism and feature disentanglement.
- Author
-
Lin, Tian, Hua, Li, Linxuan, Li, and Chuanao, Bai
- Subjects
- *
OBJECT recognition (Computer vision) , *MACHINE learning , *DETECTORS , *ALGORITHMS , *SIMULATED annealing - Abstract
Traditional object detection algorithms operate within a closed set, where the training data may not cover all real-world objects. Therefore, the issue of open-world object detection has attracted significant attention. Open-world object detection faces two major challenges: "neglecting unknown objects" and "misclassifying unknown objects as known ones." In our study, we address these challenges by utilizing the Region Proposal Network (RPN) outputs to identify potential unknown objects with high object scores that do not overlap with ground truth annotations. We introduce the reselection mechanism, which separates unknown objects from the background. Subsequently, we employ the simulated annealing algorithm to disentangle features of unknown and known classes, guiding the detector's learning process. Our method has improved on multiple evaluation metrics such as U-mAP, U-recall, and UDP, greatly alleviating the challenges faced by open world object detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Addressing catastrophic forgetting in payload parameter identification using incremental ensemble learning.
- Author
-
Taie, Wael, ElGeneidy, Khaled, Al-Yacoub, Ali, and Sun, Ronglei
- Subjects
MACHINE learning ,INDUSTRIAL robots ,PARAMETER identification ,INDUSTRY 4.0 ,CLASSIFICATION - Abstract
Collaborative robots (cobots) are increasingly integrated into Industry 4.0 dynamic manufacturing environments that require frequent system reconfiguration due to changes in cobot paths and payloads. This necessitates fast methods for identifying payload inertial parameters to compensate the cobot controller and ensure precise and safe operation. Our prior work used Incremental Ensemble Model (IEM) to identify payload parameters, eliminating the need for an excitation path and thus removing the separate identification step. However, this approach suffers from catastrophic forgetting. This paper introduces a novel incremental ensemble learning method that addresses the problem of catastrophic forgetting by adding a new weak learner to the ensemble model for each new training bag. Moreover, it proposes a new classification model that assists the ensemble model in identifying which weak learner provides the most accurate estimation for new input data. The proposed method incrementally updates the identification model while the cobot navigates any task path, maintaining accuracy on old weak learner even after updating with new data. Validation performed on the Franka Emika cobot showcases the model's superior accuracy and adaptability, effectively eliminating the problem of catastrophic forgetting. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Replay-Based Incremental Learning Framework for Gesture Recognition Overcoming the Time-Varying Characteristics of sEMG Signals.
- Author
-
Zhang, Xingguo, Li, Tengfei, Sun, Maoxun, Zhang, Lei, Zhang, Cheng, and Zhang, Yue
- Subjects
- *
PATTERN recognition systems , *MACHINE learning , *CLASS actions , *GESTURE , *ELECTROMYOGRAPHY - Abstract
Gesture recognition techniques based on surface electromyography (sEMG) signals face instability problems caused by electrode displacement and the time-varying characteristics of the signals in cross-time applications. This study proposes an incremental learning framework based on densely connected convolutional networks (DenseNet) to capture non-synchronous data features and overcome catastrophic forgetting by constructing replay datasets that store data with different time spans and jointly participate in model training. The results show that, after multiple increments, the framework achieves an average recognition rate of 96.5% from eight subjects, which is significantly better than that of cross-day analysis. The density-based spatial clustering of applications with noise (DBSCAN) algorithm is utilized to select representative samples to update the replayed dataset, achieving a 93.7% recognition rate with fewer samples, which is better than the other three conventional sample selection methods. In addition, a comparison of full dataset training with incremental learning training demonstrates that the framework improves the recognition rate by nearly 1%, exhibits better recognition performance, significantly shortens the training time, reduces the cost of model updating and iteration, and is more suitable for practical applications. This study also investigates the use of the incremental learning of action classes, achieving an average recognition rate of 88.6%, which facilitates the supplementation of action types according to the demand, and further improves the application value of the action pattern recognition technology based on sEMG signals. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Incremental learning model for sustainable agricultural land assessment using multimodal satellite data.
- Author
-
Chatrabhuj and Meshram, Kundan
- Subjects
- *
METAHEURISTIC algorithms , *SUSTAINABLE agriculture , *MACHINE learning , *CONVOLUTIONAL neural networks , *FARMS - Abstract
The identification of agricultural lands is crucial for sustainable development in rural areas. In this paper, an augmented model that utilizes multimodal satellite-based data samples for identifying agricultural lands via incremental learning. The Whale Optimization Algorithm (WOA) is used for augmenting collected images and data samples to enhance the accuracy of the model under real-time conditions. To identify agricultural lands, Deep Convolutional Neural Networks (CNNs) are trained on the augmented data samples. Additionally, the incorporation of Q-Learning for continuous optimization of the model to enhance its efficiency and effectiveness in identifying agricultural lands. The proposed model offers many edges over existing methods. Firstly, the use of multimodal satellite-based data samples allows for a comprehensive and accurate analysis of agricultural lands. Secondly, the incorporation of the Whale Optimization Algorithm enables the augmentation of collected data samples, leading to improved accuracy and reliability of the model. Thirdly, Deep CNNs allows the extraction of complex features from the data, leading to more accurate identification of agricultural lands. Finally, the use of Q-Learning ensures that the model is continuously optimized to improve its efficiency and effectiveness. The need for this work arises from the limitations of existing methods in accurately identifying agricultural lands. Traditional methods based on manual surveys and visual interpretation are time-consuming, expensive, and prone to errors. Moreover, existing automated methods often lack the ability to analyse multimodal satellite-based data samples and fail to provide accurate results. Based on these observations, the proposed augmented model offers a promising solution for identifying agricultural lands for sustainable development. The use of multimodal satellite-based data samples, WOA, Deep CNNs, and Q-Learning allows for an accurate and efficient analysis of agricultural lands, which can aid in sustainable development planning and decision-making operations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Incremental federated learning for traffic flow classification in heterogeneous data scenarios.
- Author
-
Pekar, Adrian, Makara, Laszlo Arpad, and Biczok, Gergely
- Subjects
- *
FEDERATED learning , *MACHINE learning , *COMPUTER network traffic , *TRAFFIC flow , *FEATURE selection - Abstract
This paper explores the comparative analysis of federated learning (FL) and centralized learning (CL) models in the context of multi-class traffic flow classification for network applications, a timely study in the context of increasing privacy preservation concerns. Unlike existing literature that often omits detailed class-wise performance evaluation, and consistent data handling and feature selection approaches, our study rectifies these gaps by implementing a feed-forward neural network and assessing FL performance under both independent and identically distributed (IID) and non-independent and identically distributed (non-IID) conditions, with a particular focus on incremental training. In our cross-silo experimental setup involving five clients per round, FL models exhibit notable adaptability. Under IID conditions, the accuracy of the FL model peaked at 96.65%, demonstrating its robustness. Moreover, despite the challenges presented by non-IID environments, our FL models demonstrated significant resilience, adapting incrementally over rounds to optimize performance; in most scenarios, our FL models performed comparably to the idealistic CL model regarding multiple well-established metrics. Through a comprehensive traffic flow classification use case, this work (i) contributes to a better understanding of the capabilities and limitations of FL, offering valuable insights for the real-world deployment of FL, and (ii) provides a novel, large, carefully curated traffic flow dataset for the research community. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Functional incremental least square regression algorithm.
- Author
-
Shen, Yujing, Zou, Kunjin, Zou, Bin, Xu, Jie, and Zeng, Jingjing
- Abstract
Functional linear regression is one of the main modeling tools for working with functional data. Since functional data are usually stream data essentially and there are some noises in functional data. Many numerical research studies of machine learning indicate that the noise samples not only increase the amount of storage space, but also affect the performance of algorithm. Therefore, in this paper we consider a new learning strategy by introducing incremental learning, Markov sampling for functional linear regression and propose a novel functional incremental linear square regression algorithm based on Markov sampling (FILSR-MS). To have a better understanding of the proposed FILSR-MS, we not only estimate the generalization bound of the proposed algorithm and establish the fast learning rate, but also present some useful discussions. The performance of the proposed algorithm is validated by the numerical experiments for benchmark repository. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Generalization bounds of incremental SVM.
- Author
-
Zeng, Jingjing, Zou, Bin, Qin, Yimo, and Xu, Jie
- Abstract
Incremental learning is one of the effective methods of learning from the accumulated training samples and the large-scale dataset. The main advantages of incremental learning consist of making full use of historical information, reducing the training scale greatly and saving space and time consumption. Despite extensive research on incremental support vector machine (SVM) learning algorithms, most of them are based on independent and identically distributed samples (i.i.d.). Not only that, there has been no theoretical analysis of incremental SVM learning algorithms. In this paper, we mainly study the generalization bounds of this incremental SVM learning algorithm whose samples are based on uniformly geometric Markov chains, and exponentially strongly mixing sequence. As a special case, we also obtain the generalization bounds of i.i.d. samples. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. An efficient implementation to compute the pseudoinverse for the incremental broad learning system on added inputs.
- Author
-
Zhu, Hufei, Liu, Zhulin, Chen, C. L. Philip, and Liang, Yanyang
- Abstract
In this paper, we improve the broad learning system (BLS) by speeding up the incremental learning for added inputs. We propose an efficient implementation for a step that is in the pseudoinverse computation of a partitioned matrix, to reduce the computational complexity. The proposed efficient implementation has two different forms for the cases of q > k and q ≤ k , respectively, where q and k denote the number of additional training samples and the total number of nodes, respectively. The proposed implementation for q > k utilizes the inverse of a sum of matrices to compute only a k × k matrix inverse, instead of a q × q matrix inverse in the original implementation, and the corresponding speedup for the matrix inversion operation in the number of floating-point operations is 1 2 (q / k) 3 . Moreover, it also speeds up two relevant matrix multiplication operations in the original implementation. On the other hand, the proposed implementation for q ≤ k speeds up one matrix multiplication operation in the original implementation. The numerical simulations show that both the proposed and original implementations always achieve the same testing accuracy. On the Modified National Institute of Standards and Technology dataset, the speedups of the proposed efficient BLS implementation over the original BLS implementation in total training time are 1.35–1.57 when q > k and 1.09–1.13 when q < k , while on the NYU object recognition benchmark dataset, the speedups are 1.10–1.35 when q > k and 1.07–1.09 when q < k. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Advancing Incremental Few-Shot Video Action Recognition with Cluster Compression and Generative Separation.
- Author
-
Qin, Yanfei, Chu, Renxin, and Liu, Baolin
- Subjects
- *
RECOGNITION (Psychology) , *MACHINE learning , *IMAGE recognition (Computer vision) , *DEEP learning , *SEMANTICS - Abstract
Few-Shot Class Incremental Learning (FSCIL) is a trending topic in deep learning, addressing the need for models to incrementally learn novel classes, particularly in real-world scenarios where continuously emerging classes come with limited labeled samples. However, the majority of FSCIL research has been dedicated to image classification and object recognition tasks, with limited attention given to video action classification. In this paper, we present a new Cluster Compression and Generative Separation (CCGS) method for Incremental Few-Shot Video Action Recognition (iFSVAR), which introduces contrastive learning to boost the degree of class separation in the base session. Simultaneously, it creates numerous fine-grained classes with diverse semantics, effectively filling the unallocated representation space. Experimental results on UCF101, Kinetics, and Something-Something-V2 demonstrate the effectiveness of the framework. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. 基于知识回放的即时软件缺陷预测增量模型.
- Author
-
张文静, 李勇, and 王越
- Subjects
- *
SOFTWARE reliability , *COMPUTER software quality control , *MACHINE learning , *PREDICTION models , *STATISTICAL sampling - Abstract
Just-in-time software defect prediction technology enables just-in-time defect prediction at the granularity of code changes, which is crucial for improving software code quality and ensuring software reliability. Traditional static software defect prediction models suffer from 'knowledge forgetting' when processing just-in-time software data streams, leading to poor model generalization performance. To address this, this paper proposed an incremental model method based on knowledge replay for just-in-time software defect prediction. Firstly, it used the knowledge replay mechanism stores model parameters and random samples to facilitate the learning of old knowledge. Secondly, this paper used a distributed training framework to perform incremental learning on just-in-time software data streams on local devices, achieving real-time model updates through restructuring. Lastly, this paper employed the knowledge distillation technique to construct a global incremental prediction model. Experiments show that this model performs better in terms of comprehensive performance compared to common modeling algorithms while ensuring training efficiency. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Two-step reinforcement learning for multistage strategy card game.
- Author
-
GODLEWSKI, Konrad and SAWICKI, Bartosz
- Subjects
- *
MACHINE learning , *REINFORCEMENT learning , *CARD games , *STRATEGY games , *EDUCATIONAL games - Abstract
This study introduces a two-step reinforcement learning (RL) strategy tailored for “The Lord of the Rings: The Card Game”, a complex multistage strategy card game. The research diverges from conventional RL methods by adopting a phased learning approach, beginning with a foundational learning step in a simplified version of the game and subsequently progressing to the complete, intricate game environment. This methodology notably enhances the AI agent’s adaptability and performance in the face of the unpredictable and challenging nature of the game. The paper also explores a multi-phase system where distinct RL agents are employed for various decision-making phases of the game. This approach has demonstrated remarkable improvement, with the RL agents achieving a winrate of 78.5% at the highest difficulty level. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases.
- Author
-
Xu, Deren, Chan, Weng Howe, Haron, Habibollah, Nies, Hui Wen, and Moorthy, Kohbalan
- Subjects
- *
MACHINE learning , *EMERGENCY management , *COMMUNICABLE diseases , *MONKEYPOX , *PREDICTION models , *EMERGING infectious diseases - Abstract
The outbreak of emerging infectious diseases poses significant challenges to global public health. Accurate early forecasting is crucial for effective resource allocation and emergency response planning. This study aims to develop a comprehensive predictive model for emerging infectious diseases, integrating the blending framework, transfer learning, incremental learning, and the biological feature Rt to increase prediction accuracy and practicality. By transferring features from a COVID-19 dataset to a monkeypox dataset and introducing dynamically updated incremental learning techniques, the model's predictive capability in data-scarce scenarios was significantly improved. The research findings demonstrate that the blending framework performs exceptionally well in short-term (7-day) predictions. Furthermore, the combination of transfer learning and incremental learning techniques significantly enhanced the adaptability and precision, with a 91.41% improvement in the RMSE and an 89.13% improvement in the MAE. In particular, the inclusion of the Rt feature enabled the model to more accurately reflect the dynamics of disease spread, further improving the RMSE by 1.91% and the MAE by 2.17%. This study underscores the significant application potential of multimodel fusion and real-time data updates in infectious disease prediction, offering new theoretical perspectives and technical support. This research not only enriches the theoretical foundation of infectious disease prediction models but also provides reliable technical support for public health emergency responses. Future research should continue to explore integrating data from multiple sources and enhancing model generalization capabilities to further enhance the practicality and reliability of predictive tools. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Incremental learning of humanoid robot behavior from natural interaction and large language models.
- Author
-
Bärmann, Leonard, Kartmann, Rainer, Peller-Konrad, Fabian, Niehues, Jan, Waibel, Alex, Asfour, Tamim, Ruohan Wang, Yixing Gao, and Qianli Xu
- Subjects
LANGUAGE models ,MACHINE learning ,HUMAN-robot interaction ,HUMANOID robots ,KNOWLEDGE representation (Information theory) - Abstract
Natural-language dialog is key for an intuitive human-robot interaction. It can be used not only to express humans' intents but also to communicate instructions for improvement if a robot does not understand a command correctly. It is of great importance to let robots learn from such interaction experiences in an incremental way to allow them to improve their behaviors or avoid mistakes in the future. In this paper, we propose a system to achieve such incremental learning of complex high-level behavior from natural interaction and demonstrate its implementation on a humanoid robot. Our system deploys large language models (LLMs) for high-level orchestration of the robot's behavior based on the idea of enabling the LLM to generate Python statements in an interactive console to invoke both robot perception and action. Human instructions, environment observations, and execution results are fed back to the LLM, thus informing the generation of the next statement. Since an LLM can misunderstand (potentially ambiguous) user instructions, we introduce incremental learning from the interaction, which enables the system to learn from its mistakes. For that purpose, the LLM can call another LLM responsible for code-level improvements in the current interaction based on human feedback. Subsequently, we store the improved interaction in the robot's memory so that it can later be retrieved on semantically similar requests. We integrate the system in the robot cognitive architecture of the humanoid robot ARMAR-6 and evaluate our methods both quantitatively (in simulation) and qualitatively (in simulation and real-world) by demonstrating generalized incrementally learned knowledge. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Improving the performance of GPS/INS integration during GPS outage with incremental regularized LSTM learning.
- Author
-
Alaeiyan, H., Mosavi, M.R., and Ayatollahi, A.
- Subjects
GLOBAL Positioning System ,MACHINE learning ,KALMAN filtering ,INERTIAL navigation systems ,GENERALIZATION - Abstract
Global Positioning System (GPS)/Inertial Navigation System (INS) integration is a widely used technique for navigation and positioning applications. It combines the advantages of GPS and INS to provide accurate and reliable information. However, the GPS/INS integration suffers from performance degradation during a GPS outage, which occurs when natural or artificial factors block the GPS signal. The novelty of this paper is improving GPS/INS integration performance during GPS outages using Incremental Regularized LSTM (IncRLSTM) learning. Incremental learning is a learning paradigm that can be learned from streaming data online and updating the model parameters without forgetting the previous ones. Also, regularization is a technique that prevents overfitting and improves the generalization of the network by adding some constraints or penalties to the model. IncRLSTM learning models the GPS signal as a multi-objective regression and corrects the INS output with the Faded Memory Kalman filter. The results on real-world datasets significantly show the reduction of positioning errors by an average of 72 % during GPS outages and the improvement of the accuracy and robustness of GPS/INS integration by an average of 58 % compared with existing methods. Moreover, IncRLSTM presents, on average, a 10 % improvement compared to the existing methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Oxygen Content Control in the Electroslag Remelting Process: An Incremental Learning Strategy Based on Optimized Wasserstein Generative Adversarial Network with Gradient Penalty Data Augmentation.
- Author
-
Chen, Xi, Dong, Yanwu, Jiang, Zhouhua, Liu, Yuxiao, and Wang, Jia
- Subjects
- *
ARTIFICIAL neural networks , *GENERATIVE adversarial networks , *MACHINE learning , *DATA augmentation , *BEARING steel - Abstract
Electroslag remelting (ESR) is essential for producing high‐end special steel, but its complex process and numerous influencing factors make quality control challenging. This study addresses oxygen content control during ESR using a big data machine learning approach. An incremental learning strategy is proposed based on an optimized Wasserstein generative adversarial network with gradient penalty (WGAN‐GP) for data enhancement, focusing on G20Cr2Ni4A bearing steel. The WGAN‐GP model enhances time‐series data and metadata, utilizing long short‐term memory networks, fully connected networks, and attention mechanisms. The effectiveness of data enhancement is verified using a deep neural network classifier and statistical methods. Data is divided into historical and data streams, with an incremental learning strategy based on histogram gradient boosting regression trees to prevent catastrophic forgetting and improve efficiency through knowledge distillation and real‐time hyperparameter adjustment. Results show that the data augmentation method significantly improves model generalization and accuracy in small sample metallurgy. The incremental learning strategy enhances prediction accuracy for oxygen content, contributing to better cleanliness quality of electroslag steel. This study offers a novel approach for addressing small sample challenges in metallurgical processes. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Meta-learning for real-world class incremental learning: a transformer-based approach.
- Author
-
Kumar, Sandeep, Sharma, Amit, Shokeen, Vikrant, Azar, Ahmad Taher, Amin, Syed Umar, and Khan, Zafar Iqbal
- Subjects
- *
NATURAL language processing , *MACHINE learning , *DEEP learning , *CLASS size , *MODERN languages - Abstract
Modern natural language processing (NLP) state-of-the-art (SoTA) deep learning (DL) models have hundreds of millions of parameters, making them extremely complex. Large datasets are required for training these models, and while pretraining has reduced this requirement, human-labelled datasets are still necessary for fine-tuning. Few-shot learning (FSL) techniques, such as meta-learning, try to train models from smaller datasets to mitigate this cost. However, the tasks used to evaluate these meta-learners frequently diverge from the problems in the real world that they are meant to resolve. This work aims to apply meta-learning to a problem that is more pertinent to the real world: class incremental learning (IL). In this scenario, after completing its training, the model learns to classify newly introduced classes. One unique quality of meta-learners is that they can generalise from a small sample size to classes that have never been seen before, which makes them especially useful for class incremental learning (IL). The method describes how to emulate class IL using proxy new classes. This method allows a meta-learner to complete the task without the need for retraining. To generate predictions, the transformer-based aggregation function in a meta-learner that modifies data from examples across all classes has been proposed. The principal contributions of the model include concurrently considering the entire support and query sets, and prioritising attention to crucial samples, such as the question, to increase the significance of its impact during inference. The outcomes demonstrate that the model surpasses prevailing benchmarks in the industry. Notably, most meta-learners demonstrate significant generalisation in the context of class IL even without specific training for this task. This paper establishes a high-performing baseline for subsequent transformer-based aggregation techniques, thereby emphasising the practical significance of meta-learners in class IL. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Fuzzy Min-Max Classifier in Cybersecurity Applications.
- Author
-
Sarin, K. S., Kolomnikov, R. E., Svetlakov, M. O., and Hodashinsky, I. A.
- Abstract
A modified fuzzy min-max classifier is presented that differs from the original in the way that the hyperbox expansion operation is performed. The classifier has been tested on the solution of cybersecurity problems, such as detecting spam, phishing sites and attacks on network connections. The results of experiments results showed an improvement in the accuracy relative to the original fuzzy min-max classifier. Comparisons with six alternative incremental learning classifiers showed competitive results on the false acceptance rate, the false reject rate, and the F1-score values. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. A Class-Incremental Learning Method for Interactive Event Detection via Interaction, Contrast and Distillation.
- Author
-
Duan, Jiashun and Zhang, Xin
- Subjects
MACHINE learning ,DATA mining ,INTERACTIVE learning ,DISTILLATION ,LEARNING - Abstract
Event detection is a crucial task in information extraction. Existing research primarily focuses on machine automatic detection tasks, which often perform poorly in certain practical applications. To address this, an interactive event-detection mode of "machine recommendation-human review–machine incremental learning" was proposed. In this mode, we study a few-shot continual class-incremental learning scenario, where the challenge is to learn new-class events with limited samples while preserving memory of old class events. To tackle these challenges, we propose a class-incremental learning method for interactive event detection via Interaction, Contrast and Distillation (ICD). We design a replay strategy based on representative and confusable samples to retain the most valuable samples under limited conditions; we introduce semantic-boundary-smoothness contrastive learning for effective learning of new-class events with few samples; and we employ hierarchical distillation to mitigate catastrophic forgetting. These methods complement each other and show strong performance. Experimental results demonstrate that, in the 5-shot 5-round class incremental-learning settings on two Chinese event-detection datasets ACE and DuEE, our method achieves final recall rates of 71.48% and 90.39%, respectively, improving by 6.86% and 3.90% over the best baseline methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Latent side-information dynamic augmentation for incremental recommendation.
- Author
-
Zhang, Jing, Shi, Jin, Duan, Jingsheng, and Ren, Yonggong
- Subjects
MACHINE learning ,NOISE - Abstract
The incremental recommendation involves updating existing models by extracting information from interaction data at current time-step, with the aim of maintaining model accuracy while addressing limitations including parameter dependencies and inefficient training. However, real-time user interaction data is often afflicted by substantial noise and invalid samples, presenting the following key challenges for incremental model updating: (1) how to effectively extract valuable new knowledge from interaction data at the current time-step to ensure model accuracy and timeliness, and (2) how to safeguard against the catastrophic forgetting of long-term stable preference information, thus preserving the model's sensitivity during cold-starts. In response to these challenges, we propose the Incremental Recommendation with Stable Latent Side-information Updating (SIIFR). This model employs a side-information augmenter to extract valuable latent side-information from user interaction behavior at time-step T, thereby sidestepping the interference caused by noisy interaction data and acquiring stable user preference. Moreover, the model utilizes rough interaction data at time-step T + 1 , in conjunction with existing side-information enhancements to achieve incremental updates of latent preferences, thereby ensuring the model's efficacy during cold-start. Furthermore, SIIFR leverages the change rate in user latent side-information to mitigate catastrophic forgetting that results in the loss of long-term stable preference information. The effectiveness of the proposed model is validated and compared against existing models using four popular incremental datasets. The model code can be achieved at: https://github.com/LNNU-computer-research-526/FR-sii. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Leveraging Incremental Learning for Dynamic Modulation Recognition.
- Author
-
Ma, Song, Zhang, Lin, Song, Zhangli, Yu, Wei, and Liu, Tian
- Subjects
MACHINE learning ,ARTIFICIAL intelligence ,DEEP learning ,SIGNAL sampling ,PRIOR learning - Abstract
Modulation recognition is an important technology used to correctly identify the modulation modes of wireless signals and is widely used in cooperative and confrontational scenarios. Traditional modulation-recognition algorithms require the assistance of expert experiences, which constrains their applications. With the rapid development of artificial intelligence in recent years, deep learning (DL) is widely advocated for intelligent modulation recognition. Typically, DL-based modulation-recognition algorithms implicitly assume a relatively static scenario in which the signal samples of all the modulation modes can be completely collected in advance. In practical situations, the radio environment is quite dynamic and the signal samples with new modulation modes may appear sequentially, in which the current DL-based modulation-recognition algorithms may require unacceptable time and computing resource consumption to re-train the optimal model from scratch. In this study, we leveraged incremental learning (IL) and designed a novel IL-based modulation-recognition algorithm that consists of an initial stage and multiple incremental stages. The main novelty of the proposed algorithm lies in the new loss function design in each incremental stage, which combines the distillation loss of recognizing old modulation modes and the cross-entropy loss of recognizing new modulation modes. With the proposed algorithm, the knowledge of the signal samples with new modulation modes can be efficiently learned in the current stage without forgetting the knowledge learned in the previous stages. The simulation results demonstrate that the proposed algorithm could achieve a recognition accuracy close to the upper bound with a much lower computing time and it outperformed the existing IL-based benchmarks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Towards Generalised and Incremental Bias Mitigation in Personality Computing.
- Author
-
Jiang, Jian, Manoranjan, Viswonathan, Salam, Hanan, and Celiktutan, Oya
- Abstract
Building systems for predicting human socio-emotional states has promising applications; however, if trained on biased data, such systems could inadvertently yield biased decisions. Bias mitigation remains an open problem, which tackles the correction of a model's disparate performance over different groups defined by particular sensitive attributes (e.g., gender, age, and race). In this work, we design a novel fairness loss function named Multi-Group Parity (MGP) to provide a generalised approach for bias mitigation in personality computing. In contrast to existing works in the literature, MGP is generalised as it features four ‘multiple’ properties (4Mul): multiple tasks, multiple modalities, multiple sensitive attributes, and multi-valued attributes. Moreover, we explore how to incrementally mitigate the biases when more sensitive attributes are taken into consideration sequentially. Towards this problem, we introduce a novel algorithm that utilises an incremental learning framework to mitigate bias against one attribute data at a time without compromising past fairness. Extensive experiments on two large-scale multi-modal personality recognition datasets validate the effectiveness of our approach in achieving superior bias mitigation under the proposed four properties and incremental debiasing settings. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Semi-supervised incremental domain generalization learning based on causal invariance.
- Author
-
Wang, Ning, Wang, Huiling, Yang, Shaocong, Chu, Huan, Dong, Shi, and Viriyasitavat, Wattana
- Abstract
In recent years, semi-supervised learning (SSL) methods based on pseudo-labeling algorithms have been widely applied and achieved significant success. However, most existing deep semi-supervised learning methods suffer from the problem of distribution shift between the source and target domains, as well as the issue of "cognitive bias" in pseudo labeling algorithms, where the model's errors are difficult to rectify as they accumulate through the pseudo-labeling process. This paper introduces the concept of causal invariance and proposes an incremental repeated labeling strategy with a high confidence threshold to enhance the utilization of unlabeled samples. It effectively solves the issue of distribution discrepancy between the source and target domains in the field of semi-supervised learning, as well as the problem of pseudo label "cognitive bias", thus improving the accuracy of the model. Extensive experiments on CIFAR-10, CIFAR-100, SVHN, STL-10, PACS and VLCS demonstrate that semi-supervised incremental models based on causal invariance have a significant improvement in domain generalization ability compared with state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. PCNet: a human pose compensation network based on incremental learning for sports actions estimation
- Author
-
Jia-Hong Jiang and Nan Xia
- Subjects
Human pose estimation ,Sports activities ,Keypoints compensation ,Occlusion handling ,Feature enhancement ,Incremental learning ,Electronic computers. Computer science ,QA75.5-76.95 ,Information technology ,T58.5-58.64 - Abstract
Abstract Human pose estimation has a wide range of applications. Existing methods perform well in conventional domains, but there are certain defects when they are applied to sports activities. The first is lack of estimation of the extremity posture, making it impossible to comprehensively evaluate the movement posture; the second is insufficient occlusion handling. Therefore, we propose a human pose compensation network based on incremental learning, which obtains shared weights to extract detailed features under the premise of limited extremity training data. We propose a higher-order feature compensator (HOF-compensator) to embed the attributes of the extremity into the torso and limbs topology structure, building a complete higher-order feature. In addition, to improve the occlusion handling performance, we propose an occlusion feature enhancement attention mechanism (OFE-attention) that can identify occluded keypoints and enhance attention to occlusion areas. We design comparative experiments on three public datasets and a self-built sports dataset, achieving the highest mean accuracy among all comparative methods. In addition, we design a series of ablation analysis and visualization displays to verify that our method performs best in sports pose estimation.
- Published
- 2024
- Full Text
- View/download PDF
45. From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases
- Author
-
Deren Xu, Weng Howe Chan, Habibollah Haron, Hui Wen Nies, and Kohbalan Moorthy
- Subjects
Emerging infectious disease prediction ,Transfer learning ,Incremental learning ,Biological feature Rt ,Blending framework ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Analysis ,QA299.6-433 - Abstract
Abstract The outbreak of emerging infectious diseases poses significant challenges to global public health. Accurate early forecasting is crucial for effective resource allocation and emergency response planning. This study aims to develop a comprehensive predictive model for emerging infectious diseases, integrating the blending framework, transfer learning, incremental learning, and the biological feature Rt to increase prediction accuracy and practicality. By transferring features from a COVID-19 dataset to a monkeypox dataset and introducing dynamically updated incremental learning techniques, the model's predictive capability in data-scarce scenarios was significantly improved. The research findings demonstrate that the blending framework performs exceptionally well in short-term (7-day) predictions. Furthermore, the combination of transfer learning and incremental learning techniques significantly enhanced the adaptability and precision, with a 91.41% improvement in the RMSE and an 89.13% improvement in the MAE. In particular, the inclusion of the Rt feature enabled the model to more accurately reflect the dynamics of disease spread, further improving the RMSE by 1.91% and the MAE by 2.17%. This study underscores the significant application potential of multimodel fusion and real-time data updates in infectious disease prediction, offering new theoretical perspectives and technical support. This research not only enriches the theoretical foundation of infectious disease prediction models but also provides reliable technical support for public health emergency responses. Future research should continue to explore integrating data from multiple sources and enhancing model generalization capabilities to further enhance the practicality and reliability of predictive tools.
- Published
- 2024
- Full Text
- View/download PDF
46. Improving the performance of GPS/INS integration during GPS outage with incremental regularized LSTM learning
- Author
-
H. Alaeiyan, M.R. Mosavi, and A. Ayatollahi
- Subjects
LSTM ,Incremental learning ,GPS/INS integration ,Multi-objective regression ,Engineering (General). Civil engineering (General) ,TA1-2040 - Abstract
Global Positioning System (GPS)/Inertial Navigation System (INS) integration is a widely used technique for navigation and positioning applications. It combines the advantages of GPS and INS to provide accurate and reliable information. However, the GPS/INS integration suffers from performance degradation during a GPS outage, which occurs when natural or artificial factors block the GPS signal. The novelty of this paper is improving GPS/INS integration performance during GPS outages using Incremental Regularized LSTM (IncRLSTM) learning. Incremental learning is a learning paradigm that can be learned from streaming data online and updating the model parameters without forgetting the previous ones. Also, regularization is a technique that prevents overfitting and improves the generalization of the network by adding some constraints or penalties to the model. IncRLSTM learning models the GPS signal as a multi-objective regression and corrects the INS output with the Faded Memory Kalman filter. The results on real-world datasets significantly show the reduction of positioning errors by an average of 72 % during GPS outages and the improvement of the accuracy and robustness of GPS/INS integration by an average of 58 % compared with existing methods. Moreover, IncRLSTM presents, on average, a 10 % improvement compared to the existing methods.
- Published
- 2024
- Full Text
- View/download PDF
47. Meta-learning for real-world class incremental learning: a transformer-based approach
- Author
-
Sandeep Kumar, Amit Sharma, Vikrant Shokeen, Ahmad Taher Azar, Syed Umar Amin, and Zafar Iqbal Khan
- Subjects
Deep learning ,Few-shot learning ,Incremental learning ,Meta-learning ,Medicine ,Science - Abstract
Abstract Modern natural language processing (NLP) state-of-the-art (SoTA) deep learning (DL) models have hundreds of millions of parameters, making them extremely complex. Large datasets are required for training these models, and while pretraining has reduced this requirement, human-labelled datasets are still necessary for fine-tuning. Few-shot learning (FSL) techniques, such as meta-learning, try to train models from smaller datasets to mitigate this cost. However, the tasks used to evaluate these meta-learners frequently diverge from the problems in the real world that they are meant to resolve. This work aims to apply meta-learning to a problem that is more pertinent to the real world: class incremental learning (IL). In this scenario, after completing its training, the model learns to classify newly introduced classes. One unique quality of meta-learners is that they can generalise from a small sample size to classes that have never been seen before, which makes them especially useful for class incremental learning (IL). The method describes how to emulate class IL using proxy new classes. This method allows a meta-learner to complete the task without the need for retraining. To generate predictions, the transformer-based aggregation function in a meta-learner that modifies data from examples across all classes has been proposed. The principal contributions of the model include concurrently considering the entire support and query sets, and prioritising attention to crucial samples, such as the question, to increase the significance of its impact during inference. The outcomes demonstrate that the model surpasses prevailing benchmarks in the industry. Notably, most meta-learners demonstrate significant generalisation in the context of class IL even without specific training for this task. This paper establishes a high-performing baseline for subsequent transformer-based aggregation techniques, thereby emphasising the practical significance of meta-learners in class IL.
- Published
- 2024
- Full Text
- View/download PDF
48. Dmaf: data-model anti-forgetting for federated incremental learning.
- Author
-
Zhu, Kongshang, Xu, Jiuyun, Zhou, Liang, Li, Xiaowen, Zhao, Yingzhi, Xu, Xiangrui, and Li, Shibao
- Abstract
Federated Learning has received much attention due to its data privacy benefits, but most existing approaches assume that client classes are fixed. Clients may remove old classes and add new ones, leading to catastrophic forgetting of the model. Existing methods have limitations, such as requiring additional client storage and distillation methods becoming less effective as new classes increase. For this reason, this paper proposes the Data-Model Anti-Forgetting (DMAF) framework. Specifically, in the proposed framework, an auxiliary client and group aggregation method to mitigate catastrophic forgetting at the data level has been proposed, which does not require clients to allocate additional storage space to store synthetic data and can balance class distributions. A multi-teacher integrated knowledge distillation method was adopted to retain old class knowledge by distilling multiple teacher models and design task fusion for further tuning of the global model. Finally, this paper conducts extensive experiments on public datasets to validate the effectiveness of DMAF. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
49. PCNet: a human pose compensation network based on incremental learning for sports actions estimation.
- Author
-
Jiang, Jia-Hong and Xia, Nan
- Abstract
Human pose estimation has a wide range of applications. Existing methods perform well in conventional domains, but there are certain defects when they are applied to sports activities. The first is lack of estimation of the extremity posture, making it impossible to comprehensively evaluate the movement posture; the second is insufficient occlusion handling. Therefore, we propose a human pose compensation network based on incremental learning, which obtains shared weights to extract detailed features under the premise of limited extremity training data. We propose a higher-order feature compensator (HOF-compensator) to embed the attributes of the extremity into the torso and limbs topology structure, building a complete higher-order feature. In addition, to improve the occlusion handling performance, we propose an occlusion feature enhancement attention mechanism (OFE-attention) that can identify occluded keypoints and enhance attention to occlusion areas. We design comparative experiments on three public datasets and a self-built sports dataset, achieving the highest mean accuracy among all comparative methods. In addition, we design a series of ablation analysis and visualization displays to verify that our method performs best in sports pose estimation. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
50. Optimal Knowledge Distillation through Non-Heuristic Control of Dark Knowledge
- Author
-
Darian Onchis, Codruta Istin, and Ioan Samuila
- Subjects
dark knowledge ,knowledge distillation ,clustering ,incremental learning ,Computer engineering. Computer hardware ,TK7885-7895 - Abstract
In this paper, a method is introduced to control the dark knowledge values also known as soft targets, with the purpose of improving the training by knowledge distillation for multi-class classification tasks. Knowledge distillation effectively transfers knowledge from a larger model to a smaller model to achieve efficient, fast, and generalizable performance while retaining much of the original accuracy. The majority of deep neural models used for classification tasks append a SoftMax layer to generate output probabilities and it is usual to take the highest score and consider it the inference of the model, while the rest of the probability values are generally ignored. The focus is on those probabilities as carriers of dark knowledge and our aim is to quantify the relevance of dark knowledge, not heuristically as provided in the literature so far, but with an inductive proof on the SoftMax operational limits. These limits are further pushed by using an incremental decision tree with information gain split. The user can set a desired precision and an accuracy level to obtain a maximal temperature setting for a continual classification process. Moreover, by fitting both the hard targets and the soft targets, one obtains an optimal knowledge distillation effect that mitigates better catastrophic forgetting. The strengths of our method come from the possibility of controlling the amount of distillation transferred non-heuristically and the agnostic application of this model-independent study.
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.