Back to Search
Start Over
Looking Back on the Past: Active Learning With Historical Evaluation Results
- Source :
- IEEE Transactions on Knowledge and Data Engineering. 34:4921-4932
- Publication Year :
- 2022
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2022.
-
Abstract
- Active learning is an effective approach for tasks with limited labeled data. It samples a small set of data to annotate actively and is widely applied in various AI tasks. It uses an iterative process, during which we utilize the current trained model to evaluate all unlabeled samples and annotate the best samples based on a specific query strategy to update the underlying model iteratively. Most existing active learning approaches rely on only the evaluation results generated by the current model and ignore the results from previous iterations. In this paper, we propose using more historical evaluation results which can provide additional information to help better select samples. First, we apply two kinds of heuristic features of the historical evaluation results, the weighted sum of and fluctuation of the historical evaluation sequence, to improve the effectiveness of sampling. Next, to use the information contained in the historical results more globally, we design a novel query strategy that learns how to select samples based on the historical sequences automatically. We also improve current state-of-the-art active learning methods by introducing historical evaluation results. Experimental results on two common NLP tasks including text classification and named entity recognition show that our methods significantly outperform current methods
- Subjects :
- Iterative and incremental development
Sequence
Heuristic
Computer science
business.industry
Active learning (machine learning)
Sampling (statistics)
02 engineering and technology
computer.software_genre
Machine learning
Small set
Computer Science Applications
Computational Theory and Mathematics
Named-entity recognition
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Labeled data
Artificial intelligence
business
computer
Information Systems
Subjects
Details
- ISSN :
- 23263865 and 10414347
- Volume :
- 34
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Knowledge and Data Engineering
- Accession number :
- edsair.doi...........6bbf68ead0e760d61d38849368afce47
- Full Text :
- https://doi.org/10.1109/tkde.2020.3045816