Back to Search
Start Over
Deep neural model with self-training for scientific keyphrase extraction
- Source :
- PLoS ONE, Vol 15, Iss 5, p e0232547 (2020), PLoS ONE
- Publication Year :
- 2020
- Publisher :
- Public Library of Science (PLoS), 2020.
-
Abstract
- Scientific information extraction is a crucial step for understanding scientific publications. In this paper, we focus on scientific keyphrase extraction, which aims to identify keyphrases from scientific articles and classify them into predefined categories. We present a neural network based approach for this task, which employs the bidirectional long short-memory (LSTM) to represent the sentences in the article. On top of the bidirectional LSTM layer in our neural model, conditional random field (CRF) is used to predict the label sequence for the whole sentence. Considering the expensive annotated data for supervised learning methods, we introduce self-training method into our neural model to leverage the unlabeled articles. Experimental results on the ScienceIE corpus and ACL keyphrase corpus show that our neural model achieves promising performance without any hand-designed features and external knowledge resources. Furthermore, it efficiently incorporates the unlabeled data and achieve competitive performance compared with previous state-of-the-art systems.
- Subjects :
- Conditional random field
Computer and Information Sciences
Neural Networks
Computer science
Science
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
Information Storage and Retrieval
Social Sciences
computer.software_genre
Deep Learning
Learning and Memory
Word Embedding
Leverage (statistics)
Learning
Psychology
Syntax
Recurrent Neural Networks
Natural Language Processing
Grammar
Multidisciplinary
Models, Statistical
Artificial neural network
business.industry
Deep learning
Supervised learning
Publications
Cognitive Psychology
Biology and Life Sciences
Linguistics
Semantics
Information extraction
ComputingMethodologies_PATTERNRECOGNITION
Engineering and Technology
Cognitive Science
Medicine
Artificial intelligence
Neural Networks, Computer
business
Information Technology
computer
Self training
Sentence
Natural language processing
Research Article
Neuroscience
Subjects
Details
- Language :
- English
- ISSN :
- 19326203
- Volume :
- 15
- Issue :
- 5
- Database :
- OpenAIRE
- Journal :
- PLoS ONE
- Accession number :
- edsair.doi.dedup.....5d5e12ac30f41fc00ffc56d5a7e419d7