Back to Search
Start Over
Syntactically aware neural architectures for definition extraction
- Source :
- 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), NAACL-HLT (2)
-
Abstract
- Automatically identifying definitional knowledge in text corpora (Definition Extraction or DE) is an important task with direct applications in, among others, Automatic Glossary Generation, Taxonomy Learning, Question Answering and Semantic Search. It is generally cast as a binary classification problem between definitional and non-definitional sentences. In this paper we present a set of neural architectures combining Convolutional and Recurrent Neural Networks, which are further enriched by incorporating linguistic information via syntactic dependencies. Our experimental results in the task of sentence classification, on two benchmarking DE datasets (one generic, one domain-specific), show that these models obtain consistent state of the art results. Furthermore, we demonstrate that models trained on clean Wikipedia-like definitions can successfully be applied to more noisy domain-specific corpora.
- Subjects :
- Text corpus
Computer science
business.industry
Semantic search
02 engineering and technology
computer.software_genre
Syntax
03 medical and health sciences
0302 clinical medicine
Recurrent neural network
Rule-based machine translation
Binary classification
Taxonomy (general)
030221 ophthalmology & optometry
0202 electrical engineering, electronic engineering, information engineering
Question answering
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
Natural language processing
Sentence
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), NAACL-HLT (2)
- Accession number :
- edsair.doi.dedup.....bb55e6378511e30a5fbfa980056b0c1e