Back to Search
Start Over
Overview of Long-form Document Matching: Survey of Existing Models and Their Challenges
- Source :
- Journal of Physics: Conference Series. 2171:012059
- Publication Year :
- 2022
- Publisher :
- IOP Publishing, 2022.
-
Abstract
- Long-form document matching is an important direction in the field of natural language processing and can be applied to tasks such as news recommendation and text clustering. However, long-form document matching suffers from noisiness and sparsity of semantic information in long text. Using short-form document matching methods on a long-form matching problem is not satisfactory. Long-form document matching has attracted the attention of researchers, who have proposed many effective methods. Methods for matching long texts can be divided into three categories: traditional bag-of-words-based models, traditional deep learning-based models, and pre-training-based models. This study reviews typical methods of long-form document matching, analyzes their advantages and disadvantages, and discusses possible future developments.
- Subjects :
- History
Computer Science Applications
Education
Subjects
Details
- ISSN :
- 17426596 and 17426588
- Volume :
- 2171
- Database :
- OpenAIRE
- Journal :
- Journal of Physics: Conference Series
- Accession number :
- edsair.doi...........8e9d662fe510f9f862cce31b04029632
- Full Text :
- https://doi.org/10.1088/1742-6596/2171/1/012059