Back to Search
Start Over
BiLSTM-Attention-CRF model for entity extraction in internet recruitment data
- Source :
- Procedia Computer Science. 183:706-712
- Publication Year :
- 2021
- Publisher :
- Elsevier BV, 2021.
-
Abstract
- With the development of the Internet, online recruitment has gradually become the mainstream. Analyzing the recruitment data and exploring the internal rules of the data can help enterprises and job seekers realize human-career matching. An extremely important step in analyzing recruitment data is to extract structured information from unstructured data. Named entity recognition can effectively extract entity information from unstructured data. In recent years, a lot of work has focused on this task, but no related work has been applied named entity recognition to Internet recruitment information. Therefore, this paper proposes the BiLSTM-Attention-CRF model for Internet recruitment information, which can be used to extract skill entities in job description information. This model introduces the BiLSTM and Attention mechanism to improve the effect of entity recognition. To verify the performance of the model, the paper used crawler technology to capture the real Internet recruitment data and annotate it and obtain a real data set. In this paper, a series of experiments are conducted on this data set, and the experimental results show that the proposed model achieves the best performance.
- Subjects :
- Matching (statistics)
Information retrieval
ComputingMilieux_THECOMPUTINGPROFESSION
Computer science
business.industry
Job description
020206 networking & telecommunications
Unstructured data
02 engineering and technology
computer.software_genre
Data set
Task (computing)
Named-entity recognition
0202 electrical engineering, electronic engineering, information engineering
General Earth and Planetary Sciences
020201 artificial intelligence & image processing
The Internet
Web crawler
business
computer
General Environmental Science
Subjects
Details
- ISSN :
- 18770509
- Volume :
- 183
- Database :
- OpenAIRE
- Journal :
- Procedia Computer Science
- Accession number :
- edsair.doi...........7e9c7e4ae2e9af5a74e751bc6bd1469a