Back to Search Start Over

Study on Entity Extraction Method for Pharmaceutical Instructions Based on Pretrained Models

Authors :
CHEN Zhongyong, HUANG Yongsheng, ZHANG Min, JIANG Ming
Source :
Jisuanji kexue yu tansuo, Vol 18, Iss 7, Pp 1911-1922 (2024)
Publication Year :
2024
Publisher :
Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press, 2024.

Abstract

The extraction of medical entities from drug instructions provides fundamental data for the intelligent retrieval of medication information and the construction of medical knowledge graphs, with remarkable research significance and practical value. However, the heterogeneity of medical entities in drug instructions for treating different diseases poses challenges in model training, which requires a large number of annotated samples. To address this issue, a “large model + small model” design approach is used in this research. Specifically, this research proposes a part-label named entity recognition model based on a pre-trained model, which first employs a pre-trained language model fine-tuned on a small number of samples to extract partial entities from drug instructions, and then utilizes a Transformer- based part-label model to further optimize the entity extraction results. The part-label model encodes the input text, identified partial entities, and entity labels using a planar lattice structure, extracts feature representations using Transformer, and predicts entity labels through a conditional random fields (CRF) layer. To reduce the need for annotated training data, a sample data augmentation method is proposed using entity masking strategy on labeled samples to train the part-label model. Experimental results validate the feasibility of the “large model + small model” approach in medical entity extraction, with precision (P), recall (R), and F1 score of 85.0%, 86.1%, and 85.6%, respectively, demonstrating superior performance compared with other learning methods.

Details

Language :
Chinese
ISSN :
16739418
Volume :
18
Issue :
7
Database :
Directory of Open Access Journals
Journal :
Jisuanji kexue yu tansuo
Publication Type :
Academic Journal
Accession number :
edsdoj.024877385fbe4f8cbc09f6374763a153
Document Type :
article
Full Text :
https://doi.org/10.3778/j.issn.1673-9418.2304078