1. Clinical Trial Information Extraction with BERT
- Author
-
Iya Khalil, Xiong Liu, Murthy V. Devarakonda, and Greg L. Hersch
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Science - Computation and Language ,Computer science ,business.industry ,Clinical study design ,computer.software_genre ,Quantitative Biology - Quantitative Methods ,Machine Learning (cs.LG) ,Set (abstract data type) ,Clinical trial ,Medical services ,Information extraction ,Named-entity recognition ,FOS: Biological sciences ,Artificial intelligence ,Baseline (configuration management) ,business ,Computation and Language (cs.CL) ,computer ,Quantitative Methods (q-bio.QM) ,Natural language processing - Abstract
Natural language processing (NLP) of clinical trial documents can be useful in new trial design. Here we identify entity types relevant to clinical trial design and propose a framework called CT-BERT for information extraction from clinical trial text. We trained named entity recognition (NER) models to extract eligibility criteria entities by fine-tuning a set of pre-trained BERT models. We then compared the performance of CT-BERT with recent baseline methods including attention-based BiLSTM and Criteria2Query. The results demonstrate the superiority of CT-BERT in clinical trial NLP., HealthNLP 2021, IEEE International Conference on Healthcare Informatics (ICHI 2021)
- Published
- 2021
- Full Text
- View/download PDF