Back to Search
Start Over
Using case-level context to classify cancer pathology reports
- Source :
- PLoS ONE, PLoS ONE, Vol 15, Iss 5, p e0232840 (2020)
- Publication Year :
- 2020
- Publisher :
- Public Library of Science (PLoS), 2020.
-
Abstract
- Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence-for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks-site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks.
- Subjects :
- Pathology
Computer science
Social Sciences
Disease
Machine Learning
Database and Informatics Methods
Cognition
Learning and Memory
Mathematical and Statistical Techniques
0302 clinical medicine
Neoplasms
Sequence Tagged Site Analysis
Medicine and Health Sciences
Electronic Health Records
030212 general & internal medicine
Recurrent Neural Networks
0303 health sciences
Multidisciplinary
Statistics
Histological Techniques
Test (assessment)
Physical Sciences
Medicine
Anatomy
Information Technology
Sequence Analysis
Natural Language
Natural language
Research Article
Computer and Information Sciences
medicine.medical_specialty
Histology
Neural Networks
Bioinformatics
Science
MEDLINE
Context (language use)
Research and Analysis Methods
03 medical and health sciences
Deep Learning
Artificial Intelligence
Memory
medicine
Humans
Statistical Methods
Natural Language Processing
030304 developmental biology
business.industry
Deep learning
Biology and Life Sciences
Cancer
Linguistics
medicine.disease
Languages
Cognitive Science
Artificial intelligence
business
Mathematics
Neuroscience
Forecasting
SEER Program
Subjects
Details
- ISSN :
- 19326203
- Volume :
- 15
- Database :
- OpenAIRE
- Journal :
- PLOS ONE
- Accession number :
- edsair.doi.dedup.....2f23cf988c2e7450254e04c2d003b3f6
- Full Text :
- https://doi.org/10.1371/journal.pone.0232840