Back to Search
Start Over
Lexical adaptation of link grammar to the biomedical sublanguage: a comparative evaluation of three approaches.
- Source :
- BMC Bioinformatics; 2006 Supplement 3, Vol. 7, pS2-9, 9p, 5 Charts, 2 Graphs
- Publication Year :
- 2006
-
Abstract
- Background: We study the adaptation of Link Grammar Parser to the biomedical sublanguage with a focus on domain terms not found in a general parser lexicon. Using two biomedical corpora, we implement and evaluate three approaches to addressing unknown words: automatic lexicon expansion, the use of morphological clues, and disambiguation using a part-of-speech tagger. We evaluate each approach separately for its effect on parsing performance and consider combinations of these approaches. Results: In addition to a 45% increase in parsing efficiency, we find that the best approach, incorporating information from a domain part-of-speech tagger, offers a statistically significant 10% relative decrease in error. Conclusion: When available, a high-quality domain part-of-speech tagger is the best solution to unknown word issues in the domain adaptation of a general parser. In the absence of such a resource, surface clues can provide remarkably good coverage and performance when tuned to the domain. The adapted parser is available under an open-source license. [ABSTRACT FROM AUTHOR]
- Subjects :
- BIOINFORMATICS
COMPUTERS in biology
LEXICON
OPEN source software
COMPARATIVE grammar
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Volume :
- 7
- Database :
- Complementary Index
- Journal :
- BMC Bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 28677560
- Full Text :
- https://doi.org/10.1186/1471-2105-7-S3-S2