Back to Search Start Over

Using distributional similarity to organise biomedical terminology.

Authors :
Weeds, Julie
Dowdall, James
Schneider, Gerold
Keller, Bill
Weir, David
Source :
Terminology; 2005, Vol. 11 Issue 1, p107-141, 35p
Publication Year :
2005

Abstract

The article studies an application of distributional similarity techniques to the problem of structural organization of biomedical terminology. The application domain is the relatively small GENIA corpus. Using terms that have been accurately marked-up by hand within the corpus, the problem of automatically determining semantic proximity is considered. Terminological units are defined as normalized classes of individual terms. Syntactic analysis of the corpus data is carried out using the Pro3Gres parser and provides the data required to calculate distributional similarity using a variety of measures. Evaluation is performed against a hand-crafted gold standard for this domain in the form of the GENIA ontology. The paper shows that distributional similarity can be used to predict semantic type with a good degree of accuracy, reaching an optimal value of 63.1 percent.

Details

Language :
English
ISSN :
09299971
Volume :
11
Issue :
1
Database :
Complementary Index
Journal :
Terminology
Publication Type :
Academic Journal
Accession number :
17786336