1. NCI Thesaurus: a semantic model integrating cancer-related clinical and molecular information.
- Author
-
Sioutos N, de Coronado S, Haber MW, Hartel FW, Shaiu WL, and Wright LW
- Subjects
- Computational Biology methods, Humans, National Institutes of Health (U.S.), Neoplasm Proteins metabolism, Semantics, Systems Integration, United States, User-Computer Interface, Biomedical Research methods, Database Management Systems, Databases, Factual, Information Storage and Retrieval methods, Neoplasms classification, Neoplasms physiopathology, Vocabulary, Controlled
- Abstract
Over the last 8 years, the National Cancer Institute (NCI) has launched a major effort to integrate molecular and clinical cancer-related information within a unified biomedical informatics framework, with controlled terminology as its foundational layer. The NCI Thesaurus is the reference terminology underpinning these efforts. It is designed to meet the growing need for accurate, comprehensive, and shared terminology, covering topics including: cancers, findings, drugs, therapies, anatomy, genes, pathways, cellular and subcellular processes, proteins, and experimental organisms. The NCI Thesaurus provides a partial model of how these things relate to each other, responding to actual user needs and implemented in a deductive logic framework that can help maintain the integrity and extend the informational power of what is provided. This paper presents the semantic model for cancer diseases and its uses in integrating clinical and molecular knowledge, more briefly examines the models and uses for drug, biochemical pathway, and mouse terminology, and discusses limits of the current approach and directions for future work.
- Published
- 2007
- Full Text
- View/download PDF