Back to Search
Start Over
GAIA: Framework Annotation of Genomic Sequence
- Source :
- Genome Research. 8:234-250
- Publication Year :
- 1998
- Publisher :
- Cold Spring Harbor Laboratory, 1998.
-
Abstract
- As increasing amounts of genomic sequence from many organisms become available, and as DNA sequences become a primary reagent in biologic investigations, the role of annotation as a prospective guide for laboratory experiments will expand rapidly. Here we describe a process of high-throughput, reliable annotation, called framework annotation, which is designed to provide a foundation for initial biologic characterization of previously unexamined sequence. To examine this concept in practice, we have constructed Genome Annotation and Information Analysis (GAIA), a prototype software architecture that implements several elements important for framework annotation. The center of GAIA consists of an annotation database and the associated data management subsystem that forms the software bus along which other components communicate. The schema for this database defines three principal concepts: (1) Entries, consisting of sequence and associated historical data; (2) Features, comprising information of biologic interest; and (3) Experiments, describing the evidence that supports Features. The database permits tracking of annotation results over time, as well as assessment of the reliability of particular results. New framework annotation is produced by CARTA, a set of autonomous sensors that perform automatic analyses and assert results into the annotation database. These results are available via a Web-based query interface that uses graphical Java applets as well as text-based HTML pages to display data at different levels of resolution and permit interactive exploration of annotation. We present results for initial application of framework annotation to a set of test sequences, demonstrating its effectiveness in providing a starting point for biologic investigation, and discuss ways in which the current prototype can be improved. The prototype is available for public use and comment at http://www.cbil.upenn.edu/gaia.
- Subjects :
- Databases, Factual
Interface (Java)
Data management
Molecular Sequence Data
Biology
Bioinformatics
computer.software_genre
Online Systems
Annotation
Software
Human Genome Project
Genetics
Animals
Humans
Amino Acid Sequence
Image retrieval
Java applet
Genetics (clinical)
Information retrieval
Base Sequence
business.industry
Software bus
Computational Biology
Minimum information required in the annotation of models
Sequence Analysis, DNA
business
computer
Subjects
Details
- ISSN :
- 15495469 and 10889051
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- Genome Research
- Accession number :
- edsair.doi.dedup.....7046eaf00cef46a98f330786c7e09999
- Full Text :
- https://doi.org/10.1101/gr.8.3.234