Back to Search
Start Over
APPRIS 2017: principal isoforms for multiple gene sets
- Source :
- Recercat. Dipósit de la Recerca de Catalunya, instname, UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC), Repisalud, Instituto de Salud Carlos III (ISCIII), Nucleic Acids Research
- Publisher :
- Oxford University Press (OUP)
-
Abstract
- FUNDING National Institutes of Health [U41 HG007234, 2U41 HG007234]; Spanish Ministry of Economics and Com- petitiveness [BIO2015-67580-P]; Spanish National Institute of Bioinformatics (www.inab.org) [INB-ISCIII, PRB2 toJ.M.R.]; ProteoRed [IPT13/0001-ISCIII-SGEFI/FEDERto J.V.]; Joint BSC-IRB-CRG Program in Computational Downloaded from https://academic.oup.com/nar/article-abstract/46/D1/D213/4561658 by Spanish National Cancer Research Center user on 30 October 2018 Nucleic Acids Research, 2018, Vol. 46, Database issue D217 Biology and Award Severo Ochoa [SEV 2015-0493 to A.V.]. Funding for open access charge: U.S. Depart- ment of Health and Human Services; National Institutes of Health; National Human Genome Research Institute [2U41 HG007234]. Conflict of interest statement.None declared. The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the 'principal' isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. Sí
- Subjects :
- Models, Molecular
0301 basic medicine
Gene isoform
Protein isoform
Proteome
Protein Conformation
DATABASE
PREDICTION
Computational biology
Biology
SPLICE ISOFORMS
EVOLUTIONARY INFORMATION
Enginyeria de proteïnes
03 medical and health sciences
0302 clinical medicine
Databases, Genetic
Genetics
RefSeq
Animals
Humans
Protein Isoforms
Database Issue
Ensembl
HUMAN GENOME
TOPOLOGY
Amino Acid Sequence
COMPLEXITY
GENCODE
Alternative splicing
Enginyeria biomèdica [Àrees temàtiques de la UPC]
Reproducibility of Results
PROTEIN-CODING GENES
Proteins
Molecular Sequence Annotation
FUNCTIONALLY IMPORTANT
Alternative Splicing
030104 developmental biology
Protein engineering
UniProt
GENCODE consortium
APPRIS database
Sequence Alignment
PROJECT
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 20156758
- Database :
- OpenAIRE
- Journal :
- Recercat. Dipósit de la Recerca de Catalunya, instname, UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC), Repisalud, Instituto de Salud Carlos III (ISCIII), Nucleic Acids Research
- Accession number :
- edsair.doi.dedup.....0469c978428df46b25e5c6c896722bc7