1. UniProt: a hub for protein information
- Author
-
Ursula Hinz, Prudence Mutowo, Laure Verbregue, Weizhong Li, Nadine Gruaz-Gumowski, Chantal Hulo, Hermann Zellner, Shyamala Sundaram, P Lemercier, Guoying Qi, Parit Bansal, Tony Sawford, Sebastien Gehant, Delphine Baratin, Francesco Fazzini, Monica Pozzato, Séverine Duvaud, Lai-Su L. Yeh, Nicole Redaschi, Emma Hatton-Ellis, Darren A. Natale, Damien Lieberherr, Luis Figueira, Bernd Roechert, Borisas Bursteinas, Gayatri Chavali, Brigitte Boeckmann, Cristina Casal-Casas, Baris E. Suzek, Cathy H. Wu, Paul Gane, Ghislaine Argoud-Puy, Klemens Pichler, Rachael P. Huntley, Sangya Pundir, Alan Bridge, Edouard de Castro, Benoit Bely, Kristian B. Axelsen, Emmanuel Boutet, Andre Stutz, Penelope Garmiri, Christian J. A. Sigrist, John S. Garavelli, Rolf Apweiler, Peter B. McGarvey, Patrick Masson, Maria Jesus Martin, K Sonesson, Xavier Watkins, Ioannis Xenarios, Vladimir Volynkin, Hamish McWilliam, Mark Bingley, Guillaume Keller, Hongzhan Huang, Rabie Saidi, Sylvain Poux, Tunca Doğan, Yuqi Wang, Diego Poggioli, Rodrigo Lopez, Alistair MacDougall, Kati Laiho, Qinghua Wang, W Liu, Carlos Bonilla, Duncan Legge, C. R. Vinayaka, Anne Morgat, Thierry Lombardot, Jerven Bolleman, Nevila Nouspikel, Aleksandra Shypitsyna, Emanuele Alpi, Yongxing Chen, Anne Lise Veuthey, Andrew Nightingale, Béatrice A. Cuche, Alex Bateman, Ramona Britto, Alan Wilter Sousa da Silva, Jie Luo, Lionel Breuza, Marie Claude Blatter, Elena Cibrian-Uhalte, Michel Schneider, Chuming Chen, Michele Magrane, L Famiglietti, Meher Shruti Yerramalla, Lydie Bougueleret, Vivienne Baillie Gerritsen, Anne Estreicher, Dolnide Dornevil, Catherine Rivoire, Jian Zhang, S Staehli, Andrew Peter Cowley, Tony Wardell, Ivo Pedruzzi, Andrea H. Auchincloss, Salvo Paesano, Elisabeth Gasteiger, Luis Pureza, Marc Feuermann, Leslie Arminski, Xavier D. Martin, Teresa Batista Neto, Steven Rosanoff, Florence Jungo, Sandra Orchard, Claire O'Donovan, Elisabeth Coudert, Ricardo Antunes, Sandrine Pilbout, Vicente Lara, Arnaud Gos, Reija Hieta, Manuela Pruess, Joanna Arganiska, Edward Turner, Maurizio De Giorgi, M Doche, Cecilia N. Arighi, Michael Tognolli, Leyla Jael Garcia Castro, and Lucila Aimo
- Subjects
Proteome ,Computer science ,Molecular Sequence Annotation ,Computational biology ,Accession number (bioinformatics) ,DNA sequencing ,World Wide Web ,Identifier ,Annotation ,Sequence Analysis, Protein ,Genetics ,Database Issue ,natural sciences ,UniProt ,Databases, Protein - Abstract
UniProt is an important collection of protein sequences and their annotations, which has doubled in size to 80 million sequences during the past year. This growth in sequences has prompted an extension of UniProt accession number space from 6 to 10 characters. An increasing fraction of new sequences are identical to a sequence that already exists in the database with the majority of sequences coming from genome sequencing projects. We have created a new proteome identifier that uniquely identifies a particular assembly of a species and strain or subspecies to help users track the provenance of sequences. We present a new website that has been designed using a user-experience design process. We have introduced an annotation score for all entries in UniProt to represent the relative amount of knowledge known about each protein. These scores will be helpful in identifying which proteins are the best characterized and most informative for comparative analysis. All UniProt data is provided freely and is available on the web at http://www.uniprot.org/.
- Published
- 2014