1. A word in your protein
- Author
-
Gonnet, Gaston H. and Benner, Steven A.
- Subjects
Amino acid sequence -- Terminology ,Biochemistry -- Terminology ,Tree structures (Computers) -- Usage ,Data structures -- Usage ,Chemicals, plastics and rubber industries ,Chemistry - Abstract
The ability of data structures to process large amounts of data is illustrated. A problem required the determination of the longest word spelled out in the protein sequence using the one-letter code for amino acids. The use of existing literature detailing the problem proved to be unsystematic and has produced a seven-letter word. However, the Patricia tree data structure, which entailed the matching of the Oxford Unabridged English Dictionary with the SwissProt protein sequence database, produced faster results. This method yielded two nine-letter words, namely, hidalgism and ensilists.
- Published
- 1993