1. Covariance analysis of protein families: The case of the variable domains of antibodies
- Author
-
Nicolas Hugo, Danièle Altschuh, Laurence Choulier, Virginie Lafont, Choulier, Laurence, Institut Gilbert-Laustriat : Biomolécules, Biotechnologie, Innovation Thérapeutique, and Université Louis Pasteur - Strasbourg I-Centre National de la Recherche Scientifique (CNRS)
- Subjects
Protein family ,MESH: Sequence Alignment ,MESH: Immunoglobulin kappa-Chains ,Immunoglobulin Variable Region ,MESH: Immunoglobulin Variable Region ,MESH: Algorithms ,Sequence alignment ,MESH: Amino Acid Sequence ,Computational biology ,Biochemistry ,MESH: Protein Structure, Tertiary ,Immunoglobulin kappa-Chains ,Mice ,Structure-Activity Relationship ,MESH: Structure-Activity Relationship ,Structural Biology ,Animals ,Humans ,MESH: Animals ,Covariant transformation ,Amino Acid Sequence ,MESH: Mice ,Molecular Biology ,Sequence (medicine) ,Mathematics ,Genetics ,Analysis of covariance ,MESH: Humans ,[SDV.BIBS] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,MESH: Models, Chemical ,Covariance ,MESH: Amino Acid Substitution ,[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,Protein Structure, Tertiary ,Character (mathematics) ,Amino Acid Substitution ,Models, Chemical ,Multigene Family ,MESH: Multigene Family ,Immunoglobulin Heavy Chains ,Sequence Alignment ,Algorithms ,Word (group theory) ,MESH: Immunoglobulin Heavy Chains - Abstract
A nonrestrictive method for identifying covariance in protein families is described and applied to human and mouse germline Vκ and VH sequence alignments. Amino acids that occur at each position in a sequence alignment are divided into two sets, called a word, by generating all possible combinations of alternative amino acids. Each word is associated with a pattern of changes. Words with identical patterns identify covariant positions. In antibody variable domains, the number of words generated ranged between 1103 and 2195 depending on the alignment, of which 4 to 12 % occurred in covariant pairs. Despite the nonrestrictive character of pattern generation, covariant residues did not reflect a random selection with respect to the nature of amino acid changes and/or their spatial proximity in a reference crystallographic structure. This approach allowed the identification of a covariance signal for positions with high variability, mostly located in the outer part of the common structural framework of antibody variable domains. Covariance in these regions may reflect the existence of alternative and mutually exclusive atomic arrangements that are compatible with antibody function. The method may be of general applicability to rationalize residue variability in protein families. Proteins 2000;41:475–484. © 2000 Wiley-Liss, Inc.
- Published
- 2000
- Full Text
- View/download PDF