1. Multiple Transcription Factor Binding Sites Predict AID Targeting in Non-Ig Genes
- Author
-
Steven H. Kleinstein, Ashraf Khalil, Mary M. Tomayko, David G. Schatz, Man Liu, Gur Yaari, Mark J. Shlomchik, and Jamie L. Duke
- Subjects
Genome instability ,Immunology ,B-Lymphocyte Subsets ,Somatic hypermutation ,Mice, Transgenic ,Locus (genetics) ,Biology ,Article ,E-Box Elements ,Mice ,Predictive Value of Tests ,Cytidine Deaminase ,Animals ,Immunology and Allergy ,Transcription factor ,Gene ,Mice, Knockout ,Genetics ,Mice, Inbred BALB C ,Binding Sites ,Genes, Immunoglobulin ,Germinal center ,Cytidine deaminase ,DNA binding site ,Somatic Hypermutation, Immunoglobulin ,Transcription Factors - Abstract
Aberrant targeting of the enzyme activation-induced cytidine deaminase (AID) results in the accumulation of somatic mutations in ∼25% of expressed genes in germinal center B cells. Observations in Ung−/− Msh2−/− mice suggest that many other genes efficiently repair AID-induced lesions, so that up to 45% of genes may actually be targeted by AID. It is important to understand the mechanisms that recruit AID to certain genes, because this mistargeting represents an important risk for genome instability. We hypothesize that several mechanisms combine to target AID to each locus. To resolve which mechanisms affect AID targeting, we analyzed 7.3 Mb of sequence data, along with the regulatory context, from 83 genes in Ung−/− Msh2−/− mice to identify common properties of AID targets. This analysis identifies three transcription factor binding sites (E-box motifs, along with YY1 and C/EBP-β binding sites) that may work together to recruit AID. Based on previous knowledge and these newly discovered features, a classification tree model was built to predict genome-wide AID targeting. Using this predictive model, we were able to identify a set of 101 high-interest genes that are likely targets of AID.
- Published
- 2013