34 results on '"Chuzhanova NA"'
Search Results
2. A meta-analysis of single base-pair substitutions in translational termination codons ('nonstop' mutations) that cause human inherited disease
- Author
-
Hamby Stephen E, Thomas Nick ST, Cooper David N, and Chuzhanova Nadia
- Subjects
human inherited disease ,stop codon ,3'-untranslated region ,nonstop mutation ,nonstop mRNA decay ,Medicine ,Genetics ,QH426-470 - Abstract
Abstract 'Nonstop' mutations are single base-pair substitutions that occur within translational termination (stop) codons and which can lead to the continued and inappropriate translation of the mRNA into the 3'-untranslated region. We have performed a meta-analysis of the 119 nonstop mutations (in 87 different genes) known to cause human inherited disease, examining the sequence context of the mutated stop codons and the average distance to the next alternative in-frame stop codon downstream, in comparison with their counterparts from control (non-mutated) gene sequences. A paucity of alternative in-frame stop codons was noted in the immediate vicinity (0-49 nucleotides downstream) of the mutated stop codons as compared with their control counterparts (p = 7.81 × 10-4). This implies that at least some nonstop mutations with alternative stop codons in close proximity will not have come to clinical attention, possibly because they will have given rise to stable mRNAs (not subject to nonstop mRNA decay) that are translatable into proteins of near-normal length and biological function. A significant excess of downstream in-frame stop codons was, however, noted in the range 150-199 nucleotides from the mutated stop codon (p = 8.55 × 10-4). We speculate that recruitment of an alternative stop codon at greater distance from the mutated stop codon may trigger nonstop mRNA decay, thereby decreasing the amount of protein product and yielding a readily discernible clinical phenotype. Confirmation or otherwise of this postulate must await the emergence of a clearer understanding of the mechanism of nonstop mRNA decay in mammalian cells.
- Published
- 2011
- Full Text
- View/download PDF
3. An isolated case of lissencephaly caused by the insertion of a mitochondrial genome-derived DNA sequence into the 5' untranslated region of the PAFAH1B1 (LIS1) gene
- Author
-
Millar David S, Tysoe Carolyn, Lazarou Lazarus P, Pilz Daniela T, Mohammed Shehla, Anderson Katharine, Chuzhanova Nadia, Cooper David N, and Butler Rachel
- Subjects
lissencephaly ,PAFAH1B1 gene ,mitochondrial genome ,insertion ,Medicine ,Genetics ,QH426-470 - Abstract
Abstract A 130 base pair (bp) insertion (g.-8delCins130) into the 5' untranslated region of the PAFAH1B1 (LIS1) gene, seven nucleotides upstream of the translational initiation site, was detected in an isolated case of lissencephaly. The inserted DNA sequence exhibited perfect homology to two non-contiguous regions of the mitochondrial genome (8479 to 8545 and 8775 to 8835, containing portions of two genes, ATP8 and ATP6), as well as near-perfect homology (1 bp mismatch) to a nuclear mitochondrial pseudogene (NUMT) sequence located on chromosome 1p36. This lesion was not evident on polymerase chain reaction (PCR) sequence analysis of either parent, indicating that the mutation had occurred de novo in the patient. Experiments designed to distinguish between a mitochondrial and a nuclear genomic origin for the inserted DNA sequence were, however, inconclusive. Mitochondrial genome sequences from both the patient and his parents were sequenced and found to be identical to the sequence inserted into the PAFAH1B1 gene. Analysis of parental PCR products from the chromosome 1-specific NUMT were also consistent with the interpretation that the inserted sequence had originated directly from the mitochondrial genome. The chromosome 1-specific NUMT in the patient proved to be refractory to PCR analysis, however, suggesting that this region of chromosome 1 could have been deleted or rearranged. Although it remains by far the most likely scenario, in the absence of DNA sequence information from the patient's own chromosome 1-specific NUMT, we cannot unequivocally confirm that the 130 bp insertion originated from mitochondrial genome rather than from the NUMT.
- Published
- 2010
- Full Text
- View/download PDF
4. Methylation-mediated deamination of 5-methylcytosine appears to give rise to mutations causing human inherited disease in CpNpG trinucleotides, as well as in CpG dinucleotides
- Author
-
Cooper David N, Mort Matthew, Stenson Peter D, Ball Edward V, and Chuzhanova Nadia A
- Subjects
CpG dinucleotide ,CpNpGp trinucleotide ,cytosine methylation ,5-methylcytosine deamination ,mutation hot-spots ,human inherited disease ,missense/nonsense mutations ,Medicine ,Genetics ,QH426-470 - Abstract
Abstract The cytosine-guanine (CpG) dinucleotide has long been known to be a hotspot for pathological mutation in the human genome. This hypermutability is related to its role as the major site of cytosine methylation with the attendant risk of spontaneous deamination of 5-methylcytosine (5mC) to yield thymine. Cytosine methylation, however, also occurs in the context of CpNpG sites in the human genome, an unsurprising finding since the intrinsic symmetry of CpNpG renders it capable of supporting a semi-conservative model of replication of the methylation pattern. Recently, it has become clear that significant DNA methylation occurs in a CpHpG context (where H = A, C or T) in a variety of human somatic tissues. If we assume that CpHpG methylation also occurs in the germline, and that 5mC deamination can occur within a CpHpG context, then we might surmise that methylated CpHpG sites could also constitute mutation hotspots causing human genetic disease. To test this postulate, 54,625 missense and nonsense mutations from 2,113 genes causing inherited disease were retrieved from the Human Gene Mutation Database http://www.hgmd.org. Some 18.2 per cent of these pathological lesions were found to be C → T and G → A transitions located in CpG dinucleotides (compatible with a model of methylation-mediated deamination of 5mC), an approximately ten-fold higher proportion than would have been expected by chance alone. The corresponding proportion for the CpHpG trinucleotide was 9.9 per cent, an approximately two-fold higher proportion than would have been expected by chance. We therefore estimate that ~5 per cent of missense/nonsense mutations causing human inherited disease may be attributable to methylation-mediated deamination of 5mC within a CpHpG context.
- Published
- 2010
- Full Text
- View/download PDF
5. Characterisation of a functional intronic polymorphism in the human growth hormone (GHI) gene
- Author
-
Millar David S, Horan Martin, Chuzhanova Nadia A, and Cooper David N
- Subjects
growth hormone (GHI) gene ,gene expression ,protein secretion ,intronic functional polymorphism ,alternative splicing ,Medicine ,Genetics ,QH426-470 - Abstract
Abstract The +1169A allele of the A/T single nucleotide polymorphism (SNP; rs2665802), located within intron 4 of the human growth hormone I (GHI) gene, has been associated with reduced levels of circulating GH and insulin-like growth factor I, a reduced risk of colorectal cancer and a predisposition to osteoporosis. Whether this intronic SNP is itself the functional polymorphism responsible for exerting a direct effect on GHI gene expression, however, or whether it is instead in linkage disequilibrium with the functional SNP, has been an open question. The evolutionary conservation of the +1169T allele (and the surrounding intronic sequence) in the bovine genome, as well as in primate genomes, is, however, suggestive of its functionality. Although a potential alternative splice site spans the location of the +1169 SNP, polymerase chain reaction-based assays failed to yield any evidence for alternative splicing associated with either allele. To determine whether the +1169 SNP, in different allelic combinations with SNPs at -278 (G/T), -57 (T/G) and +2103 (C/T), exerts a direct effect on gene expression and/or GH secretion, we performed a series of transfections of various GHI haplotype-expressing constructs into rat GC (somatotroph) cells. The results obtained provided evidence to support the contention that the +1169A allele contributes directly to the observed reduction in both GHI gene expression and GH secretion. Part of the apparent influence of the +1169A-bearing allele on GHI gene expression and GH secretion may still, however, be attributable to alleles of additional SNPs in cis to +1169A and located within either the promoter or the 3'-flanking region.
- Published
- 2010
- Full Text
- View/download PDF
6. Genotype-phenotype associations in neurofibromatosis type 1 (NF1): an increased risk of tumor complications in patients with NF1 splice-site mutations?
- Author
-
Alkindy Adila, Chuzhanova Nadia, Kini Usha, Cooper David N, and Upadhyaya Meena
- Subjects
Neurofibromatosis type 1 ,Genotype-phenotype correlation ,Malignant peripheral nerve sheath tumors ,Brain glioma ,Increased cancer susceptibility ,Splice-site mutations ,NF1 gene ,Gender effect ,Medicine ,Genetics ,QH426-470 - Abstract
Abstract Neurofibromatosis type 1 (NF1) is a complex neurocutaneous disorder with an increased susceptibility to develop both benign and malignant tumors but with a wide spectrum of inter and intrafamilial clinical variability. The establishment of genotype-phenotype associations in NF1 is potentially useful for targeted therapeutic intervention but has generally been unsuccessful, apart from small subsets of molecularly defined patients. The objective of this study was to evaluate the clinical phenotype associated with the specific types of NF1 mutation in a retrospectively recorded clinical dataset comprising 149 NF1 mutation-known individuals from unrelated families. Each patient was assessed for ten NF1-related clinical features, including the number of café-au-lait spots, cutaneous and subcutaneous neurofibromas and the presence/absence of intertriginous skin freckling, Lisch nodules, plexiform and spinal neurofibromas, optic gliomas, other neoplasms (in particular CNS gliomas, malignant peripheral nerve sheath tumors (MPNSTs), juvenile myelomonocytic leukemia, rhabdomyosarcoma, phaechromocytoma, gastrointestinal stromal tumors, juvenile xanthogranuloma, and lipoma) and evidence of learning difficulties. Gender and age at examination were also recorded. Patients were subcategorized according to their associated NF1 germ line mutations: frame shift deletions (52), splice-site mutations (23), nonsense mutations (36), missense mutations (32) and other types of mutation (6). A significant association was apparent between possession of a splice-site mutation and the presence of brain gliomas and MPNSTs (p = 0.006). If confirmed, these findings are likely to be clinically important since up to a third of NF1 patients harbor splice-site mutations. A significant influence of gender was also observed on the number of subcutaneous neurofibromas (females, p = 0.009) and preschool learning difficulties (females, p = 0.022).
- Published
- 2012
- Full Text
- View/download PDF
7. In Silico identification of pathogenic strains of Cronobacter from Biochemical data reveals association of inositol fermentation with pathogenicity
- Author
-
Forsythe Stephen J, Joseph Susan, Hamby Stephen E, and Chuzhanova Nadia
- Subjects
Microbiology ,QR1-502 - Abstract
Abstract Background Cronobacter, formerly known as Enterobacter sakazakii, is a food-borne pathogen known to cause neonatal meningitis, septicaemia and death. Current diagnostic tests for identification of Cronobacter do not differentiate between species, necessitating time consuming 16S rDNA gene sequencing or multilocus sequence typing (MLST). The organism is ubiquitous, being found in the environment and in a wide range of foods, although there is variation in pathogenicity between Cronobacter isolates and between species. Therefore to be able to differentiate between the pathogenic and non-pathogenic strains is of interest to the food industry and regulators. Results Here we report the use of Expectation Maximization clustering to categorise 98 strains of Cronobacter as pathogenic or non-pathogenic based on biochemical test results from standard diagnostic test kits. Pathogenicity of a strain was postulated on the basis of either pathogenic symptoms associated with strain source or corresponding MLST sequence types, allowing the clusters to be labelled as containing either pathogenic or non-pathogenic strains. The resulting clusters gave good differentiation of strains into pathogenic and non-pathogenic groups, corresponding well to isolate source and MLST sequence type. The results also revealed a potential association between pathogenicity and inositol fermentation. An investigation of the genomes of Cronobacter sakazakii and C. turicensis revealed the gene for inositol monophosphatase is associated with putative virulence factors in pathogenic strains of Cronobacter. Conclusions We demonstrated a computational approach allowing existing diagnostic kits to be used to identify pathogenic strains of Cronobacter. The resulting clusters correlated well with MLST sequence types and revealed new information about the pathogenicity of Cronobacter species.
- Published
- 2011
- Full Text
- View/download PDF
8. Evolutionary hierarchies of conserved blocks in 5'-noncoding sequences of dicot rbcS genes
- Author
-
Donnison Iain S, Chuzhanova Nadia A, Weeks Katie E, and Scott Ian M
- Subjects
Evolution ,QH359-425 - Abstract
Abstract Background Evolutionary processes in gene regulatory regions are major determinants of organismal evolution, but exceptionally challenging to study. We explored the possibilities of evolutionary analysis of phylogenetic footprints in 5'-noncoding sequences (NCS) from 27 ribulose-1,5-bisphosphate carboxylase small subunit (rbcS) genes, from three dicot families (Brassicaceae, Fabaceae and Solanaceae). Results Sequences of up to 400 bp encompassing proximal promoter and 5'-untranslated regions were analyzed. We conducted phylogenetic footprinting by several alternative methods: generalized Lempel-Ziv complexity (CLZ), multiple alignments with DIALIGN and ALIGN-M, and the MOTIF SAMPLER Gibbs sampling algorithm. These tools collectively defined 36 conserved blocks of mean length 12.8 bp. On average, 12.5 blocks were found in each 5'-NCS. The blocks occurred in arrays whose relative order was absolutely conserved, confirming the existence of 'conserved modular arrays' in promoters. Identities of half of the blocks confirmed past rbcS research, including versions of the I-box, G-box, and GT-1 sites such as Box II. Over 90% of blocks overlapped DNase-protected regions in tomato 5'-NCS. Regions characterized by low CLZ in sliding-window analyses were also frequently associated with DNase-protection. Blocks could be assigned to evolutionary hierarchies based on taxonomic distribution and estimated age. Lineage divergence dates implied that 13 blocks found in all three plant families were of Cretaceous antiquity, while other family-specific blocks were much younger. Blocks were also dated by formation of multigene families, using genome and coding sequence information. Dendrograms of evolutionary relations of the 5'-NCS were produced by several methods, including: cluster analysis using pairwise CLZ values; evolutionary trees of DIALIGN sequence alignments; and cladistic analysis of conserved blocks. Conclusion Dicot 5'-NCS contain conserved modular arrays of recurrent sequence blocks, which are coincident with functional elements. These blocks are amenable to evolutionary interpretation as hierarchies in which ancient, taxonomically widespread blocks can be distinguished from more recent, taxon-specific ones.
- Published
- 2007
- Full Text
- View/download PDF
9. Identification and molecular characterization of six novel mutations in the UDP-N-acetylglucosamine-1-phosphotransferase gamma subunit (GNPTG) gene in patients with mucolipidosis III gamma
- Author
-
Emanuele Persichetti, David Neil Cooper, Maja Di Rocco, Camillo Rosano, Andrea Dardis, Miriam Rigoldi, Nadia Chuzhanova, Barbara Tappino, Michela Sibilio, Giancarlo Parenti, Anna Lisa E. Montalvo, Nicholas Stuart Tudor Thomas, Chiara Balducci, Mirella Filocamo, Sandra Pohl, Bruno Bembi, Tommaso Beccari, Rossella Parini, Aldo Orlacchio, Silvia Dominissini, Silvia Paciotti, Persichetti, E, Chuzhanova, Na, Dardis, A, Tappino, B, Pohl, S, Thomas, N, Rosano, C, Balducci, C, Paciotti, S, Dominissini, S, Montalvo, Al, Sibilio, Michelina, Parini, R, Rigoldi, M, Di Rocco, M, Parenti, Giancarlo, Orlacchio, A, Bembi, B, Cooper, Dn, Filocamo, M, and Beccari, T.
- Subjects
Adult ,Male ,Adolescent ,Genotype ,RNA Stability ,Nonsense-mediated decay ,Molecular Sequence Data ,Mutation, Missense ,Transferases (Other Substituted Phosphate Groups) ,nonsense-mediated mRNA decay ,Biology ,medicine.disease_cause ,GNPTG ,MLIII ,Exon ,Mucolipidoses ,Genetics ,medicine ,Coding region ,Direct repeat ,Humans ,mucolipidosis type III ,RNA, Messenger ,Child ,Genetics (clinical) ,Sequence Deletion ,Mutation ,Base Sequence ,Reverse Transcriptase Polymerase Chain Reaction ,Intron ,Fibroblasts ,microdeletions ,Molecular biology ,Alternative Splicing ,Protein Subunits ,Codon, Nonsense ,RNA splicing ,Female ,RNA Splice Sites ,mucolipidosis type III, MLIII, GNPTG, microdeletions, nonsense-mediated mRNA decay - Abstract
Mucolipidosis type III (MLIII) is an autosomal recessive disorder affecting lysosomal hydrolase trafficking. In a study of 10 patients from seven families with a clinical phenotype and enzymatic diagnosis of MLIII, six novel GNPTG gene mutations were identified. These included missense (p.T286M) and nonsense (p.W111X) mutations and a transition in the obligate AG-dinucleotide of the intron 8 acceptor splice site (c.610–2A>G). Three microdeletions were also identified, two of which (c.611delG and c.640_667del28) were located within the coding region whereas one (c.609+28_610-16del) was located entirely within intron 8. RT-PCR analysis of the c.610–2A>G transition demonstrated that the change altered splicing, leading to the production of two distinct aberrantly spliced forms, viz. the skipping of exon 9 (p.G204_K247del) or the retention of introns 8 and 9 (p.G204VfsX28). RT-PCR analysis, performed on a patient homozygous for the intronic deletion (c.609+28_610-16del), failed to detect any GNPTG RNA transcripts. To determine whether c.609+28_610-16del allele-derived transcripts were subject to nonsense-mediated mRNA decay (NMD), patient fibroblasts were incubated with the protein synthesis inhibitor anisomycin. An RT-PCR fragment retaining 43 bp of intron 8 was consistently detected suggesting that the 33-bp genomic deletion had elicited NMD. Quantitative real-time PCR and GNPTG western blot analysis confirmed that the homozygous microdeletion p.G204VfsX17 had elicited NMD resulting in failure to synthesize GNPTG protein. Analysis of the sequences surrounding the microdeletion breakpoints revealed either intrinsic repetitivity of the deleted region or short direct repeats adjacent to the breakpoint junctions. This is consistent with these repeats having mediated the microdeletions via replication slippage and supports the view that the mutational spectrum of the GNPTG gene is strongly influenced by the properties of the local DNA sequence environment.
- Published
- 2009
10. Non-B DNA-forming sequences and WRN deficiency independently increase the frequency of base substitution in human cells.
- Author
-
Bacolla A, Wang G, Jain A, Chuzhanova NA, Cer RZ, Collins JR, Cooper DN, Bohr VA, and Vasquez KM
- Subjects
- Cell Line, Tumor, DNA, Z-Form genetics, Gene Knockdown Techniques, Humans, Lung Neoplasms genetics, Werner Syndrome Helicase, DNA, Z-Form metabolism, Exodeoxyribonucleases, Genomic Instability, Lung Neoplasms metabolism, RecQ Helicases, Sequence Deletion
- Abstract
Although alternative DNA secondary structures (non-B DNA) can induce genomic rearrangements, their associated mutational spectra remain largely unknown. The helicase activity of WRN, which is absent in the human progeroid Werner syndrome, is thought to counteract this genomic instability. We determined non-B DNA-induced mutation frequencies and spectra in human U2OS osteosarcoma cells and assessed the role of WRN in isogenic knockdown (WRN-KD) cells using a supF gene mutation reporter system flanked by triplex- or Z-DNA-forming sequences. Although both non-B DNA and WRN-KD served to increase the mutation frequency, the increase afforded by WRN-KD was independent of DNA structure despite the fact that purified WRN helicase was found to resolve these structures in vitro. In U2OS cells, ∼70% of mutations comprised single-base substitutions, mostly at G·C base-pairs, with the remaining ∼30% being microdeletions. The number of mutations at G·C base-pairs in the context of NGNN/NNCN sequences correlated well with predicted free energies of base stacking and ionization potentials, suggesting a possible origin via oxidation reactions involving electron loss and subsequent electron transfer (hole migration) between neighboring bases. A set of ∼40,000 somatic mutations at G·C base pairs identified in a lung cancer genome exhibited similar correlations, implying that hole migration may also be involved. We conclude that alternative DNA conformations, WRN deficiency and lung tumorigenesis may all serve to increase the mutation rate by promoting, through diverse pathways, oxidation reactions that perturb the electron orbitals of neighboring bases. It follows that such "hole migration" is likely to play a much more widespread role in mutagenesis than previously anticipated.
- Published
- 2011
- Full Text
- View/download PDF
11. Intrachromosomal mitotic nonallelic homologous recombination is the major molecular mechanism underlying type-2 NF1 deletions.
- Author
-
Roehl AC, Vogt J, Mussotter T, Zickler AN, Spöti H, Högel J, Chuzhanova NA, Wimmer K, Kluwe L, Mautner VF, Cooper DN, and Kehrer-Sawatzki H
- Subjects
- Carrier Proteins genetics, Computational Biology, Genes, Neurofibromatosis 1, Humans, Microsatellite Repeats genetics, Neoplasm Proteins, Neurofibromatosis 1 genetics, Nuclear Proteins genetics, Oligonucleotide Array Sequence Analysis methods, Polycomb Repressive Complex 2, Polymorphism, Single Nucleotide, Transcription Factors, Chromosomes, Human, Pair 17 genetics, Mitosis genetics, Neurofibromin 1 genetics, Recombination, Genetic, Sequence Deletion
- Abstract
Nonallelic homologous recombination (NAHR) is responsible for the recurrent rearrangements that give rise to genomic disorders. Although meiotic NAHR has been investigated in multiple contexts, much less is known about mitotic NAHR despite its importance for tumorigenesis. Because type-2 NF1 microdeletions frequently result from mitotic NAHR, they represent a good model in which to investigate the features of mitotic NAHR. We have used microsatellite analysis and SNP arrays to distinguish between the various alternative recombinational possibilities, thereby ascertaining that 17 of 18 type-2 NF1 deletions, with breakpoints in the SUZ12 gene and its highly homologous pseudogene, originated via intrachromosomal recombination. This high proportion of intrachromosomal NAHR causing somatic type-2 NF1 deletions contrasts with the interchromosomal origin of germline type-1 NF1 microdeletions, whose breakpoints are located within the NF1-REPs (low-copy repeats located adjacent to the SUZ12 sequences). Further, meiotic NAHR causing type-1 NF1 deletions occurs within recombination hotspots characterized by high GC-content and DNA duplex stability, whereas the type-2 breakpoints associated with the mitotic NAHR events investigated here do not cluster within hotspots and are located within regions of significantly lower GC-content and DNA stability. Our findings therefore point to fundamental mechanistic differences between the determinants of mitotic and meiotic NAHR., (Hum Mutat 31:1163-1173, 2010. © 2010 Wiley-Liss, Inc.)
- Published
- 2010
- Full Text
- View/download PDF
12. A novel third type of recurrent NF1 microdeletion mediated by nonallelic homologous recombination between LRRC37B-containing low-copy repeats in 17q11.2.
- Author
-
Bengesser K, Cooper DN, Steinmann K, Kluwe L, Chuzhanova NA, Wimmer K, Tatagiba M, Tinschert S, Mautner VF, and Kehrer-Sawatzki H
- Subjects
- Adult, Animals, Child, Chromosome Mapping, Female, Gene Deletion, Humans, Hybrid Cells, In Situ Hybridization, Fluorescence, Male, Mice, Recombination, Genetic, Segmental Duplications, Genomic genetics, Chromosome Deletion, Chromosomes, Human, Pair 17 genetics, Neurofibromatosis 1 genetics, Neurofibromin 1 genetics
- Abstract
Large microdeletions encompassing the neurofibromatosis type-1 (NF1) gene and its flanking regions at 17q11.2 belong to the group of genomic disorders caused by aberrant recombination between segmental duplications. The most common NF1 microdeletions (type-1) span 1.4-Mb and have breakpoints located within NF1-REPs A and C, low-copy repeats (LCRs) containing LRRC37-core duplicons. We have identified a novel type of recurrent NF1 deletion mediated by nonallelic homologous recombination (NAHR) between the highly homologous NF1-REPs B and C. The breakpoints of these approximately 1.0-Mb ("type-3") NF1 deletions were characterized at the DNA sequence level in three unrelated patients. Recombination regions, spanning 275, 180, and 109-bp, respectively, were identified within the LRRC37B-P paralogues of NF1-REPs B and C, and were found to contain sequences capable of non-B DNA formation. Both LCRs contain LRRC37-core duplicons, abundant and highly dynamic sequences in the human genome. NAHR between LRRC37-containing LCRs at 17q21.31 is known to have mediated the 970-kb polymorphic inversions of the MAPT-locus that occurred independently in different primate species, but also underlies the syndromes associated with recurrent 17q21.31 microdeletions and reciprocal microduplications. The novel NF1 microdeletions reported here provide further evidence for the unusually high recombinogenic potential of LRRC37-containing LCRs in the human genome.
- Published
- 2010
- Full Text
- View/download PDF
13. Molecular characterization of 22 novel UDP-N-acetylglucosamine-1-phosphate transferase alpha- and beta-subunit (GNPTAB) gene mutations causing mucolipidosis types IIalpha/beta and IIIalpha/beta in 46 patients.
- Author
-
Tappino B, Chuzhanova NA, Regis S, Dardis A, Corsolini F, Stroppiano M, Tonoli E, Beccari T, Rosano C, Mucha J, Blanco M, Szlago M, Di Rocco M, Cooper DN, and Filocamo M
- Subjects
- Adolescent, Adult, Animals, COS Cells, Child, Child, Preschool, Chlorocebus aethiops, Codon, Nonsense, DNA Mutational Analysis, Genetic Association Studies, Genotype, Humans, Infant, Mutation, Missense, Sequence Deletion, Mucolipidoses genetics, Mutation, Transferases (Other Substituted Phosphate Groups) genetics
- Abstract
Mutational analysis of the GNPTAB gene was performed in 46 apparently unrelated patients with mucolipidosis IIalpha/beta or IIIalpha/beta, characterized by the mistargeting of multiple lysosomal enzymes as a consequence of a UDP-GlcNAc-1-phosphotransferase defect. The GNPTAB mutational spectrum comprised 25 distinct mutant alleles, 22 of which were novel, including 3 nonsense mutations (p.Q314X, p.R375X, p.Q507X), 5 missense mutations (p.I403T, p.C442Y, p.C461G, p.Q926P, p.L1001P), 6 microduplications (c.749dupA, c.857dupA, c.1191_1194dupGCTG, c.1206dupT, c.1331dupG, c.2220_2221dupGA) and 8 microdeletions (c.755_759delCCTCT, c.1399delG, c.1959_1962delTAGT, c.1965delC, c.2550_2554delGAAAA, c.3443_3446delTTTG, c.3487_3490delACAG, c.3523_3529delATGTTCC). All micro-duplications/deletions were predicted to result in the premature termination of translation. A novel exonic SNP (c.303G>A; E101E) was identified which is predicted to create an SFRS1 (SF2/ASF) binding site that may be of potential functional/clinical relevance. This study of mutations in the GNPTAB gene, the largest yet reported, extends our knowledge of the mutational heterogeneity evident in MLIIalpha/beta/MLIIIalpha/beta.
- Published
- 2009
- Full Text
- View/download PDF
14. Identification and molecular characterization of six novel mutations in the UDP-N-acetylglucosamine-1-phosphotransferase gamma subunit (GNPTG) gene in patients with mucolipidosis III gamma.
- Author
-
Persichetti E, Chuzhanova NA, Dardis A, Tappino B, Pohl S, Thomas NS, Rosano C, Balducci C, Paciotti S, Dominissini S, Montalvo AL, Sibilio M, Parini R, Rigoldi M, Di Rocco M, Parenti G, Orlacchio A, Bembi B, Cooper DN, Filocamo M, and Beccari T
- Subjects
- Adolescent, Adult, Alternative Splicing genetics, Base Sequence, Child, Codon, Nonsense genetics, Female, Fibroblasts enzymology, Fibroblasts pathology, Genotype, Humans, Male, Molecular Sequence Data, Mutation, Missense genetics, RNA Splice Sites genetics, RNA Stability genetics, RNA, Messenger genetics, RNA, Messenger metabolism, Reverse Transcriptase Polymerase Chain Reaction, Sequence Deletion, Mucolipidoses enzymology, Mucolipidoses genetics, Mutation genetics, Protein Subunits genetics, Transferases (Other Substituted Phosphate Groups) genetics
- Abstract
Mucolipidosis type III (MLIII) is an autosomal recessive disorder affecting lysosomal hydrolase trafficking. In a study of 10 patients from seven families with a clinical phenotype and enzymatic diagnosis of MLIII, six novel GNPTG gene mutations were identified. These included missense (p.T286M) and nonsense (p.W111X) mutations and a transition in the obligate AG-dinucleotide of the intron 8 acceptor splice site (c.610-2A>G). Three microdeletions were also identified, two of which (c.611delG and c.640_667del28) were located within the coding region whereas one (c.609+28_610-16del) was located entirely within intron 8. RT-PCR analysis of the c.610-2A>G transition demonstrated that the change altered splicing, leading to the production of two distinct aberrantly spliced forms, viz. the skipping of exon 9 (p.G204_K247del) or the retention of introns 8 and 9 (p.G204VfsX28). RT-PCR analysis, performed on a patient homozygous for the intronic deletion (c.609+28_610-16del), failed to detect any GNPTG RNA transcripts. To determine whether c.609+28_610-16del allele-derived transcripts were subject to nonsense-mediated mRNA decay (NMD), patient fibroblasts were incubated with the protein synthesis inhibitor anisomycin. An RT-PCR fragment retaining 43 bp of intron 8 was consistently detected suggesting that the 33-bp genomic deletion had elicited NMD. Quantitative real-time PCR and GNPTG western blot analysis confirmed that the homozygous microdeletion p.G204VfsX17 had elicited NMD resulting in failure to synthesize GNPTG protein. Analysis of the sequences surrounding the microdeletion breakpoints revealed either intrinsic repetitivity of the deleted region or short direct repeats adjacent to the breakpoint junctions. This is consistent with these repeats having mediated the microdeletions via replication slippage and supports the view that the mutational spectrum of the GNPTG gene is strongly influenced by the properties of the local DNA sequence environment.
- Published
- 2009
- Full Text
- View/download PDF
15. Cruciform-forming inverted repeats appear to have mediated many of the microinversions that distinguish the human and chimpanzee genomes.
- Author
-
Kolb J, Chuzhanova NA, Högel J, Vasquez KM, Cooper DN, Bacolla A, and Kehrer-Sawatzki H
- Subjects
- Animals, Base Sequence, Chromosome Breakage, Computational Biology methods, Evolution, Molecular, Humans, Models, Genetic, Molecular Sequence Data, Recombination, Genetic, Reproducibility of Results, Sequence Analysis, DNA, Chromosome Inversion, DNA, Cruciform genetics, Genome, Human, Inverted Repeat Sequences genetics, Pan troglodytes genetics
- Abstract
Submicroscopic inversions have contributed significantly to the genomic divergence between humans and chimpanzees over evolutionary time. Those microinversions which are flanked by segmental duplications (SDs) are presumed to have originated via non-allelic homologous recombination between SDs arranged in inverted orientation. However, the nature of the mechanisms underlying those inversions which are not flanked by SDs remains unclear. We have investigated 35 such inversions, ranging in size from 51-nt to 22056-nt, with the goal of characterizing the DNA sequences in the breakpoint-flanking regions. Using the macaque genome as an outgroup, we determined the lineage specificity of these inversions and noted that the majority (N = 31; 89%) were associated with deletions (of length between 1-nt and 6754-nt) immediately adjacent to one or both inversion breakpoints. Overrepresentations of both direct and inverted repeats, >or= 6-nt in length and capable of non-B DNA structure formation, were noted in the vicinity of breakpoint junctions suggesting that these repeats could have contributed to double strand breakage. Inverted repeats capable of cruciform structure formation were also found to be a common feature of the inversion breakpoint-flanking regions, consistent with these inversions having originated through the resolution of Holliday junction-like cruciforms. Sequences capable of non-B DNA structure formation have previously been implicated in promoting gross deletions and translocations causing human genetic disease. We conclude that non-B DNA forming sequences may also have promoted the occurrence of mutations in an evolutionary context, giving rise to at least some of the inversion/deletions which now serve to distinguish the human and chimpanzee genomes.
- Published
- 2009
- Full Text
- View/download PDF
16. A meta-analysis of nonsense mutations causing human genetic disease.
- Author
-
Mort M, Ivanov D, Cooper DN, and Chuzhanova NA
- Subjects
- Codon, Humans, RNA Stability, Codon, Nonsense, Databases, Genetic, Genetic Diseases, Inborn
- Abstract
Nonsense mutations account for approximately 11% of all described gene lesions causing human inherited disease and approximately 20% of disease-associated single-basepair substitutions affecting gene coding regions. Pathological nonsense mutations resulting in TGA (38.5%), TAG (40.4%), and TAA (21.1%) occur in different proportions to naturally occurring stop codons. Of the 23 different nucleotide substitutions giving rise to nonsense mutations, the most frequent are CGA --> TGA (21%; resulting from methylation-mediated deamination) and CAG --> TAG (19%). The differing nonsense mutation frequencies are largely explicable in terms of variable nucleotide substitution rates such that it is unnecessary to invoke differential translational termination efficiency or differential codon usage. Some genes are characterized by numerous nonsense mutations but relatively few if any missense mutations (e.g., CHM) whereas other genes exhibit many missense mutations but few if any nonsense mutations (e.g., PSEN1). Genes in the latter category have a tendency to encode proteins characterized by multimer formation. Consistent with the operation of a clinical selection bias, genes exhibiting an excess of nonsense mutations are also likely to display an excess of frameshift mutations. Tumor suppressor (TS) genes exhibit a disproportionate number of nonsense mutations while most mutations in oncogenes are missense. A total of 12% of somatic nonsense mutations in TS genes were found to occur recurrently in the hypermutable CpG dinucleotide. In a comparison of somatic and germline mutational spectra for 17 TS genes, approximately 43% of somatic nonsense mutations had counterparts in the germline (rising to 98% for CpG mutations). Finally, the proportion of disease-causing nonsense mutations predicted to elicit nonsense-mediated mRNA decay (NMD) is significantly higher (P=1.56 x 10(-9)) than among nonobserved (potential) nonsense mutations, implying that nonsense mutations that elicit NMD are more likely to come to clinical attention.
- Published
- 2008
- Full Text
- View/download PDF
17. Type 2 NF1 deletions are highly unusual by virtue of the absence of nonallelic homologous recombination hotspots and an apparent preference for female mitotic recombination.
- Author
-
Steinmann K, Cooper DN, Kluwe L, Chuzhanova NA, Senger C, Serra E, Lazaro C, Gilaberte M, Wimmer K, Mautner VF, and Kehrer-Sawatzki H
- Subjects
- Adolescent, Adult, Carrier Proteins genetics, Chromosomes, Human, Pair 17, Female, Humans, Male, Mosaicism, Neoplasm Proteins, Nuclear Proteins genetics, Polycomb Repressive Complex 2, Recombination, Genetic, Sex Characteristics, Transcription Factors, Mitosis genetics, Neurofibromatosis 1 genetics, Neurofibromin 1 deficiency, Neurofibromin 1 genetics, Sequence Deletion
- Abstract
Approximately 5% of patients with neurofibromatosis type 1 (NF1) exhibit gross deletions that encompass the NF1 gene and its flanking regions. The breakpoints of the common 1.4-Mb (type 1) deletions are located within low-copy repeats (NF1-REPs) and cluster within a 3.4-kb hotspot of nonallelic homologous recombination (NAHR). Here, we present the first comprehensive breakpoint analysis of type 2 deletions, which are a second type of recurring NF1 gene deletion. Type 2 deletions span 1.2 Mb and are characterized by breakpoints located within the SUZ12 gene and its pseudogene, which closely flank the NF1-REPs. Breakpoint analysis of 13 independent type 2 deletions did not reveal any obvious hotspots of NAHR. However, an overrepresentation of polypyrimidine/polypurine tracts and triplex-forming sequences was noted in the breakpoint regions that could have facilitated NAHR. Intriguingly, all 13 type 2 deletions identified so far are characterized by somatic mosaicism, which indicates a positional preference for mitotic NAHR within the NF1 gene region. Indeed, whereas interchromosomal meiotic NAHR occurs between the NF1-REPs giving rise to type 1 deletions, NAHR during mitosis appears to occur intrachromosomally between the SUZ12 gene and its pseudogene, thereby generating type 2 deletions. Such a clear distinction between the preferred sites of mitotic versus meiotic NAHR is unprecedented in any other genomic disorder induced by the local genomic architecture. Additionally, 12 of the 13 mosaic type 2 deletions were found in females. The marked female preponderance among mosaic type 2 deletions contrasts with the equal sex distribution noted for type 1 and/or atypical NF1 deletions. Although an influence of chromatin structure was strongly suspected, no sex-specific differences in the methylation pattern exhibited by the SUZ12 gene were apparent that could explain the higher rate of mitotic recombination in females.
- Published
- 2007
- Full Text
- View/download PDF
18. Evolutionary hierarchies of conserved blocks in 5'-noncoding sequences of dicot rbcS genes.
- Author
-
Weeks KE, Chuzhanova NA, Donnison IS, and Scott IM
- Subjects
- 5' Untranslated Regions genetics, Brassicaceae enzymology, DNA Footprinting, Evolution, Molecular, Fabaceae enzymology, Phylogeny, Promoter Regions, Genetic, Solanaceae enzymology, Brassicaceae genetics, DNA, Plant genetics, Fabaceae genetics, Genes, Plant, Ribulose-Bisphosphate Carboxylase genetics, Solanaceae genetics
- Abstract
Background: Evolutionary processes in gene regulatory regions are major determinants of organismal evolution, but exceptionally challenging to study. We explored the possibilities of evolutionary analysis of phylogenetic footprints in 5'-noncoding sequences (NCS) from 27 ribulose-1,5-bisphosphate carboxylase small subunit (rbcS) genes, from three dicot families (Brassicaceae, Fabaceae and Solanaceae)., Results: Sequences of up to 400 bp encompassing proximal promoter and 5'-untranslated regions were analyzed. We conducted phylogenetic footprinting by several alternative methods: generalized Lempel-Ziv complexity (CLZ), multiple alignments with DIALIGN and ALIGN-M, and the MOTIF SAMPLER Gibbs sampling algorithm. These tools collectively defined 36 conserved blocks of mean length 12.8 bp. On average, 12.5 blocks were found in each 5'-NCS. The blocks occurred in arrays whose relative order was absolutely conserved, confirming the existence of 'conserved modular arrays' in promoters. Identities of half of the blocks confirmed past rbcS research, including versions of the I-box, G-box, and GT-1 sites such as Box II. Over 90% of blocks overlapped DNase-protected regions in tomato 5'-NCS. Regions characterized by low CLZ in sliding-window analyses were also frequently associated with DNase-protection. Blocks could be assigned to evolutionary hierarchies based on taxonomic distribution and estimated age. Lineage divergence dates implied that 13 blocks found in all three plant families were of Cretaceous antiquity, while other family-specific blocks were much younger. Blocks were also dated by formation of multigene families, using genome and coding sequence information. Dendrograms of evolutionary relations of the 5'-NCS were produced by several methods, including: cluster analysis using pairwise CLZ values; evolutionary trees of DIALIGN sequence alignments; and cladistic analysis of conserved blocks., Conclusion: Dicot 5'-NCS contain conserved modular arrays of recurrent sequence blocks, which are coincident with functional elements. These blocks are amenable to evolutionary interpretation as hierarchies in which ancient, taxonomically widespread blocks can be distinguished from more recent, taxon-specific ones.
- Published
- 2007
- Full Text
- View/download PDF
19. New screening software shows that most recent large 16S rRNA gene clone libraries contain chimeras.
- Author
-
Ashelford KE, Chuzhanova NA, Fry JC, Jones AJ, and Weightman AJ
- Subjects
- Archaea classification, Archaea genetics, Bacteria classification, Bacteria genetics, Base Sequence, Chimera genetics, Cloning, Molecular, DNA, Archaeal genetics, DNA, Bacterial genetics, DNA, Ribosomal genetics, Marine Biology, Molecular Sequence Data, RNA, Archaeal genetics, RNA, Bacterial genetics, Gene Library, RNA, Ribosomal, 16S genetics, Software
- Abstract
A new computer program, called Mallard, is presented for screening entire 16S rRNA gene libraries of up to 1,000 sequences for chimeras and other artifacts. Written in the Java computer language and capable of running on all major operating systems, the program provides a novel graphical approach for visualizing phylogenetic relationships among 16S rRNA gene sequences. To illustrate its use, we analyzed most of the large libraries of cloned bacterial 16S rRNA gene sequences submitted to the public repository during 2005. Defining a large library as one containing 100 or more sequences of 1,200 bases or greater, we screened 25 of the 28 libraries and found that all but three contained substantial anomalies. Overall, 543 anomalous sequences were found. The average anomaly content per clone library was 9.0%, 4% higher than that previously estimated for the public repository overall. In addition, 90.8% of anomalies had characteristic chimeric patterns, a rise of 25.4% over that found previously. One library alone was found to contain 54 chimeras, representing 45.8% of its content. These figures far exceed previous estimates of artifacts within public repositories and further highlight the urgent need for all researchers to adequately screen their libraries prior to submission. Mallard is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.
- Published
- 2006
- Full Text
- View/download PDF
20. In silico discrimination of single nucleotide polymorphisms and pathological mutations in human gene promoter regions by means of local DNA sequence context and regularity.
- Author
-
Khan IA, Mort M, Buckland PR, O'Donovan MC, Cooper DN, and Chuzhanova NA
- Subjects
- Base Sequence, Binding Sites, Databases, Nucleic Acid, Genome, Human, Humans, Transcription Factors genetics, Algorithms, Genetic Predisposition to Disease, Mutation, Polymorphism, Single Nucleotide, Promoter Regions, Genetic, Sequence Analysis, DNA methods
- Abstract
DNA sequence features were sought that could be used for the in silico ascertainment of the likely functional consequences of single nucleotide changes in human gene promoter regions. To identify relevant features of the local DNA sequence context, we transformed into consensus tables the nucleotide composition of sequences flanking 101 promoter SNPs of type C<-->T or A<-->G, defined empirically as being either 'functional' or 'non-functional' on the basis of a standardised reporter gene assay. The similarity of a given sequence to these consensus tables was then measured by means of the Shapiro-Senapathy score. A decision rule with the potential to discriminate between empirically ascertained functional and non-functional SNPs was proposed that potentiated discrimination between functional and non-functional SNPs with a sensitivity of 80% and a specificity of 20%. Two further datasets (viz. disease-associated SNPs of types A<-->G and C<-->T (N = 75) and pathological promoter mutations (transitions, N = 114)) were retrieved from the Human Gene Mutation Database (HGMD; http://www.hgmd.org/) and analyzed using consensus tables derived from the functional and non-functional promoter SNPs; approximately 70% were correctly recognized as being of probable functional significance. Complexity analysis was also used to quantify the regularity of the local DNA sequence environment. Functional SNPs/mutations of type C<-->T were found to occur in DNA regions characterized by lower average sequence complexity as measured with respect to symmetric elements; complexity values increased gradually from functional SNPs and pathological mutations to functional disease-associated SNPs and non-functional SNPs. This may reflect the internal axial symmetry that frequently characterizes transcription factor binding sites.
- Published
- 2006
21. Origin of the prevalent SFTPB indel g.1549C > GAA (121ins2) mutation causing surfactant protein B (SP-B) deficiency.
- Author
-
Tredano M, Cooper DN, Stuhrmann M, Christodoulou J, Chuzhanova NA, Roudot-Thoraval F, Boëlle PY, Elion J, Jeanpierre M, Feingold J, Couderc R, and Bahuau M
- Subjects
- Base Sequence, Europe, Female, Founder Effect, Gene Frequency, Genotype, Haplotypes genetics, Humans, Introns genetics, Linkage Disequilibrium, Male, Molecular Sequence Data, Mutation, Polymorphism, Single Nucleotide, Mutagenesis, Insertional, Pulmonary Surfactant-Associated Protein B deficiency, Pulmonary Surfactant-Associated Protein B genetics
- Abstract
The SFTPB gene indel g.1549C > GAA (121ins2) accounts for about 2/3 of the mutant alleles underlying complete surfactant protein B deficiency. It is unclear, however, whether its prevalence is due to recurrent mutation or a founder effect. The underlying mutational mechanism was therefore sought through the analysis of local DNA sequence complexity. A relatively complex two-step process was proposed: the first step involving slipped mispairing mediated by a direct repeat and generating an AGAA micro-insertion, the second step involving hairpin loop resolution resulting in a CA micro-deletion. The possibility of a founder effect was then assessed by typing 8 intragenic SNPs in 17 independent 121ins2 chromosomes from 10 probands, with parental non-121ins2 chromosomes serving as controls. The 121ins2 chromosomes were assigned to three discrete haplotypes, whilst control chromosomes were distributed between 10 of the 11 observed parental haplotypes. The 121ins2 mutation was in strong and significant linkage disequilibrium (LD) with the tightly linked marker g.1580T/C (|D'| = 1; P approximately 0.024), although only moderate LD was found with the rest of the locus (|D'| approximately 0.54; P approximately 0.136). Data on haplotype structure and the locus LD pattern, obtained from 81 independent Western-European chromosomes, were consistent with the three mutation-bearing haplotypes having originated from a common ancestor by recombination. Interestingly, all families harboring the 121ins2 indel had ancestors from a region of Northwestern Europe populated by Frankish/Saxon migration. Taken together, these data are consistent with the view that an indel mutation occurred on a relatively common SFTPB haplotype and now accounts for the majority of (and possibly all) extant 121ins2 chromosomes., ((c) 2005 Wiley-Liss, Inc.)
- Published
- 2006
- Full Text
- View/download PDF
22. The Human Gene Mutation Database (HGMD) and its exploitation in the study of mutational mechanisms.
- Author
-
Cooper DN, Stenson PD, and Chuzhanova NA
- Subjects
- Base Sequence, Computer Graphics, Humans, Information Storage and Retrieval, Molecular Sequence Data, User-Computer Interface, DNA Mutational Analysis methods, Database Management Systems, Databases, Genetic, Genome, Human genetics, Mutation genetics, Polymorphism, Single Nucleotide genetics, Sequence Analysis, DNA methods
- Abstract
The Human Gene Mutation Database (HGMD) constitutes a comprehensive core collection of data on germ-line mutations in nuclear genes underlying or associated with human inherited disease (http://www.hgmd.org). Data cataloged include single base-pair substitutions in coding, regulatory, and splicing-relevant regions, microdeletions and microinsertions, indels, and triplet repeat expansions, as well as gross gene deletions, insertions, duplications, and complex rearrangements. Each mutation is entered into HGMD only once, in order to avoid confusion between recurrent and identical-by-descent lesions. By June 2005, the database contained in excess of 53,000 different lesions detected in 2029 different nuclear genes, with new entries currently accumulating at a rate in excess of 5000 per annum. HGMD includes cDNA reference sequences, now provided for more than 90% of the listed genes, splice junction data, disease-associated and functional polymorphisms, and links to data present in publicly available online locus-specific mutation databases.
- Published
- 2006
- Full Text
- View/download PDF
23. At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies.
- Author
-
Ashelford KE, Chuzhanova NA, Fry JC, Jones AJ, and Weightman AJ
- Subjects
- Bacteroides classification, Base Sequence, RNA, Ribosomal, 16S genetics, Reproducibility of Results, Sequence Deletion, Software, Bacteroides genetics, Databases, Nucleic Acid standards, RNA, Ribosomal, 16S chemistry
- Abstract
A new method for detecting chimeras and other anomalies within 16S rRNA sequence records is presented. Using this method, we screened 1,399 sequences from 19 phyla, as defined by the Ribosomal Database Project, release 9, update 22, and found 5.0% to harbor substantial errors. Of these, 64.3% were obvious chimeras, 14.3% were unidentified sequencing errors, and 21.4% were highly degenerate. In all, 11 phyla contained obvious chimeras, accounting for 0.8 to 11% of the records for these phyla. Many chimeras (43.1%) were formed from parental sequences belonging to different phyla. While most comprised two fragments, 13.7% were composed of at least three fragments, often from three different sources. A separate analysis of the Bacteroidetes phylum (2,739 sequences) also revealed 5.8% records to be anomalous, of which 65.4% were apparently chimeric. Overall, we conclude that, as a conservative estimate, 1 in every 20 public database records is likely to be corrupt. Our results support concerns recently expressed over the quality of the public repositories. With 16S rRNA sequence data increasingly playing a dominant role in bacterial systematics and environmental biodiversity studies, it is vital that steps be taken to improve screening of sequences prior to submission. To this end, we have implemented our method as a program with a simple-to-use graphic user interface that is capable of running on a range of computer platforms. The program is called Pintail, is released under the terms of the GNU General Public License open source license, and is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.
- Published
- 2005
- Full Text
- View/download PDF
24. Microdeletions and microinsertions causing human genetic disease: common mechanisms of mutagenesis and the role of local DNA sequence complexity.
- Author
-
Ball EV, Stenson PD, Abeysinghe SS, Krawczak M, Cooper DN, and Chuzhanova NA
- Subjects
- Computational Biology methods, DNA-Directed DNA Polymerase genetics, Databases, Genetic, Gene Deletion, Genetic Variation, Humans, Mutation, Repetitive Sequences, Nucleic Acid, Genetic Diseases, Inborn genetics, Mutagenesis, Sequence Analysis, DNA methods
- Abstract
In the Human Gene Mutation Database (www.hgmd.org), microdeletions and microinsertions causing inherited disease (both defined as involving < or = 20 bp of DNA) account for 8,399 (17%) and 3,345 (7%) logged mutations, in 940 and 668 genes, respectively. A positive correlation was noted between the microdeletion and microinsertion frequencies for 564 genes for which both microdeletions and microinsertions are reported in HGMD, consistent with the view that the propensity of a given gene/sequence to undergo microdeletion is related to its propensity to undergo microinsertion. While microdeletions and microinsertions of 1 bp constitute respectively 48% and 66% of the corresponding totals, the relative frequency of the remaining lesions correlates negatively with the length of the DNA sequence deleted or inserted. Many of the microdeletions and microinsertions of more than 1 bp are potentially explicable in terms of slippage mutagenesis, involving the addition or removal of one copy of a mono-, di-, or trinucleotide tandem repeat. The frequency of in-frame 3-bp and 6-bp microinsertions and microdeletions was, however, found to be significantly lower than that of mutations of other lengths, suggesting that some of these in-frame lesions may not have come to clinical attention. Various sequence motifs were found to be over-represented in the vicinity of both microinsertions and microdeletions, including the heptanucleotide CCCCCTG that shares homology with the complement of the 8-bp human minisatellite conserved sequence/chi-like element (GCWGGWGG). The previously reported indel hotspot GTAAGT and its complement ACTTAC were also found to be overrepresented in the vicinity of both microinsertions and microdeletions, thereby providing a first example of a mutational hotspot that is common to different types of gene lesion. Other motifs overrepresented in the vicinity of microdeletions and microinsertions included DNA polymerase pause sites and topoisomerase cleavage sites. Several novel microdeletion/microinsertion hotspots were noted and some of these exhibited sufficient similarity to one another to justify terming them "super-hotspot" motifs. Analysis of sequence complexity also demonstrated that a combination of slipped mispairing mediated by direct repeats, and secondary structure formation promoted by symmetric elements, can account for the majority of microdeletions and microinsertions. Thus, microinsertions and microdeletions exhibit strong similarities in terms of the characteristics of their flanking DNA sequences, implying that they are generated by very similar underlying mechanisms.
- Published
- 2005
- Full Text
- View/download PDF
25. Two novel severe mutations in the pancreatic secretory trypsin inhibitor gene (SPINK1) cause familial and/or hereditary pancreatitis.
- Author
-
Le Maréchal C, Chen JM, Le Gall C, Plessis G, Chipponi J, Chuzhanova NA, Raguénès O, and Férec C
- Subjects
- Adult, Child, Cohort Studies, Exons genetics, Female, Gene Deletion, Heterozygote, Humans, Male, Middle Aged, Pedigree, RNA Splice Sites genetics, Mutation genetics, Pancreatitis genetics, Trypsin Inhibitor, Kazal Pancreatic genetics
- Abstract
Mutations in the serine protease inhibitor Kazal type 1 gene (SPINK1) encoding pancreatic secretory trypsin inhibitor (PSTI) have recently been found to be associated with chronic pancreatitis. Nevertheless, knowledge of severe mutations is particularly scarce, both in terms of number and in the extent of clinical information. The aim of this study was to expand the known spectrum of such mutations. 46 unrelated families, each including at least two pancreatitis patients and carrying neither cationic trypsinogen (PRSS1) mutations nor the frequent SPINK1 N34S mutation, participated in this study. The four exons and their flanking sequences of the SPINK1 gene were screened by denaturing high performance liquid chromatography analysis (DHPLC); and mutations were identified by direct sequencing. A heterozygous microdeletion mutation (c.27delC), which occurs within a symmetric element, was identified in two families. In one family, c.27delC showed segregation with the disease across two generations, with a penetrance of up to 75%. But in the other family, however, the same mutation manifested as a low-penetrance susceptibility factor. In addition, a novel heterozygous splicing mutation, c.87+1G>A (G>A substitution at nucleotide +1 of intron 2) was found in one family with familial pancreatitis. Our results also helped to resolve the sharply differing views about PSTI's role in pancreatitis., (Copyright 2003 Wiley-Liss, Inc.)
- Published
- 2004
- Full Text
- View/download PDF
26. Human genetic disease caused by de novo mitochondrial-nuclear DNA transfer.
- Author
-
Turner C, Killoran C, Thomas NS, Rosenberg M, Chuzhanova NA, Johnston J, Kemel Y, Cooper DN, and Biesecker LG
- Subjects
- Adolescent, Alleles, Amino Acid Sequence, Base Sequence, DNA, Complementary analysis, Hamartoma genetics, Humans, Hypothalamic Diseases genetics, Kruppel-Like Transcription Factors, Male, Molecular Sequence Data, Pedigree, Polydactyly genetics, Polymerase Chain Reaction, Sequence Analysis, DNA, Syndrome, Zinc Finger Protein Gli3, Cell Nucleus genetics, DNA genetics, DNA, Mitochondrial genetics, DNA-Binding Proteins genetics, Genetic Diseases, Inborn genetics, Nerve Tissue Proteins genetics, Repressor Proteins, Transcription Factors genetics, Xenopus Proteins
- Abstract
Transfer of nucleic acid from cytoplasmic organelles to the nuclear genome is a well-established mechanism of evolutionary change in eukaryotes. Such transfers have occurred throughout evolution, but so far, none has been shown unequivocally to occur de novo to cause a heritable human disease. We have characterized a patient with a de novo nucleic acid transfer from the mitochondrial to the nuclear genome, a transfer that is responsible for a sporadic case of Pallister-Hall syndrome, a condition usually inherited in an autosomal dominant fashion. This mutation, a 72-bp insertion into exon 14 of the GLI3 gene, creates a premature stop codon and predicts a truncated protein product. Both the mechanism and the cause of the mitochondrial-nuclear transfer are unknown. Although the conception of this patient was temporally and geographically associated with high-level radioactive contamination following the Chernobyl accident, this case cannot, on its own, be used to establish a causal relationship between radiation exposure and this rare type of mutation. Thus, for the time being, it must be considered as an intriguing coincidence. Nevertheless, these data serve to demonstrate that de novo mitochondrial-nuclear transfer of nucleic acid is a novel mechanism of human inherited disease.
- Published
- 2003
- Full Text
- View/download PDF
27. Meta-analysis of indels causing human genetic disease: mechanisms of mutagenesis and the role of local DNA sequence complexity.
- Author
-
Chuzhanova NA, Anassis EJ, Ball EV, Krawczak M, and Cooper DN
- Subjects
- Base Sequence, DNA Mutational Analysis, Humans, Models, Genetic, Mutagenesis, Insertional, Nucleic Acid Conformation, Repetitive Sequences, Nucleic Acid, Sequence Deletion, Genetic Diseases, Inborn genetics, Genetic Predisposition to Disease, Mutagenesis, Mutation
- Abstract
A relatively rare type of mutation causing human genetic disease is the indel, a complex lesion that appears to represent a combination of micro-deletion and micro-insertion. In the absence of meta-analytical studies of indels, the mutational mechanisms underlying indel formation remain unclear. Data from the Human Gene Mutation Database (HGMD) were therefore used to compare and contrast 211 different indels underlying genetic disease in an attempt to deduce the processes responsible for their genesis. Each indel was treated as if it were the result of a two-step insertion/deletion process and was assessed in the context of 10 base-pairs DNA sequence flanking the lesion on either side. Several indel hotspots were noted and a GTAAGT motif was found to be significantly over-represented in the vicinity of the indels studied. Previously postulated mechanisms underlying micro-deletions and micro-insertions were initially explored in terms of local DNA sequence regularity as measured by its complexity. The change in complexity consequent to a mutation was found to be indicative of the type of repeat sequence involved in mediating the event, thereby providing clues as to the underlying mutational mechanism. Complexity analysis was then employed to examine the possible intermediates through which each indel could have occurred and to propose likely mechanisms and pathways for indel generation on an individual basis. Manual analysis served to confirm that the majority of indels (>90%) are explicable in terms of a two-step process involving established mutational mechanisms. Indels equivalent to double base-pair substitutions (22% of the total) were found to be mechanistically indistinguishable from the remainder and may therefore be regarded as a special type of indel. The observed correspondence between changes in local DNA sequence complexity and the involvement of specific mutational mechanisms in the insertion/deletion process, and the ability of generated models to account for both the number and identity of the bases deleted and/or inserted, makes this approach invaluable not only for the analysis of indel formation, but also for the study of other types of complex lesion., (Copyright 2002 Wiley-Liss, Inc.)
- Published
- 2003
- Full Text
- View/download PDF
28. The evolution of the vertebrate beta-globin gene promoter.
- Author
-
Chuzhanova NA, Krawczak M, Thomas N, Nemytikova LA, Gusev VD, and Cooper DN
- Subjects
- Amino Acid Sequence, Animals, Base Sequence, Humans, Mammals genetics, Molecular Sequence Data, Peptide Fragments chemistry, Sequence Alignment, Sequence Homology, Nucleic Acid, Species Specificity, TATA Box, Globins genetics, Promoter Regions, Genetic, Vertebrates classification, Vertebrates genetics
- Abstract
Complexity analysis is capable of highlighting those gross evolutionary changes in gene promoter regions (loosely termed "promoter shuffling") that are undetectable by conventional DNA sequence alignment. Complexity analysis was therefore used here to identify the modular components (blocks) of the orthologous beta-globin gene promoter sequences of 22 vertebrate species, from zebrafish to humans. Considerable variation between the beta-globin gene promoters was apparent in terms of block presence/absence, copy number, and relative location. Some sequence blocks appear to be ubiquitous, whereas others are restricted to a specific taxon. Block similarities were also evident between the promoters of the paralogous human beta-like globin genes. It may be inferred that a wide variety of different mutational mechanisms have operated upon the beta-globin gene promoter over evolutionary time. Because these include gross changes such as deletion, duplication, amplification, elongation, contraction, and fusion, as well as the steady accumulation of single base-pair substitutions, it is clear that some redefinition of the term "promoter shuffling" is required. This notwithstanding, and as previously described for the vertebrate growth hormone gene promoter, the modular structure of the beta-globin promoter region and those of its paralogous counterparts have continually been rearranged into new combinations through the alteration, or shuffling, of preexisting blocks. Some of these changes may have had no influence on promoter function, but others could have altered either the level of gene expression or the responsiveness of the promoter to external stimuli. The comparative study of vertebrate beta-globin gene promoter regions described here confirms the generality of the phenomenon of sequence block shuffling and thus supports the view that it could have played an important role in the evolution of differential gene expression.
- Published
- 2002
- Full Text
- View/download PDF
29. [A rapid method for detecting interconnections between functionally and/or evolutionary close biological sequences].
- Author
-
Gusev VD, Nemytikova LA, and Chuzhanova NA
- Subjects
- Databases, Nucleic Acid, Repetitive Sequences, Nucleic Acid, Evolution, Molecular
- Abstract
It is now common practice to retrieve, by key words, highly specialized selections of sequences from general-purpose databases such as EMBL, GenBank, etc. The sequences included in a selection are often interconnected, which means that there are duplications, embeddings, intersections, homology, common structural elements. Knowledge of these interconnections is necessary for further processing of the sequences. We propose a rapid (single scan) method for identification of such interconnections by means of complexity analysis that generalizes the Lempel-Ziv approach. Analysis of a selection of 5'-flanking regions of vertebrate growth hormone genes from EMBL is presented as an example.
- Published
- 2001
30. Identification of an intronic regulatory element in the human protein C (PROC) gene.
- Author
-
Shamsher MK, Chuzhanova NA, Friedman B, Scopes DA, Alhaq A, Millar DS, Cooper DN, and Berg LP
- Subjects
- Base Sequence, Carcinoma, Hepatocellular, Conserved Sequence, DNA chemistry, DNA genetics, Genes, Reporter, Hepatocyte Nuclear Factor 1, Hepatocyte Nuclear Factor 1-alpha, Hepatocyte Nuclear Factor 1-beta, Humans, Liver Neoplasms, Luciferases genetics, Mutagenesis, Site-Directed, Transcription Factors metabolism, Transfection, Tumor Cells, Cultured, DNA-Binding Proteins, Gene Expression Regulation, Introns, Nuclear Proteins, Promoter Regions, Genetic, Protein C genetics, Regulatory Sequences, Nucleic Acid
- Abstract
Regulatory DNA elements responsible for human protein C (PROC) gene expression have previously been identified in the upstream promoter region and first (untranslated) exon of the gene. Here we show that an additional sequence element located more than 500 bp downstream of the core promoter within intron 1 further enhances PROC promoter-driven reporter gene expression in human hepatoma cells. In common with core promoter constructs used in previous studies, the activity of this 3'-extended regulatory region is diminished by a naturally occurring promoter mutation. However, in contrast to constructs lacking intronic sequence, the promoter/intron regulatory region is repressed rather than activated by the transcription factor HNF-1. Using both conventional alignment procedures and complexity analysis to study the human and canine PROC sequences, we identified two conserved intronic regions, which were tested for their involvement in gene regulation. High-level gene expression from the intron-coupled promoter was dependent upon the integrity of a 142 bp sequence element, a duplicate copy of which is located in an upstream region of the PROC gene that possesses enhancer activity. These findings emphasise the potential importance of intragenic sequences for gene regulation and serve to illustrate that the results of PROC promoter/reporter gene experiments are critically dependent upon the sequence context. The identification of such intragenic elements is relevant to the analysis of human genetic disease since it will facilitate the detection and functional evaluation of regulatory mutations and polymorphisms.
- Published
- 2000
- Full Text
- View/download PDF
31. Changes in primary DNA sequence complexity influence the phenotypic consequences of mutations in human gene regulatory regions.
- Author
-
Krawczak M, Chuzhanova NA, Stenson PD, Johansen BN, Ball EV, and Cooper DN
- Subjects
- Base Sequence, DNA Mutational Analysis, Humans, Odds Ratio, Phenotype, Sequence Analysis, DNA, DNA genetics, Genes, Regulator, Mutation
- Abstract
No general rules have been proposed to account for the functional consequences of gene regulatory mutations. In a first attempt to establish the nature of such rules, an analysis was performed of the DNA sequence context of 153 different single base-pair substitutions in the regulatory regions of 65 different human genes underlying inherited disease. Use of a recently proposed measure of DNA sequence complexity (taking into account the level of structural repetitiveness of a DNA sequence, rather than simply the oligonucleotide composition) has served to demonstrate that the concomitant change in local DNA sequence complexity surrounding a substituted nucleotide is related to the likelihood of a regulatory mutation coming to clinical attention. Mutations that led to an increase in complexity exhibited higher odds ratios in favour of pathological consequences than mutations that led to a decrease or left complexity unchanged. This relationship, however, was discernible only for pyrimidine-to-purine transversions. Odds ratios for other types of substitution were not found to be significantly associated with local changes in sequence complexity, even though a trend similar to that observed for Y-->R transversions was also apparent for transitions. These findings suggest that the maintenance of a defined level of DNA sequence complexity, or at least the avoidance of an increase in sequence complexity, is a critical prerequisite for the function of gene regulatory regions.
- Published
- 2000
- Full Text
- View/download PDF
32. Promoter shuffling has occurred during the evolution of the vertebrate growth hormone gene.
- Author
-
Chuzhanova NA, Krawczak M, Nemytikova LA, Gusev VD, and Cooper DN
- Subjects
- Animals, Base Sequence, Binding Sites, Conserved Sequence, DNA genetics, DNA metabolism, Databases, Factual, Gene Rearrangement, Humans, Molecular Sequence Data, Sequence Alignment, Species Specificity, Transcription Factors metabolism, Transcription, Genetic, Vertebrates, Evolution, Molecular, Growth Hormone genetics, Promoter Regions, Genetic genetics
- Abstract
Comparative studies of vertebrate gene promoter regions seldom detect gross rearrangements ('promoter shuffling') since such analyses usually employ relatively similar DNA sequences. Conversely, attempts to compare evolutionarily more divergent promoter sequences have been largely unsuccessful owing to the inability of conventional alignment procedures to deal with gross rearrangements. These limitations have been circumvented in the present study by using the novel technique of complexity analysis to identify modular components ('blocks') in the growth hormone (GH) gene promoter sequences of some 22 vertebrate species, from salmon to human. Significant rearrangement of blocks was found to have occurred, indicating that they have evolved as independent units. Some blocks appear to be ubiquitous, whereas others are restricted to a specific taxon. Considerable variation between orthologous GH gene promoters was apparent in terms of block length, copy number and relative location. It may be inferred that a wide variety of different mutational mechanisms have operated upon the GH gene promoter over evolutionary time. These include gross changes such as deletion, duplication, amplification, elongation, contraction, transposition, inversion and fusion, as well as the slow, steady accumulation of single base-pair substitutions. Thus the patchwork structure of the modular GH promoter region, and those of its paralogous GH2 and prolactin (PRL) counterparts, have continually been shuffled into new combinations through the rearrangement of pre-existing blocks. Although some of these changes may have had no influence on promoter function, others could have served to alter either the level of gene expression or the responsiveness of the promoter to external stimuli.
- Published
- 2000
- Full Text
- View/download PDF
33. On the complexity measures of genetic sequences.
- Author
-
Gusev VD, Nemytikova LA, and Chuzhanova NA
- Subjects
- Base Sequence, Growth Hormone genetics, Humans, Information Storage and Retrieval methods, Repetitive Sequences, Nucleic Acid, Sequence Homology, Amino Acid, Algorithms, Models, Genetic, Sequence Analysis, DNA methods
- Abstract
Motivation: It is well known that the regulatory regions of genomes are highly repetitive. They are rich in direct, symmetric and complemented repeats, and there is no doubt about the functional significance of these repeats. Among known measures of complexity, the Ziv-Lempel complexity measure reflects most adequately repeats occurring in the text. But this measure does not take into account isomorphic repeats. By isomorphic repeats we mean fragments that are identical (or symmetric) modulo some permutation of the alphabet letters., Results: In this paper, two complexity measures of symbolic sequences are proposed that generalize the Ziv-Lempel complexity measure by taking into account any isomorphic repeats in the text (rather than just direct repeats as in Ziv-Lempel). The first of them, the complexity vector, is designed for small alphabets such as the alphabet of nucleotides. The second is based on a search for the longest isomorphic fragment in the history of sequence synthesis and can be used for alphabets of arbitrary cardinality. These measures have been used for recognition of structural regularities in DNA sequences. Some interesting structures related to the regulatory region of the human growth hormone are reported.
- Published
- 1999
- Full Text
- View/download PDF
34. Evolution of the proximal promoter region of the mammalian growth hormone gene.
- Author
-
Krawczak M, Chuzhanova NA, and Cooper DN
- Subjects
- Animals, Base Sequence, Human Growth Hormone genetics, Humans, Molecular Sequence Data, Polymorphism, Genetic, Rabbits, Rats, Sequence Alignment, Sequence Homology, Nucleic Acid, Evolution, Molecular, Growth Hormone genetics, Mammals genetics, Promoter Regions, Genetic
- Abstract
The evolutionary relationship between the proximal growth hormone (GH) gene promoter sequences of 12 mammalian species was explored by comparison of their trinucleotide composition and by multiple sequence alignment. Both approaches yielded results that were consistent with the known fossil record-based phylogeny of the analysed sequences, suggesting that the two methods of tree reconstruction might be equally efficient and reliable. The pattern of evolution inferred for the mammalian GH gene promoters was found to vary both temporally and spatially. Thus, two distinct regions devoid of any evolutionary changes exist in primates, but only one of these 'gaps' is also observed in rodents, and neither is seen in ruminants. Furthermore, different evolutionary rates must have prevailed during different periods of evolutionary time and in different lineages, with a dramatic increase in evolutionary rate apparent in primates. Since a similar pattern of discontinuity has been previously noted for the evolution of the GH-coding regions, it may reflect the action of positive selection operating upon the GH gene as a single cohesive unit. Strong evidence for the action of gene conversion between primate GH gene promoters is provided by the fact that the human GH1 and GH2 sequences, which are thought to have diverged before the divergence of Old World monkeys from great apes, are more similar to one another than either is to the rhesus monkey GH2 promoter. Finally, it was noted that a number of nucleotide positions in the GH1 gene promoter that are polymorphic in humans appear to be highly conserved in mammals. This apparent conundrum, which could represent a caveat for the interpretation of phylogenetic footprinting studies, is potentially explicable in terms either of reduced genetic diversity in highly inbred animal species or insufficient population data from non-human species.
- Published
- 1999
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.