73 results on '"Hedges DJ"'
Search Results
2. Copy number variation in pediatric multiple sclerosis
- Author
-
McElroy, JP, Krupp, LB, Johnson, BA, McCauley, JL, Qi, Z, Caillier, SJ, Gourraud, PA, Yu, J, Nathanson, L, Belman, AL, Hauser, SL, Waubant, E, Hedges, DJ, and Oksenberg
- Subjects
Biomedical and Clinical Sciences ,Neurosciences ,Clinical Sciences ,Brain Disorders ,Biotechnology ,Clinical Research ,Neurodegenerative ,Pediatric ,Human Genome ,Prevention ,Autoimmune Disease ,Multiple Sclerosis ,Genetics ,2.1 Biological and endogenous factors ,Aetiology ,Neurological ,Adolescent ,Age of Onset ,Child ,Comparative Genomic Hybridization ,Female ,Gene Dosage ,Heat-Shock Proteins ,Humans ,In Situ Hybridization ,Fluorescence ,Male ,Muscle Spasticity ,Spinocerebellar Ataxias ,Multiple sclerosis ,copy number variation ,pediatric ,Neurology & Neurosurgery ,Clinical sciences ,Biological psychology - Abstract
BackgroundPediatric onset multiple sclerosis (MS) accounts for 2-4% of all MS. It is unknown whether the disease shares the same underlying pathophysiology found in adult patients or an extreme early onset phenotype triggered by distinct biological mechanisms. It has been hypothesized that copy number variations (CNVs) may result in extreme early onset diseases because CNVs can have major effects on many genes in large genomic regions.Objectives and methodsThe objective of the current research was to identify CNVs, with a specific focus on de novo CNVs, potentially causing early onset MS by competitively hybridizing 30 white non-Hispanic pediatric MS patients with each of their parents via comparative genomic hybridization (CGH) analysis on the Agilent 1M CGH array.Results and discussionWe identified 10 CNVs not overlapping with any CNV regions currently reported in the Database of Genomic Variants (DGV). Fifty-five putatively de novo CNVs were also identified: all but one common in the DGV. We found the single rare CNV was a private variation harboring the SACS gene. SACS mutations cause autosomal-recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) disease. Additional clinical review revealed that the patient with the SACS gene CNV shared some features of both MS and ARSACS.ConclusionsThis is the first reported study analyzing pediatric MS CNVs. While not yielding causal variation in our initial pediatric dataset, our approach confirmed diagnosis of an ARSACS-like disease in addition to MS in the affected individual, which led to a more complete understanding of the patient's disease course and prognosis.
- Published
- 2013
3. Evidence of novel fine-scale structural variation at autism spectrum disorder candidate loci
- Author
-
Hedges, DJ, Hamilton-Nelson, KL, Sacharow, SJ, Nations, L, Beecham, GW, Kozhekbaeva, ZM, Butler, BL, Cukier, HN, Whitehead, PL, Ma, D, Jaworski, JM, Nathanson, L, Lee, JM, Hauser, SL, Oksenberg, JR, Cuccaro, ML, Haines, JL, Gilbert, JR, and Pericak-Vance, MA
- Abstract
Background: Autism spectrum disorders (ASD) represent a group of neurodevelopmental disorders characterized by a core set of social-communicative and behavioral impairments. Gamma-aminobutyric acid (GABA) is the major inhibitory neurotransmitter in the brain, acting primarily via the GABA receptors (GABR). Multiple lines of evidence, including altered GABA and GABA receptor expression in autistic patients, indicate that the GABAergic system may be involved in the etiology of autism. Methods. As copy number variations (CNVs), particularly rare and de novo CNVs, have now been implicated in ASD risk, we examined the GABA receptors and genes in related pathways for structural variation that may be associated with autism. We further extended our candidate gene set to include 19 genes and regions that had either been directly implicated in the autism literature or were directly related (via function or ancestry) to these primary candidates. For the high resolution CNV screen we employed custom-designed 244 k comparative genomic hybridization (CGH) arrays. Collectively, our probes spanned a total of 11 Mb of GABA-related and additional candidate regions with a density of approximately one probe every 200 nucleotides, allowing a theoretical resolution for detection of CNVs of approximately 1 kb or greater on average. One hundred and sixty-eight autism cases and 149 control individuals were screened for structural variants. Prioritized CNV events were confirmed using quantitative PCR, and confirmed loci were evaluated on an additional set of 170 cases and 170 control individuals that were not included in the original discovery set. Loci that remained interesting were subsequently screened via quantitative PCR on an additional set of 755 cases and 1,809 unaffected family members. Results: Results include rare deletions in autistic individuals at JAKMIP1, NRXN1, Neuroligin4Y, OXTR, and ABAT. Common insertion/deletion polymorphisms were detected at several loci, including GABBR2 and NRXN3. Overall, statistically significant enrichment in affected vs. unaffected individuals was observed for NRXN1 deletions. Conclusions: These results provide additional support for the role of rare structural variation in ASD. © 2012 Hedges et al; licensee BioMed Central Ltd.
- Published
- 2012
4. Copy number variation in pediatric multiple sclerosis
- Author
-
McElroy, JP, primary, Krupp, LB, additional, Johnson, BA, additional, McCauley, JL, additional, Qi, Z, additional, Caillier, SJ, additional, Gourraud, PA, additional, Yu, J, additional, Nathanson, L, additional, Belman, AL, additional, Hauser, SL, additional, Waubant, E, additional, Hedges, DJ, additional, and Oksenberg, JR, additional
- Published
- 2012
- Full Text
- View/download PDF
5. Genomic landscape of Down syndrome-associated acute lymphoblastic leukemia.
- Author
-
Li Z, Chang TC, Junco JJ, Devidas M, Li Y, Yang W, Huang X, Hedges DJ, Cheng Z, Shago M, Carroll AJ, Heerema NA, Gastier-Foster J, Wood BL, Borowitz MJ, Sanclemente L, Raetz EA, Hunger SP, Feingold E, Rosser TC, Sherman SL, Loh ML, Mullighan CG, Yu J, Wu G, Lupo PJ, Rabin KR, and Yang JJ
- Subjects
- Animals, Mice, Mutation, Risk Factors, Genomics, Chromosome Aberrations, Down Syndrome complications, Down Syndrome genetics, Precursor Cell Lymphoblastic Leukemia-Lymphoma complications, Precursor Cell Lymphoblastic Leukemia-Lymphoma genetics
- Abstract
Trisomy 21, the genetic cause of Down syndrome (DS), is the most common congenital chromosomal anomaly. It is associated with a 20-fold increased risk of acute lymphoblastic leukemia (ALL) during childhood and results in distinctive leukemia biology. To comprehensively define the genomic landscape of DS-ALL, we performed whole-genome sequencing and whole-transcriptome sequencing (RNA-Seq) on 295 cases. Our integrated genomic analyses identified 15 molecular subtypes of DS-ALL, with marked enrichment of CRLF2-r, IGH::IGF2BP1, and C/EBP altered (C/EBPalt) subtypes compared with 2257 non-DS-ALL cases. We observed abnormal activation of the CEBPD, CEBPA, and CEBPE genes in 10.5% of DS-ALL cases via a variety of genomic mechanisms, including chromosomal rearrangements and noncoding mutations leading to enhancer hijacking. A total of 42.3% of C/EBP-activated DS-ALL also have concomitant FLT3 point mutations or insertions/deletions, compared with 4.1% in other subtypes. CEBPD overexpression enhanced the differentiation of mouse hematopoietic progenitor cells into pro-B cells in vitro, particularly in a DS genetic background. Notably, recombination-activating gene-mediated somatic genomic abnormalities were common in DS-ALL, accounting for a median of 27.5% of structural alterations, compared with 7.7% in non-DS-ALL. Unsupervised hierarchical clustering analyses of CRLF2-rearranged DS-ALL identified substantial heterogeneity within this group, with the BCR::ABL1-like subset linked to an inferior event-free survival, even after adjusting for known clinical risk factors. These results provide important insights into the biology of DS-ALL and point to opportunities for targeted therapy and treatment individualization., (© 2023 by The American Society of Hematology.)
- Published
- 2023
- Full Text
- View/download PDF
6. Whole genome sequencing provides comprehensive genetic testing in childhood B-cell acute lymphoblastic leukaemia.
- Author
-
Ryan SL, Peden JF, Kingsbury Z, Schwab CJ, James T, Polonen P, Mijuskovic M, Becq J, Yim R, Cranston RE, Hedges DJ, Roberts KG, Mullighan CG, Vora A, Russell LJ, Bain R, Moorman AV, Bentley DR, Harrison CJ, and Ross MT
- Subjects
- Humans, Computational Biology, Genetic Testing, Whole Genome Sequencing, Precursor Cell Lymphoblastic Leukemia-Lymphoma drug therapy, Precursor B-Cell Lymphoblastic Leukemia-Lymphoma diagnosis, Precursor B-Cell Lymphoblastic Leukemia-Lymphoma genetics
- Abstract
Childhood B-cell acute lymphoblastic leukaemia (B-ALL) is characterised by recurrent genetic abnormalities that drive risk-directed treatment strategies. Using current techniques, accurate detection of such aberrations can be challenging, due to the rapidly expanding list of key genetic abnormalities. Whole genome sequencing (WGS) has the potential to improve genetic testing, but requires comprehensive validation. We performed WGS on 210 childhood B-ALL samples annotated with clinical and genetic data. We devised a molecular classification system to subtype these patients based on identification of key genetic changes in tumour-normal and tumour-only analyses. This approach detected 294 subtype-defining genetic abnormalities in 96% (202/210) patients. Novel genetic variants, including fusions involving genes in the MAP kinase pathway, were identified. WGS results were concordant with standard-of-care methods and whole transcriptome sequencing (WTS). We expanded the catalogue of genetic profiles that reliably classify PAX5alt and ETV6::RUNX1-like subtypes. Our novel bioinformatic pipeline improved detection of DUX4 rearrangements (DUX4-r): a good-risk B-ALL subtype with high survival rates. Overall, we have validated that WGS provides a standalone, reliable genetic test to detect all subtype-defining genetic abnormalities in B-ALL, accurately classifying patients for the risk-directed treatment stratification, while simultaneously performing as a research tool to identify novel disease biomarkers., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
7. The genomic landscape of pediatric acute lymphoblastic leukemia.
- Author
-
Brady SW, Roberts KG, Gu Z, Shi L, Pounds S, Pei D, Cheng C, Dai Y, Devidas M, Qu C, Hill AN, Payne-Turner D, Ma X, Iacobucci I, Baviskar P, Wei L, Arunachalam S, Hagiwara K, Liu Y, Flasch DA, Liu Y, Parker M, Chen X, Elsayed AH, Pathak O, Li Y, Fan Y, Michael JR, Rusch M, Wilkinson MR, Foy S, Hedges DJ, Newman S, Zhou X, Wang J, Reilly C, Sioson E, Rice SV, Pastor Loyola V, Wu G, Rampersaud E, Reshmi SC, Gastier-Foster J, Guidry Auvil JM, Gesuwan P, Smith MA, Winick N, Carroll AJ, Heerema NA, Harvey RC, Willman CL, Larsen E, Raetz EA, Borowitz MJ, Wood BL, Carroll WL, Zweidler-McKay PA, Rabin KR, Mattano LA, Maloney KW, Winter SS, Burke MJ, Salzer W, Dunsmore KP, Angiolillo AL, Crews KR, Downing JR, Jeha S, Pui CH, Evans WE, Yang JJ, Relling MV, Gerhard DS, Loh ML, Hunger SP, Zhang J, and Mullighan CG
- Subjects
- Child, Chromosome Aberrations, Exome genetics, Genomics, Humans, Mutation, Precursor Cell Lymphoblastic Leukemia-Lymphoma genetics
- Abstract
Acute lymphoblastic leukemia (ALL) is the most common childhood cancer. Here, using whole-genome, exome and transcriptome sequencing of 2,754 childhood patients with ALL, we find that, despite a generally low mutation burden, ALL cases harbor a median of four putative somatic driver alterations per sample, with 376 putative driver genes identified varying in prevalence across ALL subtypes. Most samples harbor at least one rare gene alteration, including 70 putative cancer driver genes associated with ubiquitination, SUMOylation, noncoding transcripts and other functions. In hyperdiploid B-ALL, chromosomal gains are acquired early and synchronously before ultraviolet-induced mutation. By contrast, ultraviolet-induced mutations precede chromosomal gains in B-ALL cases with intrachromosomal amplification of chromosome 21. We also demonstrate the prognostic significance of genetic alterations within subtypes. Intriguingly, DUX4- and KMT2A-rearranged subtypes separate into CEBPA/FLT3- or NFATC4-expressing subgroups with potential clinical implications. Together, these results deepen understanding of the ALL genomic landscape and associated outcomes., (© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2022
- Full Text
- View/download PDF
8. RNA-seq Fusion Detection in Clinical Oncology.
- Author
-
Hedges DJ
- Subjects
- Ecosystem, Gene Fusion, High-Throughput Nucleotide Sequencing, Humans, Medical Oncology, Precision Medicine, RNA-Seq, Sequence Analysis, RNA methods, Software, Exome Sequencing, Neoplasms diagnosis, Neoplasms genetics
- Abstract
Gene fusions play a prominent role in the oncogenesis of many cancers and have been extensively targeted as biomarkers for diagnostic, prognostic, and therapeutic purposes. Detection methods span a number of platforms, including cytogenetics (e.g., FISH), targeted qPCR, and sequencing-based assays. Before the advent of next-generation sequencing (NGS), fusion testing was primarily targeted to specific genome loci, with assays tailored for previously characterized fusion events. The availability of whole genome sequencing (WGS) and whole transcriptome sequencing (RNA-seq) allows for genome-wide screening for the simultaneous detection of both known and novel fusions. RNA-seq, in particular, offers the possibility of rapid turn-around testing with less dedicated sequencing than WGS. This makes it an attractive target for clinical oncology testing, particularly when transcriptome data can be multi-purposed for tumor classification and additional analyses. Despite considerable efforts and substantial progress, however, genome-wide screening for fusions solely based on RNA-seq data remains an ongoing challenge. A host of technical artifacts adversely impact the sensitivity and specificity of existing software tools. In this chapter, the general strategies employed by current fusion software are discussed, and a selection of available fusion detection tools are surveyed. Despite its current limitations, RNA-seq-based fusion detection offers a more comprehensive and efficient strategy as compared to multiple targeted fusion assays. When thoughtfully employed within a wider ecosystem of diagnostic assays and clinical information, RNA-seq fusion detection represents a powerful tool for precision oncology., (© 2022. Springer Nature Switzerland AG.)
- Published
- 2022
- Full Text
- View/download PDF
9. Genomes for Kids: The Scope of Pathogenic Mutations in Pediatric Cancer Revealed by Comprehensive DNA and RNA Sequencing.
- Author
-
Newman S, Nakitandwe J, Kesserwan CA, Azzato EM, Wheeler DA, Rusch M, Shurtleff S, Hedges DJ, Hamilton KV, Foy SG, Edmonson MN, Thrasher A, Bahrami A, Orr BA, Klco JM, Gu J, Harrison LW, Wang L, Clay MR, Ouma A, Silkov A, Liu Y, Zhang Z, Liu Y, Brady SW, Zhou X, Chang TC, Pande M, Davis E, Becksfort J, Patel A, Wilkinson MR, Rahbarinia D, Kubal M, Maciaszek JL, Pastor V, Knight J, Gout AM, Wang J, Gu Z, Mullighan CG, McGee RB, Quinn EA, Nuccio R, Mostafavi R, Gerhardt EL, Taylor LM, Valdez JM, Hines-Dowell SJ, Pappo AS, Robinson G, Johnson LM, Pui CH, Ellison DW, Downing JR, Zhang J, and Nichols KE
- Subjects
- Child, DNA, Humans, Mutation, Sequence Analysis, RNA, Exome Sequencing, Neoplasms genetics
- Abstract
Genomic studies of pediatric cancer have primarily focused on specific tumor types or high-risk disease. Here, we used a three-platform sequencing approach, including whole-genome sequencing (WGS), whole-exome sequencing (WES), and RNA sequencing (RNA-seq), to examine tumor and germline genomes from 309 prospectively identified children with newly diagnosed (85%) or relapsed/refractory (15%) cancers, unselected for tumor type. Eighty-six percent of patients harbored diagnostic (53%), prognostic (57%), therapeutically relevant (25%), and/or cancer-predisposing (18%) variants. Inclusion of WGS enabled detection of activating gene fusions and enhancer hijacks (36% and 8% of tumors, respectively), small intragenic deletions (15% of tumors), and mutational signatures revealing of pathogenic variant effects. Evaluation of paired tumor-normal data revealed relevance to tumor development for 55% of pathogenic germline variants. This study demonstrates the power of a three-platform approach that incorporates WGS to interrogate and interpret the full range of genomic variants across newly diagnosed as well as relapsed/refractory pediatric cancers., Significance: Pediatric cancers are driven by diverse genomic lesions, and sequencing has proven useful in evaluating high-risk and relapsed/refractory cases. We show that combined WGS, WES, and RNA-seq of tumor and paired normal tissues enables identification and characterization of genetic drivers across the full spectrum of pediatric cancers. This article is highlighted in the In This Issue feature, p. 2945., (©2021 The Authors; Published by the American Association for Cancer Research.)
- Published
- 2021
- Full Text
- View/download PDF
10. Methylation profiling reveals novel molecular classes of rhabdomyosarcoma.
- Author
-
Clay MR, Patel A, Tran Q, Hedges DJ, Chang TC, Stewart E, Charville G, Cline C, Dyer MA, and Orr BA
- Subjects
- Adolescent, Adult, Aged, Child, Child, Preschool, Computational Biology methods, DNA Copy Number Variations, Diagnosis, Differential, Disease Susceptibility, Female, Humans, Immunohistochemistry, In Situ Hybridization, Infant, Male, Middle Aged, Mutation, Rhabdomyosarcoma therapy, Whole Genome Sequencing, Young Adult, Biomarkers, Tumor, DNA Methylation, Gene Expression Profiling, Gene Expression Regulation, Neoplastic, Rhabdomyosarcoma diagnosis, Rhabdomyosarcoma etiology
- Abstract
Rhabdomyosarcomas (RMS) represent a family of aggressive soft tissue sarcomas that present in both children and adults. Pathologic risk stratification for RMS has been based on histologic subtype, with poor outcomes observed in alveolar rhabdomyosarcoma (ARMS) and the adult-type pleomorphic rhabdomyosarcoma (PRMS) compared to embryonal rhabdomyosarcoma (ERMS). Genomic sequencing studies have expanded the spectrum of RMS, with several new molecularly defined entities, including fusion-driven spindle cell/sclerosing rhabdomyosarcoma (SC/SRMS) and MYOD1-mutant SC/SRMS. Comprehensive genomic analysis has previously defined the mutational and copy number spectrum for the more common ERMS and ARMS and revealed corresponding methylation signatures. Comparatively, less is known about epigenetic correlates for the rare SC/SRMS or PRMS histologic subtypes. Herein, we present exome and RNA sequencing, copy number analysis, and methylation profiling of the largest cohort of molecularly characterized RMS samples to date. In addition to ARMS and ERMS, we identify two novel methylation subtypes, one having SC/SRMS histology and defined by MYOD1 p. L122R mutations and the other matching adult-type PRMS. Selected tumors from adolescent patients grouped with the PRMS methylation class, expanding the age range of these rare tumors. Limited follow-up data suggest that pediatric tumors with MYOD1-mutations are associated with an aggressive clinical course., (© 2021. The Author(s).)
- Published
- 2021
- Full Text
- View/download PDF
11. Primary bone sarcoma with BCOR internal tandem duplication.
- Author
-
Malik F, Zreik RT, Hedges DJ, Nakitandwe J, Lee S, Ward RA, McCarville MB, Pappo A, and Bahrami A
- Subjects
- Adolescent, Bone Neoplasms diagnostic imaging, Bone Neoplasms drug therapy, Bone Neoplasms pathology, Gene Duplication, Humans, Male, Sarcoma diagnostic imaging, Sarcoma drug therapy, Sarcoma pathology, Tibia diagnostic imaging, Tibia pathology, Biomarkers, Tumor genetics, Bone Neoplasms genetics, Proto-Oncogene Proteins genetics, Repressor Proteins genetics, Sarcoma genetics
- Abstract
BCOR internal tandem duplications (ITDs) and rearrangements are implicated in the oncogenesis of a subset of undifferentiated sarcomas. To date, BCOR ITD sarcomas have been exclusively found in non-appendicular infantile soft tissues, whereas BCOR-rearranged sarcomas occur in both bones and soft tissues affecting a wider patient age range. Little is known about patient outcome in BCOR ITD sarcomas. We present a BCOR-expressing, primary bone, undifferentiated sarcoma case involving an adolescent male's left tibia that, unexpectedly, harbored a BCOR ITD instead of a BCOR rearrangement. Furthermore, the patient achieved a partial histologic response after receiving a Ewing sarcoma chemotherapy regimen. Our case expands the clinical spectrum of BCOR ITD sarcomas and suggests that childhood and adult BCOR-expressing sarcomas with an undifferentiated histology should be considered for both BCOR rearrangement and ITD screening. Accurate BCOR mutation identification in undifferentiated sarcomas is essential to define their clinical spectrum and to develop effective management strategies.
- Published
- 2020
- Full Text
- View/download PDF
12. Estimated number of adult survivors of childhood cancer in United States with cancer-predisposing germline variants.
- Author
-
Wilson CL, Wang Z, Liu Q, Ehrhardt MJ, Mostafavi R, Easton J, Mulder H, Hedges DJ, Wang S, Rusch M, Edmonson M, Levy S, Lanctot JQ, Currie K, Lear M, Patel A, Sapkota Y, Brooke RJ, Moon W, Chang TC, Chen W, Kesserwan CA, Wu G, Nichols KE, Hudson MM, Zhang J, Robison LL, and Yasui Y
- Subjects
- Adolescent, Adult, Aged, Child, Child, Preschool, Cohort Studies, Female, Follow-Up Studies, Humans, Incidence, Infant, Infant, Newborn, Male, Middle Aged, Neoplasms epidemiology, Neoplasms genetics, Prognosis, Risk Factors, Survival Rate, United States epidemiology, Young Adult, Cancer Survivors statistics & numerical data, Genetic Predisposition to Disease, Germ-Line Mutation, Neoplasm Proteins genetics, Neoplasms mortality
- Abstract
Purpose: To estimate the absolute number of adult survivors of childhood cancer in the U.S. population who carry a pathogenic or likely pathogenic variant in a cancer predisposition gene., Methods: Using the Surveillance, Epidemiology, and End Results (SEER) Program, we estimated the number of childhood cancer survivors on December 31, 2016 for each childhood cancer diagnosis, multiplied this by the proportion of carriers of pathogenic/likely pathogenic variants in the St. Jude Lifetime Cohort (SJLIFE) study, and projected the resulting number onto the U.S., Results: Based on genome sequence data, 11.8% of 2450 SJLIFE participants carry a pathogenic/likely pathogenic variant in one of 156 cancer predisposition genes. Given this information, we estimate that 21 800 adult survivors of childhood cancer in the United States carry a pathogenic/likely pathogenic variant in one of these genes. The highest estimated absolute number of variant carriers are among survivors of central nervous system tumors (n = 4300), particularly astrocytoma (n = 1800) and other gliomas (n = 1700), acute lymphoblastic leukemia (n = 4300), and retinoblastoma (n = 3500). The most frequently mutated genes are RB1 (n = 3000), NF1 (n = 2300), and BRCA2 (n = 800)., Conclusion: Given the increasing number of childhood cancer survivors in the United States, clinicians should counsel survivors regarding their potential genetic risk, consider referral for genetic counseling and testing, and, as appropriate, implement syndrome-specific cancer surveillance or risk-reducing measures., (© 2019 Wiley Periodicals, Inc.)
- Published
- 2020
- Full Text
- View/download PDF
13. Pediatric Cancer Variant Pathogenicity Information Exchange (PeCanPIE): a cloud-based platform for curating and classifying germline variants.
- Author
-
Edmonson MN, Patel AN, Hedges DJ, Wang Z, Rampersaud E, Kesserwan CA, Zhou X, Liu Y, Newman S, Rusch MC, McLeod CL, Wilkinson MR, Rice SV, Soussi T, Taylor JP, Benatar M, Becksfort JB, Nichols KE, Robison LL, Downing JR, and Zhang J
- Subjects
- Child, Cloud Computing, Databases, Genetic, Genetic Predisposition to Disease, High-Throughput Nucleotide Sequencing, Humans, User-Computer Interface, Computational Biology methods, Germ-Line Mutation, Neoplasms genetics
- Abstract
Variant interpretation in the era of massively parallel sequencing is challenging. Although many resources and guidelines are available to assist with this task, few integrated end-to-end tools exist. Here, we present the Pe diatric Can cer Variant P athogenicity I nformation E xchange (PeCanPIE), a web- and cloud-based platform for annotation, identification, and classification of variations in known or putative disease genes. Starting from a set of variants in variant call format (VCF), variants are annotated, ranked by putative pathogenicity, and presented for formal classification using a decision-support interface based on published guidelines from the American College of Medical Genetics and Genomics (ACMG). The system can accept files containing millions of variants and handle single-nucleotide variants (SNVs), simple insertions/deletions (indels), multiple-nucleotide variants (MNVs), and complex substitutions. PeCanPIE has been applied to classify variant pathogenicity in cancer predisposition genes in two large-scale investigations involving >4000 pediatric cancer patients and serves as a repository for the expert-reviewed results. PeCanPIE was originally developed for pediatric cancer but can be easily extended for use for nonpediatric cancers and noncancer genetic diseases. Although PeCanPIE's web-based interface was designed to be accessible to non-bioinformaticians, its back-end pipelines may also be run independently on the cloud, facilitating direct integration and broader adoption. PeCanPIE is publicly available and free for research use., (© 2019 Edmonson et al.; Published by Cold Spring Harbor Laboratory Press.)
- Published
- 2019
- Full Text
- View/download PDF
14. Genetic Risk for Subsequent Neoplasms Among Long-Term Survivors of Childhood Cancer.
- Author
-
Wang Z, Wilson CL, Easton J, Thrasher A, Mulder H, Liu Q, Hedges DJ, Wang S, Rusch MC, Edmonson MN, Levy S, Lanctot JQ, Caron E, Shelton K, Currie K, Lear M, Patel A, Rosencrance C, Shao Y, Vadodaria B, Yergeau D, Sapkota Y, Brooke RJ, Moon W, Rampersaud E, Ma X, Chang TC, Rice SV, Pepper C, Zhou X, Chen X, Chen W, Jones A, Boone B, Ehrhardt MJ, Krasin MJ, Howell RM, Phillips NS, Lewis C, Srivastava D, Pui CH, Kesserwan CA, Wu G, Nichols KE, Downing JR, Hudson MM, Yasui Y, Robison LL, and Zhang J
- Subjects
- Adolescent, Adult, Aged, Child, Cohort Studies, Female, Genetic Predisposition to Disease, Germ-Line Mutation, Humans, Male, Middle Aged, Neoplasms epidemiology, Neoplasms, Second Primary epidemiology, Retrospective Studies, Risk, United States epidemiology, Whole Genome Sequencing, Young Adult, Cancer Survivors statistics & numerical data, Neoplasms genetics, Neoplasms, Second Primary genetics
- Abstract
Purpose Childhood cancer survivors are at increased risk of subsequent neoplasms (SNs), but the germline genetic contribution is largely unknown. We assessed the contribution of pathogenic/likely pathogenic (P/LP) mutations in cancer predisposition genes to their SN risk. Patients and Methods Whole-genome sequencing (30-fold) was performed on samples from childhood cancer survivors who were ≥ 5 years since initial cancer diagnosis and participants in the St Jude Lifetime Cohort Study, a retrospective hospital-based study with prospective clinical follow-up. Germline mutations in 60 genes known to be associated with autosomal dominant cancer predisposition syndromes with moderate to high penetrance were classified by their pathogenicity according to the American College of Medical Genetics and Genomics guidelines. Relative rates (RRs) and 95% CIs of SN occurrence by mutation status were estimated using multivariable piecewise exponential regression stratified by radiation exposure. Results Participants were 3,006 survivors (53% male; median age, 35.8 years [range, 7.1 to 69.8 years]; 56% received radiotherapy), 1,120 SNs were diagnosed among 439 survivors (14.6%), and 175 P/LP mutations were identified in 5.8% (95% CI, 5.0% to 6.7%) of survivors. Mutations were associated with significantly increased rates of breast cancer (RR, 13.9; 95% CI, 6.0 to 32.2) and sarcoma (RR, 10.6; 95% CI, 4.3 to 26.3) among irradiated survivors and with increased rates of developing any SN (RR, 4.7; 95% CI, 2.4 to 9.3), breast cancer (RR, 7.7; 95% CI, 2.4 to 24.4), nonmelanoma skin cancer (RR, 11.0; 95% CI, 2.9 to 41.4), and two or more histologically distinct SNs (RR, 18.6; 95% CI, 3.5 to 99.3) among nonirradiated survivors. Conclusion The findings support referral of all survivors for genetic counseling for potential clinical genetic testing, which should be prioritized for nonirradiated survivors with any SN and for those with breast cancer or sarcoma in the field of prior irradiation.
- Published
- 2018
- Full Text
- View/download PDF
15. A comprehensive approach to expression of L1 loci.
- Author
-
Deininger P, Morales ME, White TB, Baddoo M, Hedges DJ, Servant G, Srivastav S, Smither ME, Concha M, DeHaro DL, Flemington EK, and Belancio VP
- Subjects
- Animals, Chromosome Mapping, Chromosomes, Human metabolism, DNA, Complementary genetics, DNA, Complementary metabolism, Genomic Instability, HeLa Cells, Humans, Mice, NIH 3T3 Cells, Nucleic Acid Amplification Techniques, Promoter Regions, Genetic, RNA, Messenger metabolism, Sequence Analysis, RNA, Chromosomes, Human chemistry, Genetic Loci, Genome, Human, Long Interspersed Nucleotide Elements, RNA, Messenger genetics, Transcription, Genetic
- Abstract
L1 elements represent the only currently active, autonomous retrotransposon in the human genome, and they make major contributions to human genetic instability. The vast majority of the 500 000 L1 elements in the genome are defective, and only a relatively few can contribute to the retrotransposition process. However, there is currently no comprehensive approach to identify the specific loci that are actively transcribed separate from the excess of L1-related sequences that are co-transcribed within genes. We have developed RNA-Seq procedures, as well as a 1200 bp 5΄ RACE product coupled with PACBio sequencing that can identify the specific L1 loci that contribute most of the L1-related RNA reads. At least 99% of L1-related sequences found in RNA do not arise from the L1 promoter, instead representing pieces of L1 incorporated in other cellular RNAs. In any given cell type a relatively few active L1 loci contribute to the 'authentic' L1 transcripts that arise from the L1 promoter, with significantly different loci seen expressed in different tissues., (© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2017
- Full Text
- View/download PDF
16. Sequencing, identification and mapping of primed L1 elements (SIMPLE) reveals significant variation in full length L1 elements between individuals.
- Author
-
Streva VA, Jordan VE, Linker S, Hedges DJ, Batzer MA, and Deininger PL
- Subjects
- Alleles, Cell Line, Chromosome Mapping, Fibroblasts metabolism, Gene Frequency, Genetic Association Studies, Genome, Human, High-Throughput Nucleotide Sequencing, Humans, Long Interspersed Nucleotide Elements, Polymorphism, Genetic, Sequence Analysis, DNA methods
- Abstract
Background: There are over a half a million copies of L1 retroelements in the human genome which are responsible for as much as 0.5% of new human genetic diseases. Most new L1 inserts arise from young source elements that are polymorphic in the human genome. Highly active polymorphic "hot" L1 source elements have been shown to be capable of extremely high levels of mobilization and result in numerous instances of disease. Additionally, hot polymorphic L1s have been described to be highly active within numerous cancer genomes. These hot L1s result in mutagenesis by insertion of new L1 copies elsewhere in the genome, but also have been shown to generate additional full length L1 insertions which are also hot and able to further retrotranspose. Through this mechanism, hot L1s may amplify within a tumor and result in a continued cycle of mutagenesis., Results and Conclusions: We have developed a method to detect full-length, polymorphic L1 elements using a targeted next generation sequencing approach, Sequencing Identification and Mapping of Primed L1 Elements (SIMPLE). SIMPLE has 94% sensitivity and detects nearly all full-length L1 elements in a genome. SIMPLE will allow researchers to identify hot mutagenic full-length L1s as potential drivers of genome instability. Using SIMPLE we find that the typical individual has approximately 100 non-reference, polymorphic L1 elements in their genome. These elements are at relatively low population frequencies relative to previously identified polymorphic L1 elements and demonstrate the tremendous diversity in potentially active L1 elements in the human population.
- Published
- 2015
- Full Text
- View/download PDF
17. The contribution of alu elements to mutagenic DNA double-strand break repair.
- Author
-
Morales ME, White TB, Streva VA, DeFreece CB, Hedges DJ, and Deininger PL
- Subjects
- Animals, DNA Breaks, Double-Stranded, DNA Damage genetics, Genome, Human, Humans, Alu Elements genetics, DNA End-Joining Repair genetics, Recombination, Genetic
- Abstract
Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both the rate and nature of DNA repair events.
- Published
- 2015
- Full Text
- View/download PDF
18. Correction: Linear Decay of Retrotransposon Antisense Bias across Genes Is Contingent upon Tissue Specificity.
- Author
-
Linker S and Hedges DJ
- Abstract
[This corrects the article DOI: 10.1371/journal.pone.0079402.].
- Published
- 2014
- Full Text
- View/download PDF
19. Linear decay of retrotransposon antisense bias across genes is contingent upon tissue specificity.
- Author
-
Linker S and Hedges DJ
- Subjects
- 5' Untranslated Regions, Alu Elements genetics, Evolution, Molecular, Exons, Gene Frequency, Humans, Long Interspersed Nucleotide Elements genetics, Nucleotide Motifs, Organ Specificity genetics, Polymorphism, Genetic, Quantitative Trait Loci, RNA, Antisense, Retroelements genetics
- Abstract
Retrotransposons comprise approximately half of the human genome and contribute to chromatin structure, regulatory motifs, and protein-coding sequences. Since retrotransposon insertions can disrupt functional genetic elements as well as introduce new sequence motifs to a region, they have the potential to affect the function of genes that harbour insertions as well as those nearby. Partly as a result of these effects, the distribution of retrotransposons across the genome is non-uniform and there are observed imbalances in the orientation of insertions with respect to the transcriptional direction of the containing gene. Although some of the factors underlying the observed distributions are understood, much of the variability remains unexplained. Detailed characterization of retrotransposon density in genes could help inform predictions of the functional consequence of de novo as well as polymorphic insertions. In order to characterize the relationship between genes and inserted elements, we have examined the distribution of retrotransposons and their internal motifs within tissue-specific and housekeeping genes. We have identified that the previously established retrotransposon antisense bias decays at a linear rate across genes, resulting in an equal density of sense and antisense retrotransposons near the 3'-UTR. In addition, the decay of antisense bias across genes is less pronounced among tissue-specific genes. Our results provide support for the scenario in which this linear decay in antisense bias is established by natural selection shortly after retrotransposon integration, and that total antisense bias observed is above and beyond any bias introduced by the integration process itself. Finally, we provide an example of a retrotransposon acting as an eQTL on a coincident gene, highlighting one of several possible avenues through which insertions may modulate gene function.
- Published
- 2013
- Full Text
- View/download PDF
20. Reconstructing the population genetic history of the Caribbean.
- Author
-
Moreno-Estrada A, Gravel S, Zakharia F, McCauley JL, Byrnes JK, Gignoux CR, Ortiz-Tello PA, Martínez RJ, Hedges DJ, Morris RW, Eng C, Sandoval K, Acevedo-Acevedo S, Norman PJ, Layrisse Z, Parham P, Martínez-Cruzado JC, Burchard EG, Cuccaro ML, Martin ER, and Bustamante CD
- Subjects
- Caribbean Region, DNA, Mitochondrial genetics, Demography, Genomics, Haplotypes, Hispanic or Latino genetics, Humans, Black People genetics, Gene Flow, Genetics, Population, Indians, North American genetics, White People genetics
- Abstract
The Caribbean basin is home to some of the most complex interactions in recent history among previously diverged human populations. Here, we investigate the population genetic history of this region by characterizing patterns of genome-wide variation among 330 individuals from three of the Greater Antilles (Cuba, Puerto Rico, Hispaniola), two mainland (Honduras, Colombia), and three Native South American (Yukpa, Bari, and Warao) populations. We combine these data with a unique database of genomic variation in over 3,000 individuals from diverse European, African, and Native American populations. We use local ancestry inference and tract length distributions to test different demographic scenarios for the pre- and post-colonial history of the region. We develop a novel ancestry-specific PCA (ASPCA) method to reconstruct the sub-continental origin of Native American, European, and African haplotypes from admixed genomes. We find that the most likely source of the indigenous ancestry in Caribbean islanders is a Native South American component shared among inland Amazonian tribes, Central America, and the Yucatan peninsula, suggesting extensive gene flow across the Caribbean in pre-Columbian times. We find evidence of two pulses of African migration. The first pulse--which today is reflected by shorter, older ancestry tracts--consists of a genetic component more similar to coastal West African regions involved in early stages of the trans-Atlantic slave trade. The second pulse--reflected by longer, younger tracts--is more similar to present-day West-Central African populations, supporting historical records of later transatlantic deportation. Surprisingly, we also identify a Latino-specific European component that has significantly diverged from its parental Iberian source populations, presumably as a result of small European founder population size. We demonstrate that the ancestral components in admixed genomes can be traced back to distinct sub-continental source populations with far greater resolution than previously thought, even when limited pre-Columbian Caribbean haplotypes have survived., Competing Interests: JKB is an employee of Ancestry.com. CDB is on the Scientific Advisory Board of Ancestry.com, 23andMe's “Roots into the Future” project, and Personalis, Inc. He is on the medical advisory board of Invitae and Med-tek. None of these entities played any role in the project or research results reported here.
- Published
- 2013
- Full Text
- View/download PDF
21. Dilated cardiomyopathy: the complexity of a diverse genetic architecture.
- Author
-
Hershberger RE, Hedges DJ, and Morales A
- Subjects
- Animals, Arrhythmogenic Right Ventricular Dysplasia genetics, Arrhythmogenic Right Ventricular Dysplasia physiopathology, Cardiomyopathy, Dilated classification, Cardiomyopathy, Dilated physiopathology, Cardiomyopathy, Hypertrophic, Familial genetics, Cardiomyopathy, Hypertrophic, Familial physiopathology, Genetic Predisposition to Disease, Genomics methods, Heredity, Humans, Pedigree, Phenotype, Risk Factors, Terminology as Topic, Cardiomyopathy, Dilated genetics, Mutation
- Abstract
Remarkable progress has been made in understanding the genetic basis of dilated cardiomyopathy (DCM). Rare variants in >30 genes, some also involved in other cardiomyopathies, muscular dystrophy, or syndromic disease, perturb a diverse set of important myocardial proteins to produce a final DCM phenotype. Large, publicly available datasets have provided the opportunity to evaluate previously identified DCM-causing mutations, and to examine the population frequency of sequence variants similar to those that have been observed to cause DCM. The frequency of these variants, whether associated with dilated or hypertrophic cardiomyopathy, is greater than estimates of disease prevalence. This mismatch might be explained by one or more of the following possibilities: that the penetrance of DCM-causing mutations is lower than previously thought, that some variants are noncausal, that DCM prevalence is higher than previously estimated, or that other more-complex genomics underlie DCM. Reassessment of our assumptions about the complexity of the genomic and phenomic architecture of DCM is warranted. Much about the genomic basis of DCM remains to be investigated, which will require comprehensive genomic studies in much larger cohorts of rigorously phenotyped probands and family members than previously examined.
- Published
- 2013
- Full Text
- View/download PDF
22. Copy number variation in pediatric multiple sclerosis.
- Author
-
McElroy JP, Krupp LB, Johnson BA, McCauley JL, Qi Z, Caillier SJ, Gourraud PA, Yu J, Nathanson L, Belman AL, Hauser SL, Waubant E, Hedges DJ, and Oksenberg JR
- Subjects
- Adolescent, Age of Onset, Child, Comparative Genomic Hybridization, Female, Heat-Shock Proteins genetics, Humans, In Situ Hybridization, Fluorescence, Male, Muscle Spasticity genetics, Spinocerebellar Ataxias congenital, Spinocerebellar Ataxias genetics, Gene Dosage, Multiple Sclerosis genetics
- Abstract
Background: Pediatric onset multiple sclerosis (MS) accounts for 2-4% of all MS. It is unknown whether the disease shares the same underlying pathophysiology found in adult patients or an extreme early onset phenotype triggered by distinct biological mechanisms. It has been hypothesized that copy number variations (CNVs) may result in extreme early onset diseases because CNVs can have major effects on many genes in large genomic regions., Objectives and Methods: The objective of the current research was to identify CNVs, with a specific focus on de novo CNVs, potentially causing early onset MS by competitively hybridizing 30 white non-Hispanic pediatric MS patients with each of their parents via comparative genomic hybridization (CGH) analysis on the Agilent 1M CGH array., Results and Discussion: We identified 10 CNVs not overlapping with any CNV regions currently reported in the Database of Genomic Variants (DGV). Fifty-five putatively de novo CNVs were also identified: all but one common in the DGV. We found the single rare CNV was a private variation harboring the SACS gene. SACS mutations cause autosomal-recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) disease. Additional clinical review revealed that the patient with the SACS gene CNV shared some features of both MS and ARSACS., Conclusions: This is the first reported study analyzing pediatric MS CNVs. While not yielding causal variation in our initial pediatric dataset, our approach confirmed diagnosis of an ARSACS-like disease in addition to MS in the affected individual, which led to a more complete understanding of the patient's disease course and prognosis.
- Published
- 2013
- Full Text
- View/download PDF
23. Exome sequencing and genome-wide linkage analysis in 17 families illustrate the complex contribution of TTN truncating variants to dilated cardiomyopathy.
- Author
-
Norton N, Li D, Rampersaud E, Morales A, Martin ER, Zuchner S, Guo S, Gonzalez M, Hedges DJ, Robertson PD, Krumm N, Nickerson DA, and Hershberger RE
- Subjects
- Adolescent, Adult, Aged, Cardiomyopathy, Dilated metabolism, Cardiomyopathy, Dilated pathology, Chromosomes, Human, Pair 9, Connectin, Female, Genetic Heterogeneity, Genetic Linkage, Genetic Loci, Humans, Male, Middle Aged, Mutation, Missense, Odds Ratio, Pedigree, Sequence Analysis, DNA, Young Adult, Cardiomyopathy, Dilated genetics, Exome genetics, Genome, Human, Muscle Proteins genetics, Protein Kinases genetics
- Abstract
BACKGROUND- Familial dilated cardiomyopathy (DCM) is a genetically heterogeneous disease with >30 known genes. TTN truncating variants were recently implicated in a candidate gene study to cause 25% of familial and 18% of sporadic DCM cases. METHODS AND RESULTS- We used an unbiased genome-wide approach using both linkage analysis and variant filtering across the exome sequences of 48 individuals affected with DCM from 17 families to identify genetic cause. Linkage analysis ranked the TTN region as falling under the second highest genome-wide multipoint linkage peak, multipoint logarithm of odds, 1.59. We identified 6 TTN truncating variants carried by individuals affected with DCM in 7 of 17 DCM families (logarithm of odds, 2.99); 2 of these 7 families also had novel missense variants that segregated with disease. Two additional novel truncating TTN variants did not segregate with DCM. Nucleotide diversity at the TTN locus, including missense variants, was comparable with 5 other known DCM genes. The average number of missense variants in the exome sequences from the DCM cases or the ≈5400 cases from the Exome Sequencing Project was ≈23 per individual. The average number of TTN truncating variants in the Exome Sequencing Project was 0.014 per individual. We also identified a region (chr9q21.11-q22.31) with no known DCM genes with a maximum heterogeneity logarithm of odds score of 1.74. CONCLUSIONS- These data suggest that TTN truncating variants contribute to DCM cause. However, the lack of segregation of all identified TTN truncating variants illustrates the challenge of determining variant pathogenicity even with full exome sequencing.
- Published
- 2013
- Full Text
- View/download PDF
24. Evaluating mitochondrial DNA variation in autism spectrum disorders.
- Author
-
Hadjixenofontos A, Schmidt MA, Whitehead PL, Konidari I, Hedges DJ, Wright HH, Abramson RK, Menon R, Williams SM, Cuccaro ML, Haines JL, Gilbert JR, Pericak-Vance MA, Martin ER, and McCauley JL
- Subjects
- Adolescent, Adult, Child, Child, Preschool, Genome-Wide Association Study, Haplotypes, Humans, Mutation, Polymorphism, Single Nucleotide, Young Adult, Child Development Disorders, Pervasive genetics, DNA, Mitochondrial genetics, Genetic Variation
- Abstract
Despite the increasing speculation that oxidative stress and abnormal energy metabolism may play a role in Autism Spectrum Disorders (ASD), and the observation that patients with mitochondrial defects have symptoms consistent with ASD, there are no comprehensive published studies examining the role of mitochondrial variation in autism. Therefore, we have sought to comprehensively examine the role of mitochondrial DNA (mtDNA) variation with regard to ASD risk, employing a multi-phase approach. In phase 1 of our experiment, we examined 132 mtDNA single-nucleotide polymorphisms (SNPs) genotyped as part of our genome-wide association studies of ASD. In phase 2 we genotyped the major European mitochondrial haplogroup-defining variants within an expanded set of autism probands and controls. Finally in phase 3, we resequenced the entire mtDNA in a subset of our Caucasian samples (∼400 proband-father pairs). In each phase we tested whether mitochondrial variation showed evidence of association to ASD. Despite a thorough interrogation of mtDNA variation, we found no evidence to suggest a major role for mtDNA variation in ASD susceptibility. Accordingly, while there may be attractive biological hints suggesting the role of mitochondria in ASD our data indicate that mtDNA variation is not a major contributing factor to the development of ASD., (© 2012 Blackwell Publishing Ltd/University College London.)
- Published
- 2013
- Full Text
- View/download PDF
25. Rescuing Alu: recovery of new inserts shows LINE-1 preserves Alu activity through A-tail expansion.
- Author
-
Wagstaff BJ, Hedges DJ, Derbes RS, Campos Sanchez R, Chiaromonte F, Makova KD, and Roy-Engel AM
- Subjects
- 3' Flanking Region, 5' Flanking Region, Base Sequence, Endonucleases genetics, Endonucleases metabolism, Evolution, Molecular, Exons, Genome, Human, HeLa Cells, Humans, Introns, Molecular Sequence Data, RNA-Directed DNA Polymerase genetics, RNA-Directed DNA Polymerase metabolism, Reverse Transcription, Alu Elements genetics, Long Interspersed Nucleotide Elements genetics, Mutagenesis, Insertional, Terminal Repeat Sequences genetics
- Abstract
Alu elements are trans-mobilized by the autonomous non-LTR retroelement, LINE-1 (L1). Alu-induced insertion mutagenesis contributes to about 0.1% human genetic disease and is responsible for the majority of the documented instances of human retroelement insertion-induced disease. Here we introduce a SINE recovery method that provides a complementary approach for comprehensive analysis of the impact and biological mechanisms of Alu retrotransposition. Using this approach, we recovered 226 de novo tagged Alu inserts in HeLa cells. Our analysis reveals that in human cells marked Alu inserts driven by either exogenously supplied full length L1 or ORF2 protein are indistinguishable. Four percent of de novo Alu inserts were associated with genomic deletions and rearrangements and lacked the hallmarks of retrotransposition. In contrast to L1 inserts, 5' truncations of Alu inserts are rare, as most of the recovered inserts (96.5%) are full length. De novo Alus show a random pattern of insertion across chromosomes, but further characterization revealed an Alu insertion bias exists favoring insertion near other SINEs, highly conserved elements, with almost 60% landing within genes. De novo Alu inserts show no evidence of RNA editing. Priming for reverse transcription rarely occurred within the first 20 bp (most 5') of the A-tail. The A-tails of recovered inserts show significant expansion, with many at least doubling in length. Sequence manipulation of the construct led to the demonstration that the A-tail expansion likely occurs during insertion due to slippage by the L1 ORF2 protein. We postulate that the A-tail expansion directly impacts Alu evolution by reintroducing new active source elements to counteract the natural loss of active Alus and minimizing Alu extinction., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2012
- Full Text
- View/download PDF
26. An X chromosome-wide association study in autism families identifies TBL1X as a novel autism spectrum disorder candidate gene in males.
- Author
-
Chung RH, Ma D, Wang K, Hedges DJ, Jaworski JM, Gilbert JR, Cuccaro ML, Wright HH, Abramson RK, Konidari I, Whitehead PL, Schellenberg GD, Hakonarson H, Haines JL, Pericak-Vance MA, and Martin ER
- Abstract
Background: Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder with a strong genetic component. The skewed prevalence toward males and evidence suggestive of linkage to the X chromosome in some studies suggest the presence of X-linked susceptibility genes in people with ASD., Methods: We analyzed genome-wide association study (GWAS) data on the X chromosome in three independent autism GWAS data sets: two family data sets and one case-control data set. We performed meta- and joint analyses on the combined family and case-control data sets. In addition to the meta- and joint analyses, we performed replication analysis by using the two family data sets as a discovery data set and the case-control data set as a validation data set., Results: One SNP, rs17321050, in the transducin β-like 1X-linked (TBL1X) gene [OMIM:300196] showed chromosome-wide significance in the meta-analysis (P value = 4.86 × 10-6) and joint analysis (P value = 4.53 × 10-6) in males. The SNP was also close to the replication threshold of 0.0025 in the discovery data set (P = 5.89 × 10-3) and passed the replication threshold in the validation data set (P = 2.56 × 10-4). Two other SNPs in the same gene in linkage disequilibrium with rs17321050 also showed significance close to the chromosome-wide threshold in the meta-analysis., Conclusions: TBL1X is in the Wnt signaling pathway, which has previously been implicated as having a role in autism. Deletions in the Xp22.2 to Xp22.3 region containing TBL1X and surrounding genes are associated with several genetic syndromes that include intellectual disability and autistic features. Our results, based on meta-analysis, joint analysis and replication analysis, suggest that TBL1X may play a role in ASD risk.
- Published
- 2011
- Full Text
- View/download PDF
27. Comparison of three targeted enrichment strategies on the SOLiD sequencing platform.
- Author
-
Hedges DJ, Guettouche T, Yang S, Bademci G, Diaz A, Andersen A, Hulme WF, Linker S, Mehta A, Edwards YJ, Beecham GW, Martin ER, Pericak-Vance MA, Zuchner S, Vance JM, and Gilbert JR
- Subjects
- Alleles, Base Sequence, Computational Biology methods, DNA Primers chemistry, Female, Genotype, Heterozygote, Humans, Male, Molecular Sequence Data, Nucleic Acid Hybridization, Polymerase Chain Reaction methods, Reproducibility of Results, Software, Sequence Analysis, DNA methods
- Abstract
Despite the ever-increasing throughput and steadily decreasing cost of next generation sequencing (NGS), whole genome sequencing of humans is still not a viable option for the majority of genetics laboratories. This is particularly true in the case of complex disease studies, where large sample sets are often required to achieve adequate statistical power. To fully leverage the potential of NGS technology on large sample sets, several methods have been developed to selectively enrich for regions of interest. Enrichment reduces both monetary and computational costs compared to whole genome sequencing, while allowing researchers to take advantage of NGS throughput. Several targeted enrichment approaches are currently available, including molecular inversion probe ligation sequencing (MIPS), oligonucleotide hybridization based approaches, and PCR-based strategies. To assess how these methods performed when used in conjunction with the ABI SOLID3+, we investigated three enrichment techniques: Nimblegen oligonucleotide hybridization array-based capture; Agilent SureSelect oligonucleotide hybridization solution-based capture; and Raindance Technologies' multiplexed PCR-based approach. Target regions were selected from exons and evolutionarily conserved areas throughout the human genome. Probe and primer pair design was carried out for all three methods using their respective informatics pipelines. In all, approximately 0.8 Mb of target space was identical for all 3 methods. SOLiD sequencing results were analyzed for several metrics, including consistency of coverage depth across samples, on-target versus off-target efficiency, allelic bias, and genotype concordance with array-based genotyping data. Agilent SureSelect exhibited superior on-target efficiency and correlation of read depths across samples. Nimblegen performance was similar at read depths at 20× and below. Both Raindance and Nimblegen SeqCap exhibited tighter distributions of read depth around the mean, but both suffered from lower on-target efficiency in our experiments. Raindance demonstrated the highest versatility in assay design.
- Published
- 2011
- Full Text
- View/download PDF
28. Restless genomes humans as a model organism for understanding host-retrotransposable element dynamics.
- Author
-
Hedges DJ and Belancio VP
- Subjects
- Animals, Biological Evolution, Endogenous Retroviruses metabolism, Gene Expression, Genetic Fitness, Genomic Instability, Host-Parasite Interactions, Humans, Mice, Mice, Transgenic, Models, Genetic, Polymorphism, Genetic, DNA Transposable Elements, Endogenous Retroviruses genetics, Genome, Human, Retroelements
- Abstract
Since their initial discovery in maize, there have been various attempts to categorize the relationship between transposable elements (TEs) and their host organisms. These have ranged from TEs being selfish parasites to their role as essential, functional components of organismal biology. Research over the past several decades has, in many respects, only served to complicate the issue even further. On the one hand, investigators have amassed substantial evidence concerning the negative effects that TE-mutagenic activity can have on host genomes and organismal fitness. On the other hand, we find an increasing number of examples, across several taxa, of TEs being incorporated into functional biological roles for their host organism. Some 45% of our own genomes are comprised of TE copies. While many of these copies are dormant, having lost their ability to mobilize, several lineages continue to actively proliferate in modern human populations. With its complement of ancestral and active TEs, the human genome exhibits key aspects of the host-TE dynamic that has played out since early on in organismal evolution. In this review, we examine what insights the particularly well-characterized human system can provide regarding the nature of the host-TE interaction., (Copyright © 2011 Elsevier Inc. All rights reserved.)
- Published
- 2011
- Full Text
- View/download PDF
29. Exome sequencing of a multigenerational human pedigree.
- Author
-
Hedges DJ, Burges D, Powell E, Almonte C, Huang J, Young S, Boese B, Schmidt M, Pericak-Vance MA, Martin E, Zhang X, Harkins TT, and Züchner S
- Subjects
- Female, Genotype, Humans, Inheritance Patterns genetics, Male, Polymorphism, Single Nucleotide genetics, Exons genetics, Family Characteristics, Genome, Human genetics, Pedigree, Sequence Analysis, DNA
- Abstract
Over the next few years, the efficient use of next-generation sequencing (NGS) in human genetics research will depend heavily upon the effective mechanisms for the selective enrichment of genomic regions of interest. Recently, comprehensive exome capture arrays have become available for targeting approximately 33 Mb or approximately 180,000 coding exons across the human genome. Selective genomic enrichment of the human exome offers an attractive option for new experimental designs aiming to quickly identify potential disease-associated genetic variants, especially in family-based studies. We have evaluated a 2.1 M feature human exome capture array on eight individuals from a three-generation family pedigree. We were able to cover up to 98% of the targeted bases at a long-read sequence read depth of > or = 3, 86% at a read depth of > or = 10, and over 50% of all targets were covered with > or = 20 reads. We identified up to 14,284 SNPs and small indels per individual exome, with up to 1,679 of these representing putative novel polymorphisms. Applying the conservative genotype calling approach HCDiff, the average rate of detection of a variant allele based on Illumina 1 M BeadChips genotypes was 95.2% at > or = 10x sequence. Further, we propose an advantageous genotype calling strategy for low covered targets that empirically determines cut-off thresholds at a given coverage depth based on existing genotype data. Application of this method was able to detect >99% of SNPs covered > or = 8x. Our results offer guidance for "real-world" applications in human genetics and provide further evidence that microarray-based exome capture is an efficient and reliable method to enrich for chromosomal regions of interest in next-generation sequencing experiments.
- Published
- 2009
- Full Text
- View/download PDF
30. Alu repeats increase local recombination rates.
- Author
-
Witherspoon DJ, Watkins WS, Zhang Y, Xing J, Tolpinrud WL, Hedges DJ, Batzer MA, and Jorde LB
- Subjects
- Base Sequence, Child, Consensus Sequence, Evolution, Molecular, Gene Dosage, Genome, Human, Haplotypes, Humans, Mutagenesis, Insertional, Polymorphism, Single Nucleotide, Racial Groups genetics, Time Factors, Alu Elements genetics, Recombination, Genetic
- Abstract
Background: Recombination rates vary widely across the human genome, but little of that variation is correlated with known DNA sequence features. The genome contains more than one million Alu mobile element insertions, and these insertions have been implicated in non-homologous recombination, modulation of DNA methylation, and transcriptional regulation. If individual Alu insertions have even modest effects on local recombination rates, they could collectively have a significant impact on the pattern of linkage disequilibrium in the human genome and on the evolution of the Alu family itself., Results: We carried out sequencing, SNP identification, and SNP genotyping around 19 AluY insertion loci in 347 individuals sampled from diverse populations, then used the SNP genotypes to estimate local recombination rates around the AluY loci. The loci and SNPs were chosen so as to minimize other factors (such as SNP ascertainment bias and SNP density) that could influence recombination rate estimates. We detected a significant increase in recombination rate within approximately 2 kb of the AluY insertions in our African population sample. To test this observation against a larger set of AluY insertions, we applied our locus- and SNP-selection design and analyses to the HapMap Phase II data. In that data set, we observed a significantly increased recombination rate near AluY insertions in both the CEU and YRI populations., Conclusion: We show that the presence of a fixed AluY insertion is significantly predictive of an elevated local recombination rate within 2 kb of the insertion, independent of other known predictors. The magnitude of this effect, approximately a 6% increase, is comparable to the effects of some recombinogenic DNA sequence motifs identified via their association with recombination hot spots.
- Published
- 2009
- Full Text
- View/download PDF
31. A genome-wide association study of autism reveals a common novel risk locus at 5p14.1.
- Author
-
Ma D, Salyakina D, Jaworski JM, Konidari I, Whitehead PL, Andersen AN, Hoffman JD, Slifer SH, Hedges DJ, Cukier HN, Griswold AJ, McCauley JL, Beecham GW, Wright HH, Abramson RK, Martin ER, Hussman JP, Gilbert JR, Cuccaro ML, Haines JL, and Pericak-Vance MA
- Subjects
- Adolescent, Child, Child, Preschool, Female, Genetic Predisposition to Disease, Humans, Male, Pedigree, Polymorphism, Single Nucleotide, White People genetics, Young Adult, Autistic Disorder genetics, Chromosomes, Human, Pair 5 genetics, Genome-Wide Association Study
- Abstract
Although autism is one of the most heritable neuropsychiatric disorders, its underlying genetic architecture has largely eluded description. To comprehensively examine the hypothesis that common variation is important in autism, we performed a genome-wide association study (GWAS) using a discovery dataset of 438 autistic Caucasian families and the Illumina Human 1M beadchip. 96 single nucleotide polymorphisms (SNPs) demonstrated strong association with autism risk (p-value < 0.0001). The validation of the top 96 SNPs was performed using an independent dataset of 487 Caucasian autism families genotyped on the 550K Illumina BeadChip. A novel region on chromosome 5p14.1 showed significance in both the discovery and validation datasets. Joint analysis of all SNPs in this region identified 8 SNPs having improved p-values (3.24E-04 to 3.40E-06) than in either dataset alone. Our findings demonstrate that in addition to multiple rare variations, part of the complex genetic architecture of autism involves common variation.
- Published
- 2009
- Full Text
- View/download PDF
32. Diverse cis factors controlling Alu retrotransposition: what causes Alu elements to die?
- Author
-
Comeaux MS, Roy-Engel AM, Hedges DJ, and Deininger PL
- Subjects
- Base Sequence, Blotting, Northern, Cystic Fibrosis Transmembrane Conductance Regulator genetics, HeLa Cells, Humans, Molecular Sequence Data, Mutation genetics, Poly A genetics, RNA, Messenger genetics, RNA, Messenger metabolism, Reverse Transcriptase Polymerase Chain Reaction, Sequence Homology, Nucleic Acid, Alu Elements genetics, Genome, Human, Polymorphism, Genetic, Regulatory Sequences, Nucleic Acid genetics, Retroelements genetics
- Abstract
The human genome contains nearly 1.1 million Alu elements comprising roughly 11% of its total DNA content. Alu elements use a copy and paste retrotransposition mechanism that can result in de novo disease insertion alleles. There are nearly 900,000 old Alu elements from subfamilies S and J that appear to be almost completely inactive, and about 200,000 from subfamily Y or younger, which include a few thousand copies of the Ya5 subfamily which makes up the majority of current activity. Given the much higher copy number of the older Alu subfamilies, it is not known why all of the active Alu elements belong to the younger subfamilies. We present a systematic analysis evaluating the observed sequence variation in the different sections of an Alu element on retrotransposition. The length of the longest number of uninterrupted adenines in the A-tail, the degree of A-tail heterogeneity, the length of the 3' unique end after the A-tail and before the RNA polymerase III terminator, and random mutations found in the right monomer all modulate the retrotransposition efficiency. These changes occur over different evolutionary time frames. The combined impact of sequence changes in all of these regions explains why young Alus are currently causing disease through retrotransposition, and the old Alus have lost their ability to retrotranspose. We present a predictive model to evaluate the retrotransposition capability of individual Alu elements and successfully applied it to identify the first putative source element for a disease-causing Alu insertion in a patient with cystic fibrosis.
- Published
- 2009
- Full Text
- View/download PDF
33. Sample degradation leads to false-positive copy number variation calls in multiplex real-time polymerase chain reaction assays.
- Author
-
Cukier HN, Pericak-Vance MA, Gilbert JR, and Hedges DJ
- Subjects
- False Positive Reactions, Gene Dosage, Humans, Sensitivity and Specificity, Genetic Variation, Genome, Human, Polymerase Chain Reaction
- Abstract
The recent implication of genomic copy number variations (CNVs) in multiple human genetic disorders has led to increased interest in CNV discovery technologies. There is a growing consensus that, in addition to the method used for detection, at least one additional technology should be employed for validation. Real-time quantitative polymerase chain reaction (qPCR) analysis, incorporating a normal (2N) copy number standard, is commonly used as a means of validating CNVs. Whereas it has previously been reported that formalin-fixed paraffin-embedded (FFPE) DNA samples can yield spurious CNV calls in real-time qPCR assays, here we report that sample degradation under standard laboratory storage conditions generates a significant increase in false-positive CNV results. Results suggest the possibility of biased degradation among genomic regions and emphasize the need to assess sample integrity immediately prior to real-time qPCR experiments.
- Published
- 2009
- Full Text
- View/download PDF
34. Rare mutations of FGFR2 causing apert syndrome: identification of the first partial gene deletion, and an Alu element insertion from a new subfamily.
- Author
-
Bochukova EG, Roscioli T, Hedges DJ, Taylor IB, Johnson D, David DJ, Deininger PL, and Wilkie AO
- Subjects
- Acrocephalosyndactylia diagnosis, Adolescent, Adult, Base Pairing, Base Sequence, Child, Preschool, DNA Mutational Analysis, Exons genetics, Fathers, Genome, Human genetics, Humans, Infant, Male, Molecular Sequence Data, Acrocephalosyndactylia genetics, Alu Elements genetics, Gene Deletion, Mutagenesis, Insertional genetics, Receptor, Fibroblast Growth Factor, Type 2 genetics
- Abstract
Apert syndrome (AS) is a severe disorder, characterized by craniosynostosis and complex syndactyly of the hands and feet. Two heterozygous gain-of-function substitutions (Ser252Trp and Pro253Arg) in exon IIIa of fibroblast growth factor receptor 2 (FGFR2) are responsible for >98% of cases. Here we describe two novel mutations in FGFR2 in the two patients in whom a mutation had not previously been found in our cohort of 227 AS cases. The first is a 1.93-kb deletion, removing exon IIIc and substantial portions of the flanking introns. This is the first large FGFR2 deletion described in any individual with craniosynostosis. The other mutation is a 5' truncated Alu insertion into exon IIIc. This is the third Alu insertion identified in AS; all have occurred within an interval of only 104 bp, representing an enrichment of over a million-fold compared to the background genomic rate. We show that the inserted Alu element belongs to a small subfamily, not previously known to be mobile, which we term Alu Yk13. Both the deletion and insertion are likely to act by a similar gain-of-function mechanism in which disruption of exon IIIc leads to illegitimate mesenchymal expression of an FGFR2 spliceform containing the alternatively spliced exon IIIb. All the AS-associated Alu insertions have arisen in the paternal germline; we propose that their enrichment in FGFR2 is driven by positive selection of the mutant spermatogonial progenitors, a mechanism analogous to that explaining why the canonical AS nucleotide substitutions also reach exceptionally high levels in sperm., ((c) 2008 Wiley-Liss, Inc.)
- Published
- 2009
- Full Text
- View/download PDF
35. Identification of repeat structure in large genomes using repeat probability clouds.
- Author
-
Gu W, Castoe TA, Hedges DJ, Batzer MA, and Pollock DD
- Subjects
- Alu Elements genetics, Chromosomes, Human, Pair 1 genetics, False Positive Reactions, Humans, Oligonucleotides genetics, Sensitivity and Specificity, Time Factors, Algorithms, Genome, Human genetics, Probability, Repetitive Sequences, Nucleic Acid genetics
- Abstract
The identification of repeat structure in eukaryotic genomes can be time-consuming and difficult because of the large amount of information ( approximately 3 x 10(9) bp) that needs to be processed and compared. We introduce a new approach based on exact word counts to evaluate, de novo, the repeat structure present within large eukaryotic genomes. This approach avoids sequence alignment and similarity search, two of the most time-consuming components of traditional methods for repeat identification. Algorithms were implemented to efficiently calculate exact counts for any length oligonucleotide in large genomes. Based on these oligonucleotide counts, oligonucleotide excess probability clouds, or "P-clouds," were constructed. P-clouds are composed of clusters of related oligonucleotides that occur, as a group, more often than expected by chance. After construction, P-clouds were mapped back onto the genome, and regions of high P-cloud density were identified as repetitive regions based on a sliding window approach. This efficient method is capable of analyzing the repeat content of the entire human genome on a single desktop computer in less than half a day, at least 10-fold faster than current approaches. The predicted repetitive regions strongly overlap with known repeat elements as well as other repetitive regions such as gene families, pseudogenes, and segmental duplicons. This method should be extremely useful as a tool for use in de novo identification of repeat structure in large newly sequenced genomes.
- Published
- 2008
- Full Text
- View/download PDF
36. Mammalian non-LTR retrotransposons: for better or worse, in sickness and in health.
- Author
-
Belancio VP, Hedges DJ, and Deininger P
- Subjects
- Animals, Evolution, Molecular, Genetic Diseases, Inborn genetics, Genomics, Humans, Long Interspersed Nucleotide Elements, Mutagenesis, Insertional, Short Interspersed Nucleotide Elements, Retroelements
- Abstract
Transposable elements (TEs) have shared an exceptionally long coexistence with their host organisms and have come to occupy a significant fraction of eukaryotic genomes. The bulk of the expansion occurring within mammalian genomes has arisen from the activity of type I retrotransposons, which amplify in a "copy-and-paste" fashion through an RNA intermediate. For better or worse, the sequences of these retrotransposons are now wedded to the genomes of their mammalian hosts. Although there are several reported instances of the positive contribution of mobile elements to their host genomes, these discoveries have occurred alongside growing evidence of the role of TEs in human disease and genetic instability. Here we examine, with a particular emphasis on human retrotransposon activity, several newly discovered aspects of mammalian retrotransposon biology. We consider their potential impact on host biology as well as their ultimate implications for the nature of the TE-host relationship.
- Published
- 2008
- Full Text
- View/download PDF
37. Characterization of pre-insertion loci of de novo L1 insertions.
- Author
-
Gasior SL, Preston G, Hedges DJ, Gilbert N, Moran JV, and Deininger PL
- Subjects
- Alu Elements, Base Composition, Base Sequence, Chromosome Mapping, DNA chemistry, DNA genetics, Genome, Human, HeLa Cells, Humans, Models, Genetic, Short Interspersed Nucleotide Elements, Long Interspersed Nucleotide Elements
- Abstract
The human Long Interspersed Element-1 (LINE-1) and the Short Interspersed Element (SINE) Alu comprise 28% of the human genome. They share the same L1-encoded endonuclease for insertion, which recognizes an A+T-rich sequence. Under a simple model of insertion distribution, this nucleotide preference would lead to the prediction that the populations of both elements would be biased towards A+T-rich regions. Genomic L1 elements do show an A+T-rich bias. In contrast, Alu is biased towards G+C-rich regions when compared to the genome average. Several analyses have demonstrated that relatively recent insertions of both elements show less G+C content bias relative to older elements. We have analyzed the repetitive element and G+C composition of more than 100 pre-insertion loci derived from de novo L1 insertions in cultured human cancer cells, which should represent an evolutionarily unbiased set of insertions. An A+T-rich bias is observed in the 50 bp flanking the endonuclease target site, consistent with the known target site for the L1 endonuclease. The L1, Alu, and G+C content of 20 kb of the de novo pre-insertion loci shows a different set of biases than that observed for fixed L1s in the human genome. In contrast to the insertion sites of genomic L1s, the de novo L1 pre-insertion loci are relatively L1-poor, Alu-rich and G+C neutral. Finally, a statistically significant cluster of de novo L1 insertions was localized in the vicinity of the c-myc gene. These results suggest that the initial insertion preference of L1, while A+T-rich in the initial vicinity of the break site, can be influenced by the broader content of the flanking genomic region and have implications for understanding the dynamics of L1 and Alu distributions in the human genome.
- Published
- 2007
- Full Text
- View/download PDF
38. Different evolutionary fates of recently integrated human and chimpanzee LINE-1 retrotransposons.
- Author
-
Lee J, Cordaux R, Han K, Wang J, Hedges DJ, Liang P, and Batzer MA
- Subjects
- 5' Untranslated Regions, Animals, Evolution, Molecular, Gorilla gorilla genetics, Humans, Molecular Sequence Data, Phylogeny, Polymorphism, Genetic, Pongo pygmaeus genetics, Species Specificity, Time Factors, Long Interspersed Nucleotide Elements, Pan troglodytes genetics
- Abstract
The long interspersed element-1 (LINE-1 or L1) is a highly successful retrotransposon in mammals. L1 elements have continued to actively propagate subsequent to the human-chimpanzee divergence, approximately 6 million years ago, resulting in species-specific inserts. Here, we report a detailed characterization of chimpanzee-specific L1 subfamily diversity and a comparison with their human-specific counterparts. Our results indicate that L1 elements have experienced different evolutionary fates in humans and chimpanzees within the past approximately 6 million years. Although the species-specific L1 copy numbers are on the same order in both species (1200-2000 copies), the number of retrotransposition-competent elements appears to be much higher in the human genome than in the chimpanzee genome. Also, while human L1 subfamilies belong to the same lineage, we identified two lineages of recently integrated L1 subfamilies in the chimpanzee genome. The two lineages seem to have coexisted for several million years, but only one shows evidence of expansion within the past three million years. These differential evolutionary paths may be the result of random variation, or the product of competition between L1 subfamily lineages. Our results suggest that the coexistence of several L1 subfamily lineages within a species may be resolved in a very short evolutionary period of time, perhaps in just a few million years. Therefore, the chimpanzee genome constitutes an excellent model in which to analyze the evolutionary dynamics of L1 retrotransposons.
- Published
- 2007
- Full Text
- View/download PDF
39. Inviting instability: Transposable elements, double-strand breaks, and the maintenance of genome integrity.
- Author
-
Hedges DJ and Deininger PL
- Subjects
- Animals, Base Sequence, DNA Repair, Endonucleases genetics, Humans, Interspersed Repetitive Sequences, Models, Genetic, Molecular Sequence Data, Mutagens, Recombination, Genetic, Sequence Deletion, Transcription, Genetic, DNA Breaks, Double-Stranded, DNA Transposable Elements, Genomic Instability
- Abstract
The ubiquity of mobile elements in mammalian genomes poses considerable challenges for the maintenance of genome integrity. The predisposition of mobile elements towards participation in genomic rearrangements is largely a consequence of their interspersed homologous nature. As tracts of nonallelic sequence homology, they have the potential to interact in a disruptive manner during both meiotic recombination and DNA repair processes, resulting in genomic alterations ranging from deletions and duplications to large-scale chromosomal rearrangements. Although the deleterious effects of transposable element (TE) insertion events have been extensively documented, it is arguably through post-insertion genomic instability that they pose the greatest hazard to their host genomes. Despite the periodic generation of important evolutionary innovations, genomic alterations involving TE sequences are far more frequently neutral or deleterious in nature. The potentially negative consequences of this instability are perhaps best illustrated by the >25 human genetic diseases that are attributable to TE-mediated rearrangements. Some of these rearrangements, such as those involving the MLL locus in leukemia and the LDL receptor in familial hypercholesterolemia, represent recurrent mutations that have independently arisen multiple times in human populations. While TE-instability has been a potent force in shaping eukaryotic genomes and a significant source of genetic disease, much concerning the mechanisms governing the frequency and variety of these events remains to be clarified. Here we survey the current state of knowledge regarding the mechanisms underlying mobile element-based genetic instability in mammals. Compared to simpler eukaryotic systems, mammalian cells appear to have several modifications to their DNA-repair ensemble that allow them to better cope with the large amount of interspersed homology that has been generated by TEs. In addition to the disruptive potential of nonallelic sequence homology, we also consider recent evidence suggesting that the endonuclease products of TEs may also play a key role in instigating mammalian genomic instability.
- Published
- 2007
- Full Text
- View/download PDF
40. Estimating the retrotransposition rate of human Alu elements.
- Author
-
Cordaux R, Hedges DJ, Herke SW, and Batzer MA
- Subjects
- Genetic Techniques, Genome, Human, Humans, Mutation, Short Interspersed Nucleotide Elements, Alu Elements, Evolution, Molecular, Retroelements
- Abstract
Mobile elements such as Alu repeats have substantially altered the architecture of the human genome, and de novo mobile element insertions sometimes cause genetic disorders. Previous estimates for the retrotransposition rate (RR) of Alu elements in humans of one new insertion every approximately 100-125 births were developed prior to the sequencing of the human and chimpanzee genomes. Here, we used two independent methods (based on the new genomic data and on disease-causing de novo Alu insertions) to generate refined Alu RR estimates in humans. Both methods consistently yielded RR on the order of one new Alu insertion every approximately 20 births, despite the fact that the evolutionary-based method represents an average RR over the past approximately 6 million years while the mutation-based method better reflects the current-day RR. These results suggest that Alu elements retrotranspose at a faster rate in humans than previously thought, and support the potential of Alu elements as mutagenic factors in the human genome.
- Published
- 2006
- Full Text
- View/download PDF
41. LINE-1 RNA splicing and influences on mammalian gene expression.
- Author
-
Belancio VP, Hedges DJ, and Deininger P
- Subjects
- Animals, Cell Line, Gene Expression Regulation, HeLa Cells, Humans, Mice, NIH 3T3 Cells, Polyadenylation, RNA Splice Sites, RNA, Messenger metabolism, Alternative Splicing, Long Interspersed Nucleotide Elements
- Abstract
Long interspersed element-1 elements compose on average one-fifth of mammalian genomes. The expression and retrotransposition of L1 is restricted by a number of cellular mechanisms in order to limit their damage in both germ-line and somatic cells. L1 transcription is largely suppressed in most tissues, but L1 mRNA and/or proteins are still detectable in testes, a number of specific somatic cell types, and malignancies. Down-regulation of L1 expression via premature polyadenylation has been found to be a secondary mechanism of limiting L1 expression. We demonstrate that mammalian L1 elements contain numerous functional splice donor and acceptor sites. Efficient usage of some of these sites results in extensive and complex splicing of L1. Several splice variants of both the human and mouse L1 elements undergo retrotransposition. Some of the spliced L1 mRNAs can potentially contribute to expression of open reading frame 2-related products and therefore have implications for the mobility of SINEs even if they are incompetent for L1 retrotransposition. Analysis of the human EST database revealed that L1 elements also participate in splicing events with other genes. Such contribution of functional splice sites by L1 may result in disruption of normal gene expression or formation of alternative mRNA transcripts.
- Published
- 2006
- Full Text
- View/download PDF
42. SVA elements: a hominid-specific retroposon family.
- Author
-
Wang H, Xing J, Grover D, Hedges DJ, Han K, Walker JA, and Batzer MA
- Subjects
- Animals, Evolution, Molecular, Genome, Human, Hominidae, Humans, Species Specificity, Regulatory Sequences, Nucleic Acid genetics, Retroelements
- Abstract
SVA is a composite repetitive element named after its main components, SINE, VNTR and Alu. We have identified 2762 SVA elements from the human genome draft sequence. Genomic distribution analysis indicates that the SVA elements are enriched in G+C-rich regions but have no preferences for inter- or intragenic regions. A phylogenetic analysis of the elements resulted in the recovery of six subfamilies that were named SVA_A to SVA_F. The composition, age and genomic distribution of the subfamilies have been examined. Subfamily age estimates based upon nucleotide divergence indicate that the expansion of four SVA subfamilies (SVA_A, SVA_B, SVA_C and SVA_D) began before the divergence of human, chimpanzee and gorilla, while subfamilies SVA_E and SVA_F are restricted to the human lineage. A survey of human genomic diversity associated with SVA_E and SVA_F subfamily members showed insertion polymorphism frequencies of 37.5% and 27.6%, respectively. In addition, we examined the amplification dynamics of SVA elements throughout the primate order and traced their origin back to the beginnings of hominid primate evolution, approximately 18 to 25 million years ago. This makes SVA elements the youngest family of retroposons in the primate order.
- Published
- 2005
- Full Text
- View/download PDF
43. Chompy: an infestation of MITE-like repetitive elements in the crocodilian genome.
- Author
-
Ray DA, Hedges DJ, Herke SW, Fowlkes JD, Barnes EW, LaVie DK, Goodwin LM, Densmore LD, and Batzer MA
- Subjects
- Animals, Gene Dosage, Multigene Family, Alligators and Crocodiles genetics, DNA Transposable Elements, Interspersed Repetitive Sequences
- Abstract
Interspersed repeats are a major component of most eukaryotic genomes and have an impact on genome size and stability, but the repetitive element landscape of crocodilian genomes has not yet been fully investigated. In this report, we provide the first detailed characterization of an interspersed repeat element in any crocodilian genome. Chompy is a putative miniature inverted-repeat transposable element (MITE) family initially recovered from the genome of Alligator mississippiensis (American alligator) but also present in the genomes of Crocodylus moreletii (Morelet's crocodile) and Gavialis gangeticus (Indian gharial). The element has all of the hallmarks of MITEs including terminal inverted repeats, possible target site duplications, and a tendency to form secondary structures. We estimate the copy number in the alligator genome to be approximately 46,000 copies. As a result of their size and unique properties, Chompy elements may provide a useful source of genomic variation for crocodilian comparative genomics.
- Published
- 2005
- Full Text
- View/download PDF
44. Modeling the amplification dynamics of human Alu retrotransposons.
- Author
-
Hedges DJ, Cordaux R, Xing J, Witherspoon DJ, Rogers AR, Jorde LB, and Batzer MA
- Subjects
- Computational Biology, Computer Simulation, Genome, Human genetics, Humans, Polymorphism, Genetic, Time Factors, Alu Elements genetics, Gene Amplification genetics, Models, Genetic, Retroelements genetics
- Abstract
Retrotransposons have had a considerable impact on the overall architecture of the human genome. Currently, there are three lineages of retrotransposons (Alu, L1, and SVA) that are believed to be actively replicating in humans. While estimates of their copy number, sequence diversity, and levels of insertion polymorphism can readily be obtained from existing genomic sequence data and population sampling, a detailed understanding of the temporal pattern of retrotransposon amplification remains elusive. Here we pose the question of whether, using genomic sequence and population frequency data from extant taxa, one can adequately reconstruct historical amplification patterns. To this end, we developed a computer simulation that incorporates several known aspects of primate Alu retrotransposon biology and accommodates sampling effects resulting from the methods by which mobile elements are typically discovered and characterized. By modeling a number of amplification scenarios and comparing simulation-generated expectations to empirical data gathered from existing Alu subfamilies, we were able to statistically reject a number of amplification scenarios for individual subfamilies, including that of a rapid expansion or explosion of Alu amplification at the time of human-chimpanzee divergence.
- Published
- 2005
- Full Text
- View/download PDF
45. From the margins of the genome: mobile elements shape primate evolution.
- Author
-
Hedges DJ and Batzer MA
- Subjects
- Alternative Splicing, Animals, CpG Islands, DNA Methylation, Evolution, Molecular, Exons, Genome, Human, Germ-Line Mutation, Humans, Long Interspersed Nucleotide Elements, Models, Genetic, Open Reading Frames, Phylogeny, Primates, Repetitive Sequences, Nucleic Acid, Retroviridae genetics, Short Interspersed Nucleotide Elements, Biological Evolution, Genome
- Abstract
As is the case with mammals in general, primate genomes are inundated with repetitive sequence. Although much of this repetitive content consists of "molecular fossils" inherited from early mammalian ancestors, a significant portion of this material comprises active mobile element lineages. Despite indications that these elements played a major role in shaping the architecture of the genome, there remain many unanswered questions surrounding the nature of the host-element relationship. Here we review advances in our understanding of the host-mobile element dynamic and its overall impact on primate evolution., ((c) 2005 Wiley Periodicals, Inc.)
- Published
- 2005
- Full Text
- View/download PDF
46. Under the genomic radar: the stealth model of Alu amplification.
- Author
-
Han K, Xing J, Wang H, Hedges DJ, Garber RK, Cordaux R, and Batzer MA
- Subjects
- Animals, Base Sequence, Computational Biology, DNA Primers, Electrophoresis, Agar Gel, Humans, Molecular Sequence Data, Oligonucleotides, Primates genetics, Sequence Analysis, DNA, Species Specificity, Alu Elements genetics, Evolution, Molecular, Genome, Human, Models, Genetic, Phylogeny
- Abstract
Alu elements are the most successful SINEs (Short INterspersed Elements) in primate genomes and have reached more than 1,000,000 copies in the human genome. The amplification of most Alu elements is thought to occur through a limited number of hyperactive "master" genes that produce a high number of copies during long evolutionary periods of time. However, the existence of long-lived, low-activity Alu lineages in the human genome suggests a more complex propagation mechanism. Using both computational and wet-bench approaches, we reconstructed the evolutionary history of the AluYb lineage, one of the most active Alu lineages in the human genome. We show that the major AluYb lineage expansion in humans is a species-specific event, as nonhuman primates possess only a handful of AluYb elements. However, the oldest existing AluYb element resided in an orthologous position in all hominoid primate genomes examined, demonstrating that the AluYb lineage originated 18-25 million years ago. Thus, the history of the AluYb lineage is characterized by approximately 20 million years of retrotranspositional quiescence preceding a major expansion in the human genome within the past few million years. We suggest that the evolutionary success of the Alu family may be driven at least in part by "stealth-driver" elements that maintain low retrotranspositional activity over extended periods of time and occasionally produce short-lived hyperactive copies responsible for the formation and remarkable expansion of Alu elements within the genome.
- Published
- 2005
- Full Text
- View/download PDF
47. Alu insertion loci and platyrrhine primate phylogeny.
- Author
-
Ray DA, Xing J, Hedges DJ, Hall MA, Laborde ME, Anders BA, White BR, Stoilova N, Fowlkes JD, Landry KE, Chemnick LG, Ryder OA, and Batzer MA
- Subjects
- Animals, Base Sequence, DNA Primers, Humans, Polymerase Chain Reaction, Alu Elements, Cebidae genetics, Phylogeny
- Abstract
Short INterspersed Elements (SINEs) make very useful phylogenetic markers because the integration of a particular element at a location in the genome is irreversible and of known polarity. These attributes make analysis of SINEs as phylogenetic characters an essentially homoplasy-free affair. Alu elements are primate-specific SINEs that make up a large portion of the human genome and are also widespread in other primates. Using a combination wet-bench and computational approach we recovered 190 Alu insertions, 183 of which are specific to the genomes of nine New World primates. We used these loci to investigate branching order and have produced a cladogram that supports a sister relationship between Atelidae (spider, woolly, and howler monkeys) and Cebidae (marmosets, tamarins, and owl monkeys) and then the joining of this two family clade to Pitheciidae (titi and saki monkeys). The data support these relationships with a homoplasy index of 0.00. In this study, we report one of the largest applications of SINE elements to phylogenetic analysis to date, and the results provide a robust molecular phylogeny for platyrrhine primates.
- Published
- 2005
- Full Text
- View/download PDF
48. Analysis of the human Alu Ye lineage.
- Author
-
Salem AH, Ray DA, Hedges DJ, Jurka J, and Batzer MA
- Subjects
- Animals, Biological Evolution, Chromosome Mapping, Evolution, Molecular, Gene Deletion, Genetic Variation, Genome, Genome, Human, Hominidae, Humans, Molecular Sequence Data, Phylogeny, Polymerase Chain Reaction, Sequence Analysis, DNA, Software, Alu Elements
- Abstract
Background: Alu elements are short (approximately 300 bp) interspersed elements that amplify in primate genomes through a process termed retroposition. The expansion of these elements has had a significant impact on the structure and function of primate genomes. Approximately 10 % of the mass of the human genome is comprised of Alu elements, making them the most abundant short interspersed element (SINE) in our genome. The majority of Alu amplification occurred early in primate evolution, and the current rate of Alu retroposition is at least 100 fold slower than the peak of amplification that occurred 30-50 million years ago. Alu elements are therefore a rich source of inter- and intra-species primate genomic variation., Results: A total of 153 Alu elements from the Ye subfamily were extracted from the draft sequence of the human genome. Analysis of these elements resulted in the discovery of two new Alu subfamilies, Ye4 and Ye6, complementing the previously described Ye5 subfamily. DNA sequence analysis of each of the Alu Ye subfamilies yielded average age estimates of approximately 14, approximately 13 and approximately 9.5 million years old for the Alu Ye4, Ye5 and Ye6 subfamilies, respectively. In addition, 120 Alu Ye4, Ye5 and Ye6 loci were screened using polymerase chain reaction (PCR) assays to determine their phylogenetic origin and levels of human genomic diversity., Conclusion: The Alu Ye lineage appears to have started amplifying relatively early in primate evolution and continued propagating at a low level as many of its members are found in a variety of hominoid (humans, greater and lesser ape) genomes. Detailed sequence analysis of several Alu pre-integration sites indicated that multiple types of events had occurred, including gene conversions, near-parallel independent insertions of different Alu elements and Alu-mediated genomic deletions. A potential hotspot for Alu insertion in the Fer1L3 gene on chromosome 10 was also identified.
- Published
- 2005
- Full Text
- View/download PDF
49. Multiplex polymerase chain reaction for simultaneous quantitation of human nuclear, mitochondrial, and male Y-chromosome DNA: application in human identification.
- Author
-
Walker JA, Hedges DJ, Perodeau BP, Landry KE, Stoilova N, Laborde ME, Shewale J, Sinha SK, and Batzer MA
- Subjects
- Cell Nucleus genetics, Chromosomes, Human, X, Chromosomes, Human, Y genetics, DNA standards, DNA Probes, DNA, Mitochondrial genetics, Female, Humans, Male, DNA analysis, DNA Fingerprinting methods, Polymerase Chain Reaction methods
- Abstract
Human forensic casework requires sensitive quantitation of human nuclear (nDNA), mitochondrial (mtDNA), and male Y-chromosome DNA from complex biomaterials. Although many such systems are commercially available, no system is capable of simultaneously quantifying all three targets in a single reaction. Most available methods either are not multiplex compatible or lack human specificity. Here, we report the development of a comprehensive set of human-specific, target-specific multiplex polymerase chain reaction (PCR) assays for DNA quantitation. Using TaqMan-MGB probes, our duplex qPCR for nDNA/mtDNA had a linear quantitation range of 100 ng to 1 pg, and our triplex qPCR assay for nDNA/mtDNA/male Y DNA had a linear range of 100-0.1 ng. Human specificity was demonstrated by the accurate detection of 0.05 and 5% human DNA from a complex source of starting templates. Target specificity was confirmed by the lack of cross-amplification among targets. A high-throughput alternative for human gender determination was also developed by multiplexing the male Y primer/probe set with an X-chromosome-based system. Background cross-amplification with DNA templates derived from 14 other species was negligible aside from the male Y assay which produced spurious amplifications from other nonhuman primate templates. Mainstream application of these assays will undoubtedly benefit forensic genomics.
- Published
- 2005
- Full Text
- View/download PDF
50. The Alu Yc1 subfamily: sorting the wheat from the chaff.
- Author
-
Garber RK, Hedges DJ, Herke SW, Hazard NW, and Batzer MA
- Subjects
- Alu Elements, Base Sequence, Consensus Sequence, DNA genetics, DNA isolation & purification, Ethnicity genetics, Humans, Molecular Sequence Data, Racial Groups genetics
- Abstract
Members of the Alu Yc1 subfamily are distinguished from the older Alu Y subfamily by a signature G-->A substitution at base 148 of their 281-bp consensus sequence. Members of the much older and larger Alu Y subfamily could have by chance accumulated this signature G-->A substitution and be misclassified as belonging to the Alu Yc1 subfamily. Using a Mahanalobis classification method, it was estimated that the "authentic" Alu Yc1 subfamily consists of approximately 262 members in the human genome. PCR amplification and further analysis was successfully completed on 225 of the Yc1 Alu family members. One hundred and seventy-seven Yc1 Alu elements were determined to be monomorphic (fixed for presence) in a panel of diverse human genomes. Forty-eight of the Yc1 Alu elements were polymorphic for insertion presence/absence in diverse human genomes. The insertion polymorphism rate of 21% in the human genome is similar to rates reported previously for other "young" Alu subfamilies. The polymorphic Yc1 Alu elements will be useful genetic loci for the study of human population genetics.
- Published
- 2005
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.