1. Detailed analysis of a contiguous 22-Mb region of the maize genome
- Author
-
Cheng Ting Yeh, Lucinda Fulton, Phillip San Miguel, Srinivas Aluru, Shiran Pasternak, Stephanie Adams, Lori Spiegel, Fusheng Wei, Regina S. Baucom, Melissa Kramer, Patrick S. Schnable, Blake C. Meyers, Lixing Yang, Rod A. Wing, Pamela J. Green, Catrina Fronick, Robert S. Fulton, Kristi Collura, Cristian Chaparro, Scott Kruchowski, Gabriel Scara, Susan M. Rock, David Kudrna, Joshua C. Stein, Richard K. Wilson, Yeisoo Yu, Jianwei Zhang, Dawn H. Nagel, Hyeran Kim, Jinke Lin, Emanuele De Paoli, William Courtney, Marina Wissotski, Sandra W. Clifton, Lifang Zhang, Angelina Angelova, Lydia Nascimento, Apurva Narechania, Laura Courtney, Robert A. Martienssen, Wolfgang Golser, Kai Ying, Ananth Kalyanaraman, Chengzhi Liang, Doreen Ware, Ning Jiang, Shiguo Zhou, David C. Schwartz, Susan R. Wessler, Jennifer Currie, Jeffrey L. Bennetzen, W. Richard McCombie, Yujun Han, Tina Graves, Jean-Marc Deragon, Ecology and Evolutionary Biology [Tucson] (EEB), University of Arizona, Cold Spring Harbor Laboratory (CSHL), Department of Genetics [Saint-Louis], Washington University in Saint Louis (WUSTL), Department of Genetics, University of Georgia [USA], Delaware Biotechnology Institute, University of Delaware [Newark], Laboratory for Molecular and Computational Genomics [Madison], University of Wisconsin-Madison, Department of Plant Biology [Athens], Center for Plant Genomics, Iowa State University (ISU), School of Electrical Engineering and Computer Science (EECS), Washington State University (WSU), Laboratoire Génome et développement des plantes (LGDP), Université de Perpignan Via Domitia (UPVD)-Centre National de la Recherche Scientifique (CNRS), Department of Horticulture and Landscape Architecture, Purdue University [West Lafayette], Department of Horticulture, Michigan State University [East Lansing], Michigan State University System-Michigan State University System, Department of Electrical and Computer Engineering, and Ecker, Joseph R
- Subjects
0106 biological sciences ,MESH: Zea mays ,Sequence Homology ,MESH: Base Sequence ,MESH: RNA, Plant ,[SDV.BID.SPT]Life Sciences [q-bio]/Biodiversity/Systematics, Phylogenetics and taxonomy ,01 natural sciences ,Gene Duplication ,MESH: Genes, Plant ,MESH: Chromosomes, Plant ,Base Pairing ,0303 health sciences ,[SDV.BBM.BS]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Structural Biology [q-bio.BM] ,[SDV.BID.EVO]Life Sciences [q-bio]/Biodiversity/Populations and Evolution [q-bio.PE] ,MESH: Gene Duplication ,food and beverages ,[SDV.BBM.MN]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Molecular Networks [q-bio.MN] ,Physical Chromosome Mapping ,[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,MESH: DNA Transposable Elements ,RNA, Plant ,Genetics and Genomics/Comparative Genomics ,Transposable element ,Evolution ,MESH: Gene Rearrangement ,Molecular Sequence Data ,Gene redundancy ,MESH: Physical Chromosome Mapping ,Evolution, Molecular ,[SDV.GEN.GPL]Life Sciences [q-bio]/Genetics/Plants genetics ,03 medical and health sciences ,Open Reading Frames ,[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN] ,Genetics ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,Genetics and Genomics/Plant Genomes and Evolution ,Synteny ,[SDV.GEN]Life Sciences [q-bio]/Genetics ,MESH: Molecular Sequence Data ,Molecular ,Oryza ,[SDV.BBM.BM]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Molecular biology ,Gene rearrangement ,Plant ,MESH: Open Reading Frames ,Genes ,Genetic Loci ,Mutation ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,Developmental Biology ,MESH: Genome, Plant ,Cancer Research ,Sequence assembly ,Retrotransposon ,Genome ,MESH: Sorghum ,Genetics (clinical) ,MESH: Evolution, Molecular ,2. Zero hunger ,Gene Rearrangement ,MESH: Synteny ,Genetics and Genomics/Bioinformatics ,MESH: Oryza sativa ,Genome, Plant ,Research Article ,Biotechnology ,MESH: Mutation ,lcsh:QH426-470 ,MESH: Base Pairing ,Computational biology ,[SDV.BC]Life Sciences [q-bio]/Cellular Biology ,Biology ,Genes, Plant ,Zea mays ,MESH: Sequence Homology, Nucleic Acid ,MESH: Genetic Loci ,Chromosomes, Plant ,Chromosomes ,Sequence Homology, Nucleic Acid ,[SDV.BV]Life Sciences [q-bio]/Vegetal Biology ,[SDV.BBM]Life Sciences [q-bio]/Biochemistry, Molecular Biology ,Genetics and Genomics/Genomics ,[SDV.BBM.BC]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Biochemistry [q-bio.BM] ,Gene ,Sorghum ,030304 developmental biology ,Base Sequence ,Nucleic Acid ,Human Genome ,lcsh:Genetics ,Genetics and Genomics/Genome Projects ,[SDV.BV.AP]Life Sciences [q-bio]/Vegetal Biology/Plant breeding ,DNA Transposable Elements ,RNA ,010606 plant biology & botany - Abstract
Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences. Here, we sequenced and carefully annotated a contiguous 22 Mb region of maize chromosome 4 using an improved pseudomolecule for annotation. The sequence segment was comprehensively ordered, oriented, and confirmed using the maize optical map. Nearly 84% of the sequence is composed of transposable elements (TEs) that are mostly nested within each other, of which most families are low-copy. We identified 544 gene models using multiple levels of evidence, as well as five miRNA genes. Gene fragments, many captured by TEs, are prevalent within this region. Elimination of gene redundancy from a tetraploid maize ancestor that originated a few million years ago is responsible in this region for most disruptions of synteny with sorghum and rice. Consistent with other sub-genomic analyses in maize, small RNA mapping showed that many small RNAs match TEs and that most TEs match small RNAs. These results, performed on ∼1% of the maize genome, demonstrate the feasibility of refining the B73 RefGen_v1 genome assembly by incorporating optical map, high-resolution genetic map, and comparative genomic data sets. Such improvements, along with those of gene and repeat annotation, will serve to promote future functional genomic and phylogenomic research in maize and other grasses., Author Summary Maize is a major cereal crop and key experimental system for eukaryotic biology. Previous investigations of the maize genome at the sequence level have primarily focused on analyses of genome survey sequences and BAC contigs. Here we used a comprehensive set of resources to construct an ordered and oriented 22-Mb sequence from chromosome 4 that represents 1% of the maize genome. Genome annotation revealed the presence of 544 genes that are interspersed with transposable elements (TEs), which occupy 83.8% of the sequence. Fifty-one genes were involved in 14 tandem gene clusters and most appear to have arisen after lineage divergence. TEs, especially helitrons, were found to contain gene fragments and were widely distributed in gene-rich regions. Large inversions and unequal gene deletion between the two homoeologous maize regions were the main contributors to synteny disruption among maize, sorghum, and rice. We also show that small RNAs are primarily associated with TEs across the region. Comparison of this ordered and oriented sequence with the corresponding uncurated region in the whole genome sequence of maize resulted in improvements in TE annotation that will ultimately enhance detection sensitivity and characterization of TEs. Doing so is likely to improve the specificity of gene annotations.
- Published
- 2009
- Full Text
- View/download PDF