Back to Search
Start Over
Building two indica rice reference genomes with PacBio long-read and Illumina paired-end sequencing data
- Source :
- Scientific Data
- Publication Year :
- 2016
- Publisher :
- Springer Science and Business Media LLC, 2016.
-
Abstract
- Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement.
- Subjects :
- 0301 basic medicine
Statistics and Probability
Data Descriptor
Plant genetics
Sequence assembly
Library and Information Sciences
Biology
Genes, Plant
Polymorphism, Single Nucleotide
Genome
Chromosomes, Plant
Education
03 medical and health sciences
Annotation
0302 clinical medicine
Data sequences
INDEL Mutation
Species Specificity
Gene Expression Regulation, Plant
DNA sequencing
Paired-end tag
Illumina dye sequencing
Genetics
Oryza sativa
Gene Expression Profiling
food and beverages
Chromosome Mapping
Genetic Variation
Oryza
Genomics
Computer Science Applications
030104 developmental biology
PNAS Plus
Statistics, Probability and Uncertainty
Genome, Plant
030217 neurology & neurosurgery
Information Systems
Reference genome
Subjects
Details
- ISSN :
- 20524463
- Volume :
- 3
- Database :
- OpenAIRE
- Journal :
- Scientific Data
- Accession number :
- edsair.doi.dedup.....54e5fb4bded02986935de71c5d3b5196