Back to Search
Start Over
Genome assembly and annotation ofArabidopsis halleri, a model for heavy metal hyperaccumulation and evolutionary ecology
- Source :
- Molecular Ecology Resources. 17:1025-1036
- Publication Year :
- 2016
- Publisher :
- Wiley, 2016.
-
Abstract
- The self-incompatible species Arabidopsis halleri is a close relative of the self-compatible model plant Arabidopsis thaliana. The broad European and Asian distribution and heavy metal hyperaccumulation ability make A. halleri a useful model for ecological genomics studies. We used long-insert mate-pair libraries to improve the genome assembly of the A. halleri ssp. gemmifera Tada mine genotype (W302) collected from a site with high contamination by heavy metals in Japan. After five rounds of forced selfing, heterozygosity was reduced to 0.04%, which facilitated subsequent genome assembly. Our assembly now covers 196 Mb or 78% of the estimated genome size and achieved scaffold N50 length of 712 kb. To validate assembly and annotation, we used synteny of A. halleri Tada mine with a previously published high-quality reference assembly of a closely related species, Arabidopsis lyrata. Further validation of the assembly quality comes from synteny and phylogenetic analysis of the HEAVY METAL ATPASE4 (HMA4) and METAL TOLERANCE PROTEIN1 (MTP1) regions using published sequences from European A. halleri for comparison. Three tandemly duplicated copies of HMA4, key gene involved in cadmium and zinc hyperaccumulation, were assembled on a single scaffold. The assembly will enhance the genomewide studies of A. halleri as well as the allopolyploid Arabidopsis kamchatica derived from A. lyrata and A. halleri.
- Subjects :
- 0301 basic medicine
Arabidopsis
Sequence Homology
Sequence assembly
Genomics
Biology
Synteny
03 medical and health sciences
Japan
Metals, Heavy
Genetics
Arabidopsis thaliana
Cation Transport Proteins
Gene
Genome size
Arabidopsis lyrata
Phylogeny
Ecology, Evolution, Behavior and Systematics
Adenosine Triphosphatases
2. Zero hunger
Phylogenetic tree
Arabidopsis Proteins
Molecular Sequence Annotation
Sequence Analysis, DNA
15. Life on land
biology.organism_classification
030104 developmental biology
Evolutionary biology
Environmental Pollution
Genome, Plant
Biotechnology
Subjects
Details
- ISSN :
- 1755098X
- Volume :
- 17
- Database :
- OpenAIRE
- Journal :
- Molecular Ecology Resources
- Accession number :
- edsair.doi.dedup.....2643686ec737e13c09950e0498c9b0d4
- Full Text :
- https://doi.org/10.1111/1755-0998.12604