Back to Search
Start Over
Targeted capture of complete coding regions across divergent species
- Source :
- Genome Biology and Evolution
- Publication Year :
- 2017
- Publisher :
- Cold Spring Harbor Laboratory, 2017.
-
Abstract
- Despite continued advances in sequencing technologies, there is a need for methods that can efficiently sequence large numbers of genes from diverse species. One approach to accomplish this is targeted capture (hybrid enrichment). While these methods are well established for genome resequencing projects, cross-species capture strategies are still being developed and generally focus on the capture of conserved regions, rather than complete coding regions from specific genes of interest. The resulting data is thus useful for phylogenetic studies, but the wealth of comparative data that could be used for evolutionary and functional studies is lost. Here we design and implement a targeted capture method that enables recovery of complete coding regions across broad taxonomic scales. Capture probes were designed from multiple reference species and extensively tiled in order to facilitate cross-species capture. Using novel bioinformatics pipelines we were able to recover nearly all of the targeted genes with high completeness from species that were up to 200 myr divergent. Increased probe diversity and tiling for a subset of genes had a large positive effect on both recovery and completeness. The resulting data produced an accurate species tree, but importantly this same data can also be applied to studies of molecular evolution and function that will allow researchers to ask larger questions in broader phylogenetic contexts. Our method demonstrates the utility of cross-species approaches for the capture of full length coding sequences, and will substantially improve the ability for researchers to conduct large-scale comparative studies of molecular evolution and function.
- Subjects :
- 0301 basic medicine
0106 biological sciences
media_common.quotation_subject
comparative studies of protein-coding genes
Computational biology
cross-species sequence capture
Biology
010603 evolutionary biology
01 natural sciences
Birds
Evolution, Molecular
03 medical and health sciences
Open Reading Frames
Molecular evolution
hybrid enrichment of complete coding sequences
Completeness (order theory)
Genetics
Coding region
Animals
Function (engineering)
Ecology, Evolution, Behavior and Systematics
030304 developmental biology
media_common
0303 health sciences
Sequence
Phylogenetic tree
Reptiles
Sequence Analysis, DNA
Tree (data structure)
030104 developmental biology
Sequence Alignment
Algorithms
Coding (social sciences)
Research Article
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Genome Biology and Evolution
- Accession number :
- edsair.doi.dedup.....a5bd52138af5befbbefd679335f5d79c
- Full Text :
- https://doi.org/10.1101/099325