Back to Search
Start Over
Method for quick DNA barcode reference library construction
- Source :
- Ecology and Evolution, Vol 11, Iss 17, Pp 11627-11638 (2021), Ecology and Evolution
- Publication Year :
- 2021
- Publisher :
- Wiley, 2021.
-
Abstract
- DNA barcoding has become one of the most important techniques in plant species identification. Successful application of this technology is dependent on the availability of reference database of high species coverage. Unfortunately, there are experimental and data processing challenges to construct such a library within a short time. Here, we present our solutions to these challenges. We sequenced six conventional DNA barcode fragments (ITS1, ITS2, matK1, matK2, rbcL1, and rbcL2) of 380 flowering plants on next‐generation sequencing (NGS) platforms (Illumina Hiseq 2500 and Ion Torrent S5) and the Sanger sequencing platform. After comparing the sequencing depths, read lengths, base qualities, and base accuracies, we conclude that Illumina Hiseq2500 PE250 run is suitable for conventional DNA barcoding. We developed a new “Cotu” method to create consensus sequences from NGS reads for longer output sequences and more reliable bases than the other three methods. Step‐by‐step instructions to our method are provided. By using high‐throughput machines (PCR and NGS), labeling PCR, and the Cotu method, it is possible to significantly reduce the cost and labor investments for DNA barcoding. A regional or even global DNA barcoding reference library with high species coverage is likely to be constructed in a few years.<br />In this study, two next‐generation sequencing platform and four data processing methods were compared for further DNA barcoding reference library construction. After comparing the sequencing depths, read lengths, base qualities, and base accuracies, we concluded that Illumina Hiseq2500 PE250 run is suitable for conventional DNA barcoding, and after comparing our new “Cotu” to other three NGS data processing methods, we recommend Cotu method for its most sequence recovery, the longest sequence length, and the highest sequence accuracy. By using herbarium materials, high‐throughput machines (PCR and NGS), and the Cotu method, it is possible to construct a regional or even global DNA barcoding reference library with high species coverage with limited cost and labor investments in a few years.
- Subjects :
- Sanger sequencing
Ecology
Cotu
Computer science
next‐generation sequencing
Plant species identification
Ion semiconductor sequencing
Computational biology
DNA barcoding
DNA sequencing
symbols.namesake
DNA barcode
symbols
Reference database
QH540-549.5
Ecology, Evolution, Behavior and Systematics
Original Research
data processing method
Nature and Landscape Conservation
Subjects
Details
- ISSN :
- 20457758
- Volume :
- 11
- Database :
- OpenAIRE
- Journal :
- Ecology and Evolution
- Accession number :
- edsair.doi.dedup.....0669bfb641bca97fb2b9bb704efedc37
- Full Text :
- https://doi.org/10.1002/ece3.7788