Back to Search
Start Over
Illumina and PacBio DNA sequencing data, de novo assembly and annotation of the genome of Aurantiochytrium limacinum strain CCAP_4062/1
- Source :
- Data in Brief, Vol 31, Iss, Pp 105729-(2020), Data in Brief, Data in Brief, Elsevier, 2020, 31, pp.105729. ⟨10.1016/j.dib.2020.105729⟩, Data in Brief, 2020, 31, pp.105729. ⟨10.1016/j.dib.2020.105729⟩
- Publication Year :
- 2020
- Publisher :
- Elsevier, 2020.
-
Abstract
- The complete genome of the thraustochytrid Aurantiochytrium limacinum strain CCAP_4062/1 was sequenced using both Illumina Novaseq 6000 and third generation sequencing technology PacBio RSII in order to obtain trustworthy assembly and annotation. The reads from both platforms were combined at multiple levels in order to obtain a reliable assembly, then compared to the A. limacinum ATCCⓇ MYA1381™ reference genome. The final assembly was annotated with the help of strain CCAP_4062/1 RNAseq data. A. limacinum strain CCAP_4062/1 is an industrial strain used for the production of very long chain polyunsaturated fatty acids, like the docosahexaenoic acid that is an essential fatty acid synthesised only at very low pace in humans and vertebrates . Thraustochytrids in general and Aurantiochytrium more specifically, are used for carotenoid and squalene production as well. Beside their biotechnological interest, thraustochytrids play a crucial role in both inshore and oceanic basins ecosystems. Genome sequences will foster biotechnological as well as ecological studies.
- Subjects :
- [SDV]Life Sciences [q-bio]
Sequence assembly
Genomics
Computational biology
Biology
lcsh:Computer applications to medicine. Medical informatics
Genome
DNA sequencing
03 medical and health sciences
Annotation
0302 clinical medicine
Next generation sequencing
lcsh:Science (General)
ComputingMilieux_MISCELLANEOUS
030304 developmental biology
0303 health sciences
Multidisciplinary
Strain (biology)
Genome project
Thraustochytrid
lcsh:R858-859.7
Structural annotation
Third generation sequencing
030217 neurology & neurosurgery
Reference genome
Biotechnology
lcsh:Q1-390
Subjects
Details
- Language :
- English
- ISSN :
- 23523409
- Volume :
- 31
- Database :
- OpenAIRE
- Journal :
- Data in Brief
- Accession number :
- edsair.doi.dedup.....9aebf31710729e2e53b401f91c8a63a6
- Full Text :
- https://doi.org/10.1016/j.dib.2020.105729⟩