1. Constructing a de novo transcriptome and a reference proteome for the bivalve Scrobicularia plana: Comparative analysis of different assembly strategies and proteomic analysis
- Author
-
Julián Blasco, Casimiro Baena-Angulo, José Manuel Jiménez-Pastor, José Alhama, Ana María Herruzo-Ruiz, Carmen Michán, Carlos A. Fuentes-Almagro, Francisco Amil-Ruiz, Ministerio de Economía y Competitividad (España), and Universidad de Córdoba (España)
- Subjects
Proteomics ,0106 biological sciences ,Proteome ,Bivalve mollusk ,Computational biology ,01 natural sciences ,Transcriptome ,03 medical and health sciences ,Shot-gun proteomics ,Genetics ,Animals ,Scrobicularia plana ,030304 developmental biology ,0303 health sciences ,Transcriptome assembly ,biology ,High-Throughput Nucleotide Sequencing ,Assembly quality assessment ,biology.organism_classification ,Bivalvia ,Molecular Databases ,010606 plant biology & botany - Abstract
Scrobicularia plana is a coastal and estuarine bivalve widely used in ecotoxicological studies. However, the underlying molecular mechanisms for S. plana pollutant responses are hardly known due to the lack of molecular databases. Thus, in this study we present a holistic approach to assess a robust reference transcriptome and proteome of this clam. A mixture of control and metal-exposed individuals was used for mRNA isolation. Four sets of high quality filtered preprocessed reads were generated (two quality scores and two sequenced lengths) and assembled with Mira, Ray and Trinity algorithms. The sixty-four generated assemblies were refined, filtered and evaluated for their proteomic quality. Eight assemblies presented top Detonate scores but one was selected due to its compactness and biological representation, which was generated: (i) from the highest quality dataset (Q20L100), (ii) using Trinity algorithm with all k-mers (AtKa), (iii) removing redundancy by CD-HIT (RR80), and (iv) filtering out poor contigs (F), that was subsequently named Q20L100AtKaRR80F. S. plana proteomic analysis revealed 10,017 peptide groups that corresponded to 2066 proteins with a wide coverage of molecular functions and biological processes, confirming the strength of the database generated., This study was supported by the Ministry of Economy and Competitiveness, Spain (CTM2016-75908-R). We would like to thank Dr. Mercedes Cousinou and Ms. Laura Redondo (Genomic Unit, SCAI, UCO) for their technical help in transcriptome sequencing. A.M. Herruzo-Ruiz received a predoctoral contract of the University of Cordoba (“Plan Propio”).
- Published
- 2021
- Full Text
- View/download PDF