713 results on '"PacBio sequencing"'
Search Results
2. Removal of bacterial pathogens and antibiotic resistance bacteria by anaerobic sludge digestion with thermal hydrolysis pre-treatment and alkaline stabilization post-treatment
- Author
-
Cuetero-Martínez, Yovany, Flores-Ramírez, Aarón, De los Cobos-Vasconcelos, Daniel, Aguirre-Garrido, José Félix, López-Vidal, Yolanda, and Noyola, Adalberto
- Published
- 2023
- Full Text
- View/download PDF
3. 217 closed Salmonella reference genomes using PacBio sequencing.
- Author
-
Luo, Yan, Jang, Jae Hee, Balkey, Maria, and Hoffmann, Maria
- Abstract
Objectives: Whole Genome Sequencing (WGS) is widely used in food safety for the detection, investigation, and control of foodborne bacterial pathogens. However, the WGS data in most public databases, such as the National Center for Biotechnology Information (NCBI), primarily consist of Illumina short reads which lack some important information for repetitive regions, structural variations, and mobile genetic elements, and the genomic location of certain important genes like antimicrobial resistance genes (AMR) and virulence genes. To address this limitation, we have contributed 217 closed circular Salmonella enterica genomes that were generated using PacBio sequencing to the NCBI Pathogen Detection (PD) database and GenBank. This dataset provides a higher level of accuracy to genome representations in the database. Data description: High-quality complete reference genomes generated from PacBio long reads can provide essential details that are not available in draft genomes from short reads. A complete reference genome allows for more accurate data analysis and researchers to establish connections between genome variations and known genes, regulatory elements, and other genomic features. The addition of 217 complete genomes from 78 different Salmonella serovars, each representing either a distinct SNP cluster within the NCBI PD database or a unique strain, significantly enriches the diversity of the reference genome database. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
4. Chromosome-Level Genome Assembly of the Meishan Pig and Insights into Its Domestication Mechanisms.
- Author
-
Du, Huipeng, Hu, Jianchao, Zhang, Zhiyan, and Wu, Zhongzi
- Abstract
Simple Summary: This study assembled a high-quality chromosome-level genome of the Meishan pig using Illumina and PacBio sequencing technologies. By integrating this new genome with existing pig genome assemblies, we contribute to developing a pig pan-genome, facilitating research on structural variations. Selective sweep analysis between the Chinese wild boar and the Meishan pig revealed key candidate genes, such as TBX19 and PGR, linked to domestication traits. These findings provide valuable genetic resources for studying Meishan pig biology and offer a foundation for improving breeding strategies and conservation efforts. Pigs are essential agricultural animals, and among the various breeds, the Meishan pig, a native breed of China, is renowned for its high reproductive performance. This breed has been introduced to many countries to enhance local pig breeding programs. However, there have been limited genomic and population genetics studies focusing on Meishan pigs. We created a chromosomal-level genomic assembly using high-depth PacBio sequencing and Illumina sequencing data collected from a Meishan pig. Additionally, we analyzed whole-genome sequencing (WGS) data from Chinese boars and Meishan pigs to identify domestication selection signals within the Meishan breed. The assembled genome of the Meishan pig (MSjxau) was found to be 2.45 Gb in size, with a scaffold length of 139.17 Mb. The quality value was 37.06, and the BUSCO score was 96.2%, indicating good completeness, continuity, and accuracy. We annotated transposable elements, segmental duplication, and genes in the MSjxau genome. By combining these data with 28 publicly available genomes, we provide a high-quality structural variants resource for pigs. Furthermore, we identified 716 selective sweep intervals between Chinese wild pigs and Meishan pigs, where the selected gene PGR may be linked to the high fertility observed in Meishan pigs. Our study offers valuable genomic and variation resources for pig breeding and identifies several genes associated with the domestication of the Meishan pig. This lays the groundwork for further investigation into the genetic mechanisms behind complex traits in pigs. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
5. Impact of DNA extraction, PCR amplification, sequencing, and bioinformatic analysis on food-associated mock communities using PacBio long-read amplicon sequencing
- Author
-
Mareike Baer, Lisa Höppe, Waldemar Seel, and André Lipski
- Subjects
PacBio sequencing ,Full-length 16S rRNA gene ,Long-read sequencing ,Amplicon sequencing ,Mock community ,Food microbiota ,Microbiology ,QR1-502 - Abstract
Abstract Background Long-read 16S rRNA gene amplicon sequencing has a high potential for characterizing food-associated microbiomes. The advantage results from sequencing the full-length (1,500 bp) gene, enabling taxonomic resolution at species level. Here we present a benchmarking study using mock communities representative of milking machine biofilms and raw meat, revealing challenges relevant to food-associated habitats. These were varying species abundances, reliable intra-genus differentiation of species, and detection of novel species with
- Published
- 2024
- Full Text
- View/download PDF
6. Impact of DNA extraction, PCR amplification, sequencing, and bioinformatic analysis on food-associated mock communities using PacBio long-read amplicon sequencing.
- Author
-
Baer, Mareike, Höppe, Lisa, Seel, Waldemar, and Lipski, André
- Subjects
LIFE sciences ,MILKING machines ,DATABASES ,RIBOSOMAL RNA ,DNA ,BACTERIAL diversity - Abstract
Background: Long-read 16S rRNA gene amplicon sequencing has a high potential for characterizing food-associated microbiomes. The advantage results from sequencing the full-length (1,500 bp) gene, enabling taxonomic resolution at species level. Here we present a benchmarking study using mock communities representative of milking machine biofilms and raw meat, revealing challenges relevant to food-associated habitats. These were varying species abundances, reliable intra-genus differentiation of species, and detection of novel species with < 98.7% sequence identity to type strains. By using mock communities at different levels of preparation − as mixed whole cells, mixed extracted DNA, and mixed PCR products − we systematically investigated the influence of DNA extraction using two different kits, PCR amplification of 16S rRNA genes, sequencing, and bioinformatics analysis including reference database and gene copy number normalization on bacterial composition and alpha diversity. Results: We demonstrated that PacBio ccs-reads allowed for correct taxonomic assignment of all species present within the mock communities using a custom Refseq database. However, choice of percent identity values for taxonomic assignment had a strong influence on identification and processing of reads from novel species. PCR amplification of 16S rRNA genes produced the strongest bias on the observed community composition, while sequencing alone reproduced the preset composition well. The PCR bias can in part be attributed to differences in mol% G + C content of 16S rRNA genes resulting in preferred amplification of low mol% G + C-containing taxa. Conclusions: This study underlines the importance of benchmarking studies with mock communities representing the habitat of interest to evaluate the methodology prior to analyzing real samples of unknown composition. It demonstrates the advantage of long-read sequencing over short-read sequencing, as species level identification enables in-depth characterization of the habitat. One benefit is improved risk assessment by enabling differentiation between pathogenic and apathogenic species of the same genus. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Near-complete genome sequence of Lipomyces tetrasporous NRRL Y-64009, an oleaginous yeast capable of growing on lignocellulosic hydrolysates.
- Author
-
Jagtap, Sujit, Liu, Jing-Jing, Walukiewicz, Hanna, Pangilinan, Jasmyn, Lipzen, Anna, Ahrendt, Steven, Koriabine, Maxim, Cobaugh, Kelly, Salamov, Asaf, Yoshinaga, Yuko, Ng, Vivian, Daum, Chris, Grigoriev, Igor, Slininger, Patricia, Dien, Bruce, Jin, Yong-Su, and Rao, Christopher
- Subjects
Lipomyces ,PacBio sequencing ,genome assembly ,genome sequence ,lipid - Abstract
Lipomyces tetrasporous is an oleaginous yeast that can utilize a variety of plant-based sugars. It accumulates lipids during growth on lignocellulosic biomass hydrolysates. We present the annotated genome sequence of L. tetrasporous NRRL Y-64009 to aid in its development as a platform organism for producing lipids and lipid-based bioproducts.
- Published
- 2023
8. Relationship between dynamic changes of microorganisms in Qupi and the quality formation of Fengxiangxing Huairang Daqu.
- Author
-
Dan Cao, Jiali Lv, Jingying Chu, Shuangshuang Xu, Chengyong Jin, Yongli Zhang, Yuhang Zhang, Wen Zhang, and Jie Kang
- Subjects
POINT processes ,BACILLUS licheniformis ,MICROBIAL communities ,AROMATIC compounds ,MICROORGANISMS ,ETHANOL - Abstract
Introduction: Fengxiangxing Huairang Daqu (FHD) is one of the major types of Daqu in China. However, the relationship between the microbial community structure at different stages, the changes in the sensory characteristics, fermentation characteristics, volatiles, the most critical process point, and the quality formation of FHD is not clear. Methods: Based on microscopic characterization, PacBio SMRT sequencing, and HS-SPME-GC-MS volatile metabolite analysis revealed the relationship between FHD quality formation and the dynamics of Qupi. Results: The results showed that the 12th day of the culture was the most critical process point, highlighting the most significant differences in microbial community structure, sensory characteristics, fermentation characteristics, and flavor substances. Bacillus licheniformis (43.25%), Saccharopolyspora rectivirgula (35.05%), Thermoascus aurantiacus (76.51%), Aspergillus amstelodami (10.81%), and Saccharomycopsis fibuligera (8.88%) were the dominant species in FHD. S. fibuligera, A. amstelodami, and T. aurantiacus were associated with the snow-white color of the FHD epidermis, the yellow color of the interior, and the gray-white color, respectively. The abundance of T. aurantiacus, A. amstelodami, B. licheniformis, and S. rectivirgula was positively associated with the esterifying power and liquefying power of FHD. The abundance of T. aurantiacus and A. amstelodami was positively correlated with the saccharifying power of FHD. The abundance of S. fibuligera was positively related to the fermenting power of FHD. A total of 248 volatiles were detected in Qupi, mainly including alcohols, esters, aldehydes, and ketones. Of them, eleven volatiles had a significant effect on the flavor of Qupi, such as 1-butanol-3-methyl-, hydrazinecarboxamide, ethanol, phenylethyl alcohol, ethyl acetate, 2-octanone, 1-octen-3-ol, formic acid-hexyl ester, (E)-2-octen-1-ol, ethyl hexanoate, and 2(3H)-furanone-dihydro-5-pentyl-. The abundance of B. licheniformis, S. rectivirgula, T. aurantiacus, and S. fibuligera was positively correlated with the alcohols, aromatic compounds, and phenols in FHD. The abundance of S. fibuligera was positively correlated with the acids, esters, and hydrocarbons in FHD. Discussion: These results indicate important theoretical basis and technical support for controllable adjustment of FHD microbial community structure, stable control of FHD quality, and precise, effective, and large-scale guidance of FHD production. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. A chromosome-level genome assembly and annotation of the medicinal plant Lepidium apetalum
- Author
-
Hang Yan, Yunhao Zhu, Haoyu Jia, Yuanjun Li, Yongguang Han, Xiaoke Zheng, Xiule Yue, Le Zhao, and Weisheng Feng
- Subjects
Lepidium apetalum ,Genome assembly ,PacBio sequencing ,Hi-C ,Transcriptome ,Genetics ,QH426-470 - Abstract
Abstract Objectives As a traditional Chinese medicine, Lepidium apetalum is commonly used for purging the lung, relieving dyspnea, alleviating edema, and has the significant pharmacological effects on cardiovascular disease, hyperlipidemia, etc. In addition, the seeds of L. apetalum are rich in unsaturated fatty acids, sterols, glucosinolates and have a variety of biological activity compounds. To facilitate genomics, phylogenetic and secondary metabolite biosynthesis studies of L. apetalum, we assembled the high-resolution genome of L. apetalum. Data description We completed chromosome-level genome assembly of the L. apetalum genome (2n = 32), using Illumina HiSeq and PacBio Sequel sequencing platform as well as high-throughput chromosome conformation capture (Hi-C) technique. The assembled genome was 296.80 Mb in size, 34.41% in GC content, and 23.89% in repeated sequence content, including 316 contigs with a contig N50 of 16.31 Mb. Hi-C scaffolding resulted in 16 chromosomes occupying 99.79% of the assembled genome sequences. A total of 46 584 genes and 105 pseudogenes were predicted, 98.37% of which can be annotated to Nr, GO, KEGG, TrEMBL, SwissPort, Pfam and KOG databases. The high-quality reference genome generated by this study will provide accurate genetic information for the molecular biology research of L. apetalum.
- Published
- 2024
- Full Text
- View/download PDF
10. A chromosome-level genome assembly and annotation of the medicinal plant Lepidium apetalum.
- Author
-
Yan, Hang, Zhu, Yunhao, Jia, Haoyu, Li, Yuanjun, Han, Yongguang, Zheng, Xiaoke, Yue, Xiule, Zhao, Le, and Feng, Weisheng
- Subjects
LEPIDIUM ,MEDICINAL plants ,UNSATURATED fatty acids ,CHINESE medicine ,MOLECULAR biology ,CHROMOSOMES - Abstract
Objectives: As a traditional Chinese medicine, Lepidium apetalum is commonly used for purging the lung, relieving dyspnea, alleviating edema, and has the significant pharmacological effects on cardiovascular disease, hyperlipidemia, etc. In addition, the seeds of L. apetalum are rich in unsaturated fatty acids, sterols, glucosinolates and have a variety of biological activity compounds. To facilitate genomics, phylogenetic and secondary metabolite biosynthesis studies of L. apetalum, we assembled the high-resolution genome of L. apetalum. Data description: We completed chromosome-level genome assembly of the L. apetalum genome (2n = 32), using Illumina HiSeq and PacBio Sequel sequencing platform as well as high-throughput chromosome conformation capture (Hi-C) technique. The assembled genome was 296.80 Mb in size, 34.41% in GC content, and 23.89% in repeated sequence content, including 316 contigs with a contig N50 of 16.31 Mb. Hi-C scaffolding resulted in 16 chromosomes occupying 99.79% of the assembled genome sequences. A total of 46 584 genes and 105 pseudogenes were predicted, 98.37% of which can be annotated to Nr, GO, KEGG, TrEMBL, SwissPort, Pfam and KOG databases. The high-quality reference genome generated by this study will provide accurate genetic information for the molecular biology research of L. apetalum. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. De novo assembly and annotation of the Patagonian toothfish (Dissostichus eleginoides) genome
- Author
-
David Ryder, David Stone, Diana Minardi, Ainsley Riley, Justin Avant, Lisa Cross, Marta Soeffker, Deborah Davidson, Andrew Newman, Peter Thomson, Chris Darby, and Ronny van Aerle
- Subjects
Dissostichus eleginoides ,Nototheniidae ,Illumina sequencing ,PacBio sequencing ,Anti-freeze glycoprotein ,Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Patagonian toothfish (Dissostichus eleginoides) is an economically and ecologically important fish species in the family Nototheniidae. Juveniles occupy progressively deeper waters as they mature and grow, and adults have been caught as deep as 2500 m, living on or in just above the southern shelves and slopes around the sub-Antarctic islands of the Southern Ocean. As apex predators, they are a key part of the food web, feeding on a variety of prey, including krill, squid, and other fish. Despite its importance, genomic sequence data, which could be used for more accurate dating of the divergence between Patagonian and Antarctic toothfish, or establish whether it shares adaptations to temperature with fish living in more polar or equatorial climes, has so far been limited. Results A high-quality D. eleginoides genome was generated using a combination of Illumina, PacBio and Omni-C sequencing technologies. To aid the genome annotation, the transcriptome derived from a variety of toothfish tissues was also generated using both short and long read sequencing methods. The final genome assembly was 797.8 Mb with a N50 scaffold length of 3.5 Mb. Approximately 31.7% of the genome consisted of repetitive elements. A total of 35,543 putative protein-coding regions were identified, of which 50% have been functionally annotated. Transcriptomics analysis showed that approximately 64% of the predicted genes (22,617 genes) were found to be expressed in the tissues sampled. Comparative genomics analysis revealed that the anti-freeze glycoprotein (AFGP) locus of D. eleginoides does not contain any AFGP proteins compared to the same locus in the Antarctic toothfish (Dissostichus mawsoni). This is in agreement with previously published results looking at hybridization signals and confirms that Patagonian toothfish do not possess AFGP coding sequences in their genome. Conclusions We have assembled and annotated the Patagonian toothfish genome, which will provide a valuable genetic resource for ecological and evolutionary studies on this and other closely related species.
- Published
- 2024
- Full Text
- View/download PDF
12. A new Plasmodium vivax reference genome for South American isolates
- Author
-
Katlijn De Meulenaere, Bart Cuypers, Dionicia Gamboa, Kris Laukens, and Anna Rosanas-Urgell
- Subjects
Plasmodium vivax ,Genome assembly ,PacBio sequencing ,Reference genome ,Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Plasmodium vivax is the second most important cause of human malaria worldwide, and accounts for the majority of malaria cases in South America. A high-quality reference genome exists for Papua Indonesia (PvP01) and Thailand (PvW1), but is lacking for South America. A reference genome specifically for South America would be beneficial though, as P. vivax is a genetically diverse parasite with geographical clustering. Results This study presents a new high-quality assembly of a South American P. vivax isolate, referred to as PvPAM (P. vivax Peruvian AMazon). The genome was obtained from a low input patient sample from the Peruvian Amazon and sequenced using PacBio technology, resulting in a highly complete assembly with 6497 functional genes. Telomeric ends were present in 17 out of 28 chromosomal ends, and additional (sub)telomeric regions are present in 12 unassigned contigs. A comparison of multigene families between PvPAM and the PvP01 genome revealed remarkable variation in vir genes, and the presence of merozoite surface proteins (MSP) 3.6 and 3.7. Three dhfr and dhps drug resistance associated mutations are present in PvPAM, similar to those found in other Peruvian isolates. Mapping of publicly available South American whole genome sequencing (WGS) data to PvPAM resulted in significantly fewer variants and truncated reads compared to the use of PvP01 or PvW1 as reference genomes. To minimize the number of core genome variants in non-South American samples, PvW1 is most suited for Southeast Asian isolates, both PvPAM and PvW1 are suited for South Asian isolates, and PvPAM is recommended for African isolates. Interestingly, non-South American samples still contained the least subtelomeric variants when mapped to PvPAM, indicating high quality of the PvPAM subtelomeric regions. Conclusions Our findings show that the PvPAM reference genome more accurately represents South American P. vivax isolates in comparison to PvP01 and PvW1. In addition, PvPAM has a high level of completeness, and contains a similar number of annotated genes as PvP01 or PvW1. The PvPAM genome therefore will be a valuable resource to improve future genomic analyses on P. vivax isolates from the South American continent.
- Published
- 2023
- Full Text
- View/download PDF
13. De novo assembly and annotation of the Patagonian toothfish (Dissostichus eleginoides) genome.
- Author
-
Ryder, David, Stone, David, Minardi, Diana, Riley, Ainsley, Avant, Justin, Cross, Lisa, Soeffker, Marta, Davidson, Deborah, Newman, Andrew, Thomson, Peter, Darby, Chris, and van Aerle, Ronny
- Subjects
TOP predators ,COMPARATIVE genomics ,BODY temperature regulation ,GERMPLASM ,FISH adaptation ,FISH diversity ,GENOMES - Abstract
Background: Patagonian toothfish (Dissostichus eleginoides) is an economically and ecologically important fish species in the family Nototheniidae. Juveniles occupy progressively deeper waters as they mature and grow, and adults have been caught as deep as 2500 m, living on or in just above the southern shelves and slopes around the sub-Antarctic islands of the Southern Ocean. As apex predators, they are a key part of the food web, feeding on a variety of prey, including krill, squid, and other fish. Despite its importance, genomic sequence data, which could be used for more accurate dating of the divergence between Patagonian and Antarctic toothfish, or establish whether it shares adaptations to temperature with fish living in more polar or equatorial climes, has so far been limited. Results: A high-quality D. eleginoides genome was generated using a combination of Illumina, PacBio and Omni-C sequencing technologies. To aid the genome annotation, the transcriptome derived from a variety of toothfish tissues was also generated using both short and long read sequencing methods. The final genome assembly was 797.8 Mb with a N50 scaffold length of 3.5 Mb. Approximately 31.7% of the genome consisted of repetitive elements. A total of 35,543 putative protein-coding regions were identified, of which 50% have been functionally annotated. Transcriptomics analysis showed that approximately 64% of the predicted genes (22,617 genes) were found to be expressed in the tissues sampled. Comparative genomics analysis revealed that the anti-freeze glycoprotein (AFGP) locus of D. eleginoides does not contain any AFGP proteins compared to the same locus in the Antarctic toothfish (Dissostichus mawsoni). This is in agreement with previously published results looking at hybridization signals and confirms that Patagonian toothfish do not possess AFGP coding sequences in their genome. Conclusions: We have assembled and annotated the Patagonian toothfish genome, which will provide a valuable genetic resource for ecological and evolutionary studies on this and other closely related species. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Properties and Fungal Communities of Different Soils for Growth of the Medicinal Asian Water Plantain, Alisma orientale , in Fujian, China.
- Author
-
Xu, Xiaomei, Lin, Wenjin, Keyhani, Nemat O., Liu, Sen, Li, Lisha, Zhang, Yamin, Lu, Xuehua, Wei, Qiuran, Wei, Daozhi, Huang, Shuaishuai, Cao, Pengxi, Tian, Lin, and Qiu, Junzhi
- Subjects
- *
FUNGAL communities , *FARMS , *AGRICULTURE , *PESTICIDE residues in food , *MULBERRY , *CHINESE medicine , *HEAVY metals - Abstract
The Asian water plantain, Alisma orientale (Sam.) Juzep, is a traditional Chinese medicinal plant. The dried tubers of the Alisma orientale, commonly referred to as Alismatis rhizome (AR), have long been used in traditional Chinese medicine to treat a variety of diseases. Soil properties and the soil microbial composition are known to affect the quality and bioactivity of plants. Here, we sought to identify variations in soil fungal communities and soil properties to determine which would be optimal for cultivation of A. orietale. Soil properties, heavy metal content, and pesticide residues were determined from soils derived from four different agricultural regions around Shaowu City, Fujian, China, that had previously been cultivated with various crops, namely, Shui Dao Tu (SDT, rice), Guo Shu Tu (GST, pecan), Cha Shu Tu (CST, tea trees), and Sang Shen Tu (SST, mulberry). As fungi can either positively or negatively impact plant growth, the fungal communities in the different soils were characterized using long-read PacBio sequencing. Finally, we examined the quality of A. orientale grown in the different soils. Our results show that fungal community diversity of the GST soil was the highest with saprotrophs the main functional modes in these and SDT soils. Our data show that GST and SDT soils were most suitable for A. orientale growth, with the quality of the AR tubers harvested from GST soil being the highest. These data provide a systematic approach at soil properties of agricultural lands in need of replacement and/or rotating crops. Based on our findings, GST was identified as the optimal soil for planting A. orientale, providing a new resource for local farmers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Exploring the cobia (Rachycentron canadum) genome: unveiling putative male heterogametic regions and identification of sex-specific markers.
- Author
-
Shen, Xueyan, Hu, Jie, Yáñez, José M, Bastos Gomes, Giana, Poon, Zhi Weng Josiah, Foster, Derick, Alarcon, Jorge F, Shao, Libin, Guo, Xinyu, Shao, Yunchang, Huerlimann, Roger, Li, Chengze, Goulden, Evan, Anderson, Kelli, Fan, Guangyi, and Domingos, Jose A
- Subjects
- *
COBIA , *SEX determination , *TRANSCRIPTION factors , *GENOME-wide association studies , *EPOXIDE hydrolase , *GENETIC sex determination - Abstract
Background Cobia (Rachycentron canadum) is the only member of the Rachycentridae family and exhibits considerable sexual dimorphism in growth rate. Sex determination in teleosts has been a long-standing basic biological question, and the molecular mechanisms of sex determination/differentiation in cobia are completely unknown. Results Here, we reported 2 high-quality, chromosome-level annotated male and female cobia genomes with assembly sizes of 586.51 Mb (contig/scaffold N50: 86.0 kb/24.3 Mb) and 583.88 Mb (79.9 kb/22.5 Mb), respectively. Synteny inference among perciform genomes revealed that cobia and the remora Echeneis naucrates were sister groups. Further, whole-genome resequencing of 31 males and 60 females, genome-wide association study, and sequencing depth analysis identified 3 short male-specific regions within a 10.7-kb continuous genomic region on male chromosome 18, which hinted at an undifferentiated sex chromosome system with a putative XX/XY mode of sex determination in cobia. Importantly, the only 2 genes within/between the male-specific regions, epoxide hydrolase 1 (ephx1 , renamed cephx1y) and transcription factor 24 (tcf24 , renamed ctcf24y), showed testis-specific/biased gene expression, whereas their counterparts cephx1x and ctf24x , located in female chromosome 18, were similarly expressed in both sexes. In addition, male-specific PCR targeting the cephx1y gene revealed that this genomic feature is conserved in cobia populations from Panama, Brazil, Australia, and Japan. Conclusion The first comprehensive genomic survey presented here is a valuable resource for future studies on cobia population structure and dynamics, conservation, and evolutionary history. Furthermore, it establishes evidence of putative male heterogametic regions with 2 genes playing a potential role in the sex determination of the species, and it provides further support for the rapid evolution of sex-determining mechanisms in teleost fish. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Bioturbation effect of artificial inoculation on the flavor metabolites and bacterial communities in the Chinese Mao-tofu fermentation
- Author
-
Tongwei Guan, Shiyu Fu, Xiaotian Wu, Hao Yu, and Ying Liu
- Subjects
Mao-tofu ,Amino acids ,Volatile profile ,Bacterial community ,PacBio sequencing ,Nutrition. Foods and food supply ,TX341-641 ,Food processing and manufacture ,TP368-456 - Abstract
A comparison between artificially inoculated Mao-tofu (CC) and naturally fermented Mao-tofu (MM) indicated that artificially adding Mucor plasmaticus to Mao-tofu dramatically enhanced the essential amino acid (EAA) content, as well as umami and sweet amino acids. Gas chromatography-tandem mass spectrometry (GC–MS/MS) analysis revealed that phenol (3.226 μg/g), 1-octen-3-ol (5.031 μg/g), ethyl heptanoate (1.646 μg/g), and indole (3.422 μg/g) were the key flavor components in Mao-tofu. Unlike MM, CC displayed a substantial increase in esters and a considerable decrease in foul odor substances, including sulfur-containing compounds and indole. Lactococcus raffinolactis, Enterobacter sp. 638, and Streptococcus parauberis KCTC 11537 represented the key bacterial species altering the amino acids and flavor of Mao-tofu according to PacBio single-molecule real-time (SMRT) sequencing and correlation analysis. This study presents the technical feasibility of artificially inoculating Mao-tofu to regulate the core bacterial communities and control the quality of fermented soybean products.
- Published
- 2024
- Full Text
- View/download PDF
17. Characterization of oxidosqualene cyclases involved in pentacyclic triterpene biosynthesis in Korean chestnut (Castanea crenata).
- Author
-
Han, Jung Yeon, Ahn, Chang-Ho, Choi, Han Suk, and Choi, Yong Eui
- Subjects
- *
CYCLASES , *CASTANEA , *CHESTNUT , *BIOSYNTHESIS , *TRITERPENES , *METABOLITES , *BARK - Abstract
Plant triterpenoids are secondary metabolites with high chemical diversity and interesting biological properties. Chestnuts are deciduous trees in the genus Castanea, and their nuts have been used as an important food. In this work, we identified various types of triterpenes, such as α-amyrin, β-amyrin, lupeol, and friedelin, in the leaves and/or stem bark of Korean chestnut (Castanea crenata). Triterpene biosynthesis occurs by the cyclization of 2,3-oxidosqualene to triterpenes, catalyzed by oxidosqualene cyclases (OSCs). A total of 65 putative OSC sequences were obtained from the leaf transcriptome data of C. crenata plants using PacBio sequencing. We selected 5 putative OSC unigenes, named CcOSC1-5, for functional characterization of genes involved in triterpene biosynthesis. Functional characterization of the CcOSC1-5 genes by heterologous expression in erg7 mutant yeast revealed that both CcOSC1 and CcOSC2 had a similar function, encoding multifunctional triterpene synthases producing mainly β-amyrin and a small amount of α-amyrin and lupeol. CcOSC3 encodes mixed amyrin synthase, which mainly produces β-amyrin and a small amount of α-amyrin. CcOSC4 produced mainly α-amyrin and lupeol and a small amount of β-amyrin. CcOSC5 encodes an enzyme for lupeol production as a single product. In conclusion, we identified various triterpenes and functionally characterized the triterpene synthase genes that participate in β-amyrin, α-amyrin, and lupeol biosynthesis in C. crenata. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. A new Plasmodium vivax reference genome for South American isolates.
- Author
-
De Meulenaere, Katlijn, Cuypers, Bart, Gamboa, Dionicia, Laukens, Kris, and Rosanas-Urgell, Anna
- Subjects
PLASMODIUM vivax ,WHOLE genome sequencing ,GENOMICS - Abstract
Background: Plasmodium vivax is the second most important cause of human malaria worldwide, and accounts for the majority of malaria cases in South America. A high-quality reference genome exists for Papua Indonesia (PvP01) and Thailand (PvW1), but is lacking for South America. A reference genome specifically for South America would be beneficial though, as P. vivax is a genetically diverse parasite with geographical clustering. Results: This study presents a new high-quality assembly of a South American P. vivax isolate, referred to as PvPAM (P. vivax Peruvian AMazon). The genome was obtained from a low input patient sample from the Peruvian Amazon and sequenced using PacBio technology, resulting in a highly complete assembly with 6497 functional genes. Telomeric ends were present in 17 out of 28 chromosomal ends, and additional (sub)telomeric regions are present in 12 unassigned contigs. A comparison of multigene families between PvPAM and the PvP01 genome revealed remarkable variation in vir genes, and the presence of merozoite surface proteins (MSP) 3.6 and 3.7. Three dhfr and dhps drug resistance associated mutations are present in PvPAM, similar to those found in other Peruvian isolates. Mapping of publicly available South American whole genome sequencing (WGS) data to PvPAM resulted in significantly fewer variants and truncated reads compared to the use of PvP01 or PvW1 as reference genomes. To minimize the number of core genome variants in non-South American samples, PvW1 is most suited for Southeast Asian isolates, both PvPAM and PvW1 are suited for South Asian isolates, and PvPAM is recommended for African isolates. Interestingly, non-South American samples still contained the least subtelomeric variants when mapped to PvPAM, indicating high quality of the PvPAM subtelomeric regions. Conclusions: Our findings show that the PvPAM reference genome more accurately represents South American P. vivax isolates in comparison to PvP01 and PvW1. In addition, PvPAM has a high level of completeness, and contains a similar number of annotated genes as PvP01 or PvW1. The PvPAM genome therefore will be a valuable resource to improve future genomic analyses on P. vivax isolates from the South American continent. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
19. The minor chicken class I gene BF1 is deleted between short imperfect direct repeats in the B14 and typical B15 major histocompatibility complex (MHC) haplotypes.
- Author
-
Rocos, Nicolas I. E., Coulter, Felicity J., Tan, Thomas C. J., and Kaufman, Jim
- Subjects
- *
MAJOR histocompatibility complex , *CHICKENS , *CYTOTOXIC T cells , *HAPLOTYPES , *NUCLEOTIDE sequencing - Abstract
The chicken major histocompatibility complex (MHC, also known as the BF-BL region of the B locus) is notably small and simple with few genes, most of which are involved in antigen processing and presentation. There are two classical class I genes, of which only BF2 is well and systemically expressed as the major ligand for cytotoxic T lymphocytes (CTLs). The other class I gene, BF1, is believed to be primarily a natural killer (NK) cell ligand. Among most standard chicken MHC haplotypes examined in detail, BF1 is expressed tenfold less than BF2 at the RNA level due to defects in the promoter or in a splice site. However, in the B14 and typical B15 haplotypes, BF1 RNA was not detected, and here, we show that a deletion between imperfect 32 nucleotide direct repeats has removed the BF1 gene entirely. The phenotypic effects of not having a BF1 gene (particularly on resistance to infectious pathogens) have not been systematically explored, but such deletions between short direct repeats are also found in some BF1 promoters and in the 5′ untranslated region (5′UTR) of some BG genes found in the BG region of the B locus. Despite the opposite transcriptional orientation of homologous genes in the chicken MHC, which might prevent the loss of key genes from a minimal essential MHC, it appears that small direct repeats can still lead to deletion. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
20. Analysis on the salt tolerance of Nitraria sibirica Pall. based on Pacbio full-length transcriptome sequencing.
- Author
-
Zhang, Panpan, Zhang, Fengxiang, Wu, Zhiheng, Cahaeraduqin, Sunaer, Liu, Wei, and Yan, Yongqing
- Subjects
- *
HALOPHYTES , *GENE expression , *TRANSCRIPTOMES , *SALT , *REACTIVE oxygen species , *GENE regulatory networks - Abstract
Key message: Nitraria sibirica Pall. regulates its tolerance to salt stress mainly by adjusting ion balance, modifying cell wall structure, and activating signal transduction pathways. N. sibirica, as a typical halophyte, can not only effectively restore saline-alkali land, but also has high economic value. However, studies on its salt tolerance at combining molecular and physiological levels were limited. In this study, the salt tolerance of N. sibirica was analyzed based on Pacbio full-length transcriptome sequencing, and the salt tolerance in the physiological level was verified by key genes. The results showed that 89,017 full-length transcripts were obtained, of which 84,632 sequences were annotated. A total of 86,482 coding sequences (CDS) were predicted and 6561 differentially expressed genes (DEGs) were identified. DEGs were significantly enriched in "sodium ion homeostasis", "response to osmotic stress", "reactive oxygen species metabolic process", "defense response by cell wall thickening", "signal transduction", etc. The expression levels for most of these DEGs increased under salt stress. A total of 69 key genes were screened based on weighted gene co-expression network analysis (WGCNA), of which 33 were first reported on salt tolerance. Moreover, NsRabE1c gene with the highest expression level was selected to verify its salt tolerance. Over-expression of NsRabE1c gene enhanced the germination potential and root length of transgenic Arabidopsis thaliana plants without salt treatment as compared to those of Col-0 and AtRabE1c mutant. The expression levels of NsRabE1c decreased in the growth stagnation phase, while significantly increased in the growth recovery phase under salt stress. We predicted that NsRabE1c gene help N. sibirica resist salt stress through the regulation of plant growth. The results of this study deepen the understanding of salinity resistance in N. sibirica. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
21. Development of Twenty Novel Polymorphic Microsatellite Markers and Their Application in Population Genetic Studies of Konosirus punctatus
- Author
-
Miao, Zengliang, Jin, Xun, Chen, Shiyi, Zhang, Kun, Li, Jiasheng, Peng, Ying, Huang, Wenhua, Liang, Xudong, Shen, Haodi, Liu, Yifan, and Liu, Bingjian
- Published
- 2024
- Full Text
- View/download PDF
22. Chromosome-level genome assembly of Cylas formicarius provides insights into its adaptation and invasion mechanisms
- Author
-
Jin-feng HUA, Lei ZHANG, Yong-hua HAN, Xiao-wan GOU, Tian-yuan CHEN, Yong-mei HUANG, Yan-qing LI, Dai-fu MA, and Zong-yun LI
- Subjects
Cylas formicarius ,PacBio sequencing ,high-through chromosome conformation capture ,chromosome-level genome ,chemosensory genes ,fluorescence competitive binding ,Agriculture (General) ,S1-972 - Abstract
Cylas formicarius is one of the most important pests of sweet potato worldwide, causing considerable ecological and economic damage. This study improved the effect of comprehensive management and understanding of genetic mechanisms by examining the functional genomics of C. formicarius. Using Illumina and PacBio sequencing, this study obtained a chromosome-level genome assembly of adult weevils from lines inbred for 15 generations. The high-quality assembly obtained was 338.84 Mb, with contig and scaffold N50 values of 14.97 and 34.23 Mb, respectively. In total, 157.51 Mb of repeat sequences and 11 907 protein-coding genes were predicted. A total of 337.06 Mb of genomic sequences was located on the 11 chromosomes, accounting for 99.03% of the total length of the associated chromosome. Comparative genomic analysis showed that C. formicarius was sister to Dendroctonus ponderosae, and C. formicarius diverged from D. ponderosae approximately 138.89 million years ago (Mya). Many important gene families expanded in the C. formicarius genome were involved in the detoxification of pesticides, tolerance to cold stress and chemosensory system. To further study the role of odorant-binding proteins (OBPs) in olfactory recognition of C. formicarius, the binding assay results indicated that CforOBP4–6 had strong binding affinities for sex pheromones and other ligands. The high-quality C. formicarius genome provides a valuable resource to reveal the molecular ecological basis, genetic mechanism, and evolutionary process of major agricultural pests; it also offers new ideas and new technologies for ecologically sustainable pest control.
- Published
- 2023
- Full Text
- View/download PDF
23. A new genome assembly of an African weakly electric fish (Campylomormyrus compressirostris, Mormyridae) indicates rapid gene family evolution in Osteoglossomorpha
- Author
-
Feng Cheng, Alice B. Dennis, Josephine Ijeoma Osuoha, Julia Canitz, Frank Kirschbaum, and Ralph Tiedemann
- Subjects
Campylomormyrus ,Pacbio sequencing ,Gene family ,Osteoglossomorpha ,Kv1 ,Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Teleost fishes comprise more than half of the vertebrate species. Within teleosts, most phylogenies consider the split between Osteoglossomorpha and Euteleosteomorpha/Otomorpha as basal, preceded only by the derivation of the most primitive group of teleosts, the Elopomorpha. While Osteoglossomorpha are generally species poor, the taxon contains the African weakly electric fish (Mormyroidei), which have radiated into numerous species. Within the mormyrids, the genus Campylomormyrus is mostly endemic to the Congo Basin. Campylomormyrus serves as a model to understand mechanisms of adaptive radiation and ecological speciation, especially with regard to its highly diverse species-specific electric organ discharges (EOD). Currently, there are few well-annotated genomes available for electric fish in general and mormyrids in particular. Our study aims at producing a high-quality genome assembly and to use this to examine genome evolution in relation to other teleosts. This will facilitate further understanding of the evolution of the osteoglossomorpha fish in general and of electric fish in particular. Results A high-quality weakly electric fish (C. compressirostris) genome was produced from a single individual with a genome size of 862 Mb, consisting of 1,497 contigs with an N50 of 1,399 kb and a GC-content of 43.69%. Gene predictions identified 34,492 protein-coding genes, which is a higher number than in the two other available Osteoglossomorpha genomes of Paramormyrops kingsleyae and Scleropages formosus. A Computational Analysis of gene Family Evolution (CAFE5) comparing 33 teleost fish genomes suggests an overall faster gene family turnover rate in Osteoglossomorpha than in Otomorpha and Euteleosteomorpha. Moreover, the ratios of expanded/contracted gene family numbers in Osteoglossomorpha are significantly higher than in the other two taxa, except for species that had undergone an additional genome duplication (Cyprinus carpio and Oncorhynchus mykiss). As potassium channel proteins are hypothesized to play a key role in EOD diversity among species, we put a special focus on them, and manually curated 16 Kv1 genes. We identified a tandem duplication in the KCNA7a gene in the genome of C. compressirostris. Conclusions We present the fourth genome of an electric fish and the third well-annotated genome for Osteoglossomorpha, enabling us to compare gene family evolution among major teleost lineages. Osteoglossomorpha appear to exhibit rapid gene family evolution, with more gene family expansions than contractions. The curated Kv1 gene family showed seven gene clusters, which is more than in other analyzed fish genomes outside Osteoglossomorpha. The KCNA7a, encoding for a potassium channel central for EOD production and modulation, is tandemly duplicated which may related to the diverse EOD observed among Campylomormyrus species.
- Published
- 2023
- Full Text
- View/download PDF
24. Chromosome-level genome assembly of a high-altitude-adapted frog (Rana kukunoris) from the Tibetan plateau provides insight into amphibian genome evolution and adaptation
- Author
-
Wei Chen, Hongzhou Chen, Jiahong Liao, Min Tang, Haifen Qin, Zhenkun Zhao, Xueyan Liu, Yanfang Wu, Lichun Jiang, Lixia Zhang, Bohao Fang, Xueyun Feng, Baowei Zhang, Kerry Reid, and Juha Merilä
- Subjects
Chromosome ,High-altitude adaptation ,Genome ,Hi-C sequencing ,Illumina ,PacBio sequencing ,Zoology ,QL1-991 - Abstract
Abstract Background The high-altitude-adapted frog Rana kukunoris, occurring on the Tibetan plateau, is an excellent model to study life history evolution and adaptation to harsh high-altitude environments. However, genomic resources for this species are still underdeveloped constraining attempts to investigate the underpinnings of adaptation. Results The R. kukunoris genome was assembled to a size of 4.83 Gb and the contig N50 was 1.80 Mb. The 6555 contigs were clustered and ordered into 12 pseudo-chromosomes covering ~ 93.07% of the assembled genome. In total, 32,304 genes were functionally annotated. Synteny analysis between the genomes of R. kukunoris and a low latitude species Rana temporaria showed a high degree of chromosome level synteny with one fusion event between chr11 and chr13 forming pseudo-chromosome 11 in R. kukunoris. Characterization of features of the R. kukunoris genome identified that 61.5% consisted of transposable elements and expansions of gene families related to cell nucleus structure and taste sense were identified. Ninety-five single-copy orthologous genes were identified as being under positive selection and had functions associated with the positive regulation of proteins in the catabolic process and negative regulation of developmental growth. These gene family expansions and positively selected genes indicate regions for further interrogation to understand adaptation to high altitude. Conclusions Here, we reported a high-quality chromosome-level genome assembly of a high-altitude amphibian species using a combination of Illumina, PacBio and Hi-C sequencing technologies. This genome assembly provides a valuable resource for subsequent research on R. kukunoris genomics and amphibian genome evolution in general.
- Published
- 2023
- Full Text
- View/download PDF
25. Long‐Read Sequencing Reveals Extensive DNA Methylations in Human Gut Phagenome Contributed by Prevalently Phage‐Encoded Methyltransferases.
- Author
-
Sun, Chuqing, Chen, Jingchao, Jin, Menglu, Zhao, Xueyang, Li, Yun, Dong, Yanqi, Gao, Na, Liu, Zhi, Bork, Peer, Zhao, Xing‐Ming, and Chen, Wei‐Hua
- Subjects
- *
DNA methylation , *HUMAN DNA , *METHYLATION , *BACTERIOPHAGES , *DNA methyltransferases - Abstract
DNA methylation plays a crucial role in the survival of bacteriophages (phages), yet the understanding of their genome methylation remains limited. In this study, DNA methylation patterns are analyzed in 8848 metagenome‐assembled high‐quality phages from 104 fecal samples using single‐molecule real‐time sequencing. The results demonstrate that 97.60% of gut phages exhibit methylation, with certain factors correlating with methylation densities. Phages with higher methylation densities appear to have potential viability advantages. Strikingly, more than one‐third of the phages possess their own DNA methyltransferases (MTases). Increased MTase copies are associated with higher genome methylation densities, specific methylation motifs, and elevated prevalence of certain phage groups. Notably, the majority of these MTases share close homology with those encoded by gut bacteria, suggesting their exchange during phage–bacterium interactions. Furthermore, these MTases can be employed to accurately predict phage–host relationships. Overall, the findings indicate the widespread utilization of DNA methylation by gut DNA phages as an evasion mechanism against host defense systems, with a substantial contribution from phage‐encoded MTases. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
26. Genomics in the long-read sequencing era.
- Author
-
van Dijk, Erwin L., Naquin, Delphine, Gorrichon, Kévin, Jaszczyszyn, Yan, Ouazahrou, Rania, Thermes, Claude, and Hernandez, Céline
- Subjects
- *
NUCLEOTIDE sequencing , *RNA modification & restriction , *HUMAN genetic variation , *GENOMICS , *GENOMES , *EPIGENETICS - Abstract
Long-read sequencing (LRS) methods can now produce highly accurate (ultra)long reads and thus enable for the first time the production of truly complete telomere-to-telomere (T2T) assemblies of complex genomes. While the recently produced T2T assemblies were labor-intensive and required a combination of various techniques, in the near future T2T assemblies based on LRS alone will be produced. We have entered the era of population scale long-read sequencing, where graph-based pangenomes much better representing genomic variation will increasingly be used in the near future. New applications of LRS appear at a rapid pace, with examples ranging from higher order chromatin interaction studies to the quality control of mRNA vaccines. Long-read sequencing (LRS) technologies have provided extremely powerful tools to explore genomes. While in the early years these methods suffered technical limitations, they have recently made significant progress in terms of read length, throughput, and accuracy and bioinformatics tools have strongly improved. Here, we aim to review the current status of LRS technologies, the development of novel methods, and the impact on genomics research. We will explore the most impactful recent findings made possible by these technologies focusing on high-resolution sequencing of genomes and transcriptomes and the direct detection of DNA and RNA modifications. We will also discuss how LRS methods promise a more comprehensive understanding of human genetic variation, transcriptomics, and epigenetics for the coming years. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
27. Beyond the exome: What's next in diagnostic testing for Mendelian conditions.
- Author
-
Wojcik, Monica H., Reuter, Chloe M., Marwaha, Shruti, Mahmoud, Medhat, Duyzend, Michael H., Barseghyan, Hayk, Yuan, Bo, Boone, Philip M., Groopman, Emily E., Délot, Emmanuèle C., Jain, Deepti, Sanchis-Juan, Alba, Starita, Lea M., Talkowski, Michael, Montgomery, Stephen B., Bamshad, Michael J., Chong, Jessica X., Wheeler, Matthew T., Berger, Seth I., and O'Donnell-Luria, Anne
- Subjects
- *
MEDICAL genetics , *DNA sequencing , *GENOTYPE-environment interaction , *GENETIC testing , *GENE mapping , *NUCLEOTIDE sequencing - Abstract
Despite advances in clinical genetic testing, including the introduction of exome sequencing (ES), more than 50% of individuals with a suspected Mendelian condition lack a precise molecular diagnosis. Clinical evaluation is increasingly undertaken by specialists outside of clinical genetics, often occurring in a tiered fashion and typically ending after ES. The current diagnostic rate reflects multiple factors, including technical limitations, incomplete understanding of variant pathogenicity, missing genotype-phenotype associations, complex gene-environment interactions, and reporting differences between clinical labs. Maintaining a clear understanding of the rapidly evolving landscape of diagnostic tests beyond ES, and their limitations, presents a challenge for non-genetics professionals. Newer tests, such as short-read genome or RNA sequencing, can be challenging to order, and emerging technologies, such as optical genome mapping and long-read DNA sequencing, are not available clinically. Furthermore, there is no clear guidance on the next best steps after inconclusive evaluation. Here, we review why a clinical genetic evaluation may be negative, discuss questions to be asked in this setting, and provide a framework for further investigation, including the advantages and disadvantages of new approaches that are nascent in the clinical sphere. We present a guide for the next best steps after inconclusive molecular testing based upon phenotype and prior evaluation, including when to consider referral to research consortia focused on elucidating the underlying cause of rare unsolved genetic disorders. Over 50% of individuals with suspected genetic conditions remain undiagnosed after clinical genetic testing. To enhance understanding of the options available beyond exome sequencing, this article offers professionals from any specialty a framework for further investigation and guidance on the next best steps after an inclusive evaluation. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
28. The Viscum album Gene Space database.
- Author
-
Schröder, Lucie, Rupp, Oliver, Senkler, Michael, Rugen, Nils, Hohnjec, Natalija, Goesmann, Alexander, Küster, Helge, and Braun, Hans-Peter
- Subjects
DATABASES ,PLANT life cycles ,LIFE cycles (Biology) ,HUMAN genome ,PLANT genomes - Abstract
The hemiparasitic flowering plant Viscum album (European mistletoe) is known for its very special life cycle, extraordinary biochemical properties, and extremely large genome. The size of its genome is estimated to be 30 times larger than the human genome and 600 times larger than the genome of the model plant Arabidopsis thaliana. To achieve insights into the Gene Space of the genome, which is defined as the space including and surrounding protein-coding regions, a transcriptome project based on PacBio sequencing has recently been conducted. A database resulting from this project contains sequences of 39,092 different open reading frames encoding 32,064 distinct proteins. Based on 'Benchmarking Universal Single-Copy Orthologs' (BUSCO) analysis, the completeness of the database was estimated to be in the range of 78%. To further develop this database, we performed a transcriptome project of V. album organs harvested in summer and winter based on Illumina sequencing. Data from both sequencing strategies were combined. The new V. album Gene Space database II (VaGs II) contains 90,039 sequences and has a completeness of 93% as revealed by BUSCO analysis. Sequences from other organisms, particularly fungi, which are known to colonize mistletoe leaves, have been removed. To evaluate the quality of the new database, proteome data of a mitochondrial fraction of V. album were re-analyzed. Compared to the original evaluation published five years ago, nearly 1000 additional proteins could be identified in the mitochondrial fraction, providing new insights into the Oxidative Phosphorylation System of V. album. The VaGs II database is available at https://viscumalbum.pflanzenproteomik.de/. Furthermore, all V. album sequences have been uploaded at the European Nucleotide Archive (ENA). [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. PacBio Sequencing Unravels Soil Bacterial Assembly Processes along a Gradient of Organic Fertilizer Application.
- Author
-
Wang, Wenhui, Gao, Yuan, Li, Na, Lu, Hongmei, Lan, Ranxiang, and Gu, Xungang
- Subjects
- *
FERTILIZER application , *ORGANIC fertilizers , *BACTERIAL communities , *SLOPE stability , *SOILS , *AGRICULTURE , *SOIL classification - Abstract
The application of organic fertilizer is an important agricultural practice for improving soil health and the soil microflora, and the microbial community assembly process relating to this application is also closely associated with soil health. However, the effects of organic fertilizer intensification on the bacterial community assembly processes of farmland soil are often overlooked. In this study, bacterial community structure, ecological networks, and bacterial community assembly processes were evaluated using the investment soil-cultivation test and PacBio sequencing. The PCoA, Mantel test, and Procrustes analysis showed that overfertilization changed soil physicochemical properties and caused significant succession of soil bacterial communities (p < 0.05). The neutral community model indicated that the spread of bacteria in the low-fertilization group was greater than that in the high-fertilization group. Under conditions of overfertilization via organic fertilizer (organic matter ≥ 50% and N-P2O5-K2O ≥ 5%), the bacterial network topology and stability of nutrient-rich loess brown (H) soil were improved compared with those of red (R) soils, and the slope of the robustness analysis displayed a 10.9% decrease in H soil and a 37.2% decrease in R soil. The inference of community assembly mechanisms via phylogenetic-bin-based null model analysis (iCAMP) confirmed that with increasing fertilization, the relative importance of ecological drift gradually increased, and the importance of homogeneous selection was reduced (p < 0.01, permutational ANOVA). A total of 103 bins (in the selected top 200 bins) of the dominant process were different between the H and R soils. The results clarified that homogeneous selection and drift were the dominant processes driving the assembly of bacterial communities in different soil types along the gradient of organic fertilizer application and confirmed that excessive fertilization enhanced the relative importance of drift among the construction mechanisms. Changes in soil construction mechanisms due to overfertilization are related not only to soil type but also to different microbial lineages. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
30. Low-copy nuclear sequence data confirm complex patterns of farina evolution in notholaenid ferns (Pteridaceae)
- Author
-
Kao, Tzu-Tong, Pryer, Kathleen M, Freund, Forrest D, Windham, Michael D, and Rothfels, Carl J
- Subjects
Biological Sciences ,Ecology ,Evolutionary Biology ,Genetics ,Base Sequence ,Bayes Theorem ,Biological Evolution ,Cell Nucleus ,Chromosomes ,Plant ,DNA ,Plant ,Gene Dosage ,Genetic Markers ,Mexico ,Phylogeny ,Plastids ,Ploidies ,Pteridaceae ,Southwestern United States ,Allopolyploidy ,Cheilanthoids ,Missing diploids ,Notholaena ,PacBio sequencing ,Phylogenetics ,P-URC bioinformatics pipeline ,P(URC) bioinformatics pipeline ,Zoology ,Evolutionary biology - Abstract
Notholaenids are an unusual group of ferns that have adapted to, and diversified within, the deserts of Mexico and the southwestern United States. With approximately 40 species, this group is noted for being desiccation-tolerant and having "farina"-powdery exudates of lipophilic flavonoid aglycones-that occur on both the gametophytic and sporophytic phases of their life cycle. The most recent circumscription of notholaenids based on plastid markers surprisingly suggests that several morphological characters, including the expression of farina, are homoplasious. In a striking case of convergence, Notholaena standleyi appears to be distantly related to core Notholaena, with several taxa not before associated with Notholaena nested between them. Such conflicts can be due to morphological homoplasy resulting from adaptive convergence or, alternatively, the plastid phylogeny itself might be misleading, diverging from the true species tree due to incomplete lineage sorting, hybridization, or other factors. In this study, we present a species phylogeny for notholaenid ferns, using four low-copy nuclear loci and concatenated data from three plastid loci. A total of 61 individuals (49 notholaenids and 12 outgroup taxa) were sampled, including 31 out of 37 recognized notholaenid species. The homeologous/allelic nuclear sequences were retrieved using PacBio sequencing and the PURC bioinformatics pipeline. Each dataset was first analyzed individually using maximum likelihood and Bayesian inference, and the species phylogeny was inferred using *BEAST. Although we observed several incongruences between the nuclear and plastid phylogenies, our principal results are broadly congruent with previous inferences based on plastid data. By mapping the presence of farina and their biochemical constitutions on our consensus phylogenetic tree, we confirmed that the characters are indeed homoplastic and have complex evolutionary histories. Hybridization among recognized species of the notholaenid clade appears to be relatively rare compared to that observed in other well-studied fern genera.
- Published
- 2019
31. Identification of the Novel HLA‐DPA1*02:146 Allele by PacBio Sequencing.
- Author
-
Chen, Bin, An, Tongcui, Lin, Yani, Ru, Kun, and Li, Chengping
- Subjects
- *
NUCLEOTIDE sequencing , *ALLELES - Abstract
The novel HLA‐DPA1*02:146 allele differs from HLA‐DPA1*02:20 by one nucleotide substitution in exon 2. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Narrative Review: Update on the Molecular Diagnosis of Fragile X Syndrome.
- Author
-
Ciobanu, Cristian-Gabriel, Nucă, Irina, Popescu, Roxana, Antoci, Lucian-Mihai, Caba, Lavinia, Ivanov, Anca Viorica, Cojocaru, Karina-Alexandra, Rusu, Cristina, Mihai, Cosmin-Teodor, and Pânzaru, Monica-Cristina
- Subjects
- *
MOLECULAR diagnosis , *FRAGILE X syndrome , *SOUTHERN blot , *TECHNOLOGICAL innovations , *GENE silencing , *GENE mapping - Abstract
The diagnosis and management of fragile X syndrome (FXS) have significantly improved in the last three decades, although the current diagnostic techniques are not yet able to precisely identify the number of repeats, methylation status, level of mosaicism, and/or the presence of AGG interruptions. A high number of repeats (>200) in the fragile X messenger ribonucleoprotein 1 gene (FMR1) results in hypermethylation of promoter and gene silencing. The actual molecular diagnosis is performed using a Southern blot, TP-PCR (Triplet-Repeat PCR), MS-PCR (Methylation-Specific PCR), and MS-MLPA (Methylation-Specific MLPA) with some limitations, with multiple assays being necessary to completely characterise a patient with FXS. The actual gold standard diagnosis uses Southern blot; however, it cannot accurately characterise all cases. Optical genome mapping is a new technology that has also been developed to approach the diagnosis of fragile X syndrome. Long-range sequencing represented by PacBio and Oxford Nanopore has the potential to replace the actual diagnosis and offers a complete characterization of molecular profiles in a single test. The new technologies have improved the diagnosis of fragile X syndrome and revealed unknown aberrations, but they are a long way from being used routinely in clinical practice. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
33. Genome resource of Xanthomonas oryzae pv. oryzae Chinese strain NE-8 causing bacterial blight of rice.
- Author
-
Shafique, Muhammad Sohaib, Guo, Wei, Chen, Xifeng, Zhao, Kaijun, Liu, Yapei, Wang, Chunlian, and Ji, Zhiyuan
- Published
- 2023
- Full Text
- View/download PDF
34. Metabolomic and transcriptomic analyses of the flavonoid biosynthetic pathway in blueberry (Vaccinium spp.).
- Author
-
Yinping Li, Haifei Li, Shiyao Wang, Jing Li, Bacha, Syed Asim Shah, and Guofeng Xu
- Subjects
FLAVONOIDS ,VACCINIUM ,METABOLOMICS ,BLUEBERRIES ,TRANSCRIPTOMES ,GENE regulatory networks - Abstract
As a highly economic small fruit crop, blueberry is enjoyed by most people in terms of color, taste, and rich nutrition. To better understand its coloring mechanism on the process of ripening, an integrative analysis of the metabolome and transcriptome profiles was performed in three blueberry varieties at three developmental stages. In this study, 41 flavonoid metabolites closely related to the coloring in blueberry samples were analyzed. It turned out that the most differential metabolites in the ripening processes were delphinidin-3-O-arabinoside (dpara), peonidin-3-O-glucoside (pnglu), and delphinidin-3-Ogalactoside (dpgal), while the most differential metabolites among different varieties were flavonols. Furthermore, to obtain more accurate and comprehensive transcripts of blueberry during the developmental stages, PacBio and Illumina sequencing technology were combined to obtain the transcriptome of the blueberry variety Misty, for the very first time. Finally, by applying the gene coexpression network analysis, the darkviolet and bisque4 modules related to flavonoid synthesis were determined, and the key genes related to two flavonoid 3', 5'-hydroxylase (F3'5'H) genes in the darkviolet module and one bHLH transcription factor in the bisque4 module were predicted. It is believed that our findings could provide valuable information for the future study on the molecular mechanism of flavonoid metabolites and flavonoid synthesis pathways in blueberries. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
35. Full-length transcriptome analysis of Zanthoxylum nitidum (Roxb.) DC.
- Author
-
Yanxia Zhu, Yanfen Huang, Kunhua Wei, Junnan Yu, and Jianping Jiang
- Subjects
LINCRNA ,POPULATION genetics ,ZANTHOXYLUM ,MICROSATELLITE repeats ,TRANSCRIPTOMES ,HERBS - Abstract
Zanthoxylum nitidum (Roxb.) DC. (Z. nitidum) is a type of Chinese Dao-di herb, also called Liangmianzhen, which is widely used to treat arthralgia, rheumatic arthralgia, and stomach pain. However, genomic resources for Z. nitidum are still scarce. This study provides transcriptomic resources for Z. nitidum by applying single-molecule real-time (SMRT) sequencing technology. In total, 456,109 circular consensus sequencing (CCS) reads were generated with a mean length of 2,216 bp from Z. nitidum roots, old stems, young branches, leaves, flowers, and fruits. Of these total reads, 353,932 were full-length nonchimeric (FLNC) reads with an average length of 1,996 bp. A total of 16,163 transcripts with a mean length of 1,171 bp were acquired. Of these transcripts, 14,231 (88%) were successfully annotated using public databases. Across all the 16,163 transcripts, we identified 6,255 long non-coding RNAs (lncRNAs) and 22,780 simple sequence repeats (SSRs). Furthermore, 3,482 transcription factors were identified. Among the SSR loci, 1-3 nucleotide repeats were dominant, occupying 99.36% of the total SSR loci, with mono-, di-, and tri-nucleotide repeats accounting for 61.80%, 19.89%, and 5.02% of the total SSR loci, respectively. A total of 36 out of 100 randomly selected primer pairs were verified to be positive, 20 of which showed polymorphism. These findings enrich the genetic resources available for facilitating future studies and research on relevant topics such as population genetics in Z. nitidum. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
36. Highly accurate genome assembly of an improved high-yielding silkworm strain, Nichi01.
- Author
-
Ryusei Waizumi, Takuya Tsubota, Akiya Jouraku, Seigo Kuwazaki, Kakeru Yokoi, Tetsuya Iizuka, Kimiko Yamamoto, and Hideki Sezutsu
- Subjects
- *
SILKWORMS , *DOMESTIC animals , *SILK production , *GENE mapping , *INSECTS - Abstract
The silkworm (Bombyx mori) is an important lepidopteran model insect and an industrial domestic animal traditionally used for silk production. Here, we report the genome assembly of an improved Japanese strain Nichi01, in which the cocoon yield is comparable to that of commercial silkworm strains. The integration of PacBio Sequel II long-read and ddRAD-seq-based high-density genetic linkage map achieved the highest quality genome assembly of silkworms to date; 22 of the 28 pseudomolecules contained telomeric repeats at both ends, and only four gaps were present in the assembly. A total of 452 Mbp of the assembly with an N50 of 16.614 Mbp covered 99.3% of the complete orthologs of the lepidopteran core genes. Although the genome sequence of Nichi01 and that of the previously reported low-yielding tropical strain p50T assured their accuracy in most regions, we corrected several regions, misassembled in p50T, in our assembly. A total of 18,397 proteins were predicted using over 95 Gb of mRNA-seq derived from 10 different organs, covering 96.9% of the complete orthologs of the lepidopteran core genes. The final assembly and annotation files are available in KAIKObase (https://kaikobase.dna.affrc.go.jp/index.html) along with a genome browser and BLAST searching service, which would facilitate further studies and the breeding of silkworms and other insects. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
37. The Viscum album Gene Space database
- Author
-
Lucie Schröder, Oliver Rupp, Michael Senkler, Nils Rugen, Natalija Hohnjec, Alexander Goesmann, Helge Küster, and Hans-Peter Braun
- Subjects
database development ,PacBio sequencing ,Illumina sequencing ,Complexome profiling ,mitochondria ,oxidative phosphorylation (OXPHOS) ,Plant culture ,SB1-1110 - Abstract
The hemiparasitic flowering plant Viscum album (European mistletoe) is known for its very special life cycle, extraordinary biochemical properties, and extremely large genome. The size of its genome is estimated to be 30 times larger than the human genome and 600 times larger than the genome of the model plant Arabidopsis thaliana. To achieve insights into the Gene Space of the genome, which is defined as the space including and surrounding protein-coding regions, a transcriptome project based on PacBio sequencing has recently been conducted. A database resulting from this project contains sequences of 39,092 different open reading frames encoding 32,064 distinct proteins. Based on ‘Benchmarking Universal Single-Copy Orthologs’ (BUSCO) analysis, the completeness of the database was estimated to be in the range of 78%. To further develop this database, we performed a transcriptome project of V. album organs harvested in summer and winter based on Illumina sequencing. Data from both sequencing strategies were combined. The new V. album Gene Space database II (VaGs II) contains 90,039 sequences and has a completeness of 93% as revealed by BUSCO analysis. Sequences from other organisms, particularly fungi, which are known to colonize mistletoe leaves, have been removed. To evaluate the quality of the new database, proteome data of a mitochondrial fraction of V. album were re-analyzed. Compared to the original evaluation published five years ago, nearly 1000 additional proteins could be identified in the mitochondrial fraction, providing new insights into the Oxidative Phosphorylation System of V. album. The VaGs II database is available at https://viscumalbum.pflanzenproteomik.de/. Furthermore, all V. album sequences have been uploaded at the European Nucleotide Archive (ENA).
- Published
- 2023
- Full Text
- View/download PDF
38. Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing
- Author
-
Zuo, Chunman, Blow, Matthew, Sreedasyam, Avinash, Kuo, Rita C, Ramamoorthy, Govindarajan Kunde, Torres-Jerez, Ivone, Li, Guifen, Wang, Mei, Dilworth, David, Barry, Kerrie, Udvardi, Michael, Schmutz, Jeremy, Tang, Yuhong, and Xu, Ying
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Biotechnology ,1.1 Normal biological development and functioning ,Generic health relevance ,Switchgrass ,PacBio sequencing ,Transcriptomic analysis ,Alternative splicing ,Plant cell wall ,Chemical Engineering ,Industrial Biotechnology ,Biochemistry and cell biology ,Industrial biotechnology - Abstract
BackgroundSwitchgrass (Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts.ResultsWe present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures.ConclusionsOverall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.
- Published
- 2018
39. A new genome assembly of an African weakly electric fish (Campylomormyrus compressirostris, Mormyridae) indicates rapid gene family evolution in Osteoglossomorpha.
- Author
-
Cheng, Feng, Dennis, Alice B., Osuoha, Josephine Ijeoma, Canitz, Julia, Kirschbaum, Frank, and Tiedemann, Ralph
- Subjects
ELECTRIC fishes ,ADAPTIVE radiation ,GENOMES ,GENOME size ,GENE families ,ELECTRIC discharges - Abstract
Background: Teleost fishes comprise more than half of the vertebrate species. Within teleosts, most phylogenies consider the split between Osteoglossomorpha and Euteleosteomorpha/Otomorpha as basal, preceded only by the derivation of the most primitive group of teleosts, the Elopomorpha. While Osteoglossomorpha are generally species poor, the taxon contains the African weakly electric fish (Mormyroidei), which have radiated into numerous species. Within the mormyrids, the genus Campylomormyrus is mostly endemic to the Congo Basin. Campylomormyrus serves as a model to understand mechanisms of adaptive radiation and ecological speciation, especially with regard to its highly diverse species-specific electric organ discharges (EOD). Currently, there are few well-annotated genomes available for electric fish in general and mormyrids in particular. Our study aims at producing a high-quality genome assembly and to use this to examine genome evolution in relation to other teleosts. This will facilitate further understanding of the evolution of the osteoglossomorpha fish in general and of electric fish in particular. Results: A high-quality weakly electric fish (C. compressirostris) genome was produced from a single individual with a genome size of 862 Mb, consisting of 1,497 contigs with an N50 of 1,399 kb and a GC-content of 43.69%. Gene predictions identified 34,492 protein-coding genes, which is a higher number than in the two other available Osteoglossomorpha genomes of Paramormyrops kingsleyae and Scleropages formosus. A Computational Analysis of gene Family Evolution (CAFE5) comparing 33 teleost fish genomes suggests an overall faster gene family turnover rate in Osteoglossomorpha than in Otomorpha and Euteleosteomorpha. Moreover, the ratios of expanded/contracted gene family numbers in Osteoglossomorpha are significantly higher than in the other two taxa, except for species that had undergone an additional genome duplication (Cyprinus carpio and Oncorhynchus mykiss). As potassium channel proteins are hypothesized to play a key role in EOD diversity among species, we put a special focus on them, and manually curated 16 Kv1 genes. We identified a tandem duplication in the KCNA7a gene in the genome of C. compressirostris. Conclusions: We present the fourth genome of an electric fish and the third well-annotated genome for Osteoglossomorpha, enabling us to compare gene family evolution among major teleost lineages. Osteoglossomorpha appear to exhibit rapid gene family evolution, with more gene family expansions than contractions. The curated Kv1 gene family showed seven gene clusters, which is more than in other analyzed fish genomes outside Osteoglossomorpha. The KCNA7a, encoding for a potassium channel central for EOD production and modulation, is tandemly duplicated which may related to the diverse EOD observed among Campylomormyrus species. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
40. Incipiently social carpenter bees (Xylocopa) host distinctive gut bacterial communities and display geographical structure as revealed by full‐length PacBio 16S rRNA sequencing.
- Author
-
Handy, Madeline Y., Sbardellati, Dino L., Yu, Michael, Saleh, Nicholas W., Ostwald, Madeleine M., and Vannette, Rachel L.
- Subjects
- *
RIBOSOMAL RNA , *BEES , *GUT microbiome , *CROP rotation , *BACTERIAL communities , *CARPENTERS , *BEE colonies - Abstract
The gut microbiota of bees affects nutrition, immunity and host fitness, yet the roles of diet, sociality and geographical variation in determining microbiome structure, including variant‐level diversity and relatedness, remain poorly understood. Here, we use full‐length 16S rRNA amplicon sequencing to compare the crop and gut microbiomes of two incipiently social carpenter bee species, Xylocopa sonorina and Xylocopa tabaniformis, from multiple geographical sites within each species' range. We found that Xylocopa species share a set of core taxa consisting of Bombilactobacillus, Bombiscardovia and Lactobacillus, found in >95% of all individual bees sampled, and Gilliamella and Apibacter were also detected in the gut of both species with high frequency. The crop bacterial community of X. sonorina comprised nearly entirely Apilactobacillus with occasionally abundant nectar bacteria. Despite sharing core taxa, Xylocopa species' microbiomes were distinguished by multiple bacterial lineages, including species‐specific variants of core taxa. The use of long‐read amplicons revealed otherwise cryptic species and population‐level differentiation in core microbiome members, which was masked when a shorter fragment of the 16S rRNA (V4) was considered. Of the core taxa, Bombilactobacillus and Bombiscardovia exhibited differentiation in amplicon sequence variants among bee populations, but this was lacking in Lactobacillus, suggesting that some bacterial genera in the gut may be structured by different processes. We conclude that these Xylocopa species host a distinctive microbiome, similar to that of previously characterized social corbiculate apids, which suggests that further investigation to understand the evolution of the bee microbiome and its drivers is warranted. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
41. A Global Survey of the Full-Length Transcriptome of Apis mellifera by Single-Molecule Long-Read Sequencing.
- Author
-
Zheng, Shuang-Yan, Pan, Lu-Xia, Cheng, Fu-Ping, Jin, Meng-Jie, and Wang, Zi-Long
- Subjects
- *
ALTERNATIVE RNA splicing , *HONEYBEES , *TRANSCRIPTOMES , *LINCRNA , *POLLINATORS , *INFORMATION needs - Abstract
As important pollinators, honey bees play a crucial role in both maintaining the ecological balance and providing products for humans. Although several versions of the western honey bee genome have already been published, its transcriptome information still needs to be refined. In this study, PacBio single-molecule sequencing technology was used to sequence the full-length transcriptome of mixed samples from many developmental time points and tissues of A. mellifera queens, workers and drones. A total of 116,535 transcripts corresponding to 30,045 genes were obtained. Of these, 92,477 transcripts were annotated. Compared to the annotated genes and transcripts on the reference genome, 18,915 gene loci and 96,176 transcripts were newly identified. From these transcripts, 136,554 alternative splicing (AS) events, 23,376 alternative polyadenylation (APA) sites and 21,813 lncRNAs were detected. In addition, based on the full-length transcripts, we identified many differentially expressed transcripts (DETs) between queen, worker and drone. Our results provide a complete set of reference transcripts for A. mellifera that dramatically expand our understanding of the complexity and diversity of the honey bee transcriptome. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
42. The Complete Mitochondrial Genome of the Chinese White Wax Scale Insect, Ericerus pela Chavannes (Hemiptera: Coccidae), with Novel Gene Arrangement and Truncated tRNA Genes.
- Author
-
An, Jia-Qi, Yu, Shu-Hui, Wei, Shu-Jun, Zhang, Hong-Ping, Shi, Yuan-Chong, Zhao, Qiu-Yu, Fu, Zuo-Yi, and Yang, Pu
- Subjects
- *
MITOCHONDRIAL DNA , *TRANSFER RNA , *SCALE insects , *ZINC oxide , *HEMIPTERA , *GENE rearrangement - Abstract
Simple Summary: Ericerus pela Chavannes (Hemiptera: Coccoidea) is one of the most economically valuable resource insects in China. Its mitochondrial genome provides essential information for the molecular identification and genetic study of this species. In this paper, we assembled the mitochondrial genome of E. pela. Gene rearrangement analysis showed that compared to other scale insects, E. pela's atp6, atp8, and a few tRNAs were obviously rearranged. Moreover, there were nine tRNAs identified to have obvious truncated structures. Synteny analysis based on the whole mitochondrial genomes showed that a significant rearrangement of homologous blocks had occurred within this Coccoidea species. The results revealed great variations in the intergenic regions among Coccoidea species and novel gene arrangement and truncated tRNA genes in the mitochondrial genome of E. pela. The Chinese white wax scale insect, Ericerus pela Chavannes (Hemiptera: Coccidae), is one of the scale insects with great economic value and has been dispersed and reared in China for over one thousand years. Its mitochondrial genome provides essential information for the molecular identification and genetic study of this species. We assembled the complete mitochondrial genome of E. pela based on PacBio sequencing and analyzed its genomic features. The genome was 17,766 bp in length with 13 protein-coding genes, 22 tRNAs, and two rRNA genes. The analysis results showed E. pela had significant gene rearrangements involving tRNAs compared with other Coccoidea species. Furthermore, E. pela's nine tRNAs were identified to have obvious truncated structures. The phylogenetic tree compiled of the species showed a long branch of the Coccoidea lineage, which indicated the high evolutionary rate in this group. Our study revealed the mitochondrial characteristics of E. pela and enriched the mitochondrial genetic information on Coccoidea species. It also determined the occurrence of gene rearrangement for the species in this superfamily. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
43. Long-read-based Genome Assembly of Drosophila gunungcola Reveals Fewer Chemosensory Genes in Flower-breeding Species.
- Author
-
Negi, Ateesha, Liao, Ben-Yang, and Yeh, Shu-Dan
- Subjects
- *
DROSOPHILA , *DROSOPHILA melanogaster , *GENES , *SPECIES , *SHOTGUN sequencing , *HUMAN sexuality , *GENOMES - Abstract
Drosophila gunungcola exhibits reproductive activities on the fresh flowers of several plant species and is an emerging model to study the co-option of morphological and behavioral traits in male courtship display. Here, we report a near-chromosome-level genome assembly that was constructed based on long-read PacBio sequencing data (with ∼66× coverage) and annotated with the assistant from RNA-seq transcriptome data of whole organisms at various developmental stages. A nuclear genome of 189 Mb with 13,950 protein-coding genes and a mitogenome of 17.5 kb were acquired. Few interchromosomal rearrangements were found in the comparisons of synteny with Drosophila elegans , its sister species, and Drosophila melanogaster , suggesting that the gene compositions on each Muller element are evolutionarily conserved. Loss events of several OR and IR genes in D. gunungcola and D. elegans were revealed when orthologous genomic regions were compared across species in the D. melanogaster species group. This high-quality reference genome will facilitate further comparative studies on traits related to the evolution of sexual behavior and diet specialization. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. Chromosome‐level assembly of Culex pipiens molestus and improved reference genome of Culex pipiens pallens (Culicidae, Diptera).
- Author
-
Liu, Wenjuan, Cheng, Peng, An, Sha, Zhang, Kexin, Gong, Maoqing, Zhang, Zhong, and Zhang, Ruiling
- Subjects
- *
CULEX pipiens , *DIPTERA , *CHEMOSENSORY proteins , *GENE families , *CHROMOSOMES , *MOSQUITOES - Abstract
Culex pipiens molestus and Culex pipiens pallens are two distinct bioforms in the Culex pipiens complex that are important vectors of several pathogens and are widely distributed around the world. In the current study, we present a high‐quality chromosome‐level genome of Cx. pipiens f. molestus and describe the genetic characteristics of this genome. The assembly genome was 559.749 Mb with contig and scaffold N50 values of 200.952 Mb and 0.370 Mb, and more than 94.78% of the assembled bases were located on 3 chromosomes. A total of 19,399 protein‐coding genes were predicted. Many gene families were expanded in the genome of Cx. pipiens f. molestus, particularly those of the chemosensory protein (CSP) and gustatory receptor (GR) gene families. In addition, utilizing Hi‐C data, we improved the previously assembled draft genome of Cx. pipiens f. pallens, with scaffold N50 of 186.195 Mb and contig N50 of 0.749 Mb, and more than 97.02% of the assembled bases were located on three chromosomes. This reference genome provides a foundation for genome‐based investigations of the unique ecological and evolutionary characteristics of Cx. pipiens f. molestus, and the findings in this study will help to elucidate the mechanisms involved in species divergence in the Culex pipiens complex. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
45. Analysis of Whole-Genome Sequences of Pathogenic Gram-Positive and Gram-Negative Isolates from the Same Hospital Environment to Investigate Common Evolutionary Trends Associated with Horizontal Gene Exchange, Mutations and DNA Methylation Patterning.
- Author
-
Korotetskiy, Ilya S., Shilov, Sergey V., Kuznetsova, Tatyana, Kerimzhanova, Bahkytzhan, Korotetskaya, Nadezhda, Ivanova, Lyudmila, Zubenko, Natalya, Parenova, Raikhan, and Reva, Oleg N.
- Subjects
GRAM-negative bacteria ,DNA methylation ,ANTIBIOTICS ,SEQUENCE analysis ,NOSOCOMIAL infections ,DRUG resistance in bacteria ,MUPIROCIN ,KLEBSIELLA pneumoniae ,STREPTOCOCCUS pneumoniae - Abstract
Hospital-acquired infections are a generally recognized problem for healthcare professionals. Clinical variants of Gram-negative and Gram-positive pathogens are characterized with enhanced antibiotic resistance and virulence due to mutations and the horizontal acquisition of respective genetic determinants. In this study, two Escherichia coli, two Klebsiella pneumoniae, three Pseudomonas aeruginosa, two Staphylococcus aureus, one Staphylococcus epidermidis and one Streptococcus pneumoniae showing broad spectra of antibiotic resistance were isolated from patients suffering from nosocomial infections in a local hospital in Almaty, Kazakhstan. The aim of the study was to compare general and species-specific pathways of the development of virulence and antibiotic resistance through opportunistic pathogens causing hospital-acquired infections. The whole-genome PacBio sequencing of the isolates allowed for the genotyping and identification of antibiotic resistance and virulence genetic determinants located in the chromosomes, plasmids and genomic islands. It was concluded that long-read sequencing is a useful tool for monitoring the epidemiological situation in hospitals. Marker antibiotic resistance mutations common for different microorganisms were identified, which were acquired due to antibiotic-selective pressure in the same clinical environment. The genotyping and identification of strain-specific DNA methylation motifs were found to be promising in estimating the risks associated with hospital infection outbreaks and monitoring the distribution and evolution of nosocomial pathogens. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
46. Bacteria‐induced amino acid metabolism involved in appearance characteristics of high‐temperature Daqu.
- Author
-
Zhang, Yuandi, Shen, Yi, Niu, Jiao, Ding, Fang, Ren, Ying, Chen, Xiaoxue, and Han, Bei‐Zhong
- Subjects
- *
AMINO acid metabolism , *GLUTAMIC acid , *INDOLEACETIC acid , *PANTOTHENIC acid , *BACILLUS (Bacteria) , *RALSTONIA , *TRYPTOPHAN , *AMINO acids - Abstract
BACKGROUND: Significant changes occurd in Daqu bricks on the 15th day of incubation, and brick color (yellow, brown, or dark) is generally used as a standard for quality evaluation by experienced workers. This study aimed to explore the basis behind the phenomenon through multi‐omics studies. The physicochemical properties of different high‐temperature Daqu were compared. Furthermore, PacBio sequencing and the ultra‐high‐performance liquid chromatographic–Q‐exactive–mass spectrometric approach were employed to analyze the differences in the microbiome and metabolome among different Daqu samples. RESULTS: Bacillus was the biomarker of yellow Daqu, Thermoactinomyces and Thermoascus were the key genera in brown Daqu, and Burkholderiales, Sphingomonas, and Ralstonia were biomarkers in dark Daqu. The physicochemical characteristics (especially the color values) of different high‐temperature Daqu showed strong correlations with the bacterial alpha diversity and the relative abundance of dominant bacterial genera. Amino acid metabolism pathways including tryptophan metabolism, β‐alanine metabolism, and arginine biosynthesis were the key factors resulting in the characteristic differences where Bacillus, Burkholderia, Ralstonia, and Sphingomonas were pivotal bacterial genera. The relative abundance of Bacillus had a positive correlation with the content of 3‐hydroxykynurenamine, l‐glutamic acid, and pantothenic acid, while it showed a negative correlation with indoleacetic acid, l‐tryptophan, N‐acetylserotonin, l‐histidine, l‐aspartic acid, phosphatidylserine, 5‐methoxyindoleacetate, and L‐serine. Burkholderia, Ralstonia, and Sphingomonas had the opposite effects. CONCLUSION: Microbes play different roles in amino acid metabolism pathways, producing different metabolites, contributing to the differences in Daqu appearance and quality. © 2022 Society of Chemical Industry. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
47. Chromosome-level genome assembly of a high-altitude-adapted frog (Rana kukunoris) from the Tibetan plateau provides insight into amphibian genome evolution and adaptation.
- Author
-
Chen, Wei, Chen, Hongzhou, Liao, Jiahong, Tang, Min, Qin, Haifen, Zhao, Zhenkun, Liu, Xueyan, Wu, Yanfang, Jiang, Lichun, Zhang, Lixia, Fang, Bohao, Feng, Xueyun, Zhang, Baowei, Reid, Kerry, and Merilä, Juha
- Subjects
RANA temporaria ,AMPHIBIANS ,RANA ,GENOMES ,GENE families ,FROGS ,CELL nuclei - Abstract
Background: The high-altitude-adapted frog Rana kukunoris, occurring on the Tibetan plateau, is an excellent model to study life history evolution and adaptation to harsh high-altitude environments. However, genomic resources for this species are still underdeveloped constraining attempts to investigate the underpinnings of adaptation. Results: The R. kukunoris genome was assembled to a size of 4.83 Gb and the contig N50 was 1.80 Mb. The 6555 contigs were clustered and ordered into 12 pseudo-chromosomes covering ~ 93.07% of the assembled genome. In total, 32,304 genes were functionally annotated. Synteny analysis between the genomes of R. kukunoris and a low latitude species Rana temporaria showed a high degree of chromosome level synteny with one fusion event between chr11 and chr13 forming pseudo-chromosome 11 in R. kukunoris. Characterization of features of the R. kukunoris genome identified that 61.5% consisted of transposable elements and expansions of gene families related to cell nucleus structure and taste sense were identified. Ninety-five single-copy orthologous genes were identified as being under positive selection and had functions associated with the positive regulation of proteins in the catabolic process and negative regulation of developmental growth. These gene family expansions and positively selected genes indicate regions for further interrogation to understand adaptation to high altitude. Conclusions: Here, we reported a high-quality chromosome-level genome assembly of a high-altitude amphibian species using a combination of Illumina, PacBio and Hi-C sequencing technologies. This genome assembly provides a valuable resource for subsequent research on R. kukunoris genomics and amphibian genome evolution in general. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
48. Single-molecule Sequencing of an Animal Mitochondrial Genome Reveals Chloroplast-like Architecture and Repeat-mediated Recombination.
- Author
-
Sharbrough, Joel, Bankers, Laura, Cook, Emily, Fields, Peter D, Jalinsky, Joseph, McElroy, Kyle E, Neiman, Maurine, Logsdon, John M, and Boore, Jeffrey L
- Subjects
MITOCHONDRIAL DNA ,CHLOROPLAST DNA ,INVERTED repeats (Genetics) ,FRESHWATER snails ,SINGLE nucleotide polymorphisms ,PLANT genomes ,MITOCHONDRIA - Abstract
Recent advances in long-read sequencing technology have allowed for single-molecule sequencing of entire mitochondrial genomes, opening the door for direct investigation of the mitochondrial genome architecture and recombination. We used PacBio sequencing to reassemble mitochondrial genomes from two species of New Zealand freshwater snails, Potamopyrgus antipodarum and Potamopyrgus estuarinus. These assemblies revealed a ∼1.7 kb structure within the mitochondrial genomes of both species that was previously undetected by an assembly of short reads and likely corresponding to a large noncoding region commonly present in the mitochondrial genomes. The overall architecture of these Potamopyrgus mitochondrial genomes is reminiscent of the chloroplast genomes of land plants, harboring a large single-copy (LSC) region and a small single-copy (SSC) region separated by a pair of inverted repeats (IRa and IRb). Individual sequencing reads that spanned across the Potamopyrgus IRa-SSC-IRb structure revealed the occurrence of a "flip-flop" recombination. We also detected evidence for two distinct IR haplotypes and recombination between them in wild-caught P. estuarinus , as well as extensive intermolecular recombination between single-nucleotide polymorphisms in the LSC region. The chloroplast-like architecture and repeat-mediated mitochondrial recombination we describe here raise fundamental questions regarding the origins and commonness of inverted repeats in cytoplasmic genomes and their role in mitochondrial genome evolution. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
49. Gene identification and tissue expression analysis inform the floral organization and color in the basal angiosperm Magnolia polytepala (Magnoliaceae)
- Author
-
Sun, Liyong, Nie, Tangjie, Chen, Yao, Li, Jia, Yang, AiXiang, and Yin, Zengfang
- Abstract
Main conclusion: In Magnolia polytepala, the formation of floral organization and color was attributed to tissue-dependent differential expression levels of MADS-box genes and anthocyanin biosynthetic genes. In angiosperms, the diversity of floral morphology and organization suggests its value in exploring plant evolution. Magnolia polytepala, an endemic basal angiosperm species in China, possesses three green sepal-like tepals in the outermost whorl and pink petal-like tepals in the inner three whorls, forming unique floral morphology and organization. However, we know little about its underlying molecular regulatory mechanism. Here, we first reported the full-length transcriptome of M. polytepala using PacBio sequencing. A total of 16 MADS-box transcripts were obtained from the transcriptome data, including floral homeotic genes (e.g., MpAPETALA3) and other non-floral homeotic genes (MpAGL6, etc.). Phylogenetic analysis and spatial expression pattern reflected their putative biological function as their homologues in Arabidopsis. In addition, nine structural genes involved in anthocyanin biosynthesis pathway had been screened out, and tepal color difference was significantly associated with their tissue-dependent differential expression levels. This study provides a relatively comprehensive investigation of the MADS-box family and anthocyanin biosynthetic genes in M. polytepala, and will facilitate our understanding of the regulatory mechanism underlying floral organization and color in basal angiosperms. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
50. DNA barcoding, an effective tool for species identification: a review.
- Author
-
Antil, Sandeep, Abraham, Jeeva Susan, Sripoorna, S., Maurya, Swati, Dagar, Jyoti, Makhija, Seema, Bhagat, Pooja, Gupta, Renu, Sood, Utkarsh, Lal, Rup, and Toteja, Ravi
- Abstract
DNA barcoding is a powerful taxonomic tool to identify and discover species. DNA barcoding utilizes one or more standardized short DNA regions for taxon identification. With the emergence of new sequencing techniques, such as Next-generation sequencing (NGS), ONT MinION nanopore sequencing, and Pac Bio sequencing, DNA barcoding has become more accurate, fast, and reliable. Rapid species identification by DNA barcodes has been used in a variety of fields, including forensic science, control of the food supply chain, and disease understanding. The Consortium for Barcode of Life (CBOL) presents various working groups to identify the universal barcode gene, such as COI in metazoans; rbcL, matK, and ITS in plants; ITS in fungi; 16S rRNA gene in bacteria and archaea, and creating a reference DNA barcode library. In this article, an attempt has been made to analyze the various proposed DNA barcode for different organisms, strengths & limitations, recent advancements in DNA barcoding, and methods to speed up the DNA barcode reference library construction. This study concludes that constructing a reference library with high species coverage would be a major step toward identifying species by DNA barcodes. This can be achieved in a short period of time by using advanced sequencing and data analysis methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.