Back to Search Start Over

Protein domain analysis of genomic sequence data reveals regulation of LRR related domains in plant transpiration in Ficus.

Authors :
Tiange Lang
Kangquan Yin
Jinyu Liu
Kunfang Cao
Charles H Cannon
Fang K Du
Source :
PLoS ONE, Vol 9, Iss 9, p e108719 (2014)
Publication Year :
2014
Publisher :
Public Library of Science (PLoS), 2014.

Abstract

Predicting protein domains is essential for understanding a protein's function at the molecular level. However, up till now, there has been no direct and straightforward method for predicting protein domains in species without a reference genome sequence. In this study, we developed a functionality with a set of programs that can predict protein domains directly from genomic sequence data without a reference genome. Using whole genome sequence data, the programming functionality mainly comprised DNA assembly in combination with next-generation sequencing (NGS) assembly methods and traditional methods, peptide prediction and protein domain prediction. The proposed new functionality avoids problems associated with de novo assembly due to micro reads and small single repeats. Furthermore, we applied our functionality for the prediction of leucine rich repeat (LRR) domains in four species of Ficus with no reference genome, based on NGS genomic data. We found that the LRRNT_2 and LRR_8 domains are related to plant transpiration efficiency, as indicated by the stomata index, in the four species of Ficus. The programming functionality established in this study provides new insights for protein domain prediction, which is particularly timely in the current age of NGS data expansion.

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
19326203
Volume :
9
Issue :
9
Database :
Directory of Open Access Journals
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
edsdoj.304137a657f46acb5714719873a1b62
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pone.0108719