33 results on '"Shin-Han Shiu"'
Search Results
2. Putative cis-Regulatory Elements Predict Iron Deficiency Responses in Arabidopsis Roots.
- Author
-
Schwarz, Birte, Azodi, Christina B., Shin-Han Shiu, and Bauer, Petra
- Published
- 2020
- Full Text
- View/download PDF
3. Within- and cross-species predictions of plant specialized metabolism genes using transfer learning.
- Author
-
Moore, Bethany M., Peipei Wang, Pengxiang Fan, Aaron Lee, Bryan Leong, Yann-Ru Lou, Schenck, Craig A., Koichi Sugimoto, Last, Robert, Lehti-Shiu, Melissa D., Barry, Cornelius S., and Shin-Han Shiu
- Published
- 2020
- Full Text
- View/download PDF
4. Transcriptome-Based Prediction of Complex Traits in Maize[OPEN].
- Author
-
Azodi, Christina B., Pardo, Jeremy, VanBuren, Robert, de los Campos, Gustavo, and 2, Shin-Han Shiu
- Published
- 2020
- Full Text
- View/download PDF
5. Cis-Regulatory Code for Predicting Plant Cell-Type Transcriptional Response to High Salinity.
- Author
-
Uygun, Sahra, Azodi, Christina B., and Shin-Han Shiu
- Published
- 2019
- Full Text
- View/download PDF
6. Recovery from N Deprivation Is a Transcriptionally and Functionally Distinct State in Chlamydomonas.
- Author
-
Chia-Hong Tsai, Uygun, Sahra, Roston, Rebecca, Shin-Han Shiu, and Benning, Christoph
- Published
- 2018
- Full Text
- View/download PDF
7. Defining Functional Genic Regions in the Human Genome through Integration of Biochemical, Evolutionary, and Genetic Evidence.
- Author
-
Zing Tsung-Yeh Tsai, Lloyd, John P., and Shin-Han Shiu
- Abstract
The human genome is dominated by large tracts of DNA with extensive biochemical activity but no known function. In particular, it is well established that transcriptional activities are not restricted to known genes. However, whether this intergenic transcription represents activity with functional significance or noise is under debate, highlighting the need for an effective method of defining functional genomic regions. Moreover, these discoveries raise the question whether genomic regions can be defined as functional based solely on the presence of biochemical activities, without considering evolutionary (conservation) and genetic (effects of mutations) evidence. Here, computational models integrating genetic, evolutionary, and biochemical evidence are established that provide reliable predictions of human protein-coding and RNA genes. Importantly, in addition to sequence conservation, biochemical features allow accurate predictions of genic sequences with phenotypic evidence under strong purifying selection, suggesting that they can be used as an alternative measure of selection. Moreover, 18.5% of annotated noncoding RNAs exhibit higher degrees of similarity to phenotype genes and, thus, are likely functional. However, 64.5% of noncoding RNAs appear to belong to a sequence class of their own, and the remaining 17% are more similar to pseudogenes and random intergenic sequences that may represent noisy transcription. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
8. Predictive Models of Spatial Transcriptional Response to High Salinity.
- Author
-
Uygun, Sahra, Seddon, Alexander E., Azodi, Christina B., and Shin-Han Shiu
- Published
- 2017
- Full Text
- View/download PDF
9. Evolution of Gene Duplication in Plants.
- Author
-
Panchy, Nicholas, Lehti-Shiu, Melissa, and Shin-Han Shiu
- Published
- 2016
- Full Text
- View/download PDF
10. A novel method for identifying polymorphic transposable elements via scanning of high-throughput short reads.
- Author
-
Houxiang Kang, Dan Zhu, Runmao Lin, Opiyo, Stephen Obol, Ning Jiang, Shin-Han Shiu, and Guo-Liang Wang
- Abstract
Identification of polymorphic transposable elements (TEs) is important because TE polymorphism creates genetic diversity and influences the function of genes in the host genome. However, de novo scanning of polymorphic TEs remains a challenge. Here, we report a novel computational method, called PTEMD (polymorphic TEs and their movement detection), for de novo discovery of genome-wide polymorphic TEs. PTEMD searches highly identical sequences using reads supported breakpoint evidences. Using PTEMD, we identified 14 polymorphic TE families (905 sequences) in rice blast fungus Magnaporthe oryzae, and 68 (10,618 sequences) in maize. We validated one polymorphic TE family experimentally, MoTE-1; all MoTE-1 family members are located in different genomic loci in the three tested isolates. We found that 57.1% (8 of 14) of the PTEMD-detected polymorphic TE families in M. oryzae are active. Furthermore, our data indicate that there are more polymorphic DNA transposons in maize than their counterparts of retrotransposons despite the fact that retrotransposons occupy largest fraction of genomic mass. We demonstrated that PTEMD is an effective tool for identifying polymorphic TEs in M. oryzae and maize genomes. PTEMD and the genome-wide polymorphic TEs in M. oryzae and maize are publically available at http://www.kanglab.cn/blast/PTEMD_V1.02.htm. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
11. A G-Box-Like Motif Is Necessary for Transcriptional Regulation by Circadian Pseudo-Response Regulators in Arabidopsis.
- Author
-
Liu, Tiffany L., Newton, Linsey, Ming-Jung Liu, Shin-Han Shiu, and Farré, Eva M.
- Abstract
PSEUDO-RESPONSE REGULATORs (PRRs) play overlapping and distinct roles in maintaining circadian rhythms and regulating diverse biological processes, including the photoperiodic control of flowering, growth, and abiotic stress responses. PRRs act as transcriptional repressors and associate with chromatin via their conserved C-terminal CCT (CONSTANS, CONSTANS-like, and TIMING OF CAB EXPRESSION 1 [TOC1/PRR1]) domains by a still-poorly understood mechanism. Here, we identified genome-wide targets of PRR9 using chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) and compared them with PRR7, PRR5, and TOC1/PRR1 ChIP-seq data. We found that PRR binding sites are located within genomic regions of low nucleosome occupancy and high DNase I hypersensitivity. Moreover, conserved noncoding regions among Brassicaceae species are enriched around PRR binding sites, indicating that PRRs associate with functionally relevant cis-regulatory regions. The PRRs shared a significant number of binding regions, and our results indicate that they coordinately restrict the expression of target genes to around dawn. A G-box-like motif was overrepresented at PRR binding regions, and we showed that this motif is necessary for mediating transcriptional regulation of CIRCADIAN CLOCK ASSOCIATED 1 and PRR9 by the PRRs. Our results further our understanding of how PRRs target specific promoters and provide an extensive resource for studying circadian regulatory networks in plants. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
12. The Impact of the Branched-Chain Ketoacid Dehydrogenase Complex on Amino Acid Homeostasis in Arabidopsis.
- Author
-
Cheng Peng, Uygun, Sahra, Shin-Han Shiu, and Last, Robert L.
- Subjects
KETONIC acids ,DEHYDROGENASES ,AMINO acids ,HOMEOSTASIS ,ARABIDOPSIS - Abstract
The branched-chain amino acids (BCAAs) Leu, Ile, and Val are among nine essential amino acids that must be obtained from the diet of humans and other animals, and can be nutritionally limiting in plant foods. Despite genetic evidence of its importance in regulating seed amino acid levels, the full BCAA catabolic network is not completely understood in plants, and limited information is available regarding its regulation. In this study, transcript coexpression analyses revealed positive correlations among BCAA catabolism genes in stress, development, diurnal/circadian, and light data sets. A core subset of BCAA catabolism genes, including those encoding putative branched-chain ketoacid dehydrogenase subunits, is highly expressed during the night in plants on a diel cycle and in prolonged darkness. Mutants defective in these subunits accumulate higher levels of BCAAs in mature seeds, providing genetic evidence for their function in BCAA catabolism. In addition, prolonged dark treatment caused the mutants to undergo senescence early and overaccumulate leaf BCAAs. These results extend the previous evidence that BCAAs can be catabolized and serve as respiratory substrates at multiple steps. Moreover, comparison of amino acid profiles between mature seeds and darktreated leaves revealed differences in amino acid accumulation when BCAA catabolism is perturbed. Together, these results demonstrate the consequences of blocking BCAA catabolism during both normal growth conditions and under energy-limited conditions. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
13. Molecular Evidence for Functional Divergence and Decay of a Transcription Factor Derived from Whole-Genome Duplication in Arabidopsis thaliana.
- Author
-
Lehti-Shiu, Melissa D., Uygun, Sahra, Moghe, Gaurav D., Panchy, Nicholas, Liang Fang, Hufnagel, David E., Jasicki, Hannah L., Feig, Michael, and Shin-Han Shiu
- Subjects
TRANSCRIPTION factors ,ARABIDOPSIS thaliana ,CHROMOSOME duplication ,GENE expression in plants ,DNA-binding proteins - Abstract
Functional divergence between duplicate transcription factors (TFs) has been linked to critical events in the evolution of land plants and can result from changes in patterns of expression, binding site divergence, and/or interactions with other proteins. Although plant TFs tend to be retained post polyploidization, many are lost within tens to hundreds of million years. Thus, it can be hypothesized that some TFs in plant genomes are in the process of becoming pseudogenes. Here, we use a pair of salt tolerance-conferring transcription factors, DWARF AND DELAYED FLOWERING1 (DDF1) and DDF2, that duplicated through paleopolyploidy 50 to 65 million years ago, as examples to illustrate potential mechanisms leading to duplicate retention and loss. We found that the expression patterns of Arabidopsis thaliana (At)DDF1 and AtDDF2 have diverged in a highly asymmetric manner, and AtDDF2 has lost most inferred ancestral stress responses. Consistent with promoter disablement, the AtDDF2 promoter has fewer predicted cis-elements and a methylated repetitive element. Through comparisons of AtDDF1, AtDDF2, and their Arabidopsis lyrata orthologs, we identified significant differences in binding affinities and binding site preference. In particular, an AtDDF2-specific substitution within the DNA-binding domain significantly reduces binding affinity. Crossspecies analyses indicate that both AtDDF1 and AtDDF2 are under selective constraint, but among A. thaliana accessions, AtDDF2 has a higher level of nonsynonymous nucleotide diversity compared with AtDDF1. This may be the result of selection in different environments or may point toward the possibility of ongoing functional decay despite retention for millions of years after gene duplication. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
14. Automated Update, Revision, and Quality Control of the Maize Genome Annotations Using MAKER-P Improves the B73 RefGen_v3 Gene Models and Identifies New Genes[OPEN].
- Author
-
MeiYee Law, Childs, Kevin L., Campbell, Michael S., Stein, Joshua C., Olson, Andrew J., Holt, Carson, Panchy, Nicholas, Jikai Lei, Dian Jiao, Andorf, Carson M., Lawrence, Carolyn J., Ware, Doreen, Shin-Han Shiu, Yanni Sun, Ning Jiang, and Yandell, Mark
- Subjects
PLANT genes ,PLANT genomes ,CORN research ,GENETIC research ,PLANT genetics ,PLANT cells & tissues - Abstract
The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
15. MAKER-P: A Tool Kit for the Rapid Creation, Management, and Quality Control of Plant Genome Annotations.
- Author
-
Campbell, Michael S., Law, MeiYee, Holt, Carson, Stein, Joshua C., Moghe, Gaurav D., Hufnagel, David E., Jikai Lei, Achawanantakun, Rujira, Jiao, Dian, Lawrence, Carolyn J., Ware, Doreen, Shin-Han Shiu, Childs, Kevin L., Yanni Sun, Ning Jiang, and Yandell, Mark
- Subjects
PLANT genomes ,ARABIDOPSIS ,QUALITY standards ,PROCESS control systems ,QUALITY circles - Abstract
We have optimized and extended the widely used annotation engine MAKER in order to better support plant genome annotation efforts. New features include better parallelization for large repeat-rich plant genomes, noncoding RNA annotation capabilities, and support for pseudogene identification. We have benchmarked the resulting software tool kit, MAKER-P, using the Arabidopsis (Arabidopsis thaliana) and maize (Zea mays) genomes. Here, we demonstrate the ability of the MAKER-P tool kit to automatically update, extend, and revise the Arabidopsis annotations in light of newly available data and to annotate pseudogenes and noncoding RNAs absent from The Arabidopsis Informatics Resource 10 build. Our results demonstrate that MAKER-P can be used to manage and improve the annotations of even Arabidopsis, perhaps the best-annotated plant genome. We have also installed and benchmarked MAKER-P on the Texas Advanced Computing Center. We show that this public resource can de novo annotate the entire Arabidopsis and maize genomes in less than 3 h and produce annotations of comparable quality to those of the current The Arabidopsis Information Resource 10 and maize V2 annotation builds. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
16. Characteristics and Significance of Intergenic Polyadenylated RNA Transcription in Arabidopsis.
- Author
-
Moghe, Gaurav D., Lehti-Shiu, Melissa D., Seddon, Alex E., Shan Yin, Chen, Yani, Juntawong, Piyada, Brandizzi, Federica, Bailey-Serres, Julia, and Shin-Han Shiu
- Subjects
ARABIDOPSIS thaliana ,PLANT genomes ,GENETIC research ,PLANT genetics ,CATALYSIS research ,PROTEIN research - Abstract
The Arabidopsis (Arabidopsis thaliana) genome is the most well-annotated plant genome. However, transcriptome sequencing in Arabidopsis continues to suggest the presence of polyadenylated (polyA) transcripts originating from presumed intergenic regions. It is not clear whether these transcripts represent novel noncoding or protein-coding genes. To understand the nature of intergenic polyA transcription, we first assessed its abundance using multiple messenger RNA sequencing data sets. We found 6,545 intergenic transcribed fragments (ITFs) occupying 3.6% of Arabidopsis intergenic space. In contrast to transcribed fragments that map to protein-coding and RNA genes, most ITFs are significantly shorter, are expressed at significantly lower levels, and tend to be more data set specific. A surprisingly large number of ITFs (32.1%) may be protein coding based on evidence of translation. However, our results indicate that these "translated" ITFs tend to be close to and are likely associated with known genes. To investigate if ITFs are under selection and are functional, we assessed ITF conservation through cross-species as well as within-species comparisons. Our analysis reveals that 237 ITFs, including 49 with translation evidence, are under strong selective constraint and relatively distant from annotated features. These ITFs are likely parts of novel genes. However, the selective pressure imposed on most ITFs is similar to that of randomly selected, untranscribed intergenic sequences. Our findings indicate that despite the prevalence of ITFs, apart from the possibility of genomic contamination, many may be background or noisy transcripts derived from "junk" DNA, whose production may be inherent to the process of transcription and which, on rare occasions, may act as catalysts for the creation of novel genes [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
17. Characterization of Genes Involved in Cytokinin Signaling and Metabolism from Rice.
- Author
-
Yu-Chang Tsai, Weir, Nicholas R., Hill, Kristine, Wenjing Zhang, Hyo Jung Kim, Shin-Han Shiu, Schaller, G. Eric, and Kieber, Joseph J.
- Subjects
CYTOKININS ,RICE ,ORYZA ,HISTIDINE kinases ,ARABIDOPSIS thaliana - Abstract
Two-component signaling elements play important roles in plants, including a central role in cytokinin signaling. We characterized two-component elements from the monocot rice (Oryza sativa) using several complementary approaches. Phylogenetic analysis reveals relatively simple orthologous relationships among the histidine kinases in rice and Arabidopsis (Arabidopsis thaliana). In contrast, the histidine-containing phosphotransfer proteins (OsHPs) and response regulators (OsRRs) display a higher degree of lineage-specific expansion. The intracellular localizations of several OsHPs and OsRRs were examined in rice and generally found to correspond to the localizations of their dicot counterparts. The functionality of rice type-B OsRRs was tested in Arabidopsis; one from a clade composed of both monocot and dicot type-B OsRRs complemented an Arabidopsis type-B response regulator mutant, but a type-B OsRR from a monocot-specific subfamily generally did not. The expression of genes encoding two-component elements and proteins involved in cytokinin biosynthesis and degradation was analyzed in rice roots and shoots and in response to phytohormones. Nearly all type-A OsRRs and OsHK4 were up-regulated in response to cytokinin, but other cytokinin signaling elements were not appreciably affected. Furthermore, multiple cytokinin oxidase (OsCKX) genes were up-regulated by cytokinin. Abscisic acid treatment decreased the expression of several genes involved in cytokinin biosynthesis and degradation. Auxin affected the expression of a few genes; brassinosteroid and gibberellin had only modest effects. Our results support a shared role for two-component elements in mediating cytokinin signaling in monocots and dicots and reveal how phytohormones can impact cytokinin function through modulating gene expression. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
18. A comparison of the low temperature transcriptomes and CBF regulons of three plant species that differ in freezing tolerance: Solanum commersonii, Solanum tuberosum, and Arabidopsis thaliana.
- Author
-
Carvallo, Marcela A., Pino, María-Teresa, Jeknić, Zoran, Cheng Zou, Doherty, Colleen J., Shin-Han Shiu, Chen, Tony H. H., and Thomashow, Michael F.
- Abstract
Solanum commersonii and Solanum tuberosum are closely related plant species that differ in their abilities to cold acclimate; whereas S. commersonii increases in freezing tolerance in response to low temperature, S. tuberosum does not. In Arabidopsis thaliana, cold-regulated genes have been shown to contribute to freezing tolerance, including those that comprise the CBF regulon, genes that are controlled by the CBF transcription factors. The low temperature transcriptomes and CBF regulons of S. commersonii and S. tuberosum were therefore compared to determine whether there might be differences that contribute to their differences in ability to cold acclimate. The results indicated that both plants alter gene expression in response to low temperature to similar degrees with similar kinetics and that both plants have CBF regulons composed of hundreds of genes. However, there were considerable differences in the sets of genes that comprised the low temperature transcriptomes and CBF regulons of the two species. Thus differences in cold regulatory programmes may contribute to the differences in freezing tolerance of these two species. However, 53 groups of putative orthologous genes that are cold-regulated in S. commersonii, S. tuberosum, and A. thaliana were identified. Given that the evolutionary distance between the two Solanum species and A. thaliana is 112–156 million years, it seems likely that these conserved cold-regulated genes--many of which encode transcription factors and proteins of unknown function--have fundamental roles in plant growth and development at low temperature. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
19. Changes in Transcript Abundance in Chlamydomonas reinhardtii following Nitrogen Deprivation Predict Diversion of Metabolism.
- Author
-
Miller, Rachel, Guangxi Wu, Deshpande, Rahul R., Vieler, Astrid, Gärtner, Katrin, Xiaobo Li, Moellering, Eric R., Zäuner, Simone, Cornish, Adam J., Bensheng Liu, Bullard, Blair, Sears, Barbara B., Min-Hao Kuo, Hegg, Eric L., Shachar-Hill, Yair, Shin-Han Shiu, and Benning, Christoph
- Subjects
GENETIC transcription ,NITROGEN ,METABOLISM ,CHLAMYDOMONAS reinhardtii ,MICROALGAE - Abstract
Like many microalgae, Chlamydomonas reinhardtii forms lipid droplets rich in triacylglycerols when nutrient deprived. To begin studying the mechanisms underlying this process, nitrogen (N) deprivation was used to induce triacylglycerol accumulation and changes in developmental programs such as gametogenesis. Comparative global analysis of transcripts under induced and noninduced conditions was applied as a first approach to studying molecular changes that promote or accompany triacylglycerol accumulation in cells encountering a new nutrient environment. Towards this goal, high-throughput sequencing technology was employed to generate large numbers of expressed sequence tags of eight biologically independent libraries, four for each condition, N replete and N deprived, allowing a statistically sound comparison of expression levels under the two tested conditions. As expected, N deprivation activated a subset of control genes involved in gametogenesis while downregulating protein biosynthesis. Genes for components of photosynthesis were also down-regulated, with the exception of the PSBS gene. N deprivation led to a marked redirection of metabolism: the primary carbon source, acetate, was no longer converted to cell building blocks by the glyoxylate cycle and gluconeogenesis but funneled directly into fatty acid biosynthesis. Additional fatty acids may be produced by membrane remodeling, a process that is suggested by the changes observed in transcript abundance of putative lipase genes. Inferences on metabolism based on transcriptional analysis are indirect, but biochemical experiments supported some of these deductions. The data provided here represent a rich source for the exploration of the mechanism of oil accumulation in microalgae. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
20. Evolutionary History and Stress Regulation of Plant Receptor-Like Kinase/Pelle Genes.
- Author
-
Lehti-Shiu, Melissa D., Cheng Zou, Hanada, Kousuke, and Shin-Han Shiu
- Subjects
PLANT regulators ,PLANT physiology ,ARABIDOPSIS ,ARABIDOPSIS thaliana ,PLANT growth - Abstract
Receptor-Like Kinase (RLK)/Pelle genes play roles ranging from growth regulation to defense response, and the dramatic expansion of this family has been postulated to be crucial for plant-specific adaptations. Despite this, little is known about the history of or the factors that contributed to the dramatic expansion of this gene family. In this study, we show that expansion coincided with the establishment of land plants and that RLK/Pelle subfamilies were established early in land plant evolution. The RLK/Pelle family expanded at a significantly higher rate than other kinases, due in large part to expansion of a few subfamilies by tandem duplication. Interestingly, these subfamilies tend to have members with known roles in defense response, suggesting that their rapid expansion was likely a consequence of adaptation to fast-evolving pathogens. Arabidopsis (Arabidopsis thaliana) expression data support the importance of RLK/Pelles in biotic stress response. We found that hundreds of RLK/Pelles are up-regulated by biotic stress. Furthermore, stress responsiveness is correlated with the degree of tandem duplication in RLK/Pelle subfamilies. Our findings suggest a link between stress response and tandem duplication and provide an explanation for why a large proportion of the RLK/Pelle gene family is found in tandem repeats. En addition, our findings provide a useful framework for potentially predicting RLK/Pelle stress functions based on knowledge of expansion pattern and duplication mechanism. Finally, we propose that the detection of highly variable molecular patterns associated with specific pathogens/parasites is the main reason for the up-regulation of hundreds of RLK/Pelles under biotic stress. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
21. The Functional Role of Pack-MULEs in Rice Inferred from Purifying Selection and Expression Profile.
- Author
-
Hanada, Kousuke, Vallejo, Veronica, Nobuta, Kan, Slotkin, R. Keith, Lisch, Damon, Meyers, Blake C., Shin-Han Shiu, and Ning Jiang
- Subjects
RICE ,GENE expression in plants ,GENES ,PLANT genetics ,RNA ,GENETIC code - Abstract
Gene duplication is an important mechanism for evolution of new genes. In plants, a special group of transposable elements, called Pack-MULEs or transduplicates, is able to duplicate and amplify genes or gene fragments on a large scale. Despite the abundance of Pack-MULEs, the functionality of these duplicates is not clear. Here, we present a comprehensive analysis of expression and purifying selection on 2809 Pack-MULEs in rice (Oryza sativa), which are derived from 1501 parental genes. At least 22% of the Pack-MULEs ,are transcribed, and 28 Pack-MULEs have direct evidence of translation. Chimeric Pack-MULEs, which contain gene fragments from multiple genes, are much more frequently expressed than those derived only from a single gene. In addition, Pack-MULEs are frequently associated with small RNAs. The presence of these small RNAs is associated with a reduction in expression of both the Pack-MULEs and their parental genes. Furthermore, an assessment of the selection pressure on the Pack-MULEs using the ratio of nonsynonymous (Ka) and synonymous (Ks) substitution rates indicates that a considerable number of Pack-MULEs likely have been under selective constraint. The Ka/Ks values of Pack-MULE and parental gene pairs are lower among Pack-MULEs that are expressed in sense orientations. Taken together, our analysis suggests that a significant number of Pack-MULEs are expressed and subjected to purifying selection, and some are associated with small RNAs. Therefore, at least a subset of Pack-MULEs are likely/functional and have great potential in regulating gene expression as well as providing novel coding capacities. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
22. Importance of Lineage-Specific Expansion of Plant Tandem Duplicates in the Adaptive Response to Environmental Stimuli.
- Author
-
Hanada, Kousuke, Cheng Zou, Lehti-Shiu, Melissa D., Shinozaki, Kazuo, and Shin-Han Shiu
- Subjects
PLANT genetics ,GENE expression in plants ,GENOMES ,PLANT physiology ,ARABIDOPSIS - Abstract
Plants have substantially higher gene duplication rates compared with most other eukaryotes. These plant gene duplicates are mostly derived from whole genome and/or tandem duplications. Earlier studies have shown that a large number of duplicate genes are retained over a long evolutionary time, and there is a clear functional bias in retention. However, the influence of duplication mechanism, particularly tandem duplication, on duplicate retention has not been thoroughly investigated. We have defined orthologous groups (OGs) between Arabidopsis (Arabidopsis thaliana) and three other land plants to examine the functional bias of retained duplicate genes during vascular plant evolution. Based on analysis of Gene Ontology categories, it is clear that genes in OGs that expanded via tandem duplication tend to be involved in responses to environmental stimuli, while those that expanded via nontandem mechanisms tend to have intracellular regulatory roles. Using Arabidopsis stress expression data, we further demonstrated that tandem duplicates in expanded OGs are significantly enriched in genes that are up-regulated by biotic stress conditions. In addition, tandem duplication of genes in an OG tends to be highly asymmetric. That is, expansion of OGs with tandem genes in one organismal lineage tends to be coupled with losses in the other. This is consistent with the notion that these tandem genes have experienced lineage-specific selection. In contrast, OGs with genes duplicated via nontandem mechanisms tend to experience convergent expansion, in which similar numbers of genes are gained in parallel. Our study demonstrates that the expansion of gene families and the retention of duplicates in plants exhibit substantial functional biases that are strongly influenced by the mechanism of duplication. In particular, genes involved in stress responses have an elevated probability of retention in a single-lineage fashion following tandem duplication, suggesting that these tandem duplicates are likely important for adaptive evolution to rapidly changing environments. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
23. Large-Scale, Lineage-Specific Expansion of a Bric-a-Brac/Tramtrack/Broad Complex Ubiquitin-Ligase Gene Family in Rice.
- Author
-
Gingerich, Derek J., Hanada, Kousuke, Shin-Han Shiu, and Vierstra, Richard D.
- Subjects
UBIQUITIN ,AMINO acid sequence ,LIGASES ,PLANTS ,MONOCOTYLEDONS ,DICOTYLEDONS ,RICE - Abstract
Selective ubiquitination of proteins is directed by diverse families of ubiquitin-protein ligases (or E3s) in plants. One important type uses Cullin-3 as a scaffold to assemble multisubunit E3 complexes containing one of a multitude of bric-a-brac/tramtrack/ broad complex (BTB) proteins that function as substrate recognition factors. We previously described the 80-member BTB gene superfamily in Arabidopsis thaliana. Here, we describe the complete BTB superfamily in rice (Oryza sativa sppjaponica cv Nipponbare) that contains 149 BTB domain--encoding genes and 43 putative pseudogenes. Amino acid sequence comparisons of the rice and Arabidopsis superfamilies revealed a near equal repertoire of putative substrate recognition module types. However, phylogenetic comparisons detected numerous gene duplication and/or loss events since the rice and Arabidopsis BTB lineages split, suggesting possible functional specialization within individual BTB families. In particular, a major expansion and diversification of a subset of BTB proteins containing Meprin and TRAF homology (MATH) substrate recognition sites was evident in rice and other monocots that likely occurred following the monocot/dicot split. The MATH domain of a subset appears to have evolved significantly faster than those in a smaller core subset that predates flowering plants, suggesting that the substrate recognition module in many monocot MATH-BTB E3s are diversifying to ubiquitinate a set of substrates that are themselves rapidly changing. Intriguing possibilities include pathogen proteins attempting to avoid inactivation by the monocot host. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
24. Identification of Important Regions for Ethylene Binding and Signaling in the Transmembrane Domain of the ETR1 Ethylene Receptor of Arabidopsis.
- Author
-
Wuyi Wang, Esch, Jeff J., Shin-Han Shiu, Agula, Hasi, Binder, Brad M., Chang, Caren, Patterson, Sara E., and Bleecker, Anthony B.
- Subjects
BINDING sites ,ETHYLENE ,ARABIDOPSIS thaliana ,CELL receptors ,AMINO acids ,GENETIC mutation - Abstract
The ethylene binding domain (EBD) of the Arabidopsis thaliana ETR1 receptor is modeled as three membrane-spanning helices. We surveyed ethylene binding activity in different kingdoms and performed a bioinformatic analysis of the EBD. Ethylene binding is confined to land plants, Chara, and a group of cyanobacteria but is largely absent in other organisms, consistent with our finding that EBD-like sequences are overrepresented among plant and cyanobacterial species. We made amino acid substitutions in 37 partially or completely conserved residues of the EBD and assayed their effects on ethylene binding and signaling. Mutations primarily in residues in Helices I and II midregions eliminated ethylene binding and conferred constitutive signaling, consistent with the inverse-agonist model of ethylene receptor signaling and indicating that these residues define the ethylene binding pocket. The largest class of mutations, clustered near the cytoplasmic ends of Helices I and III, gave normal ethylene binding activity yet still conferred constitutive signaling. Therefore, these residues may play a role in turning off the signal transmitter domain of the receptor. By contrast, only two mutations were loss of function with respect to signaling. These findings yield insight into the structure and function of the EBD and suggest a conserved role of the EBD as a negative regulator of the signal transmitter domain. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
25. Transcription Factor Families Have Much Higher Expansion Rates in Plants than in Animals.
- Author
-
Shin-Han Shiu, Ming-Che Shih, and Wen-Hsiung Li
- Subjects
- *
TRANSCRIPTION factors , *GENE expression , *GENOMES , *ARABIDOPSIS thaliana , *ARABIDOPSIS , *NUCLEOTIDE sequence - Abstract
Transcription factors (TFs), which are central to the regulation of gene expression, are usually members of multigene families. In plants, they are involved in diverse processes such as developmental control and elicitation of defense and stress responses. To investigate if differences exist in the expansion patterns of TF gene families between plants and other eukaryotes, we first used Arabidopsis (Arabidopsis thaliana) TFs to identify TF DNA-binding domains. These DNA-binding domains were then used to identify related sequences in 25 other eukaryotic genomes. Interestingly, among 19 families that are shared between animals and plants, more than 14 are larger in plants than in animals. After examining the lineage-specific expansion of TF families in two plants, eight animals, and two fungi, we found that TF families shared among these organisms have undergone much more dramatic expansion in plants than in other eukaryotes. Moreover, this elevated expansion rate of plant TF is not simply due to higher duplication rates of plant genomes but also to a higher degree of expansion compared to other plant genes. Further, in many Arabidopsis-rice (Oryza sativa) TF orthologous groups, the degree of lineage-specific expansion in Arabidopsis is correlated with that in rice. This pattern of parallel expansion is much more pronounced than the whole-genome trend in rice and Arabidopsis. The high rate of expansion among plant TF genes and their propensity for parallel expansion suggest frequent adaptive responses to selection pressure common among higher plants. [ABSTRACT FROM AUTHOR]
- Published
- 2005
- Full Text
- View/download PDF
26. A Comprehensive Expression Analysis of the Arabidopsis Proline-rich Extensin-like Receptor Kinase Gene Family using Bioinformatic and Experimental Approaches.
- Author
-
Nakhamchik, Alina, Zhiying Zhao, Provart, Nicholas J., Shin-Han Shiu, Keatley, Sarah K., Cameron, Robin K., and Goring, Daphne R.
- Subjects
PLANT genetics ,PROLINE ,GENE expression ,ARABIDOPSIS ,GENE expression in plants ,PLANT molecular genetics - Abstract
The Arabidopsis proline-rich extensin-like receptor kinase (PERK) family consists of 15 predicted receptor kinases. A comprehensive expression analysis was undertaken to identify overlapping and unique expression patterns within this family relative to their phylogeny. Three different approaches were used to study AtPERK gene family expression, and included analyses of the EST, MPSS and NASCArrays databases as well as experimental RNA blot analyses. Some of the AtPERK members were identified as tissue-specific genes while others were more broadly expressed. While in some cases there was a good association between these different expression patterns and the position of the AtPERK members in the kinase phylogeny, in other cases divergence of expression patterns was seen. The PERK expression data identified by the bioinformatics and experimental approaches were found generally to show similar trends and supported the use of data from large-scale expression studies for obtaining preliminary expression data. Thus, the bioinformatics survey for ESTs and microarrays is a powerful comprehensive approach for obtaining a genome-wide view of genes in a multigene family. [ABSTRACT FROM PUBLISHER]
- Published
- 2004
- Full Text
- View/download PDF
27. Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice.
- Author
-
Shin-Han Shiu, Karlowski, Wojciech M., Pan, Runsun, Yun-Huei Tzeng, Mayer, Klaus F. X., and Wen-Hsiung Li
- Subjects
- *
ARABIDOPSIS thaliana , *ARABIDOPSIS , *BRASSICACEAE , *CELL receptors , *PLANT cells & tissues , *PLANT anatomy , *CORN , *CROP genetics , *PLANT genetics - Abstract
Receptor-like kinases (RLKs) belong to the large RLK/Pelle gene family, and it is known that the Arabidopsis thaliana genome contains >600 such members, which play important roles in plant growth, development, and defense responses. Surprisingly, we found that rice (Oryza sativa) has nearly twice as many RLK/Pelle members as Arabidopsis does, and it is not simply a consequence of a larger predicted gene number in rice. From the inferred phylogeny of all Arabidopsis and rice RLK/Pelle members, we estimated that the common ancestor of Arabidopsis and rice had >440 RLK/Pelles and that large-scale expansions of certain RLK/Pelle members and fusions of novel domains have occurred in both the Arabidopsis and rice lineages since their divergence. In addition, the extracellular domains have higher nonsynonymous substitution rates than the intracellular domains, consistent with the role of extracellular domains in sensing diverse signals. The lineage-specific expansions in Arabidopsis can be attributed to both tandem and large-scale duplications, whereas tandem duplication seems to be the major mechanism for recent expansions in rice. Interestingly, although the RLKs that are involved in development seem to have rarely been duplicated after the Arabidopsis-rice split, those that are involved in defense/disease resistance apparently have undergone many duplication events. These findings led us to hypothesize that most of the recent expansions of the RLK/Pelle family have involved defense/resistance-related genes. [ABSTRACT FROM AUTHOR]
- Published
- 2004
- Full Text
- View/download PDF
28. Origins, Lineage-Specific Expansions, and Multiple Losses of Tyrosine Kinases in Eukaryotes.
- Author
-
Shin-Han Shiu and Wen-Hsiung Li
- Published
- 2004
- Full Text
- View/download PDF
29. Expansion of the Receptor-Like Kinase/Pelle Gene Family and Receptor-Like Proteins in Arabidopsis.
- Author
-
Shin-Han Shiu and Bleecker, Anthony B.
- Subjects
- *
PROTEIN kinases , *ARABIDOPSIS , *LIPIDS , *EUKARYOTIC cells - Abstract
Examines the expansion of the receptor-like kinase or pelle gene family and receptor-like proteins in Arabidopsis. Identification of putative protein and lipid ligands; Abundance of gene family in different eukaryotic lineages; Evaluation of the assertion of lineage-specific expansion in plants.
- Published
- 2003
- Full Text
- View/download PDF
30. Nomenclature for Two-Component Signaling Elements of Rice.
- Author
-
Schaller, G. Eric, Doi, Kazuyuki, Hwang, Ildoo, Kieber, Joseph J., Khurana, Jitendra P., Kurata, Nori, Mizuno, Takeshi, Pareek, Ashwani, Shin-Han Shiu, Ping Wu, and Wing Kin Yip
- Subjects
LETTERS to the editor ,RICE ,PHYSIOLOGY - Abstract
A letter to the editor is presented in response to several studies on the two-component signaling elements of rice.
- Published
- 2007
- Full Text
- View/download PDF
31. sORF finder: a program package to identify small open reading frames with high coding potential.
- Author
-
Hanada, Kousuke, Akiyama, Kenji, Sakurai, Tetsuya, Toyoda, Tetsuro, Shinozaki, Kazuo, and Shin-Han Shiu
- Subjects
COMPUTER software ,PROTEINS ,NUCLEOTIDES ,AMINO acids ,GENETIC code - Abstract
Summary: sORF finder is a program package for identifying small open reading frames (sORFs) with high-coding potential. This application allows the identification of coding sORFs according to the nucleotide composition bias among coding sequences and the potential functional constraint at the amino acid level through evaluation of synonymous and non-synonymous substitution rates. [ABSTRACT FROM PUBLISHER]
- Published
- 2010
- Full Text
- View/download PDF
32. Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits.
- Author
-
Azodi, Christina B., Bolger, Emily, McCarren, Andrew, Roantree, Mark, de los Campos, Gustavo, and Shin-Han Shiu
- Subjects
- *
ARTIFICIAL neural networks , *MACHINE learning , *FEATURE selection , *LIVESTOCK breeding , *PLANT breeding , *PREDICTION models , *MULTITRAIT multimethod techniques - Abstract
The usefulness of genomic prediction in crop and livestock breeding programs has prompted efforts to develop new and improved genomic prediction algorithms, such as artificial neural networks and gradient tree boosting. However, the performance of these algorithms has not been compared in a systematic manner using a wide range of datasets and models. Using data of 18 traits across six plant species with different marker densities and training population sizes, we compared the performance of six linear and six non-linear algorithms. First, we found that hyperparameter selection was necessary for all non-linear algorithms and that feature selection prior to model training was critical for artificial neural networks when the markers greatly outnumbered the number of training lines. Across all species and trait combinations, no one algorithm performed best, however predictions based on a combination of results from multiple algorithms (i.e., ensemble predictions) performed consistently well. While linear and non-linear algorithms performed best for a similar number of traits, the performance of non-linear algorithms vary more between traits. Although artificial neural networks did not perform best for any trait, we identified strategies (i.e., feature selection, seeded starting weights) that boosted their performance to near the level of other algorithms. Our results highlight the importance of algorithm selection for the prediction of trait values. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
33. Automated Update, Revision, and Quality Control of the Maize Genome Annotations Using MAKER-P Improves the B73 RefGen_v3 Gene Models and Identifies New Genes[OPEN].
- Author
-
MeiYee Law, Childs, Kevin L., Campbell, Michael S., Stein, Joshua C., Olson, Andrew J., Holt, Carson, Panchy, Nicholas, Jikai Lei, Dian Jiao, Andorf, Carson M., Lawrence, Carolyn J., Ware, Doreen, Shin-Han Shiu, Yanni Sun, Ning Jiang, and Yandell, Mark
- Subjects
- *
PLANT genes , *PLANT genomes , *CORN research , *GENETIC research , *PLANT genetics , *PLANT cells & tissues - Abstract
The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.