16 results on '"Fold-change"'
Search Results
2. Statistical Issues and Group Classification in Plasma MicroRNA Studies With Data Application.
- Author
-
Rai, Shesh N, Qian, Chen, Pan, Jianmin, McClain, Marion, Eichenberger, Maurice R, McClain, Craig J, and Galandiuk, Susan
- Subjects
- *
MICRORNA , *ANALYSIS of variance , *CLASSIFICATION , *STATISTICS , *DATA extraction , *ANALYSIS of covariance - Abstract
The analysis of plasma microRNAs (miRNAs) has been widely used as a method for finding potential biomarkers for human diseases, especially those with a link to cancer. Methods of analyzing plasma miRNA have been thoroughly discussed from sample extraction to data modeling. However, some issues exist within the process that have rarely been talked about. Rice et al. discussed some issues in plasma miRNA studies, such as the lack of standard methodology including the use of different cycle threshold, time to plasma extraction, among others. These issues can lead to inconsistent data, and thus impact the result and assay reproducibility. Other external issues, such as batch effect and operator effect, may also indirectly impact the statistical analysis. Here, we discuss issues in plasma miRNA studies from a statistical point of view. The interaction effect of different ways of calculating fold-change, the choice of housekeeping genes, and methods of normalization are among the issues we discuss, with data demonstrations. P values are calculated and compared to determine the effect of those issues on statistical conclusions. Statistical methods such as analysis of variance and analysis of covariance are crucial in the analysis of miRNA but investigators are often confused about them; therefore, a brief explanation of these statistical methods is also included. In addition, 3-group classification is discussed, as it is often challenging, compared with 2-group classification. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
3. Controlling Nuclear NF-κB Dynamics by β-TrCP—Insights from a Computational Model
- Author
-
Uwe Benary and Jana Wolf
- Subjects
mathematical modelling ,β-TrCP ,NF-κB signaling ,drug target ,oscillation ,fold-change ,area under curve ,ordinary differential equations ,Biology (General) ,QH301-705.5 - Abstract
The canonical nuclear factor kappa-light-chain-enhancer of activated B cells (NF-κB) signaling pathway regulates central processes in mammalian cells and plays a fundamental role in the regulation of inflammation and immunity. Aberrant regulation of the activation of the transcription factor NF-κB is associated with severe diseases such as inflammatory bowel disease and arthritis. In the canonical pathway, the inhibitor IκB suppresses NF-κB’s transcriptional activity. NF-κB becomes active upon the degradation of IκB, a process that is, in turn, regulated by the β-transducin repeat-containing protein (β-TrCP). β-TrCP has therefore been proposed as a promising pharmacological target in the development of novel therapeutic approaches to control NF-κB’s activity in diseases. This study explores the extent to which β-TrCP affects the dynamics of nuclear NF-κB using a computational model of canonical NF-κB signaling. The analysis predicts that β-TrCP influences the steady-state concentration of nuclear NF-κB, as well as changes characteristic dynamic properties of nuclear NF-κB, such as fold-change and the duration of its response to pathway stimulation. The results suggest that the modulation of β-TrCP has a high potential to regulate the transcriptional activity of NF-κB.
- Published
- 2019
- Full Text
- View/download PDF
4. EventPointer 3.0: flexible and accurate splicing analysis that includes studying the differential usage of protein-domains
- Author
-
Ferrer-Bonsom, J.A. (Juan Ángel), Gimeno-Combarro, M. (Marian), Olaverri, D. (Danel), Sacristan, P. (Pablo), Lobato, C. (Cesar), Castilla-Ruiz, C.(Carlos), Carazo-Melo, F.(Fernando), and Rubio-Díaz-Cordovés, Á. (Ángel)
- Subjects
Kinase ,Identification ,Inhibitor ,Genes ,Área de Biomedicina ,Quantification ,Growth ,Fold-Change ,Cancer ,Homology - Abstract
Alternative splicing (AS) plays a key role in cancer: all its hallmarks have been associated with different mechanisms of abnormal AS. The improvement of the human transcriptome annotation and the availability of fast and accurate software to estimate isoform concentrations has boosted the analysis of transcriptome profiling from RNA-seq. The statistical analysis of AS is a challenging problem not yet fully solved. We have included in EventPointer (EP), a Bioconductor package, a novel statistical method that can use the bootstrap of the pseudoaligners. We compared it with other state-of-the-art algorithms to analyze AS. Its performance is outstanding for shallow sequencing conditions. The statistical framework is very flexible since it is based on design and contrast matrices. EP now includes a convenient tool to find the primers to validate the discoveries using PCR. We also added a statistical module to study alteration in protein domain related to AS. Applying it to 9514 patients from TCGA and TARGET in 19 different tumor types resulted in two conclusions: i) aberrant alternative splicing alters the relative presence of Protein domains and, ii) the number of enriched domains is strongly correlated with the age of the patients.
- Published
- 2022
5. Multi-platform assessment of transcriptional profiling technologies utilizing a precise probe mapping methodology.
- Author
-
Jinsheng Yu, Cliften, Paul F., Juehne, Twyla I., Sinnwell, Toni M., Sawyer, Chris S., Sharma, Mala, Lutz, Andrew, Tycksen, Eric, Johnson, Mark R., Minton, Matthew R., Klotz, Elliott T., Schriefer, Andrew E., Wei Yang, Heinz, Michael E., Crosby, Seth D., and Head, Richard D.
- Subjects
- *
RNA sequencing , *GENETIC transcription , *MICROARRAY technology , *GENE expression , *EXONS (Genetics) - Abstract
Background: The arrival of RNA-seq as a high-throughput method competitive to the established microarray technologies has necessarily driven a need for comparative evaluation. To date, cross-platform comparisons of these technologies have been relatively few in number of platforms analyzed and were typically gene name annotation oriented. Here, we present a more extensive and yet precise assessment to elucidate differences and similarities in performance of numerous aspects including dynamic range, fidelity of raw signal and fold-change with sample titration, and concordance with qRT-PCR (TaqMan). To ensure that these results were not confounded by incompatible comparisons, we introduce the concept of probe mapping directed "transcript pattern". A transcript pattern identifies probe(set)s across platforms that target a common set of transcripts for a specific gene. Thus, three levels of data were examined: entire data sets, data derived from a subset of 15,442 RefSeq genes common across platforms, and data derived from the transcript pattern defined subset of 7,034 RefSeq genes. Results: In general, there were substantial core similarities between all 6 platforms evaluated; but, to varying degrees, the two RNA-seq protocols outperformed three of the four microarray platforms in most categories. Notably, a fourth microarray platform, Agilent with a modified protocol, was comparable, or marginally superior, to the RNA-seq protocols within these same assessments, especially in regards to fold-change evaluation. Furthermore, these 3 platforms (Agilent and two RNA-seq methods) demonstrated over 80 % fold-change concordance with the gold standard qRT-PCR (TaqMan). Conclusions: This study suggests that microarrays can perform on nearly equal footing with RNA-seq, in certain key features, specifically when the dynamic range is comparable. Furthermore, the concept of a transcript pattern has been introduced that may minimize potential confounding factors of multi-platform comparison and may be useful for similar evaluations. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
6. Fuzzy clustering with biological knowledge for gene selection.
- Author
-
Ghosh, Sampreeti, Mitra, Sushmita, and Dattagupta, Rana
- Subjects
FUZZY systems ,CLUSTER analysis (Statistics) ,GENE expression ,GENE ontology ,DATA analysis - Abstract
Highlights: [•] An application of FCLARANS for attribute clustering and dimensionality reduction in gene expression data has been demonstrated. [•] Domain knowledge based on gene ontology and differential gene expressions are employed in the process. [•] Important representative features are extracted from each enriched gene partition to form the reduced gene space. [•] External validation using various well-known classifiers, establishes the effectiveness of the proposed methodology. [Copyright &y& Elsevier]
- Published
- 2014
- Full Text
- View/download PDF
7. Representing dynamic biological networks with multi-scale probabilistic models
- Author
-
Oliver Pötz, Michael Kühl, Silke D. Kühlwein, Barbara Kracher, Luc De Raedt, Hans A. Kestler, Astrid S. Pfister, Alexander Groß, Thomas O. Joos, Sebastian Wiese, Dries Van Daele, Katrin Luckert, and Johann M. Kraus
- Subjects
Life Sciences & Biomedicine - Other Topics ,Computer science ,Bayesian probability ,Gene regulatory network ,Medicine (miscellaneous) ,Transfection ,computer.software_genre ,Models, Biological ,Article ,BETA-CATENIN ,General Biochemistry, Genetics and Molecular Biology ,Feedback ,03 medical and health sciences ,Bayes' theorem ,0302 clinical medicine ,SYSTEMS ,INTACT ,Humans ,Gene Regulatory Networks ,Phosphorylation ,Wnt Signaling Pathway ,Biology ,lcsh:QH301-705.5 ,beta Catenin ,030304 developmental biology ,0303 health sciences ,Models, Statistical ,Science & Technology ,IDENTIFICATION ,Systems Biology ,Scale (chemistry) ,Probabilistic logic ,Robustness (evolution) ,Bayes Theorem ,Multidisciplinary Sciences ,Range (mathematics) ,HEK293 Cells ,lcsh:Biology (General) ,Gene Knockdown Techniques ,030220 oncology & carcinogenesis ,Science & Technology - Other Topics ,Data mining ,General Agricultural and Biological Sciences ,Life Sciences & Biomedicine ,computer ,Biological network ,Signal Transduction ,FOLD-CHANGE - Abstract
Dynamic models analyzing gene regulation and metabolism face challenges when adapted to modeling signal transduction networks. During signal transduction, molecular reactions and mechanisms occur in different spatial and temporal frames and involve feedbacks. This impedes the straight-forward use of methods based on Boolean networks, Bayesian approaches, and differential equations. We propose a new approach, ProbRules, that combines probabilities and logical rules to represent the dynamics of a system across multiple scales. We demonstrate that ProbRules models can represent various network motifs of biological systems. As an example of a comprehensive model of signal transduction, we provide a Wnt network that shows remarkable robustness under a range of phenotypical and pathological conditions. Its simulation allows the clarification of controversially discussed molecular mechanisms of Wnt signaling by predicting wet-lab measurements. ProbRules provides an avenue in current computational modeling by enabling systems biologists to integrate vast amounts of available data on different scales., Alexander Gross et al. present ProbRules, a dynamic modeling approach that combines probabilities and logical rules to represent network dynamics over multiple scales. They apply ProbRules to a Wnt network, predicting gene expression readouts that they confirm with wet-lab experiments.
- Published
- 2019
- Full Text
- View/download PDF
8. Statistical Issues and Group Classification in Plasma MicroRNA Studies With Data Application
- Author
-
Maurice R Eichenberger, Chen Qian, Jianmin Pan, Susan Galandiuk, Shesh N. Rai, Craig J. McClain, and Marion McClain
- Subjects
0106 biological sciences ,Normalization (statistics) ,fold-change ,Computer science ,housekeeping genes ,lcsh:Evolution ,Batch effect ,Machine learning ,computer.software_genre ,010603 evolutionary biology ,01 natural sciences ,Data modeling ,03 medical and health sciences ,operator effect ,plasma miRNA ,Genetics ,lcsh:QH359-425 ,Statistical analysis ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Quantile normalization ,Original Research ,Analysis of covariance ,ANCOVA ,0303 health sciences ,ANOVA ,batch effect ,varying threshold ,business.industry ,quantile normalization ,Data application ,Computer Science Applications ,normalization ,classification ,Analysis of variance ,Artificial intelligence ,business ,computer - Abstract
The analysis of plasma microRNAs (miRNAs) has been widely used as a method for finding potential biomarkers for human diseases, especially those with a link to cancer. Methods of analyzing plasma miRNA have been thoroughly discussed from sample extraction to data modeling. However, some issues exist within the process that have rarely been talked about. Rice et al. discussed some issues in plasma miRNA studies, such as the lack of standard methodology including the use of different cycle threshold, time to plasma extraction, among others. These issues can lead to inconsistent data, and thus impact the result and assay reproducibility. Other external issues, such as batch effect and operator effect, may also indirectly impact the statistical analysis. Here, we discuss issues in plasma miRNA studies from a statistical point of view. The interaction effect of different ways of calculating fold-change, the choice of housekeeping genes, and methods of normalization are among the issues we discuss, with data demonstrations. P values are calculated and compared to determine the effect of those issues on statistical conclusions. Statistical methods such as analysis of variance and analysis of covariance are crucial in the analysis of miRNA but investigators are often confused about them; therefore, a brief explanation of these statistical methods is also included. In addition, 3-group classification is discussed, as it is often challenging, compared with 2-group classification.
- Published
- 2020
9. Selection of differentially expressed genes in microarray data analysis.
- Author
-
Chen, J. J., Wang, S.-J., Tsai, C.-A., and Lin, C.-J.
- Subjects
- *
GENES , *THERAPEUTICS , *DNA microarrays , *STATISTICS , *TUMORS , *TISSUES - Abstract
One common objective in microarray experiments is to identify a subset of genes that express differentially among different experimental conditions, for example, between drug treatment and no drug treatment. Often, the goal is to determine the underlying relationship between poor versus good gene signatures for identifying biological functions or predicting specific therapeutic outcomes. Because of the complexity in studying hundreds or thousands of genes in an experiment, selection of a subset of genes to enhance relationships among the underlying biological structures or to improve prediction accuracy of clinical outcomes has been an important issue in microarray data analysis. Selection of differentially expressed genes is a two-step process. The first step is to select an appropriate test statistic and compute the P-value. The genes are ranked according to their P-values as evidence of differential expression. The second step is to assign a significance level, that is, to determine a cutoff threshold from the P-values in accordance with the study objective. In this paper, we consider four commonly used statistics, t-, S- (SAM), U-(Mann–Whitney) and M-statistics to compute the P-values for gene ranking. We consider the family-wise error and false discovery rate false-positive error-controlled procedures to select a limited number of genes, and a receiver-operating characteristic (ROC) approach to select a larger number of genes for assigning the significance level. The ROC approach is particularly useful in genomic/genetic profiling studies. The well-known colon cancer data containing 22 normal and 40 tumor tissues are used to illustrate different gene ranking and significance level assignment methods for applications to genomic/genetic profiling studies. The P-values computed from the t-, U- and M-statistics are very similar. We discuss the common practice that uses the P-value, false-positive error probability, as the primary criterion, and then uses the fold-change as a surrogate measure of biological significance for gene selection. The P-value and the fold-change can be pictorially shown simultaneously in a volcano plot. We also address several issues on gene selection.The Pharmacogenomics Journal (2007) 7, 212–220. doi:10.1038/sj.tpj.6500412; published online 29 August 2006 [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
10. TargetScore used to reveal potential targets of miRNA203 and miRNA-146a in psoriasis by integrating microRNA overexpression and microarray data
- Author
-
Yan-Song Liu, Xiu-Jie Zhang, and Hai-Xia Chen
- Subjects
0301 basic medicine ,fold-change ,Gene Expression ,Context (language use) ,Computational biology ,03 medical and health sciences ,0302 clinical medicine ,Psoriasis ,microRNA ,Gene expression ,TargetScore ,Humans ,Medicine ,Gene ,Oligonucleotide Array Sequence Analysis ,business.industry ,Antigen processing ,Microarray analysis techniques ,Bayes Theorem ,Clinical Trial/Experimental Study ,psoriasis ,General Medicine ,medicine.disease ,MicroRNAs ,030104 developmental biology ,Case-Control Studies ,030220 oncology & carcinogenesis ,business ,KIR3DL1 ,Research Article - Abstract
Objective: Systematic tracking of microRNA (miRNA) targets remains a challenge. In our work, we aimed to use TargetScore to investigate the potential targets of miRNA203 and miRNA-146a in psoriasis by integrating miRNA overexpression information and sequence data, and to further uncover the functions of miRNA203 and miRNA-146a in psoriasis. Methods: This was a case-control bioinformatics analysis using already published microarray data of psoriasis. We calculated targetScores by combining log fold-change and sequence scores obtained from TargetScan context score, probabilities of conserved targeting, and derived the distribution of targetScores. The scoring cutoff was chosen based on the different targetScore distributions for the nonvalidated and validated targets. The potential target genes for miRNA-203 and miRNA-146a were predicted based on the targetScore threshold. To reveal the functions of miRNA-203 and miRNA-146a, we implemented pathway enrichment analyses for the targets of miRNA-203 and miRNA-146a. Results: TargetScore >0.4 was selected as the threshold to filter out less confidence targets because we observed little overlap between the 2 distribution at targetScore = 0.4. Based on the targetScore >0.4, 49 target genes for miRNA-203 and 17 targets for miRNA-146a were identified. Pathway enrichment results showed that the target genes of miRNA-203 (including KIR2DL1, HLA-DQA1, KIR3DL1) only participated in antigen processing and presentation. The target genes of miRNA-146a (covering ADORA3, CYSLTR2, HRH4) were only involved in neuroactive ligand-receptor interaction. Conclusion: MiRNA203 and miRNA-146a played important roles in psoriasis progression, partially through regulating the pathways of antigen processing and presentation, and neuroactive ligand-receptor interaction, respectively.
- Published
- 2018
- Full Text
- View/download PDF
11. Controlling Nuclear NF-κB Dynamics by β-TrCP—Insights from a Computational Model.
- Author
-
Benary, Uwe and Wolf, Jana
- Subjects
INFLAMMATORY bowel diseases ,IMMUNOREGULATION ,B cells ,TRANSCRIPTION factors ,TALL-1 (Protein) ,NF-kappa B - Abstract
The canonical nuclear factor kappa-light-chain-enhancer of activated B cells (NF-κB) signaling pathway regulates central processes in mammalian cells and plays a fundamental role in the regulation of inflammation and immunity. Aberrant regulation of the activation of the transcription factor NF-κB is associated with severe diseases such as inflammatory bowel disease and arthritis. In the canonical pathway, the inhibitor IκB suppresses NF-κB's transcriptional activity. NF-κB becomes active upon the degradation of IκB, a process that is, in turn, regulated by the β-transducin repeat-containing protein (β-TrCP). β-TrCP has therefore been proposed as a promising pharmacological target in the development of novel therapeutic approaches to control NF-κB's activity in diseases. This study explores the extent to which β-TrCP affects the dynamics of nuclear NF-κB using a computational model of canonical NF-κB signaling. The analysis predicts that β-TrCP influences the steady-state concentration of nuclear NF-κB, as well as changes characteristic dynamic properties of nuclear NF-κB, such as fold-change and the duration of its response to pathway stimulation. The results suggest that the modulation of β-TrCP has a high potential to regulate the transcriptional activity of NF-κB. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
12. Multi-platform assessment of transcriptional profiling technologies utilizing a precise probe mapping methodology
- Author
-
Chris S. Sawyer, Elliott T. Klotz, Jinsheng Yu, Mala Sharma, Richard D. Head, Paul F. Cliften, Eric Tycksen, Wei Yang, Michael E. Heinz, Matthew R Minton, Seth D. Crosby, Mark R. Johnson, Twyla Juehne, Andrew E. Schriefer, Andrew Lutz, and Toni M. Sinnwell
- Subjects
media_common.quotation_subject ,Fidelity ,Fold-change ,RNA-Seq ,Computational biology ,Biology ,Microarray ,Proteomics ,03 medical and health sciences ,Annotation ,0302 clinical medicine ,RefSeq ,Genetics ,Profiling (information science) ,030304 developmental biology ,media_common ,Oligonucleotide Array Sequence Analysis ,0303 health sciences ,Methodology Article ,Gene Expression Profiling ,Reproducibility of Results ,Transcript pattern ,Gene expression profiling ,TaqMan assay ,RNA ,DNA microarray ,RNA-seq ,030217 neurology & neurosurgery ,Biotechnology - Abstract
Background The arrival of RNA-seq as a high-throughput method competitive to the established microarray technologies has necessarily driven a need for comparative evaluation. To date, cross-platform comparisons of these technologies have been relatively few in number of platforms analyzed and were typically gene name annotation oriented. Here, we present a more extensive and yet precise assessment to elucidate differences and similarities in performance of numerous aspects including dynamic range, fidelity of raw signal and fold-change with sample titration, and concordance with qRT-PCR (TaqMan). To ensure that these results were not confounded by incompatible comparisons, we introduce the concept of probe mapping directed “transcript pattern”. A transcript pattern identifies probe(set)s across platforms that target a common set of transcripts for a specific gene. Thus, three levels of data were examined: entire data sets, data derived from a subset of 15,442 RefSeq genes common across platforms, and data derived from the transcript pattern defined subset of 7,034 RefSeq genes. Results In general, there were substantial core similarities between all 6 platforms evaluated; but, to varying degrees, the two RNA-seq protocols outperformed three of the four microarray platforms in most categories. Notably, a fourth microarray platform, Agilent with a modified protocol, was comparable, or marginally superior, to the RNA-seq protocols within these same assessments, especially in regards to fold-change evaluation. Furthermore, these 3 platforms (Agilent and two RNA-seq methods) demonstrated over 80 % fold-change concordance with the gold standard qRT-PCR (TaqMan). Conclusions This study suggests that microarrays can perform on nearly equal footing with RNA-seq, in certain key features, specifically when the dynamic range is comparable. Furthermore, the concept of a transcript pattern has been introduced that may minimize potential confounding factors of multi-platform comparison and may be useful for similar evaluations. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1913-6) contains supplementary material, which is available to authorized users.
- Published
- 2015
13. Input-output relations in biological systems: measurement, information and the Hill equation
- Author
-
Steven A. Frank
- Subjects
Chemical process ,Cellular Sensors ,Measurement Theory ,Natural selection ,Entropy ,Molecular Networks (q-bio.MN) ,Information Theory ,Review ,Symmetry ,0302 clinical medicine ,Models ,Cell Behavior (q-bio.CB) ,Medicine and Health Sciences ,Physical Sciences and Mathematics ,Biological Design ,Natural Selection ,Quantitative Biology - Molecular Networks ,Cellular biochemistry ,Statistical physics ,0303 health sciences ,Hill differential equation ,Agricultural and Biological Sciences(all) ,Systems Biology ,Applied Mathematics ,Life Sciences ,Biological design ,Dynamics ,Variable (computer science) ,Modeling and Simulation ,symbols ,Cellular Biochemistry ,Cellular sensors ,Measurement Scale ,Neurons and Cognition (q-bio.NC) ,General Agricultural and Biological Sciences ,Signal Transduction ,Signal processing ,Information theory ,Process (engineering) ,Immunology ,FOS: Physical sciences ,Signal Processing Probability-Distributions ,Biology ,Fold-Change ,Models, Biological ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,symbols.namesake ,Component (UML) ,Selection, Genetic ,Measurement theory ,Pathways ,Scaling ,Ecology, Evolution, Behavior and Systematics ,Condensed Matter - Statistical Mechanics ,030304 developmental biology ,Input/output ,Statistical Mechanics (cond-mat.stat-mech) ,Biochemistry, Genetics and Molecular Biology(all) ,Scale (chemistry) ,Kinetics ,Quantitative Biology - Neurons and Cognition ,FOS: Biological sciences ,Quantitative Biology - Cell Behavior ,Law ,030217 neurology & neurosurgery - Abstract
Biological systems produce outputs in response to variable inputs. Input-output relations tend to follow a few regular patterns. For example, many chemical processes follow the S-shaped Hill equation relation between input concentrations and output concentrations. That Hill equation pattern contradicts the fundamental Michaelis-Menten theory of enzyme kinetics. I use the discrepancy between the expected Michaelis-Menten process of enzyme kinetics and the widely observed Hill equation pattern of biological systems to explore the general properties of biological input-output relations. I start with the various processes that could explain the discrepancy between basic chemistry and biological pattern. I then expand the analysis to consider broader aspects that shape biological input-output relations. Key aspects include the input-output processing by component subsystems and how those components combine to determine the system's overall input-output relations. That aggregate structure often imposes strong regularity on underlying disorder. Aggregation imposes order by dissipating information as it flows through the components of a system. The dissipation of information may be evaluated by the analysis of measurement and precision, explaining why certain common scaling patterns arise so frequently in input-output relations. I discuss how aggregation, measurement and scale provide a framework for understanding the relations between pattern and process. The regularity imposed by those broader structural aspects sets the contours of variation in biology. Thus, biological design will also tend to follow those contours. Natural selection may act primarily to modulate system properties within those broad constraints., Comment: Biology Direct 8:31
- Published
- 2013
- Full Text
- View/download PDF
14. CDS: A Fold-change Based Statistical Test for Concomitant Identification of Distinctness and Similarity in Gene Expression Analysis
- Author
-
Brice Targat, Nicolas Tchitchek, Jose Felipe Golib Dzib, Sebastian Noth, Annick Lesne, Arndt Benecke, Expression des Gènes et comportement adaptatifs = Molecular Genetics, Neurophysiology and Behavior (NPS-15), Neuroscience Paris Seine (NPS), Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut de Biologie Paris Seine (IBPS), Institut National de la Santé et de la Recherche Médicale (INSERM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut de Biologie Paris Seine (IBPS), Institut National de la Santé et de la Recherche Médicale (INSERM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Institut des Hautes Études Scientifiques (IHES), IHES, Institut de Recherche Interdisciplinaire [Villeneuve d'Ascq] (IRI), Université de Lille, Sciences et Technologies-Université de Lille, Droit et Santé-Centre National de la Recherche Scientifique (CNRS), Laboratoire de Physique Théorique de la Matière Condensée (LPTMC), Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Mazalérat, Charlotte, Institut des Hautes Etudes Scientifiques (IHES), Neurosciences Paris Seine (NPS), Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Institut de Biologie Paris Seine (IBPS), Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Institut de Biologie Paris Seine (IBPS), Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS), Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut de Biologie Paris Seine (IBPS), and Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)
- Subjects
Adenoma ,media_common.quotation_subject ,Coefficient of variation ,Fold-change ,computer.software_genre ,01 natural sciences ,Biochemistry ,Similarity ,010104 statistics & probability ,03 medical and health sciences ,Gene expression ,Confidence Intervals ,Genetics ,Humans ,Digital polymerase chain reaction ,[SDV.NEU] Life Sciences [q-bio]/Neurons and Cognition [q-bio.NC] ,0101 mathematics ,Molecular Biology ,Normality ,Mathematics ,030304 developmental biology ,media_common ,Statistical hypothesis testing ,Original Research ,Oligonucleotide Array Sequence Analysis ,Single measurement ,Patient study ,0303 health sciences ,Distinctness ,business.industry ,Gene Expression Profiling ,Carcinoma ,Pattern recognition ,Fold change ,Confidence interval ,Adrenal Cortex Neoplasms ,Computational Mathematics ,Adrenal Cortex ,[SDV.NEU]Life Sciences [q-bio]/Neurons and Cognition [q-bio.NC] ,Artificial intelligence ,Data mining ,business ,Statistical test ,computer ,Type I and type II errors - Abstract
International audience; The problem of identifying differential activity such as in gene expression is a major defeat in biostatistics and bioinformatics. Equally important, however much less frequently studied, is the question of similar activity from one biological condition to another. The fold-change, or ratio, is usually considered a relevant criterion for stating difference and similarity between measurements. Importantly, no statistical method for concomitant evaluation of similarity and distinctness currently exists for biological applications. Modern microarray, digital PCR (dPCR), and Next-Generation Sequencing (NGS) technologies frequently provide a means of coefficient of variation estimation for individual measurements. Using fold-change, and by making the assumption that measurements are normally distributed with known variances, we designed a novel statistical test that allows us to detect concomitantly, thus using the same formalism, differentially and similarly expressed genes (http://cds.ihes.fr). Given two sets of gene measurements in different biological conditions, the probabilities of making type I and type II errors in stating that a gene is differentially or similarly expressed from one condition to the other can be calculated. Furthermore, a confidence interval for the fold-change can be delineated. Finally, we demonstrate that the assumption of normality can be relaxed to consider arbitrary distributions numerically. The Concomitant evaluation of Distinctness and Similarity (CDS) statistical test correctly estimates similarities and differences between measurements of gene expression. The implementation, being time and memory efficient, allows the use of the CDS test in high-throughput data analysis such as microarray, dPCR, and NGS experiments. Importantly, the CDS test can be applied to the comparison of single measurements (N = 1) provided the variance (or coefficient of variation) of the signals is known, making CDS a valuable tool also in biomedical analysis where typically a single measurement per subject is available.
- Published
- 2012
- Full Text
- View/download PDF
15. Comparaison des méthodes d'analyse de l'expression différentielle basée sur la dépendance des niveaux d'expression
- Author
-
Lefebvre, François and Lemieux, Sébastien
- Subjects
puces à ADN ,fold-change ,Affymetrix ,expression différentielle ,microarrays ,differential expression - Abstract
La technologie des microarrays demeure à ce jour un outil important pour la mesure de l'expression génique. Au-delà de la technologie elle-même, l'analyse des données provenant des microarrays constitue un problème statistique complexe, ce qui explique la myriade de méthodes proposées pour le pré-traitement et en particulier, l'analyse de l'expression différentielle. Toutefois, l'absence de données de calibration ou de méthodologie de comparaison appropriée a empêché l'émergence d'un consensus quant aux méthodes d'analyse optimales. En conséquence, la décision de l'analyste de choisir telle méthode plutôt qu'une autre se fera la plupart du temps de façon subjective, en se basant par exemple sur la facilité d'utilisation, l'accès au logiciel ou la popularité. Ce mémoire présente une approche nouvelle au problème de la comparaison des méthodes d'analyse de l'expression différentielle. Plus de 800 pipelines d'analyse sont appliqués à plus d'une centaine d'expériences sur deux plateformes Affymetrix différentes. La performance de chacun des pipelines est évaluée en calculant le niveau moyen de co-régulation par l'entremise de scores d'enrichissements pour différentes collections de signatures moléculaires. L'approche comparative proposée repose donc sur un ensemble varié de données biologiques pertinentes, ne confond pas la reproductibilité avec l'exactitude et peut facilement être appliquée à de nouvelles méthodes. Parmi les méthodes testées, la supériorité de la sommarisation FARMS et de la statistique de l'expression différentielle TREAT est sans équivoque. De plus, les résultats obtenus quant à la statistique d'expression différentielle corroborent les conclusions d'autres études récentes à propos de l'importance de prendre en compte la grandeur du changement en plus de sa significativité statistique., Microarrays remain an important tool for the measurement of gene expression, and a myriad of methods for their pre-processing or statistical testing of differential expression has been proposed in the past. However, insufficient and sometimes contradictory evidence has prevented the emergence of a strong consensus over a preferred methodology. This leaves microarray practitioners to somewhat arbitrarily decide which method should be used to analyze their data. Here we present a novel approach to the problem of comparing methods for the identification of differentially expressed genes. Over eight hundred analytic pipelines were applied to more than a hundred independent microarray experiments. The accuracy of each analytic pipeline was assessed by measuring the average level of co-regulation uncovered across all data sets. This analysis thus relies on a varied set of biologically relevant data, does not confound reproducibility for accuracy and can easily be extended to future analytic pipelines. This procedure identified FARMS summarization and the TREAT gene ordering statistic as algorithms significantly more accurate than other alternatives. Most interestingly, our results corroborate recent findings about the importance of taking the magnitude of change into account along with an assessment of statistical significance.
- Published
- 2011
16. Nonparametric bayesian evaluation of differential protein quantification
- Author
-
Serang, O., Cansizoglu, A. E., Käll, Lukas, Steen, H., Steen, J. A., Serang, O., Cansizoglu, A. E., Käll, Lukas, Steen, H., and Steen, J. A.
- Abstract
Arbitrary cutoffs are ubiquitous in quantitative computational proteomics: maximum acceptable MS/MS PSM or peptide q value, minimum ion intensity to calculate a fold change, the minimum number of peptides that must be available to trust the estimated protein fold change (or the minimum number of PSMs that must be available to trust the estimated peptide fold change), and the "significant" fold change cutoff. Here we introduce a novel experimental setup and nonparametric Bayesian algorithm for determining the statistical quality of a proposed differential set of proteins or peptides. By comparing putatively nonchanging case-control evidence to an empirical null distribution derived from a control-control experiment, we successfully avoid some of these common parameters. We then apply our method to evaluating different fold-change rules and find that for our data a 1.2-fold change is the most permissive of the plausible fold-change rules., QC 20131029
- Published
- 2013
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.