1,066 results on '"expression data"'
Search Results
2. Describing and characterizing the WAK/WAKL gene family across plant species: a systematic review.
- Author
-
Harvey, Aaron, van den Berg, Noëlani, and Swart, Velushka
- Subjects
GENE families ,CULTIVARS ,PLANT species ,CELL physiology ,GENE expression - Abstract
Wall-associated kinases (WAKs) and WAK-likes (WAKLs) are transmembrane pectin receptors which have seen rising interest in recent years due to their roles in stress responses and developmental pathways. Consequently, the genes encoding these proteins are continuously identified, described and characterised across a wide variety of plant species. The primary goal of characterizing these genes is to classify, describe and infer cellular function, mostly through in silico methods. However, inconsistencies across characterizations have led to discrepancies in WAK/WAKL definitions resulting in sequences being classified as a WAK in one study but as a WAKL or not identified in another. The methods of characterization range widely with different combinations of analyses being conducted, to similar analyses but with varying inputs and parameters which are impacting the outputs. This review collates current knowledge about WAK/WAKL genes and the recent characterizations of this family and suggests a more robust strategy for increased consistency among the different gene members, as well as the characterizations thereof. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Describing and characterizing the WAK/WAKL gene family across plant species: a systematic review
- Author
-
Aaron Harvey, Noëlani van den Berg, and Velushka Swart
- Subjects
wall-associated kinase ,wall-associated kinase-like ,gene identification and classification ,expression data ,cis-acting elements ,Plant culture ,SB1-1110 - Abstract
Wall-associated kinases (WAKs) and WAK-likes (WAKLs) are transmembrane pectin receptors which have seen rising interest in recent years due to their roles in stress responses and developmental pathways. Consequently, the genes encoding these proteins are continuously identified, described and characterised across a wide variety of plant species. The primary goal of characterizing these genes is to classify, describe and infer cellular function, mostly through in silico methods. However, inconsistencies across characterizations have led to discrepancies in WAK/WAKL definitions resulting in sequences being classified as a WAK in one study but as a WAKL or not identified in another. The methods of characterization range widely with different combinations of analyses being conducted, to similar analyses but with varying inputs and parameters which are impacting the outputs. This review collates current knowledge about WAK/WAKL genes and the recent characterizations of this family and suggests a more robust strategy for increased consistency among the different gene members, as well as the characterizations thereof.
- Published
- 2024
- Full Text
- View/download PDF
4. Identification of cardiomyopathy-related core genes through human metabolic networks and expression data
- Author
-
Zherou Rong, Hongwei Chen, Zihan Zhang, Yue Zhang, Luanfeng Ge, Zhengyu Lv, Yuqing Zou, Junjie Lv, Yuehan He, Wan Li, and Lina Chen
- Subjects
Cardiomyopathy ,Human metabolic network ,Expression data ,Module ,Core genes ,Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Cardiomyopathy is a complex type of myocardial disease, and its incidence has increased significantly in recent years. Dilated cardiomyopathy (DCM) and ischemic cardiomyopathy (ICM) are two common and indistinguishable types of cardiomyopathy. Results Here, a systematic multi-omics integration approach was proposed to identify cardiomyopathy-related core genes that could distinguish normal, DCM and ICM samples using cardiomyopathy expression profile data based on a human metabolic network. First, according to the differentially expressed genes between different states (DCM/ICM and normal, or DCM and ICM) of samples, three sets of initial modules were obtained from the human metabolic network. Two permutation tests were used to evaluate the significance of the Pearson correlation coefficient difference score of the initial modules, and three candidate modules were screened out. Then, a cardiomyopathy risk module that was significantly related to DCM and ICM was determined according to the significance of the module score based on Markov random field. Finally, based on the shortest path between cardiomyopathy known genes, 13 core genes related to cardiomyopathy were identified. These core genes were enriched in pathways and functions significantly related to cardiomyopathy and could distinguish between samples of different states. Conclusion The identified core genes might serve as potential biomarkers of cardiomyopathy. This research will contribute to identifying potential biomarkers of cardiomyopathy and to distinguishing different types of cardiomyopathy.
- Published
- 2022
- Full Text
- View/download PDF
5. Identification of cardiomyopathy-related core genes through human metabolic networks and expression data.
- Author
-
Rong, Zherou, Chen, Hongwei, Zhang, Zihan, Zhang, Yue, Ge, Luanfeng, Lv, Zhengyu, Zou, Yuqing, Lv, Junjie, He, Yuehan, Li, Wan, and Chen, Lina
- Subjects
GENE regulatory networks ,MARKOV random fields ,DILATED cardiomyopathy ,HUMAN genes ,PEARSON correlation (Statistics) ,CARDIOMYOPATHIES - Abstract
Background: Cardiomyopathy is a complex type of myocardial disease, and its incidence has increased significantly in recent years. Dilated cardiomyopathy (DCM) and ischemic cardiomyopathy (ICM) are two common and indistinguishable types of cardiomyopathy. Results: Here, a systematic multi-omics integration approach was proposed to identify cardiomyopathy-related core genes that could distinguish normal, DCM and ICM samples using cardiomyopathy expression profile data based on a human metabolic network. First, according to the differentially expressed genes between different states (DCM/ICM and normal, or DCM and ICM) of samples, three sets of initial modules were obtained from the human metabolic network. Two permutation tests were used to evaluate the significance of the Pearson correlation coefficient difference score of the initial modules, and three candidate modules were screened out. Then, a cardiomyopathy risk module that was significantly related to DCM and ICM was determined according to the significance of the module score based on Markov random field. Finally, based on the shortest path between cardiomyopathy known genes, 13 core genes related to cardiomyopathy were identified. These core genes were enriched in pathways and functions significantly related to cardiomyopathy and could distinguish between samples of different states. Conclusion: The identified core genes might serve as potential biomarkers of cardiomyopathy. This research will contribute to identifying potential biomarkers of cardiomyopathy and to distinguishing different types of cardiomyopathy. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
6. Associating expression and genomic data using co-occurrence measures
- Author
-
Maarten Larmuseau, Lieven P. C. Verbeke, and Kathleen Marchal
- Subjects
Expression data ,Co-expression ,Data integration ,Breast cancer ,Biology (General) ,QH301-705.5 - Abstract
Abstract Recent technological evolutions have led to an exponential increase in data in all the omics fields. It is expected that integration of these different data sources, will drastically enhance our knowledge of the biological mechanisms behind genomic diseases such as cancer. However, the integration of different omics data still remains a challenge. In this work we propose an intuitive workflow for the integrative analysis of expression, mutation and copy number data taken from the METABRIC study on breast cancer. First, we present evidence that the expression profile of many important breast cancer genes consists of two modes or ‘regimes’, which contain important clinical information. Then, we show how the co-occurrence of these expression regimes can be used as an association measure between genes and validate our findings on the TCGA-BRCA study. Finally, we demonstrate how these co-occurrence measures can also be applied to link expression regimes to genomic aberrations, providing a more complete, integrative view on breast cancer. As a case study, an integrative analysis of the identified MLPH-FOXA1 association is performed, illustrating that the obtained expression associations are intimately linked to the underlying genomic changes. Reviewers This article was reviewed by Dirk Walther, Francisco Garcia and Isabel Nepomuceno.
- Published
- 2019
- Full Text
- View/download PDF
7. Independent Component Analysis to Remove Batch Effects from Merged Microarray Datasets
- Author
-
Renard, Emilie, Branders, Samuel, Absil, P.-A., Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Frith, Martin, editor, and Storm Pedersen, Christian Nørgaard, editor
- Published
- 2016
- Full Text
- View/download PDF
8. Expression of the ZIP/SLC39A transporters in β-cells: a systematic review and integration of multiple datasets
- Author
-
Rebecca Lawson, Wolfgang Maret, and Christer Hogstrand
- Subjects
Type 2 diabetes ,Zinc ,ZIP ,SLC39A ,Systematic review ,Expression data ,Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Pancreatic β-cells require a constant supply of zinc to maintain normal insulin secretory function. Following co-exocytosis with insulin, zinc is replenished via the Zrt- and Irt-like (ZIP; SLC39A) family of transporters. However the ZIP paralogues of particular importance for zinc uptake, and associations with β-cell function and Type 2 Diabetes remain largely unexplored. We retrieved and statistically analysed publically available microarray and RNA-seq datasets to perform a systematic review on the expression of β-cell SLC39A paralogues. We complemented results with experimental data on expression profiling of human islets and mouse β-cell derived MIN6 cells, and compared transcriptomic and proteomic sequence conservation between human, mouse and rat. Results The 14 ZIP paralogues have 73–98% amino sequence conservation between human and rodents. We identified 18 datasets for β-cell SLC39A analysis, which compared relative expression to non-β-cells, and expression in response to PDX-1 activity, cytokines, glucose and type 2 diabetic status. Published expression data demonstrate enrichment of transcripts for ZIP7 and ZIP9 transporters within rodent β-cells and of ZIP6, ZIP7 and ZIP14 within human β-cells, with ZIP1 most differentially expressed in response to cytokines and PDX-1 within rodent, and ZIP6 in response to diabetic status in human and glucose in rat. Our qPCR expression profiling data indicate that SLC39A6, −9, −13, and − 14 are the highest expressed paralogues in human β-cells and Slc39a6 and −7 in MIN6 cells. Conclusions Our systematic review, expression profiling and sequence alignment reveal similarities and potentially important differences in ZIP complements between human and rodent β-cells. We identify ZIP6, ZIP7, ZIP9, ZIP13 and ZIP14 in human and rodent and ZIP1 in rodent as potentially biologically important for β-cell zinc trafficking. We propose ZIP6 and ZIP7 are key functional orthologues in human and rodent β-cells and highlight these zinc importers as important targets for exploring associations between zinc status and normal physiology of β-cells and their decline in Type 2 Diabetes.
- Published
- 2017
- Full Text
- View/download PDF
9. CombiFlow
- Author
-
Jan Jacob Schuringa, Bauke de Boer, Andre B. Mulder, Nisha van der Meer, Shanna M. Hogeling, Roos Houtsma, Marije T. Nijk, Linde M. Morsink, Kees Meijer, Gerwin Huls, Carolien M. Woolthuis, Emanuele Ammatuna, Stem Cell Aging Leukemia and Lymphoma (SALL), and Guided Treatment in Optimal Selected Cancer Patients (GUTS)
- Subjects
Cell ,Interleukin-3 Receptor alpha Subunit ,MINIMAL RESIDUAL DISEASE ,Disease ,Computational biology ,ACUTE MYELOID-LEUKEMIA ,Biology ,RELAPSE ,Somatic evolution in cancer ,Clonal Evolution ,Track disease ,hemic and lymphatic diseases ,medicine ,Humans ,In patient ,TRANSCRIPTOMICS ,HETEROGENEITY ,RISK ,CELL TRANSPLANTATION ,Cell Membrane ,Myeloid leukemia ,Hematology ,Clone Cells ,Leukemia, Myeloid, Acute ,medicine.anatomical_structure ,Expression data ,PROTEOMICS ,Interleukin-3 receptor - Abstract
Acute myeloid leukemia (AML) often presents as an oligoclonal disease whereby multiple genetically distinct subclones can coexist within patients. Differences in signaling and drug sensitivity of such subclones complicate treatment and warrant tools to identify them and track disease progression. We previously identified >50 AML-specific plasma membrane (PM) proteins, and 7 of these (CD82, CD97, FLT3, IL1RAP, TIM3, CD25, and CD123) were implemented in routine diagnostics in patients with AML (n = 256) and myelodysplastic syndrome (n = 33). We developed a pipeline termed CombiFlow in which expression data of multiple PM markers is merged, allowing a principal component–based analysis to identify distinctive marker expression profiles and to generate single-cell t-distributed stochastic neighbor embedding landscapes to longitudinally track clonal evolution. Positivity for one or more of the markers after 2 courses of intensive chemotherapy predicted a shorter relapse-free survival, supporting a role for these markers in measurable residual disease (MRD) detection. CombiFlow also allowed the tracking of clonal evolution in paired diagnosis and relapse samples. Extending the panel to 36 AML-specific markers further refined the CombiFlow pipeline. In conclusion, CombiFlow provides a valuable tool in the diagnosis, MRD detection, clonal tracking, and understanding of clonal heterogeneity in AML.
- Published
- 2022
- Full Text
- View/download PDF
10. An Efficient Algorithm for Microarray Probes Re-annotation
- Author
-
Foszner, Pawel, Gruca, Aleksandra, Polanski, Andrzej, Marczyk, Michal, Jaksik, Roman, Polanska, Joanna, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Nguyen, Ngoc-Thanh, editor, and Le-Thi, Hoai An, editor
- Published
- 2014
- Full Text
- View/download PDF
11. Nonparametric Bayesian Two-Level Clustering for Subject-Level Single-Cell Expression Data
- Author
-
Xiangyu Luo and Qiuyu Wu
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,business.industry ,Computer science ,Inference ,Subject (documents) ,Pattern recognition ,Statistics - Applications ,Methodology (stat.ME) ,symbols.namesake ,Expression data ,Cell clustering ,symbols ,Applications (stat.AP) ,Nonparametric bayesian ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business ,Cluster analysis ,Statistics - Methodology ,Gibbs sampling ,Count data - Abstract
The advent of single-cell sequencing opens new avenues for personalized treatment. In this paper, we address a two-level clustering problem of simultaneous subject subgroup discovery (subject level) and cell type detection (cell level) for single-cell expression data from multiple subjects. However, current statistical approaches either cluster cells without considering the subject heterogeneity or group subjects without using the single-cell information. To bridge the gap between cell clustering and subject grouping, we develop a nonparametric Bayesian model, Subject and Cell clustering for Single-Cell expression data (SCSC) model, to achieve subject and cell grouping simultaneously. SCSC does not need to prespecify the subject subgroup number or the cell type number. It automatically induces subject subgroup structures and matches cell types across subjects. Moreover, it directly models the single-cell raw count data by deliberately considering the data's dropouts, library sizes, and over-dispersion. A blocked Gibbs sampler is proposed for the posterior inference. Simulation studies and the application to a multi-subject iPSC scRNA-seq dataset validate the ability of SCSC to simultaneously cluster subjects and cells.
- Published
- 2023
- Full Text
- View/download PDF
12. Efficient Algorithm for Microarray Probes Re-annotation
- Author
-
Foszner, Pawel, Gruca, Aleksandra, Polanski, Andrzej, Marczyk, Michal, Jaksik, Roman, Polanska, Joanna, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Jędrzejowicz, Piotr, editor, Nguyen, Ngoc Thanh, editor, and Hoang, Kiem, editor
- Published
- 2011
- Full Text
- View/download PDF
13. Normalization of Biological Expression Data Based on Selection of a Stable Element Set
- Author
-
Bouki, Yoshihiko, Yoshihiro, Takuya, Inoue, Etsuko, Nakagawa, Masaru, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, König, Andreas, editor, Dengel, Andreas, editor, Hinkelmann, Knut, editor, Kise, Koichi, editor, Howlett, Robert J., editor, and Jain, Lakhmi C., editor
- Published
- 2011
- Full Text
- View/download PDF
14. Prediction of Combinatorial Protein-Protein Interaction Networks from Expression Data Using Statistics on Conditional Probability
- Author
-
Fujiki, Takatoshi, Inoue, Etsuko, Yoshihiro, Takuya, Nakagawa, Masaru, Hutchison, David, Kanade, Takeo, Kittler, Josef, Kleinberg, Jon M., Mattern, Friedemann, Mitchell, John C., Naor, Moni, Nierstrasz, Oscar, Pandu Rangan, C., Steffen, Bernhard, Sudan, Madhu, Terzopoulos, Demetri, Tygar, Doug, Vardi, Moshe Y., Weikum, Gerhard, Goebel, Randy, Siekmann, Jörg, Wahlster, Wolfgang, Setchi, Rossitza, editor, Jordanov, Ivan, editor, Howlett, Robert J., editor, and Jain, Lakhmi C., editor
- Published
- 2010
- Full Text
- View/download PDF
15. DPEBic: detecting essential proteins in gene expressions using encoding and biclustering algorithm
- Author
-
Ali, Anooja, Hulipalled, Vishwanath R., Patil, S. S., and Abdulkader, Raees
- Published
- 2021
- Full Text
- View/download PDF
16. Comparative profiling of immune genes improves the prognoses of lower grade gliomas
- Author
-
Guanzhang Li, Anhua Wu, Chuanbao Zhang, Zheng Zhao, Wen Cheng, Zheng Wang, Zhiliang Wang, and Tao Jiang
- Subjects
Oncology ,Cancer Research ,medicine.medical_specialty ,Lower grade ,business.industry ,Microarray analysis techniques ,External validation ,Nomogram ,Risk groups ,Expression data ,Internal medicine ,Medicine ,business ,Immune gene ,Median survival - Abstract
Objective: Lower grade gliomas (LGGs), classified as World Health Organization (WHO) grade II and grade III gliomas, comprise a heterogeneous group with a median survival time ranging from 4–13 years. Accurate prediction of the survival times of LGGs remains a major challenge in clinical practice. Methods: We reviewed the expression data of 865 LGG patients from 5 transcriptomics cohorts. The comparative profile of immune genes was analyzed for signature identification and validation. In-house RNAseq and microarray data from the Chinese Glioma Genome Atlas (CGGA) dataset were used as training and internal validation cohorts, respectively. The samples from The Cancer Genome Atlas (TCGA) and GSE16011 cohorts were used as external validation cohorts, and the real-time PCR of frozen LGG tissue samples (n = 36) were used for clinical validation. Results: A total of 2,214 immune genes were subjected to pairwise comparison to generate 2,449,791 immune-related gene pairs (IGPs). A total of 402 IGPs were identified with prognostic values for LGGs. The HOXA9-related and CRH-related scores facilitated identification of patients with different prognoses. An immune signature based on 10 IGPs was constructed to stratify patients into low and high risk groups, exhibiting different clinical outcomes. A nomogram, combining immune signature, 1p/19q status, and tumor grade, was able to predict the overall survival (OS) with c-indices of 0.85, 0.80, 0.80, 0.79, and 0.75 in the training, internal validation, external validation, and tissue sample cohorts, respectively. Conclusions: This study was the first to report a comparative profiling of immune genes in large LGG cohorts. A promising individualized immune signature was developed to estimate the survival time for LGG patients.
- Published
- 2021
- Full Text
- View/download PDF
17. Identifying Coexpressed Genes
- Author
-
Wang, Qihua, Härdle, Wolfgang, Mori, Yuichi, and Vieu, Philippe
- Published
- 2007
- Full Text
- View/download PDF
18. Estimating Gene Networks from Expression Data and Binding Location Data via Boolean Networks
- Author
-
Hirose, Osamu, Nariai, Naoki, Tamada, Yoshinori, Bannai, Hideo, Imoto, Seiya, Miyano, Satoru, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Gervasi, Osvaldo, editor, Gavrilova, Marina L., editor, Kumar, Vipin, editor, Laganà, Antonio, editor, Lee, Heow Pueh, editor, Mun, Youngsong, editor, Taniar, David, editor, and Tan, Chih Jeng Kenneth, editor
- Published
- 2005
- Full Text
- View/download PDF
19. A Risk Signature Consisting of Eight m6A Methylation Regulators Predicts the Prognosis of Glioma
- Author
-
Yanna Su, Sizhong Guan, Ye He, and Liping Zhou
- Subjects
Context (language use) ,Cell Biology ,General Medicine ,Methylation ,Biology ,medicine.disease ,Genome ,nervous system diseases ,Cellular and Molecular Neuroscience ,chemistry.chemical_compound ,chemistry ,Expression data ,Glioma ,Cancer research ,medicine ,Epigenetics ,N6-Methyladenosine ,neoplasms ,Gene - Abstract
Glioma progression seriously correlates to the epigenetic context. This study aims to identify glioma subtypes by clustering analysis of patients using the multi-omics data of N6-methyladenosine (m6A) methylation regulators and to construct a risk signature for investigating the role of m6A methylation regulators in the prognosis of glioma. Multi-omics data of glioma and normal control tissues were obtained through The Cancer Genome Atlas (TCGA) database. The clustering analysis of multi-omics data of patients was conducted using the R package iClusterPlus software. The risk model was constructed by univariate and multivariate Cox analysis, and the glioma expression data and related clinical data were obtained by Chinese Glioma Genome Atlas (CGGA) datasets to verify the risk model. By analyzing the glioma data in TCGA, we found that the risk signature could be constructed according to the eight genes with m6A methylation modification function, including ALKBH5, HNRNPA2B1, IGF2BP2, IGF2BP3, RBM15, WTAP, YTHDF1, and YTHDF2. Meanwhile, we found that IGF2BP2 and IGF2BP3 were highly expressed in glioma subtypes with high-risk scores and closely related to the prognosis of glioma patients. m6A methylation regulators, especially IGF2BP2 and IGF2BP3, play important roles in the malignant progression of glioma. The risk signature constructed by eight m6A methylation regulators can predict the prognosis of glioma. IGF2BP2 and IGF2BP3 may be the key regulatory factors of m6A methylation regulators involved in the occurrence and development of glioma, and can serve as molecular markers for the prognosis of glioma.
- Published
- 2021
- Full Text
- View/download PDF
20. Inferring single cell expression profiles from overlapped pooling sequencing data with compressed sensing strategy
- Author
-
Weiqiang Xu, Xiao Sun, Zuhong Lu, Yixuan Yang, Jing Tu, Mengting Huang, Xingzhao Wen, and Na Lu
- Subjects
AcademicSubjects/SCI00010 ,Computation ,Pooling ,Sequencing data ,Biology ,03 medical and health sciences ,0302 clinical medicine ,Dimension (vector space) ,Databases, Genetic ,Genetics ,Animals ,Humans ,Computer Simulation ,Sensitivity (control systems) ,030304 developmental biology ,Narese/7 ,Gene Library ,0303 health sciences ,business.industry ,Sequence Analysis, RNA ,Gene Expression Profiling ,Gene regulation, Chromatin and Epigenetics ,Reproducibility of Results ,Pattern recognition ,Models, Theoretical ,Expression (mathematics) ,Compressed sensing ,Narese/24 ,Expression data ,Artificial intelligence ,Single-Cell Analysis ,business ,030217 neurology & neurosurgery ,Algorithms - Abstract
Though single cell RNA sequencing (scRNA-seq) technologies have been well developed, the acquisition of large-scale single cell expression data may still lead to high costs. Single cell expression profile has its inherent sparse properties, which makes it compressible, thus providing opportunities for solutions. Here, by computational simulation as well as experiment of 54 single cells, we propose that expression profiles can be compressed from the dimension of samples by overlapped assigning each cell into plenty of pools. And we prove that expression profiles can be inferred from these pool expression data with overlapped pooling design and compressed sensing strategy. We also show that by combining this approach with plate-based scRNA-seq measurement, it can maintain its superiorities in gene detection sensitivity and individual identity and recover the expression profile with high precision, while saving about half of the library cost. This method can inspire novel conceptions on the measurement, storage or computation improvements for other compressible signals in many biological areas.
- Published
- 2021
21. MMP9 Geninin Aort Diseksiyonundaki Olası Etkilerinin Araştırılması
- Author
-
Burcu Salman Yaylaz, Melda Seriman, Ahmet Ekmekçi, Emel Ergül, Mahmut Uluganyan, Fulya Coşan, Özgün Melike Gedar Totuk, Neslihan Abacı, ULUGANYAN, Mahmut, ISU, Rektörlük, Uygulama ve Araştırma Merkezleri, Moleküler Kanser Uygulama ve Araştırma Merkezi, and Sariman, Melda
- Subjects
Gynecology ,medicine.medical_specialty ,MMP9 ,business.industry ,P574R ,Gene ,Uluganyan M., -A Brief Reconnoitre about Effects of MMP9 on Aortic Dissection-, EXPERIMED, cilt.11, sa.1, ss.12-20, 2021 ,Tıp ,body regions ,Aortic Dissection ,Aortic Dissection,MMP9,Q279R,P574R,Polymorphism,Gene Expression Data ,Expression data ,Aort diseksiyon,MMP9,Q279R,P574R,gen ifade verisi ,Medicine ,Polymorphism ,business ,Q279R ,Expression Data - Abstract
Amaç: Kardiyovasküler hastalıklar ve kanser metastazında sıklıkla araştı-rılan matriks metalloproteinazlar (MMPs) ekstraselüler matriks düzenle-yicileridir. Çalışmamız, aort diseksiyonu olan hastalarda MMP9 genindeki spesifik polimorfizmleri incelemeyi ve MMP9'un aort diseksiyonu üzerin-deki etkisini ekspresyon veri setleri ile karşılaştırmayı amaçlamaktadır.Gereç ve Yöntem: Q279R ve P574R polimorfizmleri 44 aort diseksiyon tanısı almış ve 40 sağlıklı bireyde polimeraz zincir reaksiyonu - restrik-siyon parça uzunluğu polimorfizmi (PCR-RFLP) yöntemiyle çalışıldı. Q279R ve P574R prevalansı istatistiksel olarak hastaların tıbbi verileriyle karşılaştırıldı. Buna ek olarak, NCBI GEO veri tabanından aort diseksiyon veri setleri toplandı ve MMP9 ifadesindeki farklılıkları görmek amacıyla bu veri setleri GEO2R ve RStudio ile yeniden analiz edildi. Elde edilen sonuçların informatik analizi için çevrimiçi veri tabanları kullanıldı.Bulgular: CG alleli taşıyıcısı kadınların aort diseksiyonu geliştirme riski erkeklerden daha yüksek bulunmasına rağmen her iki çalışma grubun-da da allellerin genotipik dağılımı benzer bulunmuştur. MMP9'un pro-tein-protein etkileşim analizinin ve hastaların tıbbi verilerinin incelen-mesinin sonucu olarak, P574R hipertansiyonu olan hastalarda önemli bir bulgu olarak değerlendirilmiştir. Array verisi analizinde ise MMP9 ifa-desinde kritik bir değişim gözlemlenmemiş olup, birçok örnekte TIMPifade seviyelerinde azalma tespit edilmiştir. Ayrıca MMP9’u hedefleyen miRNA ekspresyon seviyelerinin aort dokusu ve kanda düşük olduğu saptanmıştır.Sonuç: Q279R ve P574R, MMP9 protein yapısını doğrudan etkilemeyen iki polimorfizmdir. İncelenen polimorfizmler ve gerçekleştirilen meta-a-nalizler, MMP9'un fenotipi doğrudan etkilemediği, ancak istatistiksel sonuçlarda görüldüğü gibi aort diseksiyonu gelişimi için zemin hazır-ladığını göstermektedir., Objective: Matrix metalloproteinases (MMPs) are the extracellular ma-trix regulators that frequently investigate cardiovascular diseases and cancer metastasis. Our study aimed to examine specific polymorphisms in the MMP9 gene in our patients with aortic dissection and compare the effect of MMP9 on aortic dissection with expression datasets.Materials and Methods: Q279R and P574R polymorphisms were analyzed in 44 aortic dissection patients and 40 healthy donors via polymerase chain reaction-restriction fragment length polymorphism. (PCR-RFLP) methods. Q279R and P574R prevalence was statistically compared with the medical data of the patients. Additionally, we col-lected datasets of aortic dissection from NCBI GEO to reanalyze GEO2R and RStudio to see metalloproteinase activity on samples. Later, enrich-ment analysis was processed on widely used databases.Results: Genotypic distribution of alleles was similar in the two study groups. In addition to this, female CG carriers had a higher risk of de-veloping aortic dissection than those of males. As the results of the protein-protein interaction analysis of MMP9 and patients’ clinical data, hypertension was found to be the significant outcome of P574R varia-tion in the patients. In array analysis, MMP9 expression did not change critically, but TIMPs had been downregulated in many samples. Also, MMP9 targeted miRNA expression levels were detected as low in aortic tissue and blood.Conclusion: Q279R and P574R are two polymorphisms that do not di-rectly affect MMP9 protein structure. Consequently, studied polymor-phisms and performed meta-analysis show that MMP9 does not spark off the phenotype but sets the stage for aortic dissection development as seen in the statistical results. Furthermore, enrichment analysis on datasets shows MMP9 was not a primary reason for vascular remodeling.
- Published
- 2021
- Full Text
- View/download PDF
22. miRNA and mRNA expression profiling reveals potential biomarkers for metastatic cutaneous melanoma
- Author
-
Queqiao Bian, Yiye Tao, and Jun Wang
- Subjects
0301 basic medicine ,Skin Neoplasms ,Metastatic Cutaneous Melanoma ,Mrna expression ,03 medical and health sciences ,0302 clinical medicine ,microRNA ,Biomarkers, Tumor ,Humans ,Medicine ,Pharmacology (medical) ,RNA, Messenger ,Melanoma ,Messenger RNA ,business.industry ,Gene Expression Profiling ,Computational Biology ,Gene Expression Regulation, Neoplastic ,MicroRNAs ,030104 developmental biology ,Oncology ,Expression data ,030220 oncology & carcinogenesis ,Potential biomarkers ,Cutaneous melanoma ,Cancer research ,business ,Biomarkers - Abstract
Purpose: This study aims to uncover potential biomarkers associated with cutaneous melanoma (CM) metastasis.Methods: The mRNA and microRNA (miRNA) expression data from the metastatic CM and non-met...
- Published
- 2021
- Full Text
- View/download PDF
23. Subtypes identification on heart failure with preserved ejection fraction via network enhancement fusion using multi-omics data
- Author
-
Huihui Wang, Jinfang Cheng, Yuehua Cui, Yongqing Wu, Zhi Li, Hongyan Cao, and Ruiling Fang
- Subjects
Biophysics ,Bioinformatics ,Biochemistry ,ne-SNF ,03 medical and health sciences ,0302 clinical medicine ,Structural Biology ,microRNA ,Clinical heterogeneity ,Genetics ,Medicine ,030304 developmental biology ,ComputingMethodologies_COMPUTERGRAPHICS ,Subtypes identification ,0303 health sciences ,business.industry ,Mortality rate ,HFpEF ,Computer Science Applications ,Multi-omics data integration ,Expression data ,030220 oncology & carcinogenesis ,DNA methylation ,Multi omics ,business ,Heart failure with preserved ejection fraction ,TP248.13-248.65 ,Biomarkers ,Biotechnology ,Research Article - Abstract
Graphical abstract, Heart failure with preserved ejection fraction (HFpEF) is associated with multiple etiologic and pathophysiologic factors. HFpEF leads to significant cardiovascular morbidity and mortality. There are various reasons that fail to identify effective therapeutic interventions for HFpEF, primarily due to its clinical heterogeneity causing significant difficulties in determining physiologic and prognostic implications for this syndrome. Thus, identifying clinical subtypes using multi-omics data has great implications for efficient treatment and prognosis of HFpEF patients. Here we proposed to integrate mRNA, DNA methylation and microRNA (miRNA) expression data of HFpEF with a similarity network fusion (SNF) method following a network enhancement (ne-SNF) denoising technique to form a fused network. A spectral clustering method was then used to obtain clusters of patient subtypes. Experiments on HFpEF datasets demonstrated that ne-SNF significantly outperforms single data subtype analysis and other integrated methods. The identified subgroups were shown to have statistically significant differences in survival. Two HFpEF subtypes were defined: a high-risk group (16.8%) and a low-risk group (83.2%). The 5-year mortality rates were 63.3% and 33.0% for the high- and low-risk group, respectively. After adjusting for the effects of clinical covariates, HFpEF patients in the high-risk group were 2.43 times more likely to die than the low-risk group. A total of 157 differentially expressed (DE) mRNAs, 2199 abnormal methylations and 121 DE miRNAs were identified between two subtypes. They were also enriched in many HFpEF-related biological processes or pathways. The ne-SNF method provides a novel pipeline for subtype identification in integrated analysis of multi-omics data.
- Published
- 2021
24. ModularBoost: an efficient network inference algorithm based on module decomposition
- Author
-
Xinyu Li, Wei Zhang, Guang Li, and Jianming Zhang
- Subjects
Gene module Decomposition ,Computer science ,0206 medical engineering ,Gene regulatory network ,Inference ,02 engineering and technology ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,GRNBoost2 ,Task (project management) ,03 medical and health sciences ,Structural Biology ,Gene Modules ,Decomposition (computer science) ,Gene Regulatory Networks ,Linear regression ,Molecular Biology ,lcsh:QH301-705.5 ,030304 developmental biology ,0303 health sciences ,Applied Mathematics ,Computational Biology ,Expression (mathematics) ,Computer Science Applications ,lcsh:Biology (General) ,Expression data ,lcsh:R858-859.7 ,Regulatory network inference ,Algorithm ,020602 bioinformatics ,Algorithms ,Research Article - Abstract
Background Given expression data, gene regulatory network(GRN) inference approaches try to determine regulatory relations. However, current inference methods ignore the inherent topological characters of GRN to some extent, leading to structures that lack clear biological explanation. To increase the biophysical meanings of inferred networks, this study performed data-driven module detection before network inference. Gene modules were identified by decomposition-based methods. Results ICA-decomposition based module detection methods have been used to detect functional modules directly from transcriptomic data. Experiments about time-series expression, curated and scRNA-seq datasets suggested that the advantages of the proposed ModularBoost method over established methods, especially in the efficiency and accuracy. For scRNA-seq datasets, the ModularBoost method outperformed other candidate inference algorithms. Conclusions As a complicated task, GRN inference can be decomposed into several tasks of reduced complexity. Using identified gene modules as topological constraints, the initial inference problem can be accomplished by inferring intra-modular and inter-modular interactions respectively. Experimental outcomes suggest that the proposed ModularBoost method can improve the accuracy and efficiency of inference algorithms by introducing topological constraints.
- Published
- 2021
25. Integrating expression data and genomic sequences to investigate the transcriptional regulation in barley in response to abiotic stress
- Author
-
Azar Delavari, Zahra Zinati, Ahmad Tahmasebi, and Sima Sazegari
- Subjects
Genetics ,motif discovery ,abiotic stress ,Abiotic stress ,food and beverages ,barley ,transcriptomics analysis ,Plant Science ,Biology ,Expression data ,Transcriptional regulation ,regulatory gene set ,motif enrichment ,TP248.13-248.65 ,Biotechnology - Abstract
Abiotic stress responses are regulated critically at the transcriptional level. Clarifying the intricate mechanisms that regulate gene expression in response to abiotic stress is crucial and challenging. For this purpose, the factors that regulate gene expression and their binding sites in DNA should be determined. By using bioinformatics tools, the differentially expressed probe sets were studied. A meta-analysis of transcriptomic responses to several abio¬tic stresses in barley was performed. Motif enrichments revealed that AP2/ERF (APETALA2/Ethylene-Res¬pon¬sive Factor) has the most frequent binding sites. We found that the bHLH transcription factor family has the high¬est number of transcription factor members. Moreover, network construction revealed that AP2 has the highest number of connections with other genes, which indicates its critical role in abiotic stress responses. The present research further predicted 49 miRNAs belonging to 23 miRNA families. This study identified the probable conserved and enriched motifs, which might have a role in the regulation of differentially expressed genes under abiotic stresses. In addition to shedding light on gene expression regulation, a toolbox of available promoters for genetic engineering of crop plants under such abiotic stresses was developed.
- Published
- 2021
26. Network‐based method for detecting dysregulated pathways in glioblastoma cancer.
- Author
-
Wu, Hao, Dong, Jihua, and Wei, Jicheng
- Abstract
The knowledge on the biological molecular mechanisms underlying cancer is important for the precise diagnosis and treatment of cancer patients. Detecting dysregulated pathways in cancer can provide insights into the mechanism of cancer and help to detect novel drug targets. Based on the wide existing mutual exclusivity among mutated genes and the interrelationship between gene mutations and expression changes, this study presents a network‐based method to detect the dysregulated pathways from gene mutations and expression data of the glioblastoma cancer. First, the authors construct a gene network based on mutual exclusivity between each pair of genes and the interaction between gene mutations and expression changes. Then they detect all complete subgraphs using CFinder clustering algorithm in the constructed gene network. Next, the two gene sets whose overlapping scores are above a specific threshold are merged. Finally, they obtain two dysregulated pathways in which there are glioblastoma‐related multiple genes which are closely related to the two subtypes of glioblastoma. The results show that one dysregulated pathway revolving around epidermal growth factor receptor is likely to be associated with the primary subtype of glioblastoma, and the other dysregulated pathway revolving around TP53 is likely to be associated with the secondary subtype of glioblastoma. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
27. Reframed Genome-Scale Metabolic Model to Facilitate Genetic Design and Integration with Expression Data.
- Author
-
Gu, Deqing, Jian, Xingxing, Zhang, Cheng, and Hua, Qiang
- Abstract
Genome-scale metabolic network models (GEMs) have played important roles in the design of genetically engineered strains and helped biologists to decipher metabolism. However, due to the complex gene-reaction relationships that exist in model systems, most algorithms have limited capabilities with respect to directly predicting accurate genetic design for metabolic engineering. In particular, methods that predict reaction knockout strategies leading to overproduction are often impractical in terms of gene manipulations. Recently, we proposed a method named logical transformation of model (LTM) to simplify the gene-reaction associations by introducing intermediate pseudo reactions, which makes it possible to generate genetic design. Here, we propose an alternative method to relieve researchers from deciphering complex gene-reactions by adding pseudo gene controlling reactions. In comparison to LTM, this new method introduces fewer pseudo reactions and generates a much smaller model system named as gModel. We showed that gModel allows two seldom reported applications: identification of minimal genomes and design of minimal cell factories within a modified OptKnock framework. In addition, gModel could be used to integrate expression data directly and improve the performance of the E-Fmin method for predicting fluxes. In conclusion, the model transformation procedure will facilitate genetic research based on GEMs, extending their applications. [ABSTRACT FROM PUBLISHER]
- Published
- 2017
- Full Text
- View/download PDF
28. Identification of key genes and pathways in Parkinson's disease through integrated analysis.
- Author
-
JINGRU WANG, YINING LIU, and TUANZHI CHEN
- Subjects
- *
GENES , *PARKINSON'S disease , *NEURODEGENERATION , *SYNUCLEINS , *EXTRAPYRAMIDAL disorders - Abstract
Parkinson's disease (PD) is a progressive, degenerative neurological disease, typically characterized by tremors and muscle rigidity. The present study aimed to identify differentially expressed genes (DEGs) between patients with PD and healthy patients, and clarify their association with additional biological processes that may regulate factors that lead to PD. An integrated analysis of publicly available Gene Expression Omnibus datasets of PD was performed. DEGs were identified between PD and normal blood samples. Gene Ontology enrichment and Kyoto Encyclopedia of Genes and Genomes pathway analyses, as well as protein‑protein interaction (PPI) networks were used to predict the functions of identified DEGs. Reverse transcription‑quantitative polymerase chain reaction (RT‑qPCR) was performed to validate the predicted expression levels of identified DEGs in whole blood samples obtained from patients with PD and normal healthy controls. A total of 292DEGs were identified between the PD and normal blood samples. Of these, 156 genes were significantly upregulated and 136 genes were significantly downregulated in PD samples following integrated analysis of four PD expression datasets. The 10 most upregulated and downregulated genes were used to construct a PPI network, where ubiquitin C‑terminal hydrolase L1 (UCHL1), 3‑phosphoinositide dependent protein kinase 1 (PDPK1) and protein kinase cAMP‑activated catalytic subunit β (PRKACB) demonstrated the highest connectivity in the network. DEGs were significantly enriched in amoebiasis, vascular smooth muscle contraction, and the Wnt and calcium signaling pathways. The expression levels of significant DEGs, UCHL1, PDPK1 and PRKACB were validated using RT‑qPCR analysis. The findings revealed that UCHL1 and PDPK1 were upregulated and PRKACB was downregulated in patients with PD when compared with normal healthy controls. In conclusion, the results indicate that the significant DEGs, including UCHL1, PDPK1 and PRKACB may be associated with the development of PD. In addition, these factors may be involved in various signaling pathways, including amoebiasis, vascular smooth muscle contraction and the Wnt and calcium signaling pathways. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
29. The Influence of Polyploidy on the Evolution of Yeast Grown in a Sub-Optimal Carbon Source.
- Author
-
Scott, Amber L., Richmond, Phillip A., Dowell, Robin D., and Selmecki, Anna M.
- Abstract
Polyploidization events have occurred during the evolution of many fungi, plant, and animal species and are thought to contribute to speciation and tumorigenesis, however little is known about how ploidy level contributes to adaptation at the molecular level. Here we integrate whole genome sequencing, RNA expression analysis, and relative fitness of ~100 evolved clones at three ploidy levels. Independent haploid, diploid, and tetraploid populations were grown in a low carbon environment for 250 generations. We demonstrate that the key adaptive mutation in the evolved clones is predicted by a gene expression signature of just five genes. All of the adaptive mutations identified encompass a narrow set of genes, however the tetraploid clones gain a broader spectrumof adaptivemutations than haploid or diploid clones. While many of the adaptive mutations occur in genes that encode proteins with known roles in glucose sensing and transport, we discover mutations in genes with no canonical role in carbon utilization (IPT1 and MOT3), as well as identify novel dominant mutations in glucose signal transducers thought to only accumulate recessive mutations in carbon limited environments (MTH1 and RGT1). We conclude that polyploid cells explore more genotypic and phenotypic space than lower ploidy cells. Our study provides strong evidence for the beneficial role of polyploidization events that occur during the evolution of many species and during tumorigenesis. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
30. Expression of the ZIP/SLC39A transporters in ß-cells: a systematic review and integration of multiple datasets.
- Author
-
Lawson, Rebecca, Maret, Wolfgang, and Hogstrand, Christer
- Subjects
TYPE 2 diabetes ,ZINC transporters ,PANCREATIC beta cells ,RNA sequencing ,GENE expression ,CYTOKINES ,SYSTEMATIC reviews - Abstract
Background: Pancreatic ß-cells require a constant supply of zinc to maintain normal insulin secretory function. Following co-exocytosis with insulin, zinc is replenished via the Zrt- and Irt-like (ZIP; SLC39A) family of transporters. However the ZIP paralogues of particular importance for zinc uptake, and associations with ß-cell function and Type 2 Diabetes remain largely unexplored. We retrieved and statistically analysed publically available microarray and RNA-seq datasets to perform a systematic review on the expression of ß-cell SLC39A paralogues. We complemented results with experimental data on expression profiling of human islets and mouse ß-cell derived MIN6 cells, and compared transcriptomic and proteomic sequence conservation between human, mouse and rat. Results: The 14 ZIP paralogues have 73-98% amino sequence conservation between human and rodents. We identified 18 datasets for â-cell SLC39A analysis, which compared relative expression to non-â-cells, and expression in response to PDX-1 activity, cytokines, glucose and type 2 diabetic status. Published expression data demonstrate enrichment of transcripts for ZIP7 and ZIP9 transporters within rodent â-cells and of ZIP6, ZIP7 and ZIP14 within human â-cells, with ZIP1 most differentially expressed in response to cytokines and PDX-1 within rodent, and ZIP6 in response to diabetic status in human and glucose in rat. Our qPCR expression profiling data indicate that SLC39A6, .9, .13, and .14 are the highest expressed paralogues in human â-cells and Slc39a6 and .7 in MIN6 cells. Conclusions: Our systematic review, expression profiling and sequence alignment reveal similarities and potentially important differences in ZIP complements between human and rodent ß-cells. We identify ZIP6, ZIP7, ZIP9, ZIP13 and ZIP14 in human and rodent and ZIP1 in rodent as potentially biologically important for ß-cell zinc trafficking. We propose ZIP6 and ZIP7 are key functional orthologues in human and rodent ß-cells and highlight these zinc importers as important targets for exploring associations between zinc status and normal physiology of ß-cells and their decline in Type 2 Diabetes. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
31. Enhancing Research Through the Use of the Genotype-Tissue Expression (GTEx) Database
- Author
-
Ansley Grimes Stanfill and Xueyuan Cao
- Subjects
Genotype ,Research and Theory ,Database ,Computer science ,Counterintuitive ,Articles ,computer.software_genre ,030227 psychiatry ,03 medical and health sciences ,0302 clinical medicine ,Resource (project management) ,Data access ,Expression (architecture) ,Genotype-Tissue Expression ,Expression data ,quantitative trait loci ,Common fund ,gene expression ,Humans ,transcriptome ,computer ,030217 neurology & neurosurgery - Abstract
Despite a growing interest in multi-omic research, individual investigators may struggle to collect large-scale omic data, particularly from human subjects. Publicly available datasets can help to address this problem, including those sponsored by the NIH Common Fund, such as the Genotype-Tissue Expression (GTEx) database. This database contains genotype and expression data obtained from 54 non-diseased tissues in human subjects. But these data are often underutilized, because users may find the browsing tools to be counterintuitive or have difficulty navigating the procedures to request controlled data access. Furthermore, there is limited knowledge of these resources among nurse scientists interested in incorporating such information into their programs of research. This article outlines the procedures for using the GTEx database. Next, we provide one exemplar of using this resource to enhance existing research by investigating expression of dopamine receptor type 2 ( DRD2) across brain tissues in human subjects.
- Published
- 2021
- Full Text
- View/download PDF
32. Whole Genome microRNA Expression Data in Childhood Acute Lymphoblastic Leukemia and Evaluation of microRNA Pathways Using Fuzzy C-means
- Author
-
Bakiye Göker Bağca, Su Özgür, Özgür Çoğulu, Muhterem Duyu, and Mehmet Nurullah Orman
- Subjects
business.industry ,Expression data ,microRNA ,Medicine ,Computational biology ,business ,Genome ,Health informatics ,Childhood Acute Lymphoblastic Leukemia - Abstract
Objective: Hard clustering approaches may cause some of the relationships to be overlooked due to their nature of algorithms especially in genetic datasets. But hidden relationships can be revealed by fuzzy approaches. Purpose of this study was evaluating effect of microRNAs (miRNA) on children with acute lymphoblastic leukaemia (ALL) by using miRNA expression data obtained from bone marrow samples with sets containing different numbers of elements of fuzzy Cmeans (FCM). Material and Methods: miRNA expression levels of 43 newly diagnosed ALL patients and 14 healthy subjects were analysed via FCM. Clusters containing different numbers of miRNAs were evaluated, common properties in messenger RNA (mRNA) pathways were investigated and new pathways associated with ALL and cancer were described via miRNA target prediction tools. Results: Significant miRNA profile was compared to control cases. Only 46 out of 108 miRNAs were found to be significantly upregulated or downregulated. Of forty six miRNAs: 8 miRNAs were labelled as tumour suppressor (17.4%), 17 miRNAs were labelled as onco-miR (37.0%) and 21 miRNAs could not be labelled (45.6%) for hematological malignancy. Fourteen (%30.4) miRNAs were found to be apoptosis-related, 27 miRNAs were in leukemia-related (58.7%) and 15 labelled miRNAs were related with cancer pathways (32.6%). hsa-miR-181b, hsa-miR- 146a, hsa-miR-155, hsa-miR-181c-5p, hsa-miR-7-1-3p, hsa-miR-708- 5p onco-miRs constituted a set. These miRNAs targeted 801 common mRNAs (p
- Published
- 2021
- Full Text
- View/download PDF
33. Small-world networks of prognostic genes associated with lung adenocarcinoma development
- Author
-
Asim Bikas Das
- Subjects
Male ,0106 biological sciences ,Lung Neoplasms ,Transcription, Genetic ,Carcinogenesis ,Adenocarcinoma of Lung ,Computational biology ,Biology ,Malignancy ,Network topology ,01 natural sciences ,03 medical and health sciences ,Protein Interaction Mapping ,Genetics ,medicine ,Humans ,Gene Regulatory Networks ,RNA-Seq ,Gene ,Neoplasm Staging ,030304 developmental biology ,Sex Characteristics ,0303 health sciences ,Small-world network ,Lung ,Cancer ,Prognosis ,medicine.disease ,Gene Expression Regulation, Neoplastic ,medicine.anatomical_structure ,Expression data ,Adenocarcinoma ,Female ,010606 plant biology & botany - Abstract
The present study investigates the role of network topology in lung adenocarcinoma (LUAD) development. Analysis of sex- and stage-specific whole-genome expression data revealed that co-expressed and highly connected prognostic genes common to all cancer stages form a small-world network in each stage of LUAD. These small-world networks are present within stage-specific scale-free networks, conserved across the cancer stages, and linked to cancer-specific events. The presence of small-world networks across the cancer stages presents a synchronized system in the disordered environment of cancer, resulting in the evolution of malignancy. Our study reported that these small-world networks are resilient to random and systematic attacks, indicating the least opportunity to introduce perturbations by drugs as a therapeutic intervention. We concluded that highly clustered small-world networks could be controlled through transcriptional modulation for the improved treatment of LUAD.
- Published
- 2020
- Full Text
- View/download PDF
34. A Review of Drug Repositioning Based Chemical-induced Cell Line Expression Data
- Author
-
Fei Wang, Xiujuan Lei, and Fang-Xiang Wu
- Subjects
0301 basic medicine ,Pharmacology ,Databases, Factual ,Computer science ,Drug candidate ,Organic Chemistry ,Drug Repositioning ,Computational Biology ,Computational biology ,Biochemistry ,Cell Line ,03 medical and health sciences ,Drug repositioning ,030104 developmental biology ,0302 clinical medicine ,Expression data ,Cell culture ,Drug Discovery ,Molecular Medicine ,Transcriptome ,Gene ,030217 neurology & neurosurgery - Abstract
Drug repositioning is an important area of biomedical research. The drug repositioning studies have shifted to computational approaches. Large-scale perturbation databases, such as the Connectivity Map and the Library of Integrated Network-Based Cellular Signatures, contain a number of chemical-induced gene expression profiles and provide great opportunities for computational biology and drug repositioning. One reason is that the profiles provided by the Connectivity Map and the Library of Integrated Network-Based Cellular Signatures databases show an overall view of biological mechanism in drugs, diseases and genes. In this article, we provide a review of the two databases and their recent applications in drug repositioning.
- Published
- 2020
- Full Text
- View/download PDF
35. Linking metabolic phenotypes to pathogenic traits among 'Candidatus Liberibacter asiaticus' and its hosts
- Author
-
Georgios Vidalakis, Cristal Zuñiga, Alejandro Zepeda, James Borneman, Clarisse Marotz, Greg McCollum, Karsten Zengler, Diego Tec-Campos, Sonia Irigoyen, Nien-Chen Weng, Bo Liang, Beth B. Peacock, and Kranthi K. Mandadi
- Subjects
0106 biological sciences ,0301 basic medicine ,Citrus ,Diaphorina citri ,Microorganism ,01 natural sciences ,General Biochemistry, Genetics and Molecular Biology ,Article ,03 medical and health sciences ,Liberibacter ,Drug Discovery ,Genetics ,lcsh:QH301-705.5 ,Plant Diseases ,Comparative genomics ,Candidatus Liberibacter asiaticus ,biology ,Biochemical networks ,Applied Mathematics ,biology.organism_classification ,Phenotype ,Computer Science Applications ,Good Health and Well Being ,030104 developmental biology ,lcsh:Biology (General) ,Expression data ,Modeling and Simulation ,Host-Pathogen Interactions ,Plant sciences ,010606 plant biology & botany - Abstract
Candidatus Liberibacter asiaticus (CLas) has been associated with Huanglongbing, a lethal vector-borne disease affecting citrus crops worldwide. While comparative genomics has provided preliminary insights into the metabolic capabilities of this uncultured microorganism, a comprehensive functional characterization is currently lacking. Here, we reconstructed and manually curated genome-scale metabolic models for the six CLas strains A4, FL17, gxpsy, Ishi-1, psy62, and YCPsy, in addition to a model of the closest related culturable microorganism, L. crescens BT-1. Predictions about nutrient requirements and changes in growth phenotypes of CLas were confirmed using in vitro hairy root-based assays, while the L. crescens BT-1 model was validated using cultivation assays. Host-dependent metabolic phenotypes were revealed using expression data obtained from CLas-infected citrus trees and from the CLas-harboring psyllid Diaphorina citri Kuwayama. These results identified conserved and unique metabolic traits, as well as strain-specific interactions between CLas and its hosts, laying the foundation for the development of model-driven Huanglongbing management strategies.
- Published
- 2020
- Full Text
- View/download PDF
36. Screening of Prognosis-Related Genes in Primary Breast Carcinoma Using Genomic Expression Data
- Author
-
Yibao Du, Yifan Gu, and Guoqing Chen
- Subjects
Oncology ,medicine.medical_specialty ,Breast Neoplasms ,03 medical and health sciences ,0302 clinical medicine ,Survival data ,Breast cancer ,Internal medicine ,Gene expression microarray data ,Biomarkers, Tumor ,Genetics ,Cluster Analysis ,Humans ,Medicine ,Molecular Biology ,Gene ,Oligonucleotide Array Sequence Analysis ,Proportional Hazards Models ,Prostaglandin-E Synthases ,030304 developmental biology ,0303 health sciences ,Primary (chemistry) ,business.industry ,Computational Biology ,Janus Kinase 2 ,Prognosis ,medicine.disease ,Gene Expression Regulation, Neoplastic ,Computational Mathematics ,Gene Ontology ,Computational Theory and Mathematics ,Expression data ,Multigene Family ,030220 oncology & carcinogenesis ,Modeling and Simulation ,Female ,business ,Breast carcinoma ,Primary breast cancer - Abstract
This study aimed at exploring the genes that may be related to the prognosis of primary breast cancer (BC) patients. The gene expression microarray data, together with sample survival data were acquired from The Cancer Genome Atlas database. The top 20% genes according to expression value variance were subjected to hierarchical cluster analysis. Bootstrap methods were utilized to assess the stability of cluster. Cox regression was applied to screen genes related to the survival time of patients with BC, and the Beta-Uniform Mixture model was applied to adjust the significance of numerous tests. Further, ingenuity pathway analysis (IPA) was carried out to analyze the functions of the potential prognostic genes. Cluster analysis revealed that there were at least five stable BC subtypes, each with specific gene expression. Further, 42 survival time-associated genes were found (
- Published
- 2020
- Full Text
- View/download PDF
37. Gene Ontology Curation of Neuroinflammation Biology Improves the Interpretation of Alzheimer’s Disease Gene Expression Data
- Author
-
Milagros Rodríguez-López, David Brough, Shirin C C Saverimuttu, Maria Jesus Martin, Rina Bandopadhyay, Nigel M. Hooper, Sandra Orchard, Ruth C. Lovering, Barbara Kramarz, Helen Parkinson, Rachael P. Huntley, and Paola Roncaglia
- Subjects
0301 basic medicine ,Alzeheimer's disease ,Cytoscape network analysis ,microglia ,Gene Expression ,Disease ,Computational biology ,Biology ,cytoscape network analysis ,neuroinflammation ,03 medical and health sciences ,Annotation ,0302 clinical medicine ,Alzheimer Disease ,microRNA ,PANTHER ,Humans ,Neuroinflammation ,Disease gene ,Gene ontology ,General Neuroscience ,Neuron projection ,Computational Biology ,Molecular Sequence Annotation ,General Medicine ,Psychiatry and Mental health ,Clinical Psychology ,030104 developmental biology ,Gene Ontology ,Expression data ,gene ontology ,Encephalitis ,Geriatrics and Gerontology ,Alzheimer’s disease ,030217 neurology & neurosurgery ,Research Article - Abstract
Background:Gene Ontology (GO) is a major bioinformatic resource used for analysis of large biomedical datasets, for example from genome-wide association studies, applied universally across biological fields, including Alzheimer’s disease (AD) research.Objective:We aim to demonstrate the applicability of GO for interpretation of AD datasets to improve the understanding of the underlying molecular disease mechanisms, including the involvement of inflammatory pathways and dysregulated microRNAs (miRs).Methods:We have undertaken a systematic full article GO annotation approach focused on microglial proteins implicated in AD and the miRs regulating their expression. PANTHER was used for enrichment analysis of previously published AD data. Cytoscape was used for visualizing and analyzing miR-target interactions captured from published experimental evidence.Results:We contributed 3,084 new annotations for 494 entities, i.e., on average six new annotations per entity. This included a total of 1,352 annotations for 40 prioritized microglial proteins implicated in AD and 66 miRs regulating their expression, yielding an average of twelve annotations per prioritized entity. The updated GO resource was then used to re-analyze previously published data. The re-analysis showed novel processes associated with AD-related genes, not identified in the original study, such as ‘gliogenesis’, ‘regulation of neuron projection development’, or ‘response to cytokine’, demonstrating enhanced applicability of GO for neuroscience research.Conclusions:This study highlights ongoing development of the neurobiological aspects of GO and demonstrates the value of biocuration activities in the area, thus helping to delineate the molecular bases of AD to aid the development of diagnostic tools and treatments.
- Published
- 2020
- Full Text
- View/download PDF
38. Prognostic impact of p53 and/or NY‐ESO‐1 autoantibody induction in patients with gastroenterological cancers
- Author
-
Isamu Hoshino, Fumiaki Shiratori, Hisashi Gunji, Nobuhiro Takiguchi, Yosuke Iwatate, Yoshihiro Nabeya, Fumitaka Ishige, Rei Okada, Satoshi Yajima, and Hideaki Shimada
- Subjects
Oncology ,Poor prognosis ,medicine.medical_specialty ,RD1-811 ,RC799-869 ,Esophageal squamous cell carcinoma ,Internal medicine ,Medicine ,In patient ,neoplasms ,business.industry ,gastric cancer ,Gastroenterology ,Autoantibody ,NY‐ESO‐1 ,p53 gene ,Original Articles ,hepatocellular carcinoma ,Diseases of the digestive system. Gastroenterology ,medicine.disease ,digestive system diseases ,esophageal squamous cell carcinoma ,Expression data ,Tumor progression ,Hepatocellular carcinoma ,Original Article ,Surgery ,NY-ESO-1 ,business - Abstract
Background and Aim We evaluated the clinicopathological and prognostic significance of serum p53 (s‐p53‐Abs) and serum NY‐ESO‐1 autoantibodies (s‐NY‐ESO‐1‐Abs) in esophageal squamous cell carcinoma (ESCC), gastric cancer and hepatocellular carcinoma (HCC). Patients and Methods A total of 377 patients, 85 patients with ESCC, 248 patients with gastric cancer, and 44 patients with HCC were enrolled to measure s‐p53‐Abs and s‐NY‐ESO‐1‐Abs titers by the enzyme‐linked immunosorbent assay before treatment. The clinicopathological significance and prognostic impact of the presence of autoantibodies were evaluated. Expression data based on the Cancer Genome Atlas and the prognostic impact of gene expression was also examined for discussion. Results The positive rates of s‐p53‐Abs were 32.9% in ESCC, 15% in gastric cancer, and 4.5% in HCC. The positive rates of s‐NY‐ESO‐1‐Abs were 29.4% in ESCC, 9.7% in gastric cancer, and 13.6% in HCC. The presence of s‐p53‐Abs was not associated with tumor progression in these three cancer types. On the other hand, the presence of s‐NY‐ESO‐1‐Abs was significantly associated with tumor progression in ESCC and gastric cancer. The presence of s‐p53‐Abs and/or s‐NY‐ESO‐1‐Abs was significantly associated with poor prognosis in gastric cancer but not in ESCC nor HCC. Conclusions The presence of s‐p53‐Abs and/or s‐NY‐ESO‐1‐Abs was associated with tumor progression in ESCC and gastric cancer. These autoantibodies might have poor prognostic impacts on gastric cancer (UMIN000014530)., NY‐ESO‐1 antibodies were significantly associated with tumor progression in gastric cancer and esophageal squamous cell carcinoma, while the presence of p53 and/or NY‐ESO‐1 antibodies was significantly associated with poor prognosis only in gastric cancer.
- Published
- 2020
39. Exploring dynamic protein-protein interactions in cassava through the integrative interactome network
- Author
-
Saowalak Kalapanulak, Treenut Saithong, Ratana Thanasomboon, and Supatcharee Netrphan
- Subjects
0106 biological sciences ,0301 basic medicine ,Drought stress ,Manihot ,Dynamic networks ,lcsh:Medicine ,Computational biology ,Biology ,01 natural sciences ,Genome ,Interactome ,Article ,Protein–protein interaction ,03 medical and health sciences ,Gene Expression Regulation, Plant ,Protein Interaction Maps ,lcsh:Science ,Transcription factor ,Plant Diseases ,Plant Proteins ,Multidisciplinary ,lcsh:R ,food and beverages ,Potyviridae ,Starch biosynthesis ,Droughts ,Computational biology and bioinformatics ,Plant Leaves ,030104 developmental biology ,Expression data ,Host-Pathogen Interactions ,RNA, Viral ,Data integration ,lcsh:Q ,Systems biology ,010606 plant biology & botany - Abstract
Protein-protein interactions (PPIs) play an essential role in cellular regulatory processes. Despite, in-depth studies to uncover the mystery of PPI-mediated regulations are still lacking. Here, an integrative interactome network (MePPI-Ux) was obtained by incorporating expression data into the improved genome-scale interactome network of cassava (MePPI-U). The MePPI-U, constructed by both interolog- and domain-based approaches, contained 3,638,916 interactions and 24,590 proteins (59% of proteins in the cassava AM560 genome version 6). After incorporating expression data as information of state, the MePPI-U rewired to represent condition-dependent PPIs (MePPI-Ux), enabling us to envisage dynamic PPIs (DPINs) that occur at specific conditions. The MePPI-Ux was exploited to demonstrate timely PPIs of cassava under various conditions, namely drought stress, brown streak virus (CBSV) infection, and starch biosynthesis in leaf/root tissues. MePPI-Uxdrought and MePPI-UxCBSV suggested involved PPIs in response to stress. MePPI-UxSB,leaf and MePPI-UxSB,root suggested the involvement of interactions among transcription factor proteins in modulating how leaf or root starch is synthesized. These findings deepened our knowledge of the regulatory roles of PPIs in cassava and would undeniably assist targeted breeding efforts to improve starch quality and quantity.
- Published
- 2020
- Full Text
- View/download PDF
40. A computational system for identifying operons based on RNA-seq data
- Author
-
Brian Tjaden
- Subjects
0303 health sciences ,Models, Genetic ,Transcription, Genetic ,Operon ,Multicistronic message ,030302 biochemistry & molecular biology ,Bacterial genes ,RNA-Seq ,Genomics ,Computational biology ,Biology ,Genome ,Article ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Expression data ,Transcription (biology) ,bacteria ,Gene Regulatory Networks ,Molecular Biology ,Gene ,Genome, Bacterial ,030304 developmental biology - Abstract
An operon is a set of neighboring genes in a genome that is transcribed as a single polycistronic message. Genes that are part of the same operon often have related functional roles or participate in the same metabolic pathways. The majority of all bacterial genes are co-transcribed with one or more other genes as part of a multi-gene operon. Thus, accurate identification of operons is important in understanding co-regulation of genes and their functional relationships. Here, we present a computational system that uses RNA-seq data to determine operons throughout a genome. The system takes the name of a genome and one or more files of RNA-seq data as input. Our method combines primary genomic sequence information with expression data from the RNA-seq files in a unified probabilistic model in order to identify operons. We assess our method's ability to accurately identify operons in a range of species through comparison to external databases of operons, both experimentally confirmed and computationally predicted, and through focused experiments that confirm new operons identified by our method. Our system is freely available at https://cs.wellesley.edu/~btjaden/Rockhopper/.
- Published
- 2020
- Full Text
- View/download PDF
41. Plant pangenomics: approaches, applications and advancements
- Author
-
Cassandria Tay Fernandez, Monica F. Danilevicz, Philipp E. Bayer, David Edwards, and Jacob I. Marsh
- Subjects
0106 biological sciences ,0301 basic medicine ,Exploit ,Biodiversity ,Plant Science ,Variation (game tree) ,Plants ,Biology ,Biological Evolution ,01 natural sciences ,Genome ,Structural variation ,03 medical and health sciences ,Variable (computer science) ,030104 developmental biology ,Evolutionary biology ,Expression data ,Taxonomic rank ,Genome, Plant ,010606 plant biology & botany - Abstract
With the assembly of increasing numbers of plant genomes, it is becoming accepted that a single reference assembly does not reflect the gene diversity of a species. The production of pangenomes, which reflect the structural variation and polymorphisms in genomes, enables in depth comparisons of variation within species or higher taxonomic groups. In this review, we discuss the current and emerging approaches for pangenome assembly, analysis and visualisation. In addition, we consider the potential of pangenomes for applied crop improvement, evolutionary and biodiversity studies. To fully exploit the value of pangenomes it is important to integrate broad information such as phenotypic, environmental, and expression data to gain insights into the role of variable regions within genomes.
- Published
- 2020
- Full Text
- View/download PDF
42. Genome-wide analysis of a putative lipid transfer protein LTP_2 gene family reveals CsLTP_2 genes involved in response of cucumber against root-knot nematode (Meloidogyne incognita)
- Author
-
Chunyan Cheng, Xing Wang, Qunfeng Lou, Ji Li, Qingrong Li, Jinfeng Chen, and Kaijing Zhang
- Subjects
0106 biological sciences ,0301 basic medicine ,Abiotic component ,Genetics ,biology ,Genome wide analysis ,General Medicine ,biology.organism_classification ,01 natural sciences ,03 medical and health sciences ,030104 developmental biology ,Expression data ,Meloidogyne incognita ,Root-knot nematode ,Gene family ,Molecular Biology ,Gene ,Plant lipid transfer proteins ,010606 plant biology & botany ,Biotechnology - Abstract
Plant lipid transfer proteins (LTPs) are small basic proteins that play important roles in the regulation of various plant biological processes as well as the response to biotic and abiotic stresses. However, knowledge is limited on how this family of proteins is regulated in response to nematode infection in cucumber. In the present study, a total of 39 CsLTP_2 genes were identified by querying databases for cucumber-specific LTP_2 using a Hidden Markov Model approach and manual curation. The family has a five-cysteine motif (5CM) with the basic form CC-Xn-CXC-Xn-C, which differentiates it from typical nsLTPs. The members of CsLTP_2 were grouped into six families according to their structure and their phylogenetic relationships. Expression data of CsLTP_2 genes in 10 cucumber tissues indicated that they were tissue-specific genes. Two genes showed significant expression change in roots of resistant and susceptible lines during nematode infection, indicating their involvement in response to Meloidogyne incognita. This systematic analysis provides a foundation of knowledge for future studies of the biological roles of CsLTP_2 genes in cucumber in response to nematode infection and may help in the efforts to improve M. incognita-resistance breeding in cucumber.
- Published
- 2020
- Full Text
- View/download PDF
43. A Machine-Learning Analysis of Flowering Gene Expression in the CDC Frontier Chickpea Cultivar
- Author
-
Maria Samsonova, Vitaly V. Gursky, and B. S. Podolny
- Subjects
0301 basic medicine ,Genetics ,030102 biochemistry & molecular biology ,Key genes ,fungi ,Biophysics ,food and beverages ,Repressor ,Meristem ,Biology ,03 medical and health sciences ,AP-1 transcription factor ,030104 developmental biology ,Expression data ,Gene expression ,Cultivar ,Gene - Abstract
—We have analyzed the gene expression dynamics in floral transition in the CDC Frontier chickpea cultivar. We provide a model, in several versions, to predict the expression dynamics of five flowering genes, taking the expression of their regulators as an input. The models were trained using the random forest method on the previously published expression data for ten flowering genes under the short- and long-day growing conditions. The resulting models correctly predict the dynamics of the average expression levels under long days. We show that the models for CDC Frontier mainly reproduce the regulatory interactions between the key genes described for the Arabidopsis thaliana model plant. Based on the analysis, we hypothesize that the short-day data and the long-day data contain qualitatively different information, which may be due to different regulatory modules that function in different conditions. For the regulators of the flower meristem identity genes AP1 and LFY, our models predict FTa3 as the main activator and TFL1c as the main repressor under long days.
- Published
- 2020
- Full Text
- View/download PDF
44. Generating Contexts for Expression Data Using Pathway Queries
- Author
-
Sohler, Florian, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Fages, François, editor, and Soliman, Sylvain, editor
- Published
- 2005
- Full Text
- View/download PDF
45. A universal approach for integrating super large-scale single-cell transcriptomes by exploring gene rankings
- Author
-
Chao Zhang, Jilong Yang, Xiangchun Li, Yichen Yang, Mengyao Feng, Xilin Shen, Wei Wang, Yang Li, Hongru Shen, Jilei Liu, Jiani Hu, Dan Wu, Meng Yang, Qiang Zhang, and Kexin Chen
- Subjects
Cell specific ,Computer science ,Scale (chemistry) ,Computational biology ,Expression (mathematics) ,Transcriptome ,Mice ,Identification (information) ,Expression data ,Exome Sequencing ,Animals ,Cluster Analysis ,Gene Regulatory Networks ,Single-Cell Analysis ,Cluster analysis ,Gene ,Molecular Biology ,Information Systems - Abstract
Advancement in single-cell RNA sequencing leads to exponential accumulation of single-cell expression data. However, there is still lack of tools that could integrate these unlimited accumulation of single-cell expression data. Here, we presented a universal approach iSEEEK for integrating super large-scale single-cell expression via exploring expression rankings of top-expressing genes. We developed iSEEEK with 13.7 million single-cells. We demonstrated the efficiency of iSEEEK with canonical single-cell downstream tasks on five heterogenous datasets encompassing human and mouse samples. iSEEEK achieved good clustering performance benchmarked against well-annotated cell labels. In addition, iSEEEK could transfer its knowledge learned from large-scale expression data on new dataset that was not involved in its development. iSEEEK enables identification of gene-gene interaction networks that are characteristic of specific cell types. Our study presents a simple and yet effective method to integrate super large-scale single-cell transcriptomes and would facilitate translational single-cell research from bench to bedside.
- Published
- 2022
- Full Text
- View/download PDF
46. The Lamellipodin homologue MIG-10 is not essential for dorsal intercalation in the embryonic epidermis of the C. elegans embryo
- Author
-
Serre, Joel M. and Hardin, Jeff
- Subjects
New Finding ,animal structures ,integumentary system ,Phenotype Data ,Expression Data ,C. Elegans - Abstract
Dorsal intercalation of the embryonic epidermis in the Caenorhabditis elegans embryo is a promising system for genetic analysis of convergent extension, a conserved process in animal embryos. We sought to identify functionally important actin regulators in dorsal epidermal cells. A promising candidate is MIG-10, the single MIG-10/RIAM/Lamellipodin (MRL) family member in C. elegans. We endogenously tagged all mig-10 isoforms with mNeonGreen and analyzed mig-10 mutants using 4-dimensional microscopy. MIG-10::mNG is expressed prominently in muscle progenitors but is not detectable in the dorsal epidermis. mig-10(ct41) homozygotes complete dorsal intercalation in a manner indistinguishable from wildtype, indicating MIG-10 is not essential during dorsal intercalation.
- Published
- 2022
- Full Text
- View/download PDF
47. Immune determinants of the association between tumor mutational burden and immunotherapy response across cancer types
- Author
-
Morris Lgt, Sanju Sinha, Kenneth Aldape, Chan Ta, Eytan Ruppin, Kevin Litchfield, Neelam Sinha, Valero C, and Alejandro A. Schäffer
- Subjects
Cancer Research ,medicine.medical_treatment ,Immune checkpoint inhibitors ,Pembrolizumab ,Article ,Signalling & Oncogenes ,Text mining ,Immune system ,Ecology,Evolution & Ethology ,Neoplasms ,medicine ,Biomarkers, Tumor ,Tumor Microenvironment ,Humans ,Immunologic Factors ,Immune Checkpoint Inhibitors ,Computational & Systems Biology ,Chemical Biology & High Throughput ,Human Biology & Physiology ,business.industry ,Genome Integrity & Repair ,Cancer ,Immunotherapy ,Tumour Biology ,medicine.disease ,Oncology ,Expression data ,Mutation ,Cancer research ,Biomarker (medicine) ,business ,Genetics & Genomics - Abstract
The FDA has recently approved a high tumor mutational burden (TMB-high) biomarker, defined by ≥10 mutations/Mb, for the treatment of solid tumors with pembrolizumab, an immune checkpoint inhibitor (ICI) that targets PD1. However, recent studies have shown that this TMB-high biomarker is only able to stratify ICI responders in a subset of cancer types, and the mechanisms underlying this observation have remained unknown. The tumor immune microenvironment (TME) may modulate the stratification power of TMB (termed TMB power), determining if it will be predictive of ICI response in a given cancer type. To systematically study this hypothesis, we inferred the levels of 31 immune-related factors characteristic of the TME of different cancer types in The Cancer Genome Atlas. Integration of this information with TMB and response data of 2,277 patients treated with anti-PD1 identified key immune factors that determine TMB power across 14 different cancer types. We find that high levels of M1 macrophages and low resting dendritic cells in the TME characterized cancer types with high TMB power. A model based on these two immune factors strongly predicted TMB power in a given cancer type during cross-validation and testing (Spearman Rho = 0.76 and 1, respectively). Using this model, we predicted the TMB power in nine additional cancer types, including rare cancers, for which TMB and ICI response data are not yet publicly available. Our analysis indicates that TMB-high may be highly predictive of ICI response in cervical squamous cell carcinoma, suggesting that such a study should be prioritized. Significance: This study uncovers immune-related factors that may modulate the relationship between high tumor mutational burden and ICI response, which can help prioritize cancer types for clinical trials.
- Published
- 2022
- Full Text
- View/download PDF
48. Normalization and Gene p-Value Estimation: Issues in Microarray Data Processing
- Author
-
Katrin Fundel, Robert Küffner, Thomas Aigner, and Ralf Zimmer
- Subjects
expression data ,normalization ,regulated genes ,data processing ,Biology (General) ,QH301-705.5 - Abstract
Introduction: Numerous methods exist for basic processing, e.g. normalization, of microarray gene expression data. These methods have an important effect on the final analysis outcome. Therefore, it is crucial to select methods appropriate for a given dataset in order to assure the validity and reliability of expression data analysis.Furthermore, biological interpretation requires expression values for genes, which are often represented by several spots or probe sets on a microarray. How to best integrate spot/probe set values into gene values has so far been a somewhat neglected problem.Results: We present a case study comparing different between-array normalization methods with respect to the identification of differentially expressed genes. Our results show that it is feasible and necessary to use prior knowledge on gene expression measurements to select an adequate normalization method for the given data. Furthermore, we provide evidence that combining spot/probe set p-values into gene p-values for detecting differentially expressed genes has advantages com- pared to combining expression values for spots/probe sets into gene expression values. The comparison of different methods suggests to use Stouffer’s method for this purpose.The study has been conducted on gene expression experiments investigating human joint cartilage samples of Osteoarthritis related groups: a cDNA microarray (83 samples, four groups) and an Affymetrix (26 samples, two groups) data set.Conclusion: The apparently straight forward steps of gene expression data analysis, e.g. between-array normalization and detection of differentially regulated genes, can be accomplished by numerous different methods. We analyzed multiple methods and the possible effects and thereby demonstrate the importance of the single decisions taken during data processing. We give guidelines for evaluating normalization outcomes. An overview of these effects via appropriate measures and plots compared to prior knowledge is essential for the biological interpretation of gene expression measurements.
- Published
- 2008
49. Using ICLite for deconvolution of bulk transcriptional data from mixed cell populations
- Author
-
Matthew J. Camiolo, Sally E. Wenzel, and Anuradha Ray
- Subjects
Linkage (software) ,Science (General) ,General Immunology and Microbiology ,Base Sequence ,Sequence analysis ,Sequence Analysis, RNA ,Bioinformatics ,General Neuroscience ,Systems biology ,Gene Expression Profiling ,Immunology ,Mixed cell ,Gene Expression ,Computational biology ,Biology ,General Biochemistry, Genetics and Molecular Biology ,Q1-390 ,Expression data ,Protocol ,Deconvolution ,Gene - Abstract
Summary Bulk expression data from heterogeneous cell populations pose a challenge for investigators, as differences in cell numbers and transcriptional programs may complicate analysis. To improve the performance of bulk RNA sequencing on mixed populations, we created Immune Cell Linkage through Exploratory Matrices (ICLite). The ICLite package for R constructs modules of correlated genes and identifies their relationship to specific lineages in mixed cell populations. This protocol details formatting, optimization of run parameters, and interpretation of results following implementation of ICLite. For complete details on the use and execution of this protocol, please refer to Camiolo et al. (2021)., Graphical abstract, Highlights • ICLite identifies gene modules in bulk transcriptional data from mixed cell populations • Protocol details how to run and interpret the results of ICLite • Discussion of parameter tuning, data formatting, and solution evaluation • Details for post-run exploration including gene ontology and semantic similarity, Bulk expression data from heterogeneous cell populations pose a challenge for investigators, as differences in cell numbers and transcriptional programs may complicate analysis. To improve the performance of bulk RNA sequencing on mixed populations, we created Immune Cell Linkage through Exploratory Matrices (ICLite). The ICLite package for R constructs modules of correlated genes and identifies their relationship to specific lineages in mixed cell populations. This protocol details formatting, optimization of run parameters, and interpretation of results following implementation of ICLite.
- Published
- 2021
50. SPECTRA: an Integrated Knowledge Base for Comparing Tissue and Tumor Specific PPI Networks in Human
- Author
-
Giovanni eMicale, Alfredo eFerro, Alfredo ePulvirenti, and Rosalba eGiugno
- Subjects
Tissues ,Network analysis ,tumors ,expression data ,PPI Network ,Cytoscape Visualization ,Biotechnology ,TP248.13-248.65 - Abstract
Protein-protein interaction (PPI) networks available in public repositories usually represent relationships between proteins within the cell. They ignore the specific set of tissues or tumors where the interactions take place. Indeed, genes can form tissue-selective complexes, while they remain inactive in other tissues. For these reasons, a great attention has been recently paid to tissue-specific PPI networks, in which nodes are proteins of the global PPI network that are preferentially expressed in specific tissues. In this paper we present SPECTRA, a knowledge base to build and compare tissue or tumor specific PPI networks. SPECTRA integrates gene expression and protein interaction data from the most authoritative online repositories. We also provide tools for visualizing and comparing such networks, in order to identify the expression and interaction changes of proteins across tissues, or between the normal and pathological states in the same tissue. SPECTRA isavailable as a web server at http://ferrolab.dmi.unict.it/spectra/spectra.php
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.