205 results on '"Gyori, Benjamin M."'
Search Results
2. The O3 guidelines: open data, open code, and open infrastructure for sustainable curated scientific resources
- Author
-
Hoyt, Charles Tapley and Gyori, Benjamin M.
- Published
- 2024
- Full Text
- View/download PDF
3. Beyond protein lists: AI-assisted interpretation of proteomic investigations in the context of evolving scientific knowledge
- Author
-
Gyori, Benjamin M. and Vitek, Olga
- Published
- 2024
- Full Text
- View/download PDF
4. Democratising Knowledge Representation with BioCypher
- Author
-
Lobentanzer, Sebastian, Aloy, Patrick, Baumbach, Jan, Bohar, Balazs, Charoentong, Pornpimol, Danhauser, Katharina, Doğan, Tunca, Dreo, Johann, Dunham, Ian, Fernandez-Torras, Adrià, Gyori, Benjamin M., Hartung, Michael, Hoyt, Charles Tapley, Klein, Christoph, Korcsmaros, Tamas, Maier, Andreas, Mann, Matthias, Ochoa, David, Pareja-Lorente, Elena, Popp, Ferdinand, Preusse, Martin, Probul, Niklas, Schwikowski, Benno, Sen, Bünyamin, Strauss, Maximilian T., Turei, Denes, Ulusoy, Erva, Wodke, Judith Andrea Heidrun, and Saez-Rodriguez, Julio
- Subjects
Quantitative Biology - Molecular Networks - Abstract
Standardising the representation of biomedical knowledge among all researchers is an insurmountable task, hindering the effectiveness of many computational methods. To facilitate harmonisation and interoperability despite this fundamental challenge, we propose to standardise the framework of knowledge graph creation instead. We implement this standardisation in BioCypher, a FAIR (findable, accessible, interoperable, reusable) framework to transparently build biomedical knowledge graphs while preserving provenances of the source data. Mapping the knowledge onto biomedical ontologies helps to balance the needs for harmonisation, human and machine readability, and ease of use and accessibility to non-specialist researchers. We demonstrate the usefulness of this framework on a variety of use cases, from maintenance of task-specific knowledge stores, to interoperability between biomedical domains, to on-demand building of task-specific knowledge graphs for federated learning. BioCypher (https://biocypher.org) frees up valuable developer time; we encourage further development and usage by the community., Comment: 34 pages, 6 figures; submitted to Nature Biotechnology
- Published
- 2022
5. NDEx IQuery: a multi-method network gene set analysis leveraging the Network Data Exchange
- Author
-
Pillich, Rudolf T, Chen, Jing, Churas, Christopher, Fong, Dylan, Gyori, Benjamin M, Ideker, Trey, Karis, Klas, Liu, Sophie N, Ono, Keiichiro, Pico, Alexander, and Pratt, Dexter
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Networking and Information Technology R&D (NITRD) ,Underpinning research ,1.5 Resources and infrastructure (underpinning) ,Generic health relevance ,Computational Biology ,Software ,Protein Interaction Maps ,Publications ,Databases ,Factual ,Internet ,Mathematical Sciences ,Information and Computing Sciences ,Bioinformatics ,Biological sciences ,Information and computing sciences ,Mathematical sciences - Abstract
MotivationThe investigation of sets of genes using biological pathways is a common task for researchers and is supported by a wide variety of software tools. This type of analysis generates hypotheses about the biological processes that are active or modulated in a specific experimental context.ResultsThe Network Data Exchange Integrated Query (NDEx IQuery) is a new tool for network and pathway-based gene set interpretation that complements or extends existing resources. It combines novel sources of pathways, integration with Cytoscape, and the ability to store and share analysis results. The NDEx IQuery web application performs multiple gene set analyses based on diverse pathways and networks stored in NDEx. These include curated pathways from WikiPathways and SIGNOR, published pathway figures from the last 27 years, machine-assembled networks using the INDRA system, and the new NCI-PID v2.0, an updated version of the popular NCI Pathway Interaction Database. NDEx IQuery's integration with MSigDB and cBioPortal now provides pathway analysis in the context of these two resources.Availability and implementationNDEx IQuery is available at https://www.ndexbio.org/iquery and is implemented in Javascript and Java.
- Published
- 2023
6. Unifying the identification of biomedical entities with the Bioregistry
- Author
-
Hoyt, Charles Tapley, Balk, Meghan, Callahan, Tiffany J, Domingo-Fernández, Daniel, Haendel, Melissa A, Hegde, Harshad B, Himmelstein, Daniel S, Karis, Klas, Kunze, John, Lubiana, Tiago, Matentzoglu, Nicolas, McMurry, Julie, Moxon, Sierra, Mungall, Christopher J, Rutz, Adriano, Unni, Deepak R, Willighagen, Egon, Winston, Donald, and Gyori, Benjamin M
- Subjects
Networking and Information Technology R&D (NITRD) - Abstract
The standardized identification of biomedical entities is a cornerstone of interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for biomedical entities such as small molecules, proteins, cell lines, and clinical trials. However, existing registries have struggled to provide sufficient coverage and metadata standards that meet the evolving needs of modern life sciences researchers. Here, we introduce the Bioregistry, an integrative, open, community-driven metaregistry that synthesizes and substantially expands upon 23 existing registries. The Bioregistry addresses the need for a sustainable registry by leveraging public infrastructure and automation, and employing a progressive governance model centered around open code and open data to foster community contribution. The Bioregistry can be used to support the standardized annotation of data, models, ontologies, and scientific literature, thereby promoting their interoperability and reuse. The Bioregistry can be accessed through https://bioregistry.io and its source code and data are available under the MIT and CC0 Licenses at https://github.com/biopragmatics/bioregistry .
- Published
- 2022
7. A Unified Framework for Rank-based Evaluation Metrics for Link Prediction in Knowledge Graphs
- Author
-
Hoyt, Charles Tapley, Berrendorf, Max, Galkin, Mikhail, Tresp, Volker, and Gyori, Benjamin M.
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
The link prediction task on knowledge graphs without explicit negative triples in the training data motivates the usage of rank-based metrics. Here, we review existing rank-based metrics and propose desiderata for improved metrics to address lack of interpretability and comparability of existing metrics to datasets of different sizes and properties. We introduce a simple theoretical framework for rank-based metrics upon which we investigate two avenues for improvements to existing metrics via alternative aggregation functions and concepts from probability theory. We finally propose several new rank-based metrics that are more easily interpreted and compared accompanied by a demonstration of their usage in a benchmarking of knowledge graph embedding models., Comment: Accepted at the Workshop on Graph Learning Benchmarks @ The WebConf 2022
- Published
- 2022
8. ChemicalX: A Deep Learning Library for Drug Pair Scoring
- Author
-
Rozemberczki, Benedek, Hoyt, Charles Tapley, Gogleva, Anna, Grabowski, Piotr, Karis, Klas, Lamov, Andrej, Nikolov, Andriy, Nilsson, Sebastian, Ughetto, Michael, Wang, Yu, Derr, Tyler, and Gyori, Benjamin M
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In this paper, we introduce ChemicalX, a PyTorch-based deep learning library designed for providing a range of state of the art models to solve the drug pair scoring task. The primary objective of the library is to make deep drug pair scoring models accessible to machine learning researchers and practitioners in a streamlined framework.The design of ChemicalX reuses existing high level model training utilities, geometric deep learning, and deep chemistry layers from the PyTorch ecosystem. Our system provides neural network layers, custom pair scoring architectures, data loaders, and batch iterators for end users. We showcase these features with example code snippets and case studies to highlight the characteristics of ChemicalX. A range of experiments on real world drug-drug interaction, polypharmacy side effect, and combination synergy prediction tasks demonstrate that the models available in ChemicalX are effective at solving the pair scoring task. Finally, we show that ChemicalX could be used to train and score machine learning models on large drug pair datasets with hundreds of thousands of compounds on commodity hardware., Comment: https://github.com/AstraZeneca/chemicalx
- Published
- 2022
9. A roadmap for the functional annotation of protein families: a community perspective
- Author
-
de Crécy-lagard, Valérie, de Hegedus, Rocio Amorin, Arighi, Cecilia, Babor, Jill, Bateman, Alex, Blaby, Ian, Blaby-Haas, Crysten, Bridge, Alan J, Burley, Stephen K, Cleveland, Stacey, Colwell, Lucy J, Conesa, Ana, Dallago, Christian, Danchin, Antoine, de Waard, Anita, Deutschbauer, Adam, Dias, Raquel, Ding, Yousong, Fang, Gang, Friedberg, Iddo, Gerlt, John, Goldford, Joshua, Gorelik, Mark, Gyori, Benjamin M, Henry, Christopher, Hutinet, Geoffrey, Jaroch, Marshall, Karp, Peter D, Kondratova, Liudmyla, Lu, Zhiyong, Marchler-Bauer, Aron, Martin, Maria-Jesus, McWhite, Claire, Moghe, Gaurav D, Monaghan, Paul, Morgat, Anne, Mungall, Christopher J, Natale, Darren A, Nelson, William C, O’Donoghue, Seán, Orengo, Christine, O’Toole, Katherine H, Radivojac, Predrag, Reed, Colbie, Roberts, Richard J, Rodionov, Dmitri, Rodionova, Irina A, Rudolf, Jeffrey D, Saleh, Lana, Sheynkman, Gloria, Thibaud-Nissen, Francoise, Thomas, Paul D, Uetz, Peter, Vallenet, David, Carter, Erica Watson, Weigele, Peter R, Wood, Valerie, Wood-Charlson, Elisha M, and Xu, Jin
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Generic health relevance ,Base Sequence ,Computational Biology ,Genome ,Genomics ,Molecular Sequence Annotation ,Proteins ,Data Format ,Library and Information Studies ,Bioinformatics and computational biology ,Data management and data science - Abstract
Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.
- Published
- 2022
10. A Simple Standard for Sharing Ontological Mappings (SSSOM)
- Author
-
Matentzoglu, Nicolas, Balhoff, James P., Bello, Susan M., Bizon, Chris, Brush, Matthew, Callahan, Tiffany J., Chute, Christopher G, Duncan, William D., Evelo, Chris T., Gabriel, Davera, Graybeal, John, Gray, Alasdair, Gyori, Benjamin M., Haendel, Melissa, Harmse, Henriette, Harris, Nomi L., Harrow, Ian, Hegde, Harshad, Hoyt, Amelia L., Hoyt, Charles T., Jiao, Dazhi, Jiménez-Ruiz, Ernesto, Jupp, Simon, Kim, Hyeongsik, Koehler, Sebastian, Liener, Thomas, Long, Qinqin, Malone, James, McLaughlin, James A., McMurry, Julie A., Moxon, Sierra, Munoz-Torres, Monica C., Osumi-Sutherland, David, Overton, James A., Peters, Bjoern, Putman, Tim, Queralt-Rosinach, Núria, Shefchek, Kent, Solbrig, Harold, Thessen, Anne, Tudorache, Tania, Vasilevsky, Nicole, Wagner, Alex H., and Mungall, Christopher J.
- Subjects
Computer Science - Databases - Abstract
Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Are they associated in some other way? Such relationships between the mapped terms are often not documented, leading to incorrect assumptions and making them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Also, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. The Simple Standard for Sharing Ontological Mappings (SSSOM) addresses these problems by: 1. Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. 2. Defining an easy to use table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data standards. 3. Implementing open and community-driven collaborative workflows designed to evolve the standard continuously to address changing requirements and mapping practices. 4. Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases, and survey some existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable, and Reusable (FAIR). The SSSOM specification is at http://w3id.org/sssom/spec., Comment: Corresponding author: Christopher J. Mungall
- Published
- 2021
- Full Text
- View/download PDF
11. A Simple Standard for Sharing Ontological Mappings (SSSOM)
- Author
-
Matentzoglu, Nicolas, Balhoff, James P, Bello, Susan M, Bizon, Chris, Brush, Matthew, Callahan, Tiffany J, Chute, Christopher G, Duncan, William D, Evelo, Chris T, Gabriel, Davera, Graybeal, John, Gray, Alasdair, Gyori, Benjamin M, Haendel, Melissa, Harmse, Henriette, Harris, Nomi L, Harrow, Ian, Hegde, Harshad B, Hoyt, Amelia L, Hoyt, Charles T, Jiao, Dazhi, Jiménez-Ruiz, Ernesto, Jupp, Simon, Kim, Hyeongsik, Koehler, Sebastian, Liener, Thomas, Long, Qinqin, Malone, James, McLaughlin, James A, McMurry, Julie A, Moxon, Sierra, Munoz-Torres, Monica C, Osumi-Sutherland, David, Overton, James A, Peters, Bjoern, Putman, Tim, Queralt-Rosinach, Núria, Shefchek, Kent, Solbrig, Harold, Thessen, Anne, Tudorache, Tania, Vasilevsky, Nicole, Wagner, Alex H, and Mungall, Christopher J
- Subjects
Data Management and Data Science ,Information and Computing Sciences ,Data Science ,Networking and Information Technology R&D (NITRD) ,Data Management ,Databases ,Factual ,Metadata ,Semantic Web ,Workflow ,Data Format ,Library and Information Studies ,Bioinformatics and computational biology ,Data management and data science - Abstract
Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Or are they associated in some other way? Such relationships between the mapped terms are often not documented, which leads to incorrect assumptions and makes them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Furthermore, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. We have developed the Simple Standard for Sharing Ontological Mappings (SSSOM) which addresses these problems by: (i) Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. (ii) Defining an easy-to-use simple table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data principles. (iii) Implementing open and community-driven collaborative workflows that are designed to evolve the standard continuously to address changing requirements and mapping practices. (iv) Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases in detail and survey some of the existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable and Reusable (FAIR). The SSSOM specification can be found at http://w3id.org/sssom/spec. Database URL: http://w3id.org/sssom/spec.
- Published
- 2022
12. Democratizing knowledge representation with BioCypher
- Author
-
Lobentanzer, Sebastian, Aloy, Patrick, Baumbach, Jan, Bohar, Balazs, Carey, Vincent J., Charoentong, Pornpimol, Danhauser, Katharina, Doğan, Tunca, Dreo, Johann, Dunham, Ian, Farr, Elias, Fernandez-Torras, Adrià, Gyori, Benjamin M., Hartung, Michael, Hoyt, Charles Tapley, Klein, Christoph, Korcsmaros, Tamas, Maier, Andreas, Mann, Matthias, Ochoa, David, Pareja-Lorente, Elena, Popp, Ferdinand, Preusse, Martin, Probul, Niklas, Schwikowski, Benno, Sen, Bünyamin, Strauss, Maximilian T., Turei, Denes, Ulusoy, Erva, Waltemath, Dagmar, Wodke, Judith A. H., and Saez-Rodriguez, Julio
- Published
- 2023
- Full Text
- View/download PDF
13. Automated assembly of molecular mechanisms at scale from text mining and curated databases
- Author
-
Bachman, John A, Gyori, Benjamin M, and Sorger, Peter K
- Published
- 2023
- Full Text
- View/download PDF
14. Statistical Model Checking based Analysis of Biological Networks
- Author
-
Liu, Bing, Gyori, Benjamin M., and Thiagarajan, P. S.
- Subjects
Quantitative Biology - Quantitative Methods ,Quantitative Biology - Molecular Networks - Abstract
We introduce a framework for analyzing ordinary differential equation (ODE) models of biological networks using statistical model checking (SMC). A key aspect of our work is the modeling of single-cell variability by assigning a probability distribution to intervals of initial concentration values and kinetic rate constants. We propagate this distribution through the system dynamics to obtain a distribution over the set of trajectories of the ODEs. This in turn opens the door for performing statistical analysis of the ODE system's behavior. To illustrate this we first encode quantitative data and qualitative trends as bounded linear time temporal logic (BLTL) formulas. Based on this we construct a parameter estimation method using an SMC-driven evaluation procedure applied to the stochastic version of the behavior of the ODE system. We then describe how this SMC framework can be generalized to hybrid automata by exploiting the given distribution over the initial states and the--much more sophisticated--system dynamics to associate a Markov chain with the hybrid automaton. We then establish a strong relationship between the behaviors of the hybrid automaton and its associated Markov chain. Consequently, we sample trajectories from the hybrid automaton in a way that mimics the sampling of the trajectories of the Markov chain. This enables us to verify approximately that the Markov chain meets a BLTL specification with high probability. We have applied these methods to ODE based models of Toll-like receptor signaling and the crosstalk between autophagy and apoptosis, as well as to systems exhibiting hybrid dynamics including the circadian clock pathway and cardiac cell physiology. We present an overview of these applications and summarize the main empirical results. These case studies demonstrate that the our methods can be applied in a variety of practical settings., Comment: Accepted for Publication in Automated Reasoning for Systems Biology and Medicine on 2018-09-18
- Published
- 2018
15. Encoding Growth Factor Identity in the Temporal Dynamics of FOXO3 under the Combinatorial Control of ERK and AKT Kinases
- Author
-
Sampattavanich, Somponnat, Steiert, Bernhard, Kramer, Bernhard A, Gyori, Benjamin M, Albeck, John G, and Sorger, Peter K
- Subjects
Underpinning research ,1.1 Normal biological development and functioning ,Cell Line ,Cytosol ,Forkhead Box Protein O3 ,Forkhead Transcription Factors ,Humans ,Intercellular Signaling Peptides and Proteins ,MAP Kinase Signaling System ,MCF-7 Cells ,Phosphorylation ,Protein Transport ,Proto-Oncogene Proteins c-akt ,Signal Transduction ,AKT ,ERK ,FOXO proteins ,combinatorial control ,oncogenes ,signal transduction ,transcription ,Biochemistry and Cell Biology - Abstract
Extracellular growth factors signal to transcription factors via a limited number of cytoplasmic kinase cascades. It remains unclear how such cascades encode ligand identities and concentrations. In this paper, we use live-cell imaging and statistical modeling to study FOXO3, a transcription factor regulating diverse aspects of cellular physiology that is under combinatorial control. We show that FOXO3 nuclear-to-cytosolic translocation has two temporally distinct phases varying in magnitude with growth factor identity and cell type. These phases comprise synchronous translocation soon after ligand addition followed by an extended back-and-forth shuttling; this shuttling is pulsatile and does not have a characteristic frequency, unlike a simple oscillator. Early and late dynamics are differentially regulated by Akt and ERK and have low mutual information, potentially allowing the two phases to encode different information. In cancer cells in which ERK and Akt are dysregulated by oncogenic mutation, the diversity of states is lower.
- Published
- 2018
16. From knowledge to models: Automated modeling in systems and synthetic biology
- Author
-
Gyori, Benjamin M. and Bachman, John A.
- Published
- 2021
- Full Text
- View/download PDF
17. Drug-target identification in COVID-19 disease mechanisms using computational systems biology approaches
- Author
-
Niarakis, Anna, Ostaszewski, Marek, Mazein, Alexander, Kuperstein, Inna, Kutmon, Martina, Gillespie, Marc E., Funahashi, Akira, Acencio, Marcio Luis, Hemedan, Ahmed, Aichem, Michael, Klein, Karsten, Czauderna, Tobias, Burtscher, Felicia, Yamada, Takahiro G., Hiki, Yusuke, Hiroi, Noriko F., Hu, Finterly, Pham, Nhung, Ehrhart, Friederike, Willighagen, Egon L., Valdeolivas, Alberto, Dugourd, Aurelien, Messina, Francesco, Esteban-Medina, Marina, Peña-Chilet, Maria, Rian, Kinza, Soliman, Sylvain, Aghamiri, Sara Sadat, Puniya, Bhanwar Lal, Naldi, Aurélien, Helikar, Tomáš, Singh, Vidisha, Fernández, Marco Fariñas, Bermudez, Viviam, Tsirvouli, Eirini, Montagud, Arnau, Noël, Vincent, Ponce-de-Leon, Miguel, Maier, Dieter, Bauch, Angela, Gyori, Benjamin M., Bachman, John A., Luna, Augustin, Piñero, Janet, Furlong, Laura I., Balaur, Irina, Rougny, Adrien, Jarosz, Yohan, Overall, Rupert, Phair, Robert, Perfetto, Livia, Matthews, Lisa, Rex, Devasahayam Arokia Balaya, Orlic-Milacic, Marija, Gomez, Luis Cristobal Monraz, De Meulder, Bertrand, Ravel, Jean Marie, Jassal, Bijay, Satagopam, Venkata, Wu, Guanming, Golebiewski, Martin, Gawron, Piotr, Calzone, Laurence, Beckmann, Jacques S., Evelo, Chris T., D’Eustachio, Peter, Schreiber, Falk, Saez-Rodriguez, Julio, Dopazo, Joaquin, Kuiper, Martin, Valencia, Alfonso, Wolkenhauer, Olaf, Kitano, Hiroaki, Barillot, Emmanuel, Auffray, Charles, Balling, Rudi, Schneider, Reinhard, Niarakis, Anna, Ostaszewski, Marek, Mazein, Alexander, Kuperstein, Inna, Kutmon, Martina, Gillespie, Marc E., Funahashi, Akira, Acencio, Marcio Luis, Hemedan, Ahmed, Aichem, Michael, Klein, Karsten, Czauderna, Tobias, Burtscher, Felicia, Yamada, Takahiro G., Hiki, Yusuke, Hiroi, Noriko F., Hu, Finterly, Pham, Nhung, Ehrhart, Friederike, Willighagen, Egon L., Valdeolivas, Alberto, Dugourd, Aurelien, Messina, Francesco, Esteban-Medina, Marina, Peña-Chilet, Maria, Rian, Kinza, Soliman, Sylvain, Aghamiri, Sara Sadat, Puniya, Bhanwar Lal, Naldi, Aurélien, Helikar, Tomáš, Singh, Vidisha, Fernández, Marco Fariñas, Bermudez, Viviam, Tsirvouli, Eirini, Montagud, Arnau, Noël, Vincent, Ponce-de-Leon, Miguel, Maier, Dieter, Bauch, Angela, Gyori, Benjamin M., Bachman, John A., Luna, Augustin, Piñero, Janet, Furlong, Laura I., Balaur, Irina, Rougny, Adrien, Jarosz, Yohan, Overall, Rupert, Phair, Robert, Perfetto, Livia, Matthews, Lisa, Rex, Devasahayam Arokia Balaya, Orlic-Milacic, Marija, Gomez, Luis Cristobal Monraz, De Meulder, Bertrand, Ravel, Jean Marie, Jassal, Bijay, Satagopam, Venkata, Wu, Guanming, Golebiewski, Martin, Gawron, Piotr, Calzone, Laurence, Beckmann, Jacques S., Evelo, Chris T., D’Eustachio, Peter, Schreiber, Falk, Saez-Rodriguez, Julio, Dopazo, Joaquin, Kuiper, Martin, Valencia, Alfonso, Wolkenhauer, Olaf, Kitano, Hiroaki, Barillot, Emmanuel, Auffray, Charles, Balling, Rudi, and Schneider, Reinhard
- Abstract
Introduction: The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing. Methods: Extensive community work allowed an impressive step forward in building interfaces between Systems Biology tools and platforms. Our framework can link biomolecules from omics data analysis and computational modelling to dysregulated pathways in a cell-, tissue- or patient-specific manner. Drug repurposing using text mining and AI-assisted analysis identified potential drugs, chemicals and microRNAs that could target the identified key factors. Results: Results revealed drugs already tested for anti-COVID-19 efficacy, providing a mechanistic context for their mode of action, and drugs already in clinical trials for treating other diseases, never tested against COVID-19. Discussion: The key advance is that the proposed framework is versatile and expandable, offering a significant upgrade in the arsenal for virus-host interactions and other complex pathologies., Peer Reviewed
- Published
- 2024
18. Eliater: a Python package for estimating outcomes of perturbations in biomolecular networks.
- Author
-
Mohammad-Taheri, Sara, Navada, Pruthvi Prakash, Hoyt, Charles Tapley, Zucker, Jeremy, Sachs, Karen, Gyori, Benjamin M, and Vitek, Olga
- Subjects
GENE regulatory networks ,ESCHERICHIA coli ,MOLECULES ,DOCUMENTATION - Abstract
Summary We introduce Eliater , a Python package for estimating the effect of perturbation of an upstream molecule on a downstream molecule in a biomolecular network. The estimation takes as input a biomolecular network, observational biomolecular data, and a perturbation of interest, and outputs an estimated quantitative effect of the perturbation. We showcase the functionalities of Eliater in a case study of Escherichia coli transcriptional regulatory network. Availability and implementation The code, the documentation, and several case studies are available open source at https://github.com/y0-causal-inference/eliater. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Statistical Model Checking-Based Analysis of Biological Networks
- Author
-
Liu, Bing, Gyori, Benjamin M., Thiagarajan, P. S., Crippen, Gordon, Advisory Editor, Dress, Andreas, Editor-in-Chief, Giegerich, Robert, Editorial Board Member, Kelso, Janet, Editorial Board Member, Linial, Michal, Editor-in-Chief, Felsenstein, Joseph, Advisory Editor, Troyanskaya, Olga, Editor-in-Chief, Gusfield, Dan, Advisory Editor, Myers, Gene, Editorial Board Member, Istrail, Sorin, Advisory Editor, Pevzner, Pavel, Editorial Board Member, Vingron, Martin, Editor-in-Chief, Lengauer, Thomas, Advisory Editor, McClure, Marcella, Advisory Editor, Nowak, Martin, Advisory Editor, Sankoff, David, Advisory Editor, Shamir, Ron, Advisory Editor, Steel, Mike, Advisory Editor, Stormo, Gary, Advisory Editor, Tavaré, Simon, Advisory Editor, Warnow, Tandy, Advisory Editor, Welch, Lonnie, Advisory Editor, Liò, Pietro, editor, and Zuliani, Paolo, editor
- Published
- 2019
- Full Text
- View/download PDF
20. Approximate probabilistic verification of hybrid systems
- Author
-
Gyori, Benjamin M., Liu, Bing, Paul, Soumya, Ramanathan, R., and Thiagarajan, P. S.
- Subjects
Computer Science - Systems and Control - Abstract
Hybrid systems whose mode dynamics are governed by non-linear ordinary differential equations (ODEs) are often a natural model for biological processes. However such models are difficult to analyze. To address this, we develop a probabilistic analysis method by approximating the mode transitions as stochastic events. We assume that the probability of making a mode transition is proportional to the measure of the set of pairs of time points and value states at which the mode transition is enabled. To ensure a sound mathematical basis, we impose a natural continuity property on the non-linear ODEs. We also assume that the states of the system are observed at discrete time points but that the mode transitions may take place at any time between two successive discrete time points. This leads to a discrete time Markov chain as a probabilistic approximation of the hybrid system. We then show that for BLTL (bounded linear time temporal logic) specifications the hybrid system meets a specification iff its Markov chain approximation meets the same specification with probability $1$. Based on this, we formulate a sequential hypothesis testing procedure for verifying -approximately- that the Markov chain meets a BLTL specification with high probability. Our case studies on cardiac cell dynamics and the circadian rhythm indicate that our scheme can be applied in a number of realistic settings.
- Published
- 2014
21. Probabilistic verification of partially observable dynamical systems
- Author
-
Gyori, Benjamin M., Paulin, Daniel, and Palaniappan, Sucheendra K.
- Subjects
Computer Science - Systems and Control ,Computer Science - Logic in Computer Science ,Quantitative Biology - Quantitative Methods - Abstract
The construction and formal verification of dynamical models is important in engineering, biology and other disciplines. We focus on non-linear models containing a set of parameters governing their dynamics. The value of these parameters is often unknown and not directly observable through measurements, which are themselves noisy. When treating parameters as random variables, one can constrain their distribution by conditioning on observations and thereby constructing a posterior probability distribution. We aim to perform model verification with respect to this posterior. The main difficulty in performing verification on a model under the posterior distribution is that in general, it is difficult to obtain \emph{independent} samples from the posterior, especially for non-linear dynamical models. Standard statistical model checking methods require independent realizations of the system and are therefore not applicable in this context. We propose a Markov chain Monte Carlo based statistical model checking framework, which produces a sequence of dependent random realizations of the model dynamics over the parameter posterior. Using this sequence of samples, we use statistical hypothesis tests to verify whether the model satisfies a bounded temporal logic property with a certain probability. We use sample size bounds tailored to the setting of dependent samples for fixed sample size and sequential tests. We apply our method to a case-study from the domain of systems biology, to a model of the JAK-STAT biochemical pathway. The pathway is modeled as a system of non-linear ODEs containing a set of unknown parameters. Noisy, indirect observations of the system state are available from an experiment. The results show that the proposed method enables probabilistic verification with respect to the parameter posterior with specified error bounds., Comment: 21 pages, 6 figures
- Published
- 2014
22. Hypothesis testing for Markov chain Monte Carlo
- Author
-
Gyori, Benjamin M. and Paulin, Daniel
- Subjects
Statistics - Methodology ,62M02 - Abstract
Testing between hypotheses, when independent sampling is possible, is a well developed subject. In this paper, we propose hypothesis tests that are applicable when the samples are obtained using Markov chain Monte Carlo. These tests are useful when one is interested in deciding whether the expected value of a certain quantity is above or below a given threshold. We show non-asymptotic error bounds and bounds on the expected number of samples for three types of tests, a fixed sample size test, a sequential test with indifference region, and a sequential test without indifference region. Our tests can lead to significant savings in sample size. We illustrate our results on an example of Bayesian parameter inference involving an ODE model of a biochemical pathway., Comment: 21 pages, 8 figures
- Published
- 2014
- Full Text
- View/download PDF
23. Erratum To: COVID‐19 Disease Map, a computational knowledge repository of virus‐host interaction mechanisms
- Author
-
Ostaszewski, Marek, Niarakis, Anna, Mazein, Alexander, Kuperstein, Inna, Phair, Robert, Orta‐Resendiz, Aurelio, Singh, Vidisha, Aghamiri, Sara Sadat, Acencio, Marcio Luis, Glaab, Enrico, Ruepp, Andreas, Fobo, Gisela, Montrone, Corinna, Brauner, Barbara, Frishman, Goar, Monraz Gómez, Luis Cristóbal, Somers, Julia, Hoch, Matti, Kumar Gupta, Shailendra, Scheel, Julia, Borlinghaus, Hanna, Czauderna, Tobias, Schreiber, Falk, Montagud, Arnau, Ponce de Leon, Miguel, Funahashi, Akira, Hiki, Yusuke, Hiroi, Noriko, Yamada, Takahiro G, Dräger, Andreas, Renz, Alina, Naveez, Muhammad, Bocskei, Zsolt, Messina, Francesco, Börnigen, Daniela, Fergusson, Liam, Conti, Marta, Rameil, Marius, Nakonecnij, Vanessa, Vanhoefer, Jakob, Schmiester, Leonard, Wang, Muying, Ackerman, Emily E, Shoemaker, Jason E, Zucker, Jeremy, Oxford, Kristie, Teuton, Jeremy, Kocakaya, Ebru, Summak, Gökçe Yağmur, Hanspers, Kristina, Kutmon, Martina, Coort, Susan, Eijssen, Lars, Ehrhart, Friederike, Rex, D A B, Slenter, Denise, Martens, Marvin, Pham, Nhung, Haw, Robin, Jassal, Bijay, Matthews, Lisa, Orlic‐Milacic, Marija, Senff‐Ribeiro, Andrea, Rothfels, Karen, Shamovsky, Veronica, Stephan, Ralf, Sevilla, Cristoffer, Varusai, Thawfeek, Ravel, Jean‐Marie, Fraser, Rupsha, Ortseifen, Vera, Marchesi, Silvia, Gawron, Piotr, Smula, Ewa, Heirendt, Laurent, Satagopam, Venkata, Wu, Guanming, Riutta, Anders, Golebiewski, Martin, Owen, Stuart, Goble, Carole, Hu, Xiaoming, Overall, Rupert W, Maier, Dieter, Bauch, Angela, Gyori, Benjamin M, Bachman, John A, Vega, Carlos, Grouès, Valentin, Vazquez, Miguel, Porras, Pablo, Licata, Luana, Iannuccelli, Marta, Sacco, Francesca, Nesterova, Anastasia, Yuryev, Anton, de Waard, Anita, Turei, Denes, Luna, Augustin, Babur, Ozgun, Soliman, Sylvain, Valdeolivas, Alberto, Esteban‐Medina, Marina, Peña‐Chilet, Maria, Rian, Kinza, Helikar, Tomáš, Puniya, Bhanwar Lal, Modos, Dezso, Treveil, Agatha, Olbei, Marton, De Meulder, Bertrand, Ballereau, Stephane, Dugourd, Aurélien, Naldi, Aurélien, Noël, Vincent, Calzone, Laurence, Sander, Chris, Demir, Emek, Korcsmaros, Tamas, Freeman, Tom C, Augé, Franck, Beckmann, Jacques S, Hasenauer, Jan, Wolkenhauer, Olaf, Willighagen, Egon L, Pico, Alexander R, Evelo, Chris T, Gillespie, Marc E, Stein, Lincoln D, Hermjakob, Henning, D'Eustachio, Peter, Saez‐Rodriguez, Julio, Dopazo, Joaquin, Valencia, Alfonso, Kitano, Hiroaki, Barillot, Emmanuel, Auffray, Charles, Balling, Rudi, and Schneider, Reinhard
- Published
- 2021
- Full Text
- View/download PDF
24. GeneWalk identifies relevant gene functions for a biological context using network representation learning
- Author
-
Ietswaart, Robert, Gyori, Benjamin M., Bachman, John A., Sorger, Peter K., and Churchman, L. Stirling
- Published
- 2021
- Full Text
- View/download PDF
25. COVID19 Disease Map, a computational knowledge repository of virus–host interaction mechanisms
- Author
-
Ostaszewski, Marek, Niarakis, Anna, Mazein, Alexander, Kuperstein, Inna, Phair, Robert, Orta‐Resendiz, Aurelio, Singh, Vidisha, Aghamiri, Sara Sadat, Acencio, Marcio Luis, Glaab, Enrico, Ruepp, Andreas, Fobo, Gisela, Montrone, Corinna, Brauner, Barbara, Frishman, Goar, Monraz Gómez, Luis Cristóbal, Somers, Julia, Hoch, Matti, Kumar Gupta, Shailendra, Scheel, Julia, Borlinghaus, Hanna, Czauderna, Tobias, Schreiber, Falk, Montagud, Arnau, Ponce de Leon, Miguel, Funahashi, Akira, Hiki, Yusuke, Hiroi, Noriko, Yamada, Takahiro G, Dräger, Andreas, Renz, Alina, Naveez, Muhammad, Bocskei, Zsolt, Messina, Francesco, Börnigen, Daniela, Fergusson, Liam, Conti, Marta, Rameil, Marius, Nakonecnij, Vanessa, Vanhoefer, Jakob, Schmiester, Leonard, Wang, Muying, Ackerman, Emily E, Shoemaker, Jason E, Zucker, Jeremy, Oxford, Kristie, Teuton, Jeremy, Kocakaya, Ebru, Summak, Gökçe Yağmur, Hanspers, Kristina, Kutmon, Martina, Coort, Susan, Eijssen, Lars, Ehrhart, Friederike, Rex, Devasahayam Arokia Balaya, Slenter, Denise, Martens, Marvin, Pham, Nhung, Haw, Robin, Jassal, Bijay, Matthews, Lisa, Orlic‐Milacic, Marija, Senff-Ribeiro, Andrea, Rothfels, Karen, Shamovsky, Veronica, Stephan, Ralf, Sevilla, Cristoffer, Varusai, Thawfeek, Ravel, Jean‐Marie, Fraser, Rupsha, Ortseifen, Vera, Marchesi, Silvia, Gawron, Piotr, Smula, Ewa, Heirendt, Laurent, Satagopam, Venkata, Wu, Guanming, Riutta, Anders, Golebiewski, Martin, Owen, Stuart, Goble, Carole, Hu, Xiaoming, Overall, Rupert W, Maier, Dieter, Bauch, Angela, Gyori, Benjamin M, Bachman, John A, Vega, Carlos, Grouès, Valentin, Vazquez, Miguel, Porras, Pablo, Licata, Luana, Iannuccelli, Marta, Sacco, Francesca, Nesterova, Anastasia, Yuryev, Anton, de Waard, Anita, Turei, Denes, Luna, Augustin, Babur, Ozgun, Soliman, Sylvain, Valdeolivas, Alberto, Esteban‐Medina, Marina, Peña‐Chilet, Maria, Rian, Kinza, Helikar, Tomáš, Puniya, Bhanwar Lal, Modos, Dezso, Treveil, Agatha, Olbei, Marton, De Meulder, Bertrand, Ballereau, Stephane, Dugourd, Aurélien, Naldi, Aurélien, Noël, Vincent, Calzone, Laurence, Sander, Chris, Demir, Emek, Korcsmaros, Tamas, Freeman, Tom C, Augé, Franck, Beckmann, Jacques S, Hasenauer, Jan, Wolkenhauer, Olaf, Willighagen, Egon L, Pico, Alexander R, Evelo, Chris T, Gillespie, Marc E, Stein, Lincoln D, Hermjakob, Henning, D'Eustachio, Peter, Saez‐Rodriguez, Julio, Dopazo, Joaquin, Valencia, Alfonso, Kitano, Hiroaki, Barillot, Emmanuel, Auffray, Charles, Balling, Rudi, and Schneider, Reinhard
- Published
- 2021
- Full Text
- View/download PDF
26. Non-asymptotic confidence intervals for MCMC in practice
- Author
-
Gyori, Benjamin M. and Paulin, Daniel
- Subjects
Mathematics - Probability ,65C05, 60J10, 62M05, 82B20, 68Q87, 68W20 - Abstract
Using concentration inequalities, we give non-asymptotic confidence intervals for estimates obtained by Markov chain Monte Carlo (MCMC) simulations, when using the approximation $\mathbb{E}_{\pi} f\approx (1/(N-t_0))\cdot \sum_{i=t_0+1}^N f(X_i)$. To allow the application of non-asymptotic error bounds in practice, here we state bounds formulated in terms of the spectral properties of the chain and the properties of $f$ and propose estimators of the parameters appearing in the bounds, including the spectral gap, mixing time, and asymptotic variance. We introduce a method for setting the burn-in time and the initial distribution that is theoretically well-founded and yet is relatively simple to apply. We also investigate the estimation of $\mathbb{E}_{\pi}f$ via subsampling and by using parallel runs instead of a single run. Our results are applicable to both reversible and non-reversible Markov chains on discrete as well as general state spaces. We illustrate our methods by simulations for three examples of Bayesian inference in the context of risk models and clinical trials., Comment: 33 pages, 2 figures
- Published
- 2012
27. Open code, open data, and open infrastructure to promote the sustainability of curated scientific resources
- Author
-
Hoyt, Charles Tapley, primary and Gyori, Benjamin M., additional
- Published
- 2023
- Full Text
- View/download PDF
28. Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
- Author
-
Scholten, Bernice, Simon, Laura Guerrero, Krishnan, Shaji, Vermeulen, Roel, Pronk, Anjoeka, Gyori, Benjamin M., Bachman, John A., Vlaanderen, Jette, and Stierum, Rob
- Subjects
Cocarcinogens -- Evaluation -- Health aspects ,Meta-analysis -- Methods ,Hazardous substances -- Risk assessment ,Machine learning -- Usage ,Carcinogens -- Evaluation -- Health aspects ,Technology application ,Environmental issues ,Health - Abstract
BACKGROUND: Mechanistic data is increasingly used in hazard identification of chemicals. However, the volume of data is large, challenging the efficient identification and clustering of relevant data. OBJECTIVES: We investigated whether evidence identification for hazard assessment can become more efficient and informed through an automated approach that combines machine reading of publications with network visualization tools. METHODS: We chose 13 chemicals that were evaluated by the International Agency for Research on Cancer (IARC) Monographs program incorporating the key characteristics of carcinogens (KCCs) approach. Using established literature search terms for KCCs, we retrieved and analyzed literature using Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA combines large-scale literature processing with pathway databases and extracts relationships between biomolecules, bioprocesses, and chemicals into statements (e.g., 'benzene activates DNA damage'). These statements were subsequently assembled into networks and compared with the KCC evaluation by the IARC, to evaluate the informativeness of our approach. RESULTS: We found, in general, larger networks for those chemicals which the IARC has evaluated the evidence to be strong for KCC induction. Larger networks were not directly linked to publication count, given that we retrieved small networks for several chemicals with little support for KCC activation according to the IARC, despite the significant volume of literature for these specific chemicals. In addition, interpreting networks for genotoxicity and DNA repair showed concordance with the IARC KCC evaluation. DISCUSSION: Our method is an automated approach to condense mechanistic literature into searchable and interpretable networks based on an a priori ontology. The approach is no replacement of expert evaluation but, instead, provides an informed structure for experts to quickly identify which statements are made in which papers and how these could connect. We focused on the KCCs because these are supported by well-described search terms. The method needs to be tested in other frameworks as well to demonstrate its generalizability. https://doi.org/10.1289/EHP9112, Introduction Risk assessment of chemicals is commonly based on toxicological or epidemiological studies. Mechanistic studies can be used to complement animal or epidemiological data to inform mechanisms of toxicity, dose-response [...]
- Published
- 2022
- Full Text
- View/download PDF
29. Parallelized Parameter Estimation of Biological Pathway Models
- Author
-
Ramanathan, R., Zhang, Yan, Zhou, Jun, Gyori, Benjamin M., Wong, Weng-Fai, Thiagarajan, P. S., Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Abate, Alessandro, editor, and Šafránek, David, editor
- Published
- 2015
- Full Text
- View/download PDF
30. Approximate Probabilistic Verification of Hybrid Systems
- Author
-
Gyori, Benjamin M., Liu, Bing, Paul, Soumya, Ramanathan, R., Thiagarajan, P. S., Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Abate, Alessandro, editor, and Šafránek, David, editor
- Published
- 2015
- Full Text
- View/download PDF
31. A Simple Standard for Ontological Mappings 2023: Updates on data model, collaborations and tooling
- Author
-
Matentzoglu, Nicolas, Braun, Ian, Caron, Anita R., Goutte-Gattat, Damien, Gyori, Benjamin M., Harris, Nomi L., Hartley, Emily, Hegde, Harshad B., Hertling, Sven, Hoyt, Charles Tapley, Kim, HyeongSik, Li, Huanyu, McLaughlin, James, Trojahn, Cassia, Vasilevsky, Nicole, Mungall, Christopher J., Matentzoglu, Nicolas, Braun, Ian, Caron, Anita R., Goutte-Gattat, Damien, Gyori, Benjamin M., Harris, Nomi L., Hartley, Emily, Hegde, Harshad B., Hertling, Sven, Hoyt, Charles Tapley, Kim, HyeongSik, Li, Huanyu, McLaughlin, James, Trojahn, Cassia, Vasilevsky, Nicole, and Mungall, Christopher J.
- Abstract
The Simple Standard for Ontological Mappings (SSSOM) was first published in December 2021 (v. 0.9). After a number of revisions prompted by community feedback, we have published version 0.15.0 in July 2023. Here we report on the progress made since August 2022, in particular changes to tooling, data model and summary of ongoing standardisation efforts.
- Published
- 2023
32. Drug-target identification in COVID-19 disease mechanisms using computational systems biology approaches
- Author
-
Sanofi, Instituto de Salud Carlos III, Ministerio de Ciencia, Innovación y Universidades (España), Agencia Estatal de Investigación (España), German Research Foundation, Ministero della Salute, European Commission, Generalitat de Catalunya, National Institutes of Health (US), Klaus Tschira Foundation, National Library of Medicine (US), Niarakis, Anna, Ostaszewski, Marek, Mazein, Alexander, Kuperstein, Inna, Kutmon, Martina, Gillespie, Marc E., Funahashi, Akira, Acencio, Marcio Luis, Hemedan, Ahmed, Aichem, Michael, Klein, Karsten, Czauderna, Tobias, Burtscher, Felicia, Yamada, Takahiro G., Hiki, Yusuke, Hiroi, Noriko F., Hu, Finterly, Pham, Nhung, Ehrhart, Friederike, Willighagen, Egon L., Valdeolivas, Alberto, Dugourd, Aurelien, Messina, Francesco, Esteban-Medina, Marina, Peña-Chilet, María, Rian, Kinza, Soliman, Sylvain, Aghamiri, Sara Sadat, Lal Puniya, Bhanwar, Naldi, Aurelien, Helikar, Tomas, Singh, Vidisha, Fariñas Fernández, Marco, Bermudez, Viviam, Tsirvouli, Eirini, Montagud, Arnau, Noël, Vincent, Ponce de León, Miguel, Maier, Dieter, Bauch, Angela, Gyori, Benjamin M., Bachman, John A., Luna, Augustin, Piñero, Janet, Furlong, Laura I., Balaur, Irina BalaurIrina, Rougny, Adrien, Jarosz, Yohan, Overall, Rupert W., Phair, Robert, Perfetto, Livia, Matthews, Lisa, Balaya Rex, Devasahayam Arokia, Orlic-Milacic, Marija, Monraz Gómez, Luis Cristóbal, De Meulder, Bertrand, Ravel, Jean Marie, Jassal, Bijay, Satagopam, Venkata, Wu, Guanming, Golebiewski, Martin, Gawron, Piotr, Calzone, Laurence, Beckmann, Jacques S., Evelo, Chris T., D’Eustachio, Peter, Schreiber, Falk, Sáez-Rodríguez, Julio, Dopazo, Joaquín, Kuiper, Martin, Valencia, Alfonso, Wolkenhauer, Olaf, Kitano, Hiroaki, Barillot, Emmanuel, Auffray, Charles, Balling, Rudi, Schneider, Reinhard, COVID- Disease Map Community the COVID-19 Disease Map Community, Sanofi, Instituto de Salud Carlos III, Ministerio de Ciencia, Innovación y Universidades (España), Agencia Estatal de Investigación (España), German Research Foundation, Ministero della Salute, European Commission, Generalitat de Catalunya, National Institutes of Health (US), Klaus Tschira Foundation, National Library of Medicine (US), Niarakis, Anna, Ostaszewski, Marek, Mazein, Alexander, Kuperstein, Inna, Kutmon, Martina, Gillespie, Marc E., Funahashi, Akira, Acencio, Marcio Luis, Hemedan, Ahmed, Aichem, Michael, Klein, Karsten, Czauderna, Tobias, Burtscher, Felicia, Yamada, Takahiro G., Hiki, Yusuke, Hiroi, Noriko F., Hu, Finterly, Pham, Nhung, Ehrhart, Friederike, Willighagen, Egon L., Valdeolivas, Alberto, Dugourd, Aurelien, Messina, Francesco, Esteban-Medina, Marina, Peña-Chilet, María, Rian, Kinza, Soliman, Sylvain, Aghamiri, Sara Sadat, Lal Puniya, Bhanwar, Naldi, Aurelien, Helikar, Tomas, Singh, Vidisha, Fariñas Fernández, Marco, Bermudez, Viviam, Tsirvouli, Eirini, Montagud, Arnau, Noël, Vincent, Ponce de León, Miguel, Maier, Dieter, Bauch, Angela, Gyori, Benjamin M., Bachman, John A., Luna, Augustin, Piñero, Janet, Furlong, Laura I., Balaur, Irina BalaurIrina, Rougny, Adrien, Jarosz, Yohan, Overall, Rupert W., Phair, Robert, Perfetto, Livia, Matthews, Lisa, Balaya Rex, Devasahayam Arokia, Orlic-Milacic, Marija, Monraz Gómez, Luis Cristóbal, De Meulder, Bertrand, Ravel, Jean Marie, Jassal, Bijay, Satagopam, Venkata, Wu, Guanming, Golebiewski, Martin, Gawron, Piotr, Calzone, Laurence, Beckmann, Jacques S., Evelo, Chris T., D’Eustachio, Peter, Schreiber, Falk, Sáez-Rodríguez, Julio, Dopazo, Joaquín, Kuiper, Martin, Valencia, Alfonso, Wolkenhauer, Olaf, Kitano, Hiroaki, Barillot, Emmanuel, Auffray, Charles, Balling, Rudi, Schneider, Reinhard, and COVID- Disease Map Community the COVID-19 Disease Map Community
- Abstract
Introduction: The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing., Methods: Extensive community work allowed an impressive step forward in building interfaces between Systems Biology tools and platforms. Our framework can link biomolecules from omics data analysis and computational modelling to dysregulated pathways in a cell-, tissue- or patient-specific manner. Drug repurposing using text mining and AI-assisted analysis identified potential drugs, chemicals and microRNAs that could target the identified key factors., Results: Results revealed drugs already tested for anti-COVID-19 efficacy, providing a mechanistic context for their mode of action, and drugs already in clinical trials for treating other diseases, never tested against COVID-19., Discussion: The key advance is that the proposed framework is versatile and expandable, offering a significant upgrade in the arsenal for virus-host interactions and other complex pathologies.
- Published
- 2023
33. Prediction and curation of missing biomedical identifier mappings with Biomappings
- Author
-
Hoyt, Charles Tapley, primary, Hoyt, Amelia L, additional, and Gyori, Benjamin M, additional
- Published
- 2023
- Full Text
- View/download PDF
34. Nociceptor neuroimmune interactomes reveal cell type- and injury-specific inflammatory pain pathways
- Author
-
Jain, Aakanksha, primary, Gyori, Benjamin M., additional, Hakim, Sara, additional, Bunga, Samuel, additional, Taub, Daniel G, additional, Ruiz-Cantero, Mari Carmen, additional, Tong-Li, Candace, additional, Andrews, Nicholas, additional, Sorger, Peter K, additional, and Woolf, Clifford J, additional
- Published
- 2023
- Full Text
- View/download PDF
35. Statistical Model Checking Based Calibration and Analysis of Bio-pathway Models
- Author
-
Palaniappan, Sucheendra K., Gyori, Benjamin M., Liu, Bing, Hsu, David, Thiagarajan, P. S., Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Istrail, Sorin, editor, Pevzner, Pavel, editor, Waterman, Michael S., editor, Gupta, Ashutosh, editor, and Henzinger, Thomas A., editor
- Published
- 2013
- Full Text
- View/download PDF
36. Prediction and Curation of Missing Biomedical Identifier Mappings with Biomappings
- Author
-
Hoyt, Charles Tapley, primary, Hoyt, Amelia L., additional, and Gyori, Benjamin M., additional
- Published
- 2022
- Full Text
- View/download PDF
37. OpenComet: An automated tool for comet assay image analysis
- Author
-
Gyori, Benjamin M., Venkatachalam, Gireedhar, Thiagarajan, P.S., Hsu, David, and Clement, Marie-Veronique
- Published
- 2014
- Full Text
- View/download PDF
38. FamPlex: a resource for entity recognition and relationship resolution of human protein families and complexes in biomedical text mining
- Author
-
Bachman, John A., Gyori, Benjamin M., and Sorger, Peter K.
- Published
- 2018
- Full Text
- View/download PDF
39. Hypothesis testing for Markov chain Monte Carlo
- Author
-
Gyori, Benjamin M. and Paulin, Daniel
- Published
- 2016
- Full Text
- View/download PDF
40. Automated assembly of molecular mechanisms at scale from text mining and curated databases
- Author
-
Bachman, John A., primary, Gyori, Benjamin M., additional, and Sorger, Peter K., additional
- Published
- 2022
- Full Text
- View/download PDF
41. ChemicalX: A Deep Learning Library for Drug Pair Scoring
- Author
-
Rozemberczki, Benedek, primary, Hoyt, Charles Tapley, additional, Gogleva, Anna, additional, Grabowski, Piotr, additional, Karis, Klas, additional, Lamov, Andrej, additional, Nikolov, Andriy, additional, Nilsson, Sebastian, additional, Ughetto, Michael, additional, Wang, Yu, additional, Derr, Tyler, additional, and Gyori, Benjamin M., additional
- Published
- 2022
- Full Text
- View/download PDF
42. Unifying the Identification of Biomedical Entities with the Bioregistry
- Author
-
Hoyt, Charles Tapley, primary, Balk, Meghan, additional, Callahan, Tiffany J., additional, Domingo-Fernández, Daniel, additional, Haendel, Melissa A., additional, Hegde, Harshad B., additional, Himmelstein, Daniel S., additional, Karis, Klas, additional, Kunze, John, additional, Lubiana, Tiago, additional, Matentzoglu, Nicolas, additional, McMurry, Julie, additional, Moxon, Sierra, additional, Mungall, Christopher J., additional, Rutz, Adriano, additional, Unni, Deepak R., additional, Willighagen, Egon, additional, Winston, Donald, additional, and Gyori, Benjamin M., additional
- Published
- 2022
- Full Text
- View/download PDF
43. Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment
- Author
-
IRAS OH Epidemiology Chemical Agents, dIRAS RA-2, Scholten, Bernice, Simón, Laura Guerrero, Krishnan, Shaji, Vermeulen, Roel, Pronk, Anjoeka, Gyori, Benjamin M, Bachman, John A, Vlaanderen, Jelle, Stierum, Rob, IRAS OH Epidemiology Chemical Agents, dIRAS RA-2, Scholten, Bernice, Simón, Laura Guerrero, Krishnan, Shaji, Vermeulen, Roel, Pronk, Anjoeka, Gyori, Benjamin M, Bachman, John A, Vlaanderen, Jelle, and Stierum, Rob
- Published
- 2022
44. ProtSTonKGs: A Sophisticated Transformer Trained on Protein Sequences, Text, and Knowledge Graphs
- Author
-
Balabin, Helena, Hoyt, Charles Tapley, Gyori, Benjamin M., Bachman, John, Tom Kodamullil, Alpha, Hofmann-Apitius, Martin, Domingo-Fernández, Daniel, Balabin, Helena, Hoyt, Charles Tapley, Gyori, Benjamin M., Bachman, John, Tom Kodamullil, Alpha, Hofmann-Apitius, Martin, and Domingo-Fernández, Daniel
- Abstract
While most approaches individually exploit unstructured data from the biomedical literature or structured data from biomedical knowledge graphs, their union can better exploit the advantages of such approaches, ultimately improving representations of biology. Using multimodal transformers for such purposes can improve performance on context dependent classication tasks, as demonstrated by our previous model, the Sophisticated Transformer Trained on Biomedical Text and Knowledge Graphs (STonKGs). In this work, we introduce ProtSTonKGs, a transformer aimed at learning all-encompassing representations of protein-protein interactions. ProtSTonKGs presents an extension to our previous work by adding textual protein descriptions and amino acid sequences (i.e., structural information) to the text- and knowledge graph-based input sequence used in STonKGs. We benchmark ProtSTonKGs against STonKGs, resulting in improved F1 scores by up to 0.066 (i.e., from 0.204 to 0.270) in several tasks such as predicting protein interactions in several contexts. Our work demonstrates how multimodal transformers can be used to integrate heterogeneous sources of information, paving the foundation for future approaches that use multiple modalities for biomedical applications.
- Published
- 2022
45. STonKGs: A Sophisticated Transformer Trained on Biomedical Text and Knowledge Graphs
- Author
-
Balabin, Helena, Hoyt, Charles Tapley, Birkenbihl, Colin, Gyori, Benjamin M., Bachman, John, Tom Kodamullil, Alpha, Plöger, Paul G., Hofmann-Apitius, Martin, Domingo-Fernández, Daniel, Balabin, Helena, Hoyt, Charles Tapley, Birkenbihl, Colin, Gyori, Benjamin M., Bachman, John, Tom Kodamullil, Alpha, Plöger, Paul G., Hofmann-Apitius, Martin, and Domingo-Fernández, Daniel
- Abstract
MOTIVATION The majority of biomedical knowledge is stored in structured databases or as unstructured text in scientific publications. This vast amount of information has led to numerous machine learning-based biological applications using either text through natural language processing (NLP) or structured data through knowledge graph embedding models (KGEMs). However, representations based on a single modality are inherently limited. RESULTS To generate better representations of biological knowledge, we propose STonKGs, a Sophisticated Transformer trained on biomedical text and Knowledge Graphs (KGs). This multimodal Transformer uses combined input sequences of structured information from KGs and unstructured text data from biomedical literature to learn joint representations in a shared embedding space. First, we pre-trained STonKGs on a knowledge base assembled by the Integrated Network and Dynamical Reasoning Assembler (INDRA) consisting of millions of text-triple pairs extracted from biomedical literature by multiple NLP systems. Then, we benchmarked STonKGs against three baseline models trained on either one of the modalities (i.e., text or KG) across eight different classification tasks, each corresponding to a different biological application. Our results demonstrate that STonKGs outperforms both baselines, especially on the more challenging tasks with respect to the number of classes, improving upon the F1-score of the best baseline by up to 0.084 (i.e., from 0.881 to 0.965). Finally, our pre-trained model as well as the model architecture can be adapted to various other transfer learning applications. AVAILABILITY We make the source code and the Python package of STonKGs available at GitHub (https://github.com/stonkgs/stonkgs) and PyPI (https://pypi.org/project/stonkgs/). The pre-trained STonKGs models and the task-specific classification models are respectively available at https://huggingface.co/stonkgs/stonkgs-150k and https://zenodo.org/communities/ston
- Published
- 2022
46. From word models to executable models of signaling networks using automated assembly
- Author
-
Gyori, Benjamin M, Bachman, John A, Subramanian, Kartik, Muhlich, Jeremy L, Galescu, Lucian, and Sorger, Peter K
- Published
- 2017
- Full Text
- View/download PDF
47. PyBioPAX: biological pathway exchange in Python
- Author
-
Gyori, Benjamin M., primary and Hoyt, Charles Tapley, additional
- Published
- 2022
- Full Text
- View/download PDF
48. STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs
- Author
-
Balabin, Helena, primary, Hoyt, Charles Tapley, additional, Birkenbihl, Colin, additional, Gyori, Benjamin M, additional, Bachman, John, additional, Kodamullil, Alpha Tom, additional, Plöger, Paul G, additional, Hofmann-Apitius, Martin, additional, and Domingo-Fernández, Daniel, additional
- Published
- 2022
- Full Text
- View/download PDF
49. Gilda: biomedical entity text normalization with machine-learned disambiguation as a service
- Author
-
Gyori, Benjamin M, primary, Hoyt, Charles Tapley, additional, and Steppi, Albert, additional
- Published
- 2022
- Full Text
- View/download PDF
50. A Simple Standard for Sharing Ontological Mappings (SSSOM)
- Author
-
Matentzoglu, Nicolas, primary, Balhoff, James P, additional, Bello, Susan M, additional, Bizon, Chris, additional, Brush, Matthew, additional, Callahan, Tiffany J, additional, Chute, Christopher G, additional, Duncan, William D, additional, Evelo, Chris T, additional, Gabriel, Davera, additional, Graybeal, John, additional, Gray, Alasdair, additional, Gyori, Benjamin M, additional, Haendel, Melissa, additional, Harmse, Henriette, additional, Harris, Nomi L, additional, Harrow, Ian, additional, Hegde, Harshad B, additional, Hoyt, Amelia L, additional, Hoyt, Charles T, additional, Jiao, Dazhi, additional, Jiménez-Ruiz, Ernesto, additional, Jupp, Simon, additional, Kim, Hyeongsik, additional, Koehler, Sebastian, additional, Liener, Thomas, additional, Long, Qinqin, additional, Malone, James, additional, McLaughlin, James A, additional, McMurry, Julie A, additional, Moxon, Sierra, additional, Munoz-Torres, Monica C, additional, Osumi-Sutherland, David, additional, Overton, James A, additional, Peters, Bjoern, additional, Putman, Tim, additional, Queralt-Rosinach, Núria, additional, Shefchek, Kent, additional, Solbrig, Harold, additional, Thessen, Anne, additional, Tudorache, Tania, additional, Vasilevsky, Nicole, additional, Wagner, Alex H, additional, and Mungall, Christopher J, additional
- Published
- 2022
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.