156 results on '"Martin Krzywinski"'
Search Results
2. Data from Sequence Variant Discovery in DNA Repair Genes from Radiosensitive and Radiotolerant Prostate Brachytherapy Patients
- Author
-
Richard A. Moore, Marco A. Marra, Peggy L. Olive, W. Jim Morris, Michael McKenzie, Alexander Agranovich, Cindy Yang, Karen Novik, Dallas Thomas, Martin Krzywinski, Allen Delaney, Lorena Barclay, Mira Keyes, and Trevor J. Pugh
- Abstract
Purpose: The presence of intrinsic radiosensitivity within prostate cancer patients may be an important factor contributing to development of radiation toxicity. We investigated whether variants in genes responsible for detecting and repairing DNA damage independently contribute to toxicity following prostate brachytherapy.Experimental Design: Genomic DNA was extracted from blood samples of 41 prostate brachytherapy patients, 21 with high and 20 with low late toxicity scores. For each patient, 242 PCR amplicons were generated containing 173 exons of eight candidate genes: ATM, BRCA1, ERCC2, H2AFX, LIG4, MDC1, MRE11A, and RAD50. These amplicons were sequenced and all sequence variants were subjected to statistical analysis to identify those associated with late radiation toxicity.Results: Across 41 patients, 239 sites differed from the human genome reference sequence; 170 of these corresponded to known polymorphisms. Sixty variants, 14 of them novel, affected protein coding regions and 43 of these were missense mutations. In our patient population, the high toxicity group was enriched for individuals with at least one LIG4 coding variant (P = 0.028). One synonymous variant in MDC1, rs28986317, was associated with increased radiosensitivity (P = 0.048). A missense variant in ATM, rs1800057, associated with increased prostate cancer risk, was found exclusively in two high toxicity patients but did not reach statistical significance for association with radiosensitivity (P = 0.488).Conclusions: Our data revealed new germ-line sequence variants, indicating that existing sequence databases do not fully represent the full extent of sequence variation. Variants in three DNA repair genes were linked to increased radiosensitivity but require validation in larger populations.
- Published
- 2023
3. Supplementary Data from Sequence Variant Discovery in DNA Repair Genes from Radiosensitive and Radiotolerant Prostate Brachytherapy Patients
- Author
-
Richard A. Moore, Marco A. Marra, Peggy L. Olive, W. Jim Morris, Michael McKenzie, Alexander Agranovich, Cindy Yang, Karen Novik, Dallas Thomas, Martin Krzywinski, Allen Delaney, Lorena Barclay, Mira Keyes, and Trevor J. Pugh
- Abstract
Supplementary Data from Sequence Variant Discovery in DNA Repair Genes from Radiosensitive and Radiotolerant Prostate Brachytherapy Patients
- Published
- 2023
4. Regression modeling of time-to-event data with censoring
- Author
-
Tanujit Dey, Stuart R. Lipsitz, Zara Cooper, Quoc-Dien Trinh, Martin Krzywinski, and Naomi Altman
- Subjects
Cell Biology ,Molecular Biology ,Biochemistry ,Biotechnology - Published
- 2022
5. Survival analysis—time-to-event data and censoring
- Author
-
Tanujit Dey, Stuart R. Lipsitz, Zara Cooper, Quoc-Dien Trinh, Martin Krzywinski, and Naomi Altman
- Subjects
Cell Biology ,Molecular Biology ,Biochemistry ,Biotechnology - Published
- 2022
6. Prokaryotic responses to a warm temperature anomaly in northeast subarctic Pacific waters
- Author
-
Brandon Kieft, Steven J. Hallam, Martin Krzywinski, Tetjana Ross, Angelica Peña, Sachia J. Traving, Marie Robert, Colleen T. E. Kellogg, Ryan J. McLaughlin, and Grace Y. Ho
- Subjects
Water microbiology ,Biogeochemical cycle ,Hot Temperature ,QH301-705.5 ,Climate Change ,Biodiversity ,Medicine (miscellaneous) ,Bacterial Physiological Phenomena ,General Biochemistry, Genetics and Molecular Biology ,The Blob ,Article ,Animal mortality ,Seawater ,Biology (General) ,Microbial biooceanography ,Pacific Ocean ,biology ,Community structure ,biology.organism_classification ,Subarctic climate ,Archaea ,Food web ,Oceanography ,Environmental science ,Seasons ,General Agricultural and Biological Sciences - Abstract
Recent studies on marine heat waves describe water temperature anomalies causing changes in food web structure, bloom dynamics, biodiversity loss, and increased plant and animal mortality. However, little information is available on how water temperature anomalies impact prokaryotes (bacteria and archaea) inhabiting ocean waters. This is a nontrivial omission given their integral roles in driving major biogeochemical fluxes that influence ocean productivity and the climate system. Here we present a time-resolved study on the impact of a large-scale warm water surface anomaly in the northeast subarctic Pacific Ocean, colloquially known as the Blob, on prokaryotic community compositions. Multivariate statistical analyses identified significant depth- and season-dependent trends that were accentuated during the Blob. Moreover, network and indicator analyses identified shifts in specific prokaryotic assemblages from typically particle-associated before the Blob to taxa considered free-living and chemoautotrophic during the Blob, with potential implications for primary production and organic carbon conversion and export., Traving et al. use small subunit ribosomal RNA gene sequencing to examine spatial and temporal trends in bacterial and archaeal community structure during a large marine warm water surface anomaly, the Blob. Their findings suggest that community structure shifted during the Blob, with taxa considered free-living and chemoautotrophic prevailing under these unusual conditions.
- Published
- 2021
7. Survival analysis-time-to-event data and censoring
- Author
-
Tanujit, Dey, Stuart R, Lipsitz, Zara, Cooper, Quoc-Dien, Trinh, Martin, Krzywinski, and Naomi, Altman
- Subjects
Models, Statistical ,Data Interpretation, Statistical ,Computer Simulation ,Survival Analysis - Published
- 2022
8. The class imbalance problem
- Author
-
Naomi Altman, Fadel M. Megahed, Ying-Ju Chen, Yuya Ong, Aly Megahed, and Martin Krzywinski
- Subjects
Class imbalance ,medicine.medical_specialty ,business.industry ,MEDLINE ,Medicine ,Cell Biology ,business ,Intensive care medicine ,Molecular Biology ,Biochemistry ,Biotechnology - Published
- 2021
9. Graphical assessment of tests and classifiers
- Author
-
Martin Krzywinski and Naomi Altman
- Subjects
Information retrieval ,Computer science ,MEDLINE ,Cell Biology ,Molecular Biology ,Biochemistry ,Biotechnology - Abstract
I do not think you can start with anything precise. You have to achieve such precision as you can, as you go along. —Bertrand Russell
- Published
- 2021
10. Improved structural variant interpretation for hereditary cancer susceptibility using long-read sequencing
- Author
-
My Linh Thibodeau, Dean Cheng, Janessa Laskin, Karen Mungall, Howard John Lim, Daniel J. Renouf, Kasmintan A. Schrader, Sophie Sun, Aly Karsan, Pawan Pandoh, Stephen Yip, Eric Chuah, Martin Krzywinski, Caralyn Reisle, Kane Tse, Richard D. Moore, Tina Wong, Katherine Dixon, Stephen Chia, Marco A. Marra, Carol Cremin, Yaoqing Shen, Stephen Pleasance, David F. Schaeffer, Alexandra Fok, Erin Pleasance, Steven J.M. Jones, Andrew J. Mungall, Kieran O'Neill, and Sean D. Young
- Subjects
0301 basic medicine ,Computational biology ,Carrier testing ,Biology ,Brief Communication ,Genome ,Germline ,DNA sequencing ,03 medical and health sciences ,0302 clinical medicine ,Neoplasms ,Cancer screening ,Humans ,Genetic Predisposition to Disease ,Gene ,Genetics (clinical) ,Base Sequence ,variant interpretation ,High-Throughput Nucleotide Sequencing ,structural variants ,Chromosome ,genome sequencing ,030104 developmental biology ,hereditary cancer ,030220 oncology & carcinogenesis ,long-read sequencing ,Nanopore sequencing - Abstract
Purpose Structural variants (SVs) may be an underestimated cause of hereditary cancer syndromes given the current limitations of short-read next-generation sequencing. Here we investigated the utility of long-read sequencing in resolving germline SVs in cancer susceptibility genes detected through short-read genome sequencing. Methods Known or suspected deleterious germline SVs were identified using Illumina genome sequencing across a cohort of 669 advanced cancer patients with paired tumor genome and transcriptome sequencing. Candidate SVs were subsequently assessed by Oxford Nanopore long-read sequencing. Results Nanopore sequencing confirmed eight simple pathogenic or likely pathogenic SVs, resolving three additional variants whose impact could not be fully elucidated through short-read sequencing. A recurrent sequencing artifact on chromosome 16p13 and one complex rearrangement on chromosome 5q35 were subsequently classified as likely benign, obviating the need for further clinical assessment. Variant configuration was further resolved in one case with a complex pathogenic rearrangement affecting TSC2. Conclusion Our findings demonstrate that long-read sequencing can improve the validation, resolution, and classification of germline SVs. This has important implications for return of results, cascade carrier testing, cancer screening, and prophylactic interventions.
- Published
- 2020
11. The standardization fallacy
- Author
-
Hanno Würbel, Naomi Altman, Martin Krzywinski, and Bernhard Voelkl
- Subjects
Fallacy ,Standardization ,MEDLINE ,Computational Biology ,Reproducibility of Results ,Cell Biology ,Reference Standards ,Biochemistry ,Epistemology ,Sample Size ,Humans ,Psychology ,Molecular Biology ,Biotechnology - Published
- 2021
12. A Platform for Oncogenomic Reporting and Interpretation
- Author
-
B. M. Pierce, I. Beckie, R. Stevenson, B. Pellegrini, Melika Bonakdar, L. Bailey, Laura Williamson, A. Muhammadzadeh, E. Chuah, M. Douglas, Stephen Yip, Martin Jones, Martin Krzywinski, D. W. Bleile, A. Fisic, Yussanne Ma, R. Matiello Pletz, T. Mitchell, Erin Pleasance, Janessa Laskin, Cameron J. Grisdale, D. Pham, Jessica Nelson, A. Davies, H. Wong, Karen Mungall, Marco A. Marra, C. Reisle, Daniel J. Renouf, A. Reisle, Sjm Jones, and Jun Li
- Subjects
Manual interpretation ,Matching (statistics) ,Carcinogenesis ,Computer science ,Science ,Knowledge Bases ,Genomic data ,General Physics and Astronomy ,General Biochemistry, Genetics and Molecular Biology ,Unmet needs ,Neoplasms ,Databases, Genetic ,Biomarkers, Tumor ,Humans ,Precision Medicine ,Multidisciplinary ,Interpretation (logic) ,business.industry ,Genetic Variation ,General Chemistry ,Genomics ,Data science ,Knowledge base ,Precision oncology ,Graph (abstract data type) ,InformationSystems_MISCELLANEOUS ,business - Abstract
Manual interpretation of variants remains rate limiting in precision oncology. The increasing scale and complexity of molecular data generated from comprehensive sequencing of cancer samples requires advanced interpretative platforms as precision oncology expands beyond individual patients to entire populations. To address this unmet need, we created the Platform for Oncogenomic Reporting and Interpretation (PORI), comprising an analytic framework created to facilitate the interpretation and reporting of somatic variants in cancer. PORI is unique in its integration of reporting and graph knowledge base tools combined with support for manual curation at the reporting stage. PORI represents one of the first open-source platform alternatives to commercial reporting solutions suitable for comprehensive genomic data sets in precision oncology. We demonstrate the utility of PORI by matching 9,961 TCGA tumours to the graph knowledge base, revealing that 88.2% have at least one potentially targetable alteration, and making available reports describing select individual samples.
- Published
- 2021
13. Uncertainty and the management of epidemics
- Author
-
Katriona Shea, Martin Krzywinski, Ottar N. Bjørnstad, and Naomi Altman
- Subjects
0303 health sciences ,History ,Health Policy ,MEDLINE ,Uncertainty ,Cell Biology ,Plague (disease) ,Biochemistry ,Communicable Diseases ,03 medical and health sciences ,Communicable Disease Control ,Humans ,Social science ,Epidemics ,Molecular Biology ,Health policy ,030304 developmental biology ,Biotechnology - Abstract
“I have no idea what’s awaiting me, or what will happen when this all ends. For the moment I know this: there are sick people and they need curing.” ―Albert Camus, The Plague
- Published
- 2020
14. Modeling infectious epidemics
- Author
-
Martin Krzywinski, Ottar N. Bjørnstad, Naomi Altman, and Katriona Shea
- Subjects
0303 health sciences ,History ,Vaccination ,Cell Biology ,Ancient history ,Models, Theoretical ,Plague (disease) ,Biochemistry ,Biological Evolution ,Communicable Diseases ,03 medical and health sciences ,Humans ,Epidemics ,Molecular Biology ,030304 developmental biology ,Biotechnology - Abstract
“Every day sadder and sadder news of its increase. In the City died this week 7496; and of them, 6102 of the plague. But it is feared that the true number of the dead this week is near 10,000 ....” —Samuel Pepys, 1665
- Published
- 2020
15. Testing for rare conditions
- Author
-
Naomi Altman and Martin Krzywinski
- Subjects
business.industry ,Computer science ,Cell Biology ,Sensitivity and Specificity ,Biochemistry ,Data science ,Data Accuracy ,Rare Diseases ,Text mining ,Humans ,False Positive Reactions ,Genetic Testing ,business ,False Negative Reactions ,Molecular Biology ,Biotechnology - Published
- 2021
16. Optimization of magnetic bead-based nucleic acid extraction for SARS-CoV-2 testing using readily available reagents
- Author
-
Jason Nguyen, Edmund Su, Simon Haile, Agatha N. Jassem, Amee R. Manges, Diane Eisler, Michelle Moksa, Mel Krajden, David D.W. Twa, Leah M Prentice, Qi Cao, Stephen Pleasance, Aidan M. Nikiforuk, Martin Krzywinski, Andrew J. Mungall, Natalie Prystajecky, Steven J. M. Jones, Frankie Tsang, Jessica Nelson, Marcus Wong, Duane E Smailus, Marco A. Marra, Yongjun Zhao, Angus Wong, Pawan Pandoh, Martin Hirst, and Robin J.N. Coope
- Subjects
2019-20 coronavirus outbreak ,Coronavirus disease 2019 (COVID-19) ,Computer science ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,Short Communication ,Biology ,Sensitivity and Specificity ,Cross-well ,COVID-19 Testing ,Contamination ,Virology ,Nucleic Acids ,Humans ,Hamilton NIMBUS ,Process engineering ,NA, nucleic acid ,Pandemics ,Magnetic beads ,business.industry ,SARS-CoV-2 ,Nucleic acid extraction ,Magnetic Phenomena ,Extraction (chemistry) ,Ct, PCR cycle threshold ,Diagnostic test ,COVID-19 ,Reagent ,Magnetic bead ,Nucleic acid ,qPCR, quantitative polymerase chain reaction ,RNA ,RNA, Viral ,Indicators and Reagents ,business - Abstract
The COVID-19 pandemic has highlighted the need for generic reagents and flexible systems in diagnostic testing. Magnetic bead-based nucleic acid extraction protocols using 96-well plates on open liquid handlers are readily amenable to meet this need. Here, one such approach is rigorously optimized to minimize cross-well contamination while maintaining sensitivity.Article SummaryA scalable, non-proprietary, magnetic bead-based automated nucleic acid extraction protocol optimised for minimum cross-well contamination
- Published
- 2021
17. Author Correction: Graphical assessment of tests and classifiers
- Author
-
Martin Krzywinski and Naomi Altman
- Subjects
Computer science ,business.industry ,Published Erratum ,MEDLINE ,Cell Biology ,Artificial intelligence ,computer.software_genre ,business ,Molecular Biology ,Biochemistry ,computer ,Natural language processing ,Biotechnology - Published
- 2021
18. A collaborative filtering-based approach to biomedical knowledge discovery
- Author
-
Jake Lever, Celia Siu, Michael Gottlieb, Martin R. Jones, Maia Smith, Tahereh Rashnavadi, Sitanshu Gakkhar, Martin Krzywinski, Santina Lin, and Steven J.M. Jones
- Subjects
0301 basic medicine ,Statistics and Probability ,Computer science ,02 engineering and technology ,Biochemistry ,03 medical and health sciences ,Software ,Resource (project management) ,Text mining ,Knowledge extraction ,0202 electrical engineering, electronic engineering, information engineering ,Collaborative filtering ,Data Mining ,Representation (mathematics) ,Molecular Biology ,Information retrieval ,business.industry ,Knowledge economy ,Publications ,Data science ,Computer Science Applications ,Computational Mathematics ,030104 developmental biology ,Computational Theory and Mathematics ,Knowledge graph ,020201 artificial intelligence & image processing ,business - Abstract
Motivation The increase in publication rates makes it challenging for an individual researcher to stay abreast of all relevant research in order to find novel research hypotheses. Literature-based discovery methods make use of knowledge graphs built using text mining and can infer future associations between biomedical concepts that will likely occur in new publications. These predictions are a valuable resource for researchers to explore a research topic. Current methods for prediction are based on the local structure of the knowledge graph. A method that uses global knowledge from across the knowledge graph needs to be developed in order to make knowledge discovery a frequently used tool by researchers. Results We propose an approach based on the singular value decomposition (SVD) that is able to combine data from across the knowledge graph through a reduced representation. Using cooccurrence data extracted from published literature, we show that SVD performs better than the leading methods for scoring discoveries. We also show the diminishing predictive power of knowledge discovery as we compare our predictions with real associations that appear further into the future. Finally, we examine the strengths and weaknesses of the SVD approach against another well-performing system using several predicted associations. Availability and implementation All code and results files for this analysis can be accessed at https://github.com/jakelever/knowledgediscovery. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2017
19. Hive Panel Explorer: an interactive network visualization tool
- Author
-
Sarah E I Perez, Aria S. Hahn, Martin Krzywinski, and Steven J. Hallam
- Subjects
Statistics and Probability ,0303 health sciences ,Ecology ,AcademicSubjects/SCI01060 ,business.industry ,Computer science ,Systems Biology ,030302 biochemistry & molecular biology ,Applications Notes ,Biochemistry ,Computer Science Applications ,Rendering (computer graphics) ,Visualization ,World Wide Web ,03 medical and health sciences ,Computational Mathematics ,Software ,Computational Theory and Mathematics ,Graph drawing ,Data and Text Mining ,business ,Molecular Biology ,030304 developmental biology - Abstract
Motivation Networks are used to relate topological structure to system dynamics and function, particularly in ecology systems biology. Network analysis is often guided or complemented by data-driven visualization. Hive one of many network visualizations, distinguish themselves as providing a general, consistent and coherent rule-based representation to motivate hypothesis development and testing. Results Here, we present HyPE, Hive Panel Explorer, a software application that creates a panel of interactive hive plots. HyPE enables network exploration based on user-driven layout rules and parameter combinations for simultaneous of multiple network views. We demonstrate HyPE’s features by exploring a microbial co-occurrence network constructed from forest soil microbiomes. Availability and implementation HyPE is available under the GNU license: https://github.com/hallamlab/HivePanelExplorer. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2020
20. The SEIRS model for infectious disease dynamics
- Author
-
Ottar N. Bjørnstad, Katriona Shea, Martin Krzywinski, and Naomi Altman
- Subjects
business.industry ,Vaccination ,Basic Reproduction Number ,Infant, Newborn ,Cell Biology ,Communicable Diseases ,Models, Biological ,Biochemistry ,Virology ,Infectious disease (medical specialty) ,Communicable disease transmission ,Disease Transmission, Infectious ,Humans ,Medicine ,Computer Simulation ,business ,Molecular Biology ,Disease transmission ,Basic reproduction number ,Biotechnology - Published
- 2020
21. Markov models — training and evaluation of hidden Markov models
- Author
-
Jasleen K. Grewal, Martin Krzywinski, and Naomi Altman
- Subjects
0303 health sciences ,Markov chain ,Computer science ,business.industry ,Training (meteorology) ,Cell Biology ,Markov model ,Machine learning ,computer.software_genre ,Biochemistry ,03 medical and health sciences ,Artificial intelligence ,Hidden Markov model ,business ,Molecular Biology ,computer ,030304 developmental biology ,Biotechnology - Abstract
“With one eye you are looking at the outside world, while with the other you are looking within yourself.” —Amedeo Modigliani
- Published
- 2020
22. Predicting with confidence and tolerance
- Author
-
Naomi Altman and Martin Krzywinski
- Subjects
0301 basic medicine ,medicine.medical_specialty ,business.industry ,MEDLINE ,Cell Biology ,Biochemistry ,03 medical and health sciences ,030104 developmental biology ,Predictive value of tests ,Emergency medicine ,medicine ,business ,Molecular Biology ,Biotechnology - Published
- 2018
23. Author Correction: The SEIRS model for infectious disease dynamics
- Author
-
Naomi Altman, Katriona Shea, Ottar N. Bjørnstad, and Martin Krzywinski
- Subjects
business.industry ,Infectious disease (medical specialty) ,Published Erratum ,MEDLINE ,Medicine ,Cell Biology ,Computational biology ,business ,Molecular Biology ,Biochemistry ,Biotechnology - Published
- 2021
24. Author Correction: Nested designs
- Author
-
Paul C. Blainey, Martin Krzywinski, and Naomi Altman
- Subjects
Cell Biology ,Molecular Biology ,Biochemistry ,Biotechnology - Published
- 2020
25. Statistics versus Machine Learning
- Author
-
Danilo Bzdok, Martin Krzywinski, Naomi Altman, Bzdok, Danilo, Modelling brain structure, function and variability based on high-field MRI data (PARIETAL), Service NEUROSPIN (NEUROSPIN), Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Inria Saclay - Ile de France, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Department of Psychiatry, Psychotherapy and Psychosomatics [Aachen], Rheinisch-Westfälische Technische Hochschule Aachen University (RWTH), Pennsylvania State University (Penn State), Penn State System, Canada's Michael Smith Genome Sciences Centre (CMSGSC), BC Cancer Agency (BCCRC), Inria Saclay - Ile de France, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Service NEUROSPIN (NEUROSPIN), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA), and Rheinisch-Westfälische Technische Hochschule Aachen (RWTH)
- Subjects
0301 basic medicine ,Computer science ,Population ,MEDLINE ,Sample (statistics) ,[INFO] Computer Science [cs] ,Machine learning ,computer.software_genre ,Biochemistry ,Article ,Machine Learning ,03 medical and health sciences ,0302 clinical medicine ,Text mining ,Statistics ,[INFO]Computer Science [cs] ,education ,Molecular Biology ,[INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM] ,education.field_of_study ,Models, Genetic ,business.industry ,Data interpretation ,Cell Biology ,030104 developmental biology ,Gene Expression Regulation ,Data Interpretation, Statistical ,Artificial intelligence ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,business ,computer ,030217 neurology & neurosurgery ,Biotechnology - Abstract
International audience; Two major goals in the study of biological systems are inference andprediction. Inference creates a mathematical model of the datageneration process to formalize our understanding or test ahypothesis about how the system behaves. Prediction aims atforecasting unobserved outcomes or future behavior, such as whethera mouse with a given phenotype will have a disease. Prediction makesit possible to identify best courses of action (e.g. treatment choice)without requiring understanding of the underlying mechanisms. In atypical research project, both inference and prediction are of value—we want to know how biological processes work and what will happennext. For example, we might want to infer which biological processesare associated with the dysregulation of a gene in a disease as well asclassify whether a subject has the disease and predict the besttherapy, such as drug intervention or invasive surgery.
- Published
- 2018
26. Machine learning: supervised methods
- Author
-
Martin Krzywinski, Naomi Altman, and Danilo Bzdok
- Subjects
0301 basic medicine ,Computer Science::Machine Learning ,Support Vector Machine ,Computer science ,business.industry ,MEDLINE ,Cell Biology ,Machine learning ,computer.software_genre ,Biochemistry ,Article ,Support vector machine ,Machine Learning ,03 medical and health sciences ,030104 developmental biology ,Text mining ,Artificial Intelligence ,Humans ,Artificial intelligence ,business ,Molecular Biology ,computer ,Supervised training ,Biotechnology - Abstract
Supervised learning algorithms extract general principles from observed examples guided by a specific prediction objective.
- Published
- 2018
27. Markov models — hidden Markov models
- Author
-
Martin Krzywinski, Naomi Altman, and Jasleen K. Grewal
- Subjects
0303 health sciences ,Computer science ,business.industry ,Cell Biology ,Markov model ,Biochemistry ,03 medical and health sciences ,Artificial intelligence ,Hidden Markov model ,business ,Molecular Biology ,ComputingMilieux_MISCELLANEOUS ,030304 developmental biology ,Biotechnology - Abstract
“Everything we see hides another thing, we always want to see what is hidden by what we see” — Rene Magritte
- Published
- 2019
28. Markov models—Markov chains
- Author
-
Jasleen K. Grewal, Naomi Altman, and Martin Krzywinski
- Subjects
0303 health sciences ,03 medical and health sciences ,Markov chain ,Computer science ,Cell Biology ,Markov model ,Molecular Biology ,Biochemistry ,Mathematical economics ,030304 developmental biology ,Biotechnology - Abstract
You can look back there to explain things, but the explanation disappears. You’ll never find it there. Things are not explained by the past. They’re explained by what happens now. –Alan Watts
- Published
- 2019
29. Analyzing outliers: robust methods to the rescue
- Author
-
Naomi Altman, Martin Krzywinski, George Luta, and Luca Greco
- Subjects
0303 health sciences ,03 medical and health sciences ,Computer science ,Outlier ,Cell Biology ,Data mining ,computer.software_genre ,Molecular Biology ,Biochemistry ,computer ,030304 developmental biology ,Biotechnology ,Robust regression - Abstract
Robust regression generates more reliable estimates by detecting and downweighting outliers.
- Published
- 2019
30. Two-level factorial experiments
- Author
-
Naomi Altman, Byran J. Smucker, and Martin Krzywinski
- Subjects
Research design ,0303 health sciences ,fungi ,MEDLINE ,food and beverages ,Data interpretation ,Cell Biology ,Factorial experiment ,Biochemistry ,03 medical and health sciences ,Multiple factors ,Statistics ,Molecular Biology ,030304 developmental biology ,Biotechnology ,Mathematics - Abstract
Simultaneous examination of multiple factors at two levels can reveal which have an effect.
- Published
- 2019
31. Machine learning: A primer
- Author
-
Danilo Bzdok, Martin Krzywinski, and Naomi Altman
- Subjects
0301 basic medicine ,Computer Science::Machine Learning ,Computer science ,MEDLINE ,Machine learning ,computer.software_genre ,Biochemistry ,Models, Biological ,Sensitivity and Specificity ,Article ,Machine Learning ,03 medical and health sciences ,Computer Science::Hardware Architecture ,0302 clinical medicine ,Humans ,Molecular Biology ,Primer (cosmetics) ,business.industry ,Physics::Physics Education ,Cell Biology ,Hormones ,030104 developmental biology ,Artificial intelligence ,business ,computer ,030217 neurology & neurosurgery ,Algorithms ,Biotechnology - Abstract
Machine learning extracts general principles from observed examples without explicit instructions.
- Published
- 2017
32. Scientific Data Visualization
- Author
-
Martin Krzywinski
- Subjects
Diagrammatic reasoning ,Data visualization ,law ,Computer science ,Human–computer interaction ,business.industry ,CLARITY ,business ,law.invention - Published
- 2017
33. Enhancing knowledge discovery from cancer genomics data with Galaxy
- Author
-
Bruno M. Grande, Prasath Pararajalingam, Paul C. Boutros, Selin Jessa, Elie Ritch, Martin Krzywinski, Jasleen K. Grewal, Sohrab P. Shah, Ryan D. Morin, and Marco A. Albuquerque
- Subjects
0301 basic medicine ,Lymphoma ,Computer science ,Driver ,Health Informatics ,Genomics ,Cloud computing ,computer.software_genre ,Workflow ,03 medical and health sciences ,Knowledge extraction ,Pipeline ,Technical Note ,Humans ,Cancer ,Internet ,Genome ,Massive parallel sequencing ,business.industry ,Usability ,Virtualization ,Data science ,Computer Science Applications ,030104 developmental biology ,Mutation ,Tool ,Lymphoma, Large B-Cell, Diffuse ,business ,Cloud ,computer ,Algorithms ,Software ,Data integration - Abstract
The field of cancer genomics has demonstrated the power of massively parallel sequencing techniques to inform on the genes and specific alterations that drive tumor onset and progression. Although large comprehensive sequence data sets continue to be made increasingly available, data analysis remains an ongoing challenge, particularly for laboratories lacking dedicated resources and bioinformatics expertise. To address this, we have produced a collection of Galaxy tools that represent many popular algorithms for detecting somatic genetic alterations from cancer genome and exome data. We developed new methods for parallelization of these tools within Galaxy to accelerate runtime and have demonstrated their usability and summarized their runtimes on multiple cloud service providers. Some tools represent extensions or refinement of existing toolkits to yield visualizations suited to cohort-wide cancer genomic analysis. For example, we present Oncocircos and Oncoprintplus, which generate data-rich summaries of exome-derived somatic mutation. Workflows that integrate these to achieve data integration and visualizations are demonstrated on a cohort of 96 diffuse large B-cell lymphomas and enabled the discovery of multiple candidate lymphoma-related genes. Our toolkit is available from our GitHub repository as Galaxy tool and dependency definitions and has been deployed using virtualization on multiple platforms including Docker.
- Published
- 2017
34. Impact ofERBB2mutations on in vitro sensitivity of bladder cancer to lapatinib
- Author
-
Martin Krzywinski, Evanguelos Xylinas, Morgan Rouprêt, Shahrokh F. Shariat, Tobias Klatte, Thomas Clozel, Dazhong Zhuang, Michela de Martino, Olivier Elemento, and Malte Rieken
- Subjects
Cancer Research ,Receptor, ErbB-2 ,medicine.drug_class ,Mutant ,Antineoplastic Agents ,In Vitro Techniques ,Pharmacology ,Biology ,Lapatinib ,Tyrosine-kinase inhibitor ,Inhibitory Concentration 50 ,Epidermal growth factor ,Cell Line, Tumor ,Gene expression ,medicine ,Humans ,skin and connective tissue diseases ,neoplasms ,Gene ,Cell Proliferation ,fungi ,Urinary Bladder Neoplasms ,Oncology ,Cell culture ,Mutation ,Quinazolines ,Cancer research ,Molecular Medicine ,Phosphorylation ,Research Paper ,medicine.drug - Abstract
Lapatinib, a dual tyrosine kinase inhibitor of ErbB1 and ErbB2, shows a clinical benefit in a subset of patients with advanced urothelial bladder cancer (UBC). We hypothesized that the corresponding gene, ERBB2, is affected by mutations in a subset of UBC and that these mutations impact ErbB2 function, signaling, UBC proliferation, gene expression, and predict response to lapatinib. We found ERBB2 mutations in 5 of 33 UBC cell lines (15%), all of which were derived from invasive or high grade tumors. Phosphorylation and activation of ErbB2 and its downstream pathways were markedly enhanced in mutated cell lines compared with the ERBB2 wild-type. In addition, the gene expression profile was distinct, specifically for genes encoding for proteins of the extracellular matrix. RT112 cells infected with ERBB2 mutants showed a particular growth pattern (“mini-foci”). Upon treatment with lapatinib, 93% of these “mini-foci” were reversed. The sensitivity to lapatinib was greatest among cell lines with ERBB2 mutations. In conclusion, ERBB2 mutations occur in a subset of UBC and impact proliferation, signaling, gene expression and predict a greater response to lapatinib. If confirmed in the clinical setting, this may lead the way toward personalized treatment of a subset of UBC.
- Published
- 2014
35. Optimal experimental design
- Author
-
Naomi Altman, Byran J. Smucker, and Martin Krzywinski
- Subjects
0301 basic medicine ,Computer science ,business.industry ,010401 analytical chemistry ,MEDLINE ,Linear model ,Regression analysis ,Cell Biology ,computer.software_genre ,01 natural sciences ,Biochemistry ,0104 chemical sciences ,03 medical and health sciences ,030104 developmental biology ,Text mining ,Data mining ,business ,Molecular Biology ,computer ,Biotechnology - Abstract
Customize the experiment for the setting instead of adjusting the setting to fit a classical design.
- Published
- 2018
36. The curse(s) of dimensionality
- Author
-
Naomi Altman and Martin Krzywinski
- Subjects
0301 basic medicine ,Curse ,Information retrieval ,business.industry ,Computer science ,Big data ,MEDLINE ,Data interpretation ,Cell Biology ,Biochemistry ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,business ,Molecular Biology ,030217 neurology & neurosurgery ,Biotechnology ,Curse of dimensionality - Published
- 2018
37. Author Correction: Tabular data
- Author
-
Martin Krzywinski and Naomi Altman
- Subjects
Information retrieval ,Computer science ,Cell Biology ,Molecular Biology ,Biochemistry ,Biotechnology - Published
- 2019
38. Author Correction: Two-level factorial experiments
- Author
-
Martin Krzywinski, Byran J. Smucker, and Naomi Altman
- Subjects
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING ,Table (database) ,Cell Biology ,Factorial experiment ,Molecular Biology ,Biochemistry ,Column (database) ,Algorithm ,Biotechnology ,Mathematics - Abstract
The initially published paper contained an error in Table 1: in the rightmost column (y), “0.09” should have been “–0.09.” This error has been corrected in the PDF and HTML versions of the article.
- Published
- 2019
39. The genetic landscape of high-risk neuroblastoma
- Author
-
Jenny Q. Qian, Nina Thiessen, Daniela S. Gerhard, Yvonne Moyer, Shahab Asgharzadeh, Inanc Birol, Jun S. Wei, Baljit Kamoh, Marco A. Marra, Gad Getz, Javed Khan, Adam Kiezun, Stacey Gabriel, Angela Tam, Jaime M. Guidry Auvil, Wendy B. London, Lee Lichenstein, Scott L. Carter, Chip Stewart, Jaegil Kim, Malcolm A. Smith, Readman Chiu, Kristina A. Cole, Maura Diamond, Richard Sposto, Aaron McKenna, Martin Hirst, Matthew Meyerson, Allan Lo, Julie M. Gastier-Foster, Martin Krzywinski, Alireza Hadj Khodabakshi, Michael S. Lawrence, Andrew Wood, Steven J.M. Jones, Richard Corbett, Daniel Auclair, Michael D. Hogarty, Trevor J. Pugh, Carrie Sougnez, Lingyun Ji, Shaun D. Jackman, Richard A. Moore, Kristian Cibulskis, Robert C. Seeger, Yongjun Zhao, Megan Hanna, Edward F. Attiyeh, Sharon J. Diskin, Adrian Ally, Yael P. Mosse, Erica Shefler, Andrey Sivachenko, Olena Morozova, John M. Maris, Chandra Sekhar Pedamallu, Alex H. Ramos, Karen Mungall, Thomas C. Badgett, Eric S. Lander, Massachusetts Institute of Technology. Department of Biology, and Lander, Eric S.
- Subjects
Neuroblastoma RAS viral oncogene homolog ,Biology ,medicine.disease_cause ,Polymorphism, Single Nucleotide ,Neuroblastoma ,03 medical and health sciences ,0302 clinical medicine ,Germline mutation ,Cell Line, Tumor ,Genetics ,medicine ,Humans ,Exome ,Genetic Predisposition to Disease ,Mutation frequency ,ATRX ,030304 developmental biology ,0303 health sciences ,Mutation ,Genome, Human ,Sequence Analysis, DNA ,medicine.disease ,3. Good health ,PTPN11 ,030220 oncology & carcinogenesis ,Cancer research ,Transcriptome - Abstract
Neuroblastoma is a malignancy of the developing sympathetic nervous system that often presents with widespread metastatic disease, resulting in survival rates of less than 50%. To determine the spectrum of somatic mutation in high-risk neuroblastoma, we studied 240 affected individuals (cases) using a combination of whole-exome, genome and transcriptome sequencing as part of the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) initiative. Here we report a low median exonic mutation frequency of 0.60 per Mb (0.48 nonsilent) and notably few recurrently mutated genes in these tumors. Genes with significant somatic mutation frequencies included ALK (9.2% of cases), PTPN11 (2.9%), ATRX (2.5%, and an additional 7.1% had focal deletions), MYCN (1.7%, causing a recurrent p.Pro44Leu alteration) and NRAS (0.83%). Rare, potentially pathogenic germline variants were significantly enriched in ALK, CHEK2, PINK1 and BARD1. The relative paucity of recurrent somatic mutations in neuroblastoma challenges current therapeutic strategies that rely on frequently altered oncogenic drivers., National Human Genome Research Institute (U.S.) (Grant U54HG003067), National Cancer Institute (U.S.) (Contract HHSN261200800001E)
- Published
- 2013
40. Enhancing Knowledge Discovery from Cancer Genomics Data with Galaxy
- Author
-
Marco A. Albuquerque, Selin Jessa, Sohrab P. Shah, Ryan D. Morin, Paul C. Boutros, Bruno M. Grande, Jasleen K. Grewal, Elie Ritch, and Martin Krzywinski
- Subjects
0303 health sciences ,Computer science ,Somatic cell ,0206 medical engineering ,Cancer ,Genomics ,02 engineering and technology ,medicine.disease ,Data science ,03 medical and health sciences ,Germline mutation ,Knowledge extraction ,medicine ,Gene ,Exome ,020602 bioinformatics ,030304 developmental biology - Abstract
We present a collection of Galaxy tools representing many popular algorithms for detecting somatic genetic alterations from cancer genome and exome data. We implemented methods for parallelization of these tools within Galaxy to accelerate runtime and have demonstrated their usability on cloud-based infrastructure and commodity hardware. Some tools represents extensions or refinement of existing toolkits to yield visualizations suited to cohort-wide cancer genomic analysis. For example, we present Oncocircos and Oncoprintplus, which generate data-rich summaries of exome-derived somatic mutation. Workflows that integrate several of these to perform some standard data integration and visualization tasks are demonstrated on a cohort of 96 diffuse large B-cell lymphomas, enabling the discovery of multiple candidate lymphoma-related genes that have not been reported previously.
- Published
- 2016
- Full Text
- View/download PDF
41. Neural circuit diagrams
- Author
-
Barbara J. Hunnicutt and Martin Krzywinski
- Subjects
0301 basic medicine ,Neurons ,Computer science ,Models, Neurological ,Color ,Cell Biology ,Documentation ,Biochemistry ,Computer graphics ,03 medical and health sciences ,030104 developmental biology ,Computer graphics (images) ,Synapses ,Connectome ,Computer Graphics ,Animals ,Humans ,Nerve Net ,Molecular Biology ,Circuit diagram ,Biotechnology ,Research data ,Ultrasonography - Published
- 2016
42. Visualizing Clonal Evolution in Cancer
- Author
-
Martin Krzywinski
- Subjects
0301 basic medicine ,Time Factors ,genetic structures ,Population Dynamics ,Clone (cell biology) ,Computational biology ,Biology ,Somatic evolution in cancer ,Clonal Evolution ,Evolution, Molecular ,03 medical and health sciences ,Static image ,Single-cell analysis ,Phylogenetics ,Neoplasms ,medicine ,Biomarkers, Tumor ,Computer Graphics ,Animals ,Humans ,Genetic Predisposition to Disease ,Molecular Biology ,Phylogeny ,Genetics ,Phylogenetic tree ,Audiovisual Aids ,Cancer ,Cell Biology ,medicine.disease ,030104 developmental biology ,Phenotype ,Single-Cell Analysis - Abstract
Rapid and inexpensive single-cell sequencing is driving new visualizations of cancer instability and evolution. Krzywinski discusses how to present clone evolution plots in order to visualize temporal, phylogenetic, and spatial aspects of a tumor in a single static image.
- Published
- 2016
43. Points of View: Pathways
- Author
-
Barbara J, Hunnicutt and Martin, Krzywinski
- Subjects
Computational Biology - Published
- 2016
44. Study Design for Sequencing Studies
- Author
-
Loren A, Honaas, Naomi S, Altman, and Martin, Krzywinski
- Subjects
Research Design ,Sequence Analysis, RNA ,Animals ,High-Throughput Nucleotide Sequencing ,Humans ,Sequence Analysis, DNA ,Sequence Analysis - Abstract
Once a biochemical method has been devised to sample RNA or DNA of interest, sequencing can be used to identify the sampled molecules with high fidelity and low bias. High-throughput sequencing has therefore become the primary data acquisition method for many genomics studies and is being used more and more to address molecular biology questions. By applying principles of statistical experimental design, sequencing experiments can be made more sensitive to the effects under study as well as more biologically sound, hence more replicable.
- Published
- 2016
45. Study Design for Sequencing Studies
- Author
-
Naomi Altman, Loren A. Honaas, and Martin Krzywinski
- Subjects
0301 basic medicine ,Genetics ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,food and beverages ,DNA sequencing theory ,Genomics ,Computational biology ,Biology ,030217 neurology & neurosurgery ,Deep sequencing - Abstract
Once a biochemical method has been devised to sample RNA or DNA of interest, sequencing can be used to identify the sampled molecules with high fidelity and low bias. High-throughput sequencing has therefore become the primary data acquisition method for many genomics studies and is being used more and more to address molecular biology questions. By applying principles of statistical experimental design, sequencing experiments can be made more sensitive to the effects under study as well as more biologically sound, hence more replicable.
- Published
- 2016
46. Ensemble methods: bagging and random forests
- Author
-
Naomi Altman and Martin Krzywinski
- Subjects
0301 basic medicine ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Computer science ,Statistics ,Cell Biology ,Molecular Biology ,Biochemistry ,Ensemble learning ,030217 neurology & neurosurgery ,Biotechnology ,Random forest - Published
- 2017
47. Classification and regression trees
- Author
-
Martin Krzywinski and Naomi Altman
- Subjects
0301 basic medicine ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Computer science ,Statistics ,Cell Biology ,Molecular Biology ,Biochemistry ,030217 neurology & neurosurgery ,Regression ,Biotechnology ,Research data - Published
- 2017
48. Clustering
- Author
-
Martin Krzywinski and Naomi Altman
- Subjects
0301 basic medicine ,business.industry ,Computer science ,Single-linkage clustering ,Correlation clustering ,Pattern recognition ,Cell Biology ,Biochemistry ,03 medical and health sciences ,030104 developmental biology ,Consensus clustering ,Artificial intelligence ,business ,Cluster analysis ,Molecular Biology ,Biotechnology - Published
- 2017
49. Tabular data
- Author
-
Martin Krzywinski and Naomi Altman
- Subjects
0301 basic medicine ,education.field_of_study ,Population ,Cell Biology ,01 natural sciences ,Biochemistry ,010104 statistics & probability ,03 medical and health sciences ,030104 developmental biology ,Geography ,0101 mathematics ,education ,Molecular Biology ,Cartography ,Biotechnology ,Research data - Abstract
Tabulating the number of objects in categories of interest dates back to the earliest records of commerce and population censuses.
- Published
- 2017
50. Interpreting P values
- Author
-
Naomi Altman and Martin Krzywinski
- Subjects
0106 biological sciences ,0301 basic medicine ,genetic structures ,education ,Cell Biology ,010603 evolutionary biology ,01 natural sciences ,Biochemistry ,humanities ,03 medical and health sciences ,030104 developmental biology ,Statistics ,Compatibility (mechanics) ,p-value ,Molecular Biology ,health care economics and organizations ,Biotechnology ,Mathematics - Abstract
A P value measures a sample's compatibility with a hypothesis, not the truth of the hypothesis.
- Published
- 2017
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.