36 results on '"Filion GJ"'
Search Results
2. Causality in transcription and genome folding: Insights from X inactivation.
- Author
-
Bauer M, Payer B, and Filion GJ
- Subjects
- Animals, Genome genetics, Mammals genetics, X Chromosome, X Chromosome Inactivation genetics
- Abstract
The spatial organization of genomes is becoming increasingly understood. In mammals, where it is most investigated, this organization ties in with transcription, so an important research objective is to understand whether gene activity is a cause or a consequence of genome folding in space. In this regard, the phenomena of X-chromosome inactivation and reactivation open a unique window of investigation because of the singularities of the inactive X chromosome. Here we focus on the cause-consequence nexus between genome conformation and transcription and explain how recent results about the structural changes associated with inactivation and reactivation of the X chromosome shed light on this problem., (© 2022 The Authors. BioEssays published by Wiley Periodicals LLC.)
- Published
- 2022
- Full Text
- View/download PDF
3. Chromatin and viral integration in immunity: The challenge of silencing non-self genes.
- Author
-
Kabi M and Filion GJ
- Subjects
- Animals, Gene Silencing, Humans, Proviruses genetics, Chromatin genetics, Virus Integration genetics
- Abstract
Several viruses hide in the genome of their host. To complete their replication cycle, they need to integrate in the form of a provirus and express their genes. In vertebrates, integrated viruses can be silenced by chromatin, implying that some specific mechanisms exist to detect non-self genes. The known mechanisms depend on sequence features of retroelements, but the fluctuations of virus expression suggest that other determinants also exist. Here we review the mechanisms allowing chromatin to silence integrated viruses and propose that DNA repair may help flag them as 'non-self' shortly after their genomic insertion., Competing Interests: Declaration of interests The authors have no interests to declare., (Crown Copyright © 2022. Published by Elsevier Ltd. All rights reserved.)
- Published
- 2022
- Full Text
- View/download PDF
4. Strand asymmetry influences mismatch resolution during a single-strand annealing.
- Author
-
Pokusaeva VO, Diez AR, Espinar L, Pérez AT, and Filion GJ
- Subjects
- Animals, DNA, DNA Damage, Mice, DNA Breaks, Double-Stranded, DNA Repair
- Abstract
Background: Biases of DNA repair can shape the nucleotide landscape of genomes at evolutionary timescales. The molecular mechanisms of those biases are still poorly understood because it is difficult to isolate the contributions of DNA repair from those of DNA damage., Results: Here, we develop a genome-wide assay whereby the same DNA lesion is repaired in different genomic contexts. We insert thousands of barcoded transposons carrying a reporter of DNA mismatch repair in the genome of mouse embryonic stem cells. Upon inducing a double-strand break between tandem repeats, a mismatch is generated if the break is repaired through single-strand annealing. The resolution of the mismatch showed a 60-80% bias in favor of the strand with the longest 3' flap. The location of the lesion in the genome and the type of mismatch had little influence on the bias. Instead, we observe a complete reversal of the bias when the longest 3' flap is moved to the opposite strand by changing the position of the double-strand break in the reporter., Conclusions: These results suggest that the processing of the double-strand break has a major influence on the repair of mismatches during a single-strand annealing., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
5. Heterochromatin: did H3K9 methylation evolve to tame transposons?
- Author
-
Kabi M and Filion GJ
- Subjects
- Animals, Eukaryota, Humans, Protein Processing, Post-Translational, DNA Methylation, DNA Transposable Elements, Heterochromatin, Histones
- Published
- 2021
- Full Text
- View/download PDF
6. Author Correction: Spatially clustered loci with multiple enhancers are frequent targets of HIV-1 integration.
- Author
-
Lucic B, Chen HC, Kuzman M, Zorita E, Wegner J, Minneker V, Wang W, Fronza R, Laufs S, Schmidt M, Stadhouders R, Roukos V, Vlahovicek K, Filion GJ, and Lusic M
- Published
- 2021
- Full Text
- View/download PDF
7. Chromosome compartments on the inactive X guide TAD formation independently of transcription during X-reactivation.
- Author
-
Bauer M, Vidal E, Zorita E, Üresin N, Pinter SF, Filion GJ, and Payer B
- Subjects
- Animals, Cellular Reprogramming genetics, Chromatin Assembly and Disassembly, Induced Pluripotent Stem Cells cytology, Induced Pluripotent Stem Cells metabolism, Mice, RNA, Long Noncoding genetics, RNA, Long Noncoding metabolism, Sex Chromatin genetics, Sex Chromatin metabolism, X Chromosome genetics, Transcription, Genetic, X Chromosome metabolism, X Chromosome Inactivation genetics
- Abstract
A hallmark of chromosome organization is the partition into transcriptionally active A and repressed B compartments, and into topologically associating domains (TADs). Both structures were regarded to be absent from the inactive mouse X chromosome, but to be re-established with transcriptional reactivation and chromatin opening during X-reactivation. Here, we combine a tailor-made mouse iPSC reprogramming system and high-resolution Hi-C to produce a time course combining gene reactivation, chromatin opening and chromosome topology during X-reactivation. Contrary to previous observations, we observe A/B-like compartments on the inactive X harbouring multiple subcompartments. While partial X-reactivation initiates within a compartment rich in X-inactivation escapees, it then occurs rapidly along the chromosome, concomitant with downregulation of Xist. Importantly, we find that TAD formation precedes transcription and initiates from Xist-poor compartments. Here, we show that TAD formation and transcriptional reactivation are causally independent during X-reactivation while establishing Xist as a common denominator.
- Published
- 2021
- Full Text
- View/download PDF
8. Calibrating Seed-Based Heuristics to Map Short Reads With Sesame.
- Author
-
Filion GJ, Cortini R, and Zorita E
- Abstract
The increasing throughput of DNA sequencing technologies creates a need for faster algorithms. The fate of most reads is to be mapped to a reference sequence, typically a genome. Modern mappers rely on heuristics to gain speed at a reasonable cost for accuracy. In the seeding heuristic, short matches between the reads and the genome are used to narrow the search to a set of candidate locations. Several seeding variants used in modern mappers show good empirical performance but they are difficult to calibrate or to optimize for lack of theoretical results. Here we develop a theory to estimate the probability that the correct location of a read is filtered out during seeding, resulting in mapping errors. We describe the properties of simple exact seeds, skip seeds and MEM seeds (Maximal Exact Match seeds). The main innovation of this work is to use concepts from analytic combinatorics to represent reads as abstract sequences, and to specify their generative function to estimate the probabilities of interest. We provide several algorithms, which together give a workable solution for the problem of calibrating seeding heuristics for short reads. We also provide a C implementation of these algorithms in a library called Sesame. These results can improve current mapping algorithms and lay the foundation of a general strategy to tackle sequence alignment problems. The Sesame library is open source and available for download at https://github.com/gui11aume/sesame., (Copyright © 2020 Filion, Cortini and Zorita.)
- Published
- 2020
- Full Text
- View/download PDF
9. Spatially clustered loci with multiple enhancers are frequent targets of HIV-1 integration.
- Author
-
Lucic B, Chen HC, Kuzman M, Zorita E, Wegner J, Minneker V, Wang W, Fronza R, Laufs S, Schmidt M, Stadhouders R, Roukos V, Vlahovicek K, Filion GJ, and Lusic M
- Subjects
- Base Sequence, CD4-Positive T-Lymphocytes virology, Cell Nucleus metabolism, Cell Nucleus virology, Chromatin genetics, Chromatin virology, HIV Infections genetics, HIV Infections immunology, HIV Infections virology, HIV-1 physiology, Humans, Nuclear Pore genetics, Nuclear Pore virology, Promoter Regions, Genetic genetics, Transcription, Genetic, CD4-Positive T-Lymphocytes metabolism, Cell Nucleus genetics, Enhancer Elements, Genetic, HIV-1 genetics, Virus Integration genetics
- Abstract
HIV-1 recurrently targets active genes and integrates in the proximity of the nuclear pore compartment in CD4
+ T cells. However, the genomic features of these genes and the relevance of their transcriptional activity for HIV-1 integration have so far remained unclear. Here we show that recurrently targeted genes are proximal to super-enhancer genomic elements and that they cluster in specific spatial compartments of the T cell nucleus. We further show that these gene clusters acquire their location during the activation of T cells. The clustering of these genes along with their transcriptional activity are the major determinants of HIV-1 integration in T cells. Our results provide evidence of the relevance of the spatial compartmentalization of the genome for HIV-1 integration, thus further strengthening the role of nuclear architecture in viral infection.- Published
- 2019
- Full Text
- View/download PDF
10. Transcription factors and 3D genome conformation in cell-fate decisions.
- Author
-
Stadhouders R, Filion GJ, and Graf T
- Subjects
- Animals, Chromatin Assembly and Disassembly genetics, Chromosome Positioning, Gene Expression Regulation, Humans, Organ Specificity genetics, Cell Differentiation genetics, Cells cytology, Cells metabolism, Genome genetics, Transcription Factors metabolism
- Abstract
How cells adopt different identities has long fascinated biologists. Signal transduction in response to environmental cues results in the activation of transcription factors that determine the gene-expression program characteristic of each cell type. Technological advances in the study of 3D chromatin folding are bringing the role of genome conformation in transcriptional regulation to the fore. Characterizing this role of genome architecture has profound implications, not only for differentiation and development but also for diseases including developmental malformations and cancer. Here we review recent studies indicating that the interplay between transcription and genome conformation is a driving force for cell-fate decisions.
- Published
- 2019
- Full Text
- View/download PDF
11. An experimental assay of the interactions of amino acids from orthologous sequences shaping a complex fitness landscape.
- Author
-
Pokusaeva VO, Usmanova DR, Putintseva EV, Espinar L, Sarkisyan KS, Mishin AS, Bogatyreva NS, Ivankov DN, Akopyan AV, Avvakumov SY, Povolotskaya IS, Filion GJ, Carey LB, and Kondrashov FA
- Subjects
- Amino Acid Sequence, Amino Acid Substitution, Amino Acids genetics, Amino Acids metabolism, Epistasis, Genetic, Fungal Proteins chemistry, Genes, Fungal, Genotype, Hydro-Lyases chemistry, Hydro-Lyases genetics, Hydro-Lyases metabolism, Models, Genetic, Models, Molecular, Phylogeny, Saccharomyces cerevisiae genetics, Saccharomyces cerevisiae metabolism, Saccharomyces cerevisiae Proteins chemistry, Saccharomyces cerevisiae Proteins genetics, Saccharomyces cerevisiae Proteins metabolism, Evolution, Molecular, Fungal Proteins genetics, Fungal Proteins metabolism, Genetic Fitness, Yeasts genetics, Yeasts metabolism
- Abstract
Characterizing the fitness landscape, a representation of fitness for a large set of genotypes, is key to understanding how genetic information is interpreted to create functional organisms. Here we determined the evolutionarily-relevant segment of the fitness landscape of His3, a gene coding for an enzyme in the histidine synthesis pathway, focusing on combinations of amino acid states found at orthologous sites of extant species. Just 15% of amino acids found in yeast His3 orthologues were always neutral while the impact on fitness of the remaining 85% depended on the genetic background. Furthermore, at 67% of sites, amino acid replacements were under sign epistasis, having both strongly positive and negative effect in different genetic backgrounds. 46% of sites were under reciprocal sign epistasis. The fitness impact of amino acid replacements was influenced by only a few genetic backgrounds but involved interaction of multiple sites, shaping a rugged fitness landscape in which many of the shortest paths between highly fit genotypes are inaccessible., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2019
- Full Text
- View/download PDF
12. OneD: increasing reproducibility of Hi-C samples with abnormal karyotypes.
- Author
-
Vidal E, le Dily F, Quilez J, Stadhouders R, Cuartero Y, Graf T, Marti-Renom MA, Beato M, and Filion GJ
- Subjects
- Animals, Base Composition, Bias, Cell Line, Chromosome Aberrations, Computational Biology methods, Computational Biology statistics & numerical data, Computer Simulation, DNA Copy Number Variations, Genetic Techniques, Humans, Markov Chains, Mice, Models, Statistical, Reproducibility of Results, Abnormal Karyotype
- Abstract
The three-dimensional conformation of genomes is an essential component of their biological activity. The advent of the Hi-C technology enabled an unprecedented progress in our understanding of genome structures. However, Hi-C is subject to systematic biases that can compromise downstream analyses. Several strategies have been proposed to remove those biases, but the issue of abnormal karyotypes received little attention. Many experiments are performed in cancer cell lines, which typically harbor large-scale copy number variations that create visible defects on the raw Hi-C maps. The consequences of these widespread artifacts on the normalized maps are mostly unexplored. We observed that current normalization methods are not robust to the presence of large-scale copy number variations, potentially obscuring biological differences and enhancing batch effects. To address this issue, we developed an alternative approach designed to take into account chromosomal abnormalities. The method, called OneD, increases reproducibility among replicates of Hi-C samples with abnormal karyotype, outperforming previous methods significantly. On normal karyotypes, OneD fared equally well as state-of-the-art methods, making it a safe choice for Hi-C normalization. OneD is fast and scales well in terms of computing resources for resolutions up to 5 kb.
- Published
- 2018
- Full Text
- View/download PDF
13. Theoretical principles of transcription factor traffic on folded chromatin.
- Author
-
Cortini R and Filion GJ
- Subjects
- Cell Line, Transformed, Chromatin ultrastructure, Humans, Lymphocytes cytology, Lymphocytes metabolism, Models, Genetic, Molecular Dynamics Simulation, Transcription Factors genetics, Chromatin chemistry, Genome, Human, Transcription Factors metabolism, Transcription, Genetic
- Abstract
All organisms regulate transcription of their genes. To understand this process, a complete understanding of how transcription factors find their targets in cellular nuclei is essential. The DNA sequence and other variables are known to influence this binding, but the distribution of transcription factor binding patterns remains mostly unexplained in metazoan genomes. Here, we investigate the role of chromosome conformation in the trajectories of transcription factors. Using molecular dynamics simulations, we uncover the principles of their diffusion on chromatin. Chromosome contacts play a conflicting role: at low density they enhance transcription factor traffic, but at high density they lower it by volume exclusion. Consistently, we observe that in human cells, highly occupied targets, where protein binding is promiscuous, are found at sites engaged in chromosome loops within uncompacted chromatin. In summary, we provide a framework for understanding the search trajectories of transcription factors, highlighting the key contribution of genome conformation.
- Published
- 2018
- Full Text
- View/download PDF
14. A New Quinoline BRD4 Inhibitor Targets a Distinct Latent HIV-1 Reservoir for Reactivation from Other "Shock" Drugs.
- Author
-
Abner E, Stoszko M, Zeng L, Chen HC, Izquierdo-Bouldstridge A, Konuma T, Zorita E, Fanunza E, Zhang Q, Mahmoudi T, Zhou MM, Filion GJ, and Jordan A
- Subjects
- Azepines pharmacology, CD4-Positive T-Lymphocytes virology, Cell Cycle Proteins, Gene Expression Regulation, Viral drug effects, HEK293 Cells, HIV-1 metabolism, HeLa Cells, Histone Deacetylase Inhibitors pharmacology, Humans, Jurkat Cells, Protein Domains drug effects, Proto-Oncogene Proteins c-myc biosynthesis, Proviruses genetics, Triazoles pharmacology, Viral Load drug effects, Virus Integration drug effects, HIV Infections drug therapy, Nuclear Proteins antagonists & inhibitors, Quinolines pharmacology, Transcription Factors antagonists & inhibitors, Virus Activation drug effects, Virus Latency drug effects
- Abstract
Upon HIV-1 infection, a reservoir of latently infected resting T cells prevents the eradication of the virus from patients. To achieve complete depletion, the existing virus-suppressing antiretroviral therapy must be combined with drugs that reactivate the dormant viruses. We previously described a novel chemical scaffold compound, MMQO (8-methoxy-6-methylquinolin-4-ol), that is able to reactivate viral transcription in several models of HIV latency, including J-Lat cells, through an unknown mechanism. MMQO potentiates the activity of known latency-reversing agents (LRAs) or "shock" drugs, such as protein kinase C (PKC) agonists or histone deacetylase (HDAC) inhibitors. Here, we demonstrate that MMQO activates HIV-1 independently of the Tat transactivator. Gene expression microarrays in Jurkat cells indicated that MMQO treatment results in robust immunosuppression, diminishes expression of c-Myc, and causes the dysregulation of acetylation-sensitive genes. These hallmarks indicated that MMQO mimics acetylated lysines of core histones and might function as a bromodomain and extraterminal domain protein family inhibitor (BETi). MMQO functionally mimics the effects of JQ1, a well-known BETi. We confirmed that MMQO interacts with the BET family protein BRD4. Utilizing MMQO and JQ1, we demonstrate how the inhibition of BRD4 targets a subset of latently integrated barcoded proviruses distinct from those targeted by HDAC inhibitors or PKC pathway agonists. Thus, the quinoline-based compound MMQO represents a new class of BET bromodomain inhibitors that, due to its minimalistic structure, holds promise for further optimization for increased affinity and specificity for distinct bromodomain family members and could potentially be of use against a variety of diseases, including HIV infection. IMPORTANCE The suggested "shock and kill" therapy aims to eradicate the latent functional proportion of HIV-1 proviruses in a patient. However, to this day, clinical studies investigating the "shocking" element of this strategy have proven it to be considerably more difficult than anticipated. While the proportion of intracellular viral RNA production and general plasma viral load have been shown to increase upon a shock regimen, the global viral reservoir remains unaffected, highlighting both the inefficiency of the treatments used and the gap in our understanding of viral reactivation in vivo Utilizing a new BRD4 inhibitor and barcoded HIV-1 minigenomes, we demonstrate that PKC pathway activators and HDAC and bromodomain inhibitors all target different subsets of proviral integration. Considering the fundamental differences of these compounds and the synergies displayed between them, we propose that the field should concentrate on investigating the development of combinatory shock cocktail therapies for improved reservoir reactivation., (Copyright © 2018 American Society for Microbiology.)
- Published
- 2018
- Full Text
- View/download PDF
15. Using Barcoded HIV Ensembles (B-HIVE) for Single Provirus Transcriptomics.
- Author
-
Chen HC, Zorita E, and Filion GJ
- Subjects
- Computer Simulation, HEK293 Cells, Humans, Jurkat Cells, Polymerase Chain Reaction, RNA, Messenger genetics, RNA, Messenger metabolism, Recombination, Genetic genetics, Reproducibility of Results, Virus Latency genetics, DNA Barcoding, Taxonomic methods, HIV-1 genetics, Proviruses genetics, Transcriptome genetics
- Abstract
The latent HIV reservoir is the main barrier to curing AIDS, because infected cells escape the immune system and antiretroviral therapies. Developing new treatment strategies requires technologies to trace latent proviruses. Here, we describe a genome-wide technique called Barcoded HIV Ensembles (B-HIVE) to measure HIV expression at the single provirus level. The principle of B-HIVE is to tag the genome of HIV with DNA barcodes to trace viral transcripts produced by single proviruses in an infected cell population. This in turn reveals which proviruses are active and which are latent or expressed at low level. B-HIVE is a high-throughput method to identify and quantify thousands of individual viral transcripts per round of infection. It can be applied in different conditions, characterizing the response of single proviruses to different treatments. Overall, B-HIVE gives unprecedented insight into the expression of single proviruses in populations of HIV-infected cells. © 2018 by John Wiley & Sons, Inc., (Copyright © 2018 John Wiley & Sons, Inc.)
- Published
- 2018
- Full Text
- View/download PDF
16. Transcription factors orchestrate dynamic interplay between genome topology and gene regulation during cell reprogramming.
- Author
-
Stadhouders R, Vidal E, Serra F, Di Stefano B, Le Dily F, Quilez J, Gomez A, Collombet S, Berenguer C, Cuartero Y, Hecht J, Filion GJ, Beato M, Marti-Renom MA, and Graf T
- Subjects
- Animals, Binding Sites genetics, Cells, Cultured, Chromosome Structures metabolism, Dosage Compensation, Genetic genetics, Female, Gene Expression Regulation, Kruppel-Like Factor 4, Kruppel-Like Transcription Factors metabolism, Kruppel-Like Transcription Factors physiology, Mice, Mice, Transgenic, Protein Binding, X Chromosome Inactivation genetics, Cellular Reprogramming genetics, Chromatin Assembly and Disassembly genetics, Chromosome Structures genetics, Genome, Transcription Factors physiology
- Abstract
Chromosomal architecture is known to influence gene expression, yet its role in controlling cell fate remains poorly understood. Reprogramming of somatic cells into pluripotent stem cells (PSCs) by the transcription factors (TFs) OCT4, SOX2, KLF4 and MYC offers an opportunity to address this question but is severely limited by the low proportion of responding cells. We have recently developed a highly efficient reprogramming protocol that synchronously converts somatic into pluripotent stem cells. Here, we used this system to integrate time-resolved changes in genome topology with gene expression, TF binding and chromatin-state dynamics. The results showed that TFs drive topological genome reorganization at multiple architectural levels, often before changes in gene expression. Removal of locus-specific topological barriers can explain why pluripotency genes are activated sequentially, instead of simultaneously, during reprogramming. Together, our results implicate genome topology as an instructive force for implementing transcriptional programs and cell fate in mammals.
- Published
- 2018
- Full Text
- View/download PDF
17. Methylation of DNA Ligase 1 by G9a/GLP Recruits UHRF1 to Replicating DNA and Regulates DNA Methylation.
- Author
-
Ferry L, Fournier A, Tsusaka T, Adelmant G, Shimazu T, Matano S, Kirsh O, Amouroux R, Dohmae N, Suzuki T, Filion GJ, Deng W, de Dieuleveult M, Fritsch L, Kudithipudi S, Jeltsch A, Leonhardt H, Hajkova P, Marto JA, Arita K, Shinkai Y, and Defossez PA
- Subjects
- Animals, CCAAT-Enhancer-Binding Proteins chemistry, CCAAT-Enhancer-Binding Proteins genetics, DNA genetics, DNA Ligase ATP chemistry, DNA Ligase ATP genetics, Embryonic Stem Cells enzymology, HEK293 Cells, HeLa Cells, Histocompatibility Antigens chemistry, Histocompatibility Antigens genetics, Histone-Lysine N-Methyltransferase chemistry, Histone-Lysine N-Methyltransferase genetics, Histones metabolism, Humans, Lysine, Methylation, Mice, Models, Molecular, Molecular Mimicry, Mutation, Protein Binding, Protein Conformation, Structure-Activity Relationship, Transfection, Tudor Domain, Ubiquitin-Protein Ligases, CCAAT-Enhancer-Binding Proteins metabolism, DNA biosynthesis, DNA Ligase ATP metabolism, DNA Methylation, DNA Replication, Epigenesis, Genetic, Histocompatibility Antigens metabolism, Histone-Lysine N-Methyltransferase metabolism, Protein Processing, Post-Translational
- Abstract
DNA methylation is an essential epigenetic mark in mammals that has to be re-established after each round of DNA replication. The protein UHRF1 is essential for this process; it has been proposed that the protein targets newly replicated DNA by cooperatively binding hemi-methylated DNA and H3K9me2/3, but this model leaves a number of questions unanswered. Here, we present evidence for a direct recruitment of UHRF1 by the replication machinery via DNA ligase 1 (LIG1). A histone H3K9-like mimic within LIG1 is methylated by G9a and GLP and, compared with H3K9me2/3, more avidly binds UHRF1. Interaction with methylated LIG1 promotes the recruitment of UHRF1 to DNA replication sites and is required for DNA methylation maintenance. These results further elucidate the function of UHRF1, identify a non-histone target of G9a and GLP, and provide an example of a histone mimic that coordinates DNA replication and DNA methylation maintenance., (Copyright © 2017 Elsevier Inc. All rights reserved.)
- Published
- 2017
- Full Text
- View/download PDF
18. Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors.
- Author
-
Serra F, Baù D, Goodstadt M, Castillo D, Filion GJ, and Marti-Renom MA
- Subjects
- Algorithms, Animals, Chromatin genetics, Chromatin ultrastructure, Computational Biology methods, Drosophila melanogaster genetics, Genome, Insect genetics, Imaging, Three-Dimensional methods, Software
- Abstract
The sequence of a genome is insufficient to understand all genomic processes carried out in the cell nucleus. To achieve this, the knowledge of its three-dimensional architecture is necessary. Advances in genomic technologies and the development of new analytical methods, such as Chromosome Conformation Capture (3C) and its derivatives, provide unprecedented insights in the spatial organization of genomes. Here we present TADbit, a computational framework to analyze and model the chromatin fiber in three dimensions. Our package takes as input the sequencing reads of 3C-based experiments and performs the following main tasks: (i) pre-process the reads, (ii) map the reads to a reference genome, (iii) filter and normalize the interaction data, (iv) analyze the resulting interaction matrices, (v) build 3D models of selected genomic domains, and (vi) analyze the resulting models to characterize their structural properties. To illustrate the use of TADbit, we automatically modeled 50 genomic domains from the fly genome revealing differential structural features of the previously defined chromatin colors, establishing a link between the conformation of the genome and the local chromatin composition. TADbit provides three-dimensional models built from 3C-based experiments, which are ready for visualization and for characterizing their relation to gene expression and epigenetic states. TADbit is an open-source Python library available for download from https://github.com/3DGenomes/tadbit.
- Published
- 2017
- Full Text
- View/download PDF
19. Clustering of Drosophila housekeeping promoters facilitates their expression.
- Author
-
Corrales M, Rosado A, Cortini R, van Arensbergen J, van Steensel B, and Filion GJ
- Subjects
- Animals, Cell Line, Drosophila melanogaster, Gene Expression Regulation physiology, Genes, Essential physiology, Multigene Family physiology
- Abstract
Housekeeping genes of animal genomes cluster in the same chromosomal regions. It has long been suggested that this organization contributes to their steady expression across all the tissues of the organism. Here, we show that the activity of Drosophila housekeeping gene promoters depends on the expression of their neighbors. By measuring the expression of ∼85,000 reporters integrated in Kc167 cells, we identified the best predictors of expression as chromosomal contacts with the promoters and terminators of active genes. Surprisingly, the chromatin composition at the insertion site and the contacts with enhancers were less informative. These results are substantiated by the existence of genomic "paradoxical" domains, rich in euchromatic features and enhancers, but where the reporters are expressed at low level, concomitant with a deficit of interactions with promoters and terminators. This indicates that the proper function of housekeeping genes relies not on contacts with long distance enhancers but on spatial clustering. Overall, our results suggest that spatial proximity between genes increases their expression and that the linear architecture of the Drosophila genome contributes to this effect., (© 2017 Corrales et al.; Published by Cold Spring Harbor Laboratory Press.)
- Published
- 2017
- Full Text
- View/download PDF
20. Position effects influence HIV latency reversal.
- Author
-
Chen HC, Martinez JP, Zorita E, Meyerhans A, and Filion GJ
- Subjects
- Genes, Viral, Humans, Jurkat Cells, Virus Integration, HIV Infections virology, HIV-1 physiology, Virus Latency
- Abstract
The main obstacle to curing HIV is the presence of latent proviruses in the bodies of infected patients. The partial success of reactivation therapies suggests that the genomic context of integrated proviruses can interfere with treatment. Here we developed a method called Barcoded HIV ensembles (B-HIVE) to map the chromosomal locations of thousands of individual proviruses while tracking their transcriptional activities in an infected cell population. B-HIVE revealed that, in Jurkat cells, the expression of HIV is strongest close to endogenous enhancers. The insertion site also affects the response to latency-reversing agents, because we found that phytohemagglutinin and vorinostat reactivated proviruses inserted at distinct genomic locations. From these results, we propose that combinations of drugs targeting all areas of the genome will be most effective. Overall, our data suggest that the insertion context of HIV is a critical determinant of the viral response to reactivation therapies.
- Published
- 2017
- Full Text
- View/download PDF
21. Zerone: a ChIP-seq discretizer for multiple replicates with built-in quality control.
- Author
-
Cuscó P and Filion GJ
- Subjects
- Animals, DNA Replication, Genome, Quality Control, Reproducibility of Results, Sequence Analysis, DNA, Chromatin Immunoprecipitation
- Abstract
Motivation: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is the standard method to investigate chromatin protein composition. As the number of community-available ChIP-seq profiles increases, it becomes more common to use data from different sources, which makes joint analysis challenging. Issues such as lack of reproducibility, heterogeneous quality and conflicts between replicates become evident when comparing datasets, especially when they are produced by different laboratories., Results: Here, we present Zerone, a ChIP-seq discretizer with built-in quality control. Zerone is powered by a Hidden Markov Model with zero-inflated negative multinomial emissions, which allows it to merge several replicates into a single discretized profile. To identify low quality or irreproducible data, we trained a Support Vector Machine and integrated it as part of the discretization process. The result is a classifier reaching 95% accuracy in detecting low quality profiles. We also introduce a graphical representation to compare discretization quality and we show that Zerone achieves outstanding accuracy. Finally, on current hardware, Zerone discretizes a ChIP-seq experiment on mammalian genomes in about 5 min using less than 700 MB of memory., Availability and Implementation: Zerone is available as a command line tool and as an R package. The C source code and R scripts can be downloaded from https://github.com/nanakiksc/zerone The information to reproduce the benchmark and the figures is stored in a public Docker image that can be downloaded from https://hub.docker.com/r/nanakiksc/zerone/, Contact: : guillaume.filion@gmail.com, Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author 2016. Published by Oxford University Press.)
- Published
- 2016
- Full Text
- View/download PDF
22. Machine Learning: How Much Does It Tell about Protein Folding Rates?
- Author
-
Corrales M, Cuscó P, Usmanova DR, Chen HC, Bogatyreva NS, Filion GJ, and Ivankov DN
- Subjects
- Reproducibility of Results, Machine Learning, Protein Folding, Proteins chemistry
- Abstract
The prediction of protein folding rates is a necessary step towards understanding the principles of protein folding. Due to the increasing amount of experimental data, numerous protein folding models and predictors of protein folding rates have been developed in the last decade. The problem has also attracted the attention of scientists from computational fields, which led to the publication of several machine learning-based models to predict the rate of protein folding. Some of them claim to predict the logarithm of protein folding rate with an accuracy greater than 90%. However, there are reasons to believe that such claims are exaggerated due to large fluctuations and overfitting of the estimates. When we confronted three selected published models with new data, we found a much lower predictive power than reported in the original publications. Overly optimistic predictive powers appear from violations of the basic principles of machine-learning. We highlight common misconceptions in the studies claiming excessive predictive power and propose to use learning curves as a safeguard against those mistakes. As an example, we show that the current amount of experimental data is insufficient to build a linear predictor of logarithms of folding rates based on protein amino acid composition.
- Published
- 2015
- Full Text
- View/download PDF
23. 3D genome structure. Organization of the nucleus in space and time.
- Author
-
Filion GJ and Beato M
- Subjects
- Chromatin genetics, Chromatin ultrastructure, Epigenesis, Genetic, Humans, Cell Nucleus physiology, Genome, Human
- Published
- 2015
- Full Text
- View/download PDF
24. Starcode: sequence clustering based on all-pairs search.
- Author
-
Zorita E, Cuscó P, and Filion GJ
- Subjects
- Humans, Algorithms, Cluster Analysis, Computational Biology methods, High-Throughput Nucleotide Sequencing methods, Sequence Analysis, DNA methods, Software
- Abstract
Motivation: The increasing throughput of sequencing technologies offers new applications and challenges for computational biology. In many of those applications, sequencing errors need to be corrected. This is particularly important when sequencing reads from an unknown reference such as random DNA barcodes. In this case, error correction can be done by performing a pairwise comparison of all the barcodes, which is a computationally complex problem., Results: Here, we address this challenge and describe an exact algorithm to determine which pairs of sequences lie within a given Levenshtein distance. For error correction or redundancy reduction purposes, matched pairs are then merged into clusters of similar sequences. The efficiency of starcode is attributable to the poucet search, a novel implementation of the Needleman-Wunsch algorithm performed on the nodes of a trie. On the task of matching random barcodes, starcode outperforms sequence clustering algorithms in both speed and precision., Availability and Implementation: The C source code is available at http://github.com/gui11aume/starcode., (© The Author 2015. Published by Oxford University Press.)
- Published
- 2015
- Full Text
- View/download PDF
25. The signed Kolmogorov-Smirnov test: why it should not be used.
- Author
-
Filion GJ
- Subjects
- Reproducibility of Results, Data Interpretation, Statistical, Genomics methods, Statistics, Nonparametric
- Abstract
The two-sample Kolmogorov-Smirnov (KS) test is often used to decide whether two random samples have the same statistical distribution. A popular modification of the KS test is to use a signed version of the KS statistic to infer whether the values of one sample are statistically larger than the values of the other. The underlying hypotheses of the KS test are intrinsically incompatible with this approach and the test can produce false positives supported by extremely low p-values. This potentially makes the signed KS test a tool of p-hacking, which should be discouraged by replacing it with standard tests such as the t-test and by providing confidence intervals instead of p-values.
- Published
- 2015
- Full Text
- View/download PDF
26. A network model of the molecular organization of chromatin in Drosophila.
- Author
-
van Bemmel JG, Filion GJ, Rosado A, Talhout W, de Haas M, van Welsem T, van Leeuwen F, and van Steensel B
- Subjects
- Animals, Bayes Theorem, Binding Sites, Cell Line, Chromosomes, Insect metabolism, DNA Repair, DNA Replication, Drosophila Proteins genetics, Drosophila Proteins physiology, Drosophila melanogaster genetics, Gene Expression Regulation, Models, Biological, Molecular Sequence Annotation, Nuclear Proteins genetics, Nuclear Proteins physiology, Principal Component Analysis, Protein Binding, Protein Interaction Mapping, Protein Interaction Maps, Protein Processing, Post-Translational, Chromatin metabolism, Drosophila Proteins metabolism, Drosophila melanogaster metabolism, Nuclear Proteins metabolism
- Abstract
Chromatin governs gene regulation and genome maintenance, yet a substantial fraction of the chromatin proteome is still unexplored. Moreover, a global model of the chromatin protein network is lacking. By screening >100 candidates we identify 42 Drosophila proteins that were not previously associated with chromatin, which all display specific genomic binding patterns. Bayesian network modeling of the binding profiles of these and 70 known chromatin components yields a detailed blueprint of the in vivo chromatin protein network. We demonstrate functional compartmentalization of this network, and predict functions for most of the previously unknown chromatin proteins, including roles in DNA replication and repair, and gene activation and repression., (Copyright © 2013 Elsevier Inc. All rights reserved.)
- Published
- 2013
- Full Text
- View/download PDF
27. The inner nuclear membrane proteins Man1 and Ima1 link to two different types of chromatin at the nuclear periphery in S. pombe.
- Author
-
Steglich B, Filion GJ, van Steensel B, and Ekwall K
- Subjects
- Amino Acid Sequence, Chromatin metabolism, Genetic Loci genetics, Genome, Fungal genetics, Membrane Proteins chemistry, Membrane Proteins deficiency, Membrane Proteins metabolism, Molecular Sequence Data, Nuclear Envelope metabolism, Nuclear Proteins chemistry, Nuclear Proteins deficiency, Nuclear Proteins metabolism, RNA Interference, Schizosaccharomyces metabolism, Schizosaccharomyces pombe Proteins chemistry, Schizosaccharomyces pombe Proteins metabolism, Site-Specific DNA-Methyltransferase (Adenine-Specific) metabolism, Telomere genetics, Telomere metabolism, Chromatin genetics, Membrane Proteins genetics, Nuclear Envelope genetics, Nuclear Proteins genetics, Schizosaccharomyces cytology, Schizosaccharomyces genetics, Schizosaccharomyces pombe Proteins genetics
- Abstract
Metazoan chromatin at the nuclear periphery is generally characterized by lowly expressed genes and repressive chromatin marks and presents a sub-compartment with properties distinct from the nuclear interior. To test whether the S. pombe nuclear periphery behaves similarly, we used DNA adenine methyltransferase identification (DamID) to map the target loci of two inner nuclear membrane proteins, Ima1 and Man1. We found that peripheral chromatin shows low levels of RNA-Polymerase II and nucleosome occupancy, both characteristic of repressed chromatin regions. Consistently, lowly expressed genes preferentially associate with the periphery and highly expressed genes are depleted from it. When looking at peripheral intergenic regions (IGRs), we found that divergent IGRs are enriched compared with convergent IGRs, indicating that transcription preferentially points away from the periphery rather than toward it. Interestingly, we found that Ima1 and Man1 have common, but also separate target regions in the genome. Ima1-interacting loci were enriched for the RNAi components Dcr1 and Rdp1. This agrees with previous findings that Dcr1 is localized at the nuclear periphery. In contrast, Man1 target loci were bound by the heterochromatin protein Swi6, especially at subtelomeric regions. Subtelomeric chromatin was shown to form a unique chromatin type lacking both repressive and active chromatin features and containing low levels of the histone variant H2A.Z. Thus, we find that the fission yeast nuclear periphery shows similar properties to those of metazoan cells, despite the absence of a nuclear lamina. Our results point to a role of nuclear membrane proteins in organizing chromatin domains and loops.
- Published
- 2012
- Full Text
- View/download PDF
28. Systematic protein location mapping reveals five principal chromatin types in Drosophila cells.
- Author
-
Filion GJ, van Bemmel JG, Braunschweig U, Talhout W, Kind J, Ward LD, Brugman W, de Castro IJ, Kerkhoven RM, Bussemaker HJ, and van Steensel B
- Subjects
- Animals, Cell Line, Chromatin metabolism, DNA-Binding Proteins metabolism, Drosophila Proteins metabolism, Drosophila melanogaster metabolism, Euchromatin metabolism, Heterochromatin metabolism, Histones metabolism, Principal Component Analysis, Chromatin classification, DNA-Binding Proteins analysis, Drosophila Proteins analysis, Drosophila melanogaster genetics
- Abstract
Chromatin is important for the regulation of transcription and other functions, yet the diversity of chromatin composition and the distribution along chromosomes are still poorly characterized. By integrative analysis of genome-wide binding maps of 53 broadly selected chromatin components in Drosophila cells, we show that the genome is segmented into five principal chromatin types that are defined by unique yet overlapping combinations of proteins and form domains that can extend over > 100 kb. We identify a repressive chromatin type that covers about half of the genome and lacks classic heterochromatin markers. Furthermore, transcriptionally active euchromatin consists of two types that differ in molecular organization and H3K36 methylation and regulate distinct classes of genes. Finally, we provide evidence that the different chromatin types help to target DNA-binding factors to specific genomic regions. These results provide a global view of chromatin diversity and domain organization in a metazoan cell., (Copyright © 2010 Elsevier Inc. All rights reserved.)
- Published
- 2010
- Full Text
- View/download PDF
29. Bayesian network analysis of targeting interactions in chromatin.
- Author
-
van Steensel B, Braunschweig U, Filion GJ, Chen M, van Bemmel JG, and Ideker T
- Subjects
- Animals, Bayes Theorem, Models, Biological, Protein Interaction Mapping, Chromatin metabolism, DNA-Binding Proteins metabolism, Drosophila melanogaster genetics, Gene Regulatory Networks, Metabolic Networks and Pathways
- Abstract
In eukaryotes, many chromatin proteins together regulate gene expression. Chromatin proteins often direct the genomic binding pattern of other chromatin proteins, for example, by recruitment or competition mechanisms. The network of such targeting interactions in chromatin is complex and still poorly understood. Based on genome-wide binding maps, we constructed a Bayesian network model of the targeting interactions among a broad set of 43 chromatin components in Drosophila cells. This model predicts many novel functional relationships. For example, we found that the homologous proteins HP1 and HP1C each target the heterochromatin protein HP3 to distinct sets of genes in a competitive manner. We also discovered a central role for the remodeling factor Brahma in the targeting of several DNA-binding factors, including GAGA factor, JRA, and SU(VAR)3-7. Our network model provides a global view of the targeting interplay among dozens of chromatin components.
- Published
- 2010
- Full Text
- View/download PDF
30. Reassessing the abundance of H3K9me2 chromatin domains in embryonic stem cells.
- Author
-
Filion GJ and van Steensel B
- Subjects
- Animals, Cell Differentiation, Chromosome Mapping, Embryonic Stem Cells cytology, Histone-Lysine N-Methyltransferase metabolism, Humans, Methylation, Mice, Protein Processing, Post-Translational, Embryonic Stem Cells metabolism, Histones metabolism, Lysine metabolism
- Published
- 2010
- Full Text
- View/download PDF
31. Sensing X chromosome pairs before X inactivation via a novel X-pairing region of the Xic.
- Author
-
Augui S, Filion GJ, Huart S, Nora E, Guggiari M, Maresca M, Stewart AF, and Heard E
- Subjects
- Alleles, Animals, Cell Differentiation, Cell Line, Chromosomes, Artificial, Bacterial, Down-Regulation, Embryonic Stem Cells, Female, Mice, Mice, Transgenic, RNA, Long Noncoding, RNA, Untranslated genetics, RNA, Untranslated metabolism, S Phase, Transfection, Transgenes, Up-Regulation, X Chromosome physiology, Chromosome Pairing, X Chromosome genetics, X Chromosome Inactivation
- Abstract
Mammalian dosage compensation involves silencing of one of the two X chromosomes in females and is controlled by the X-inactivation center (Xic). The Xic, which includes Xist and its antisense transcription unit Tsix/Xite, somehow senses the number of X chromosomes and triggers Xist up-regulation from one of the two X chromosomes in females. We found that a segment of the mouse Xic lying several hundred kilobases upstream of Xist brings the two Xics together before the onset of X inactivation. This region can autonomously drive Xic trans-interactions even as an ectopic single-copy transgene. Its introduction into male embryonic stem cells is strongly selected against, consistent with a possible role in trans-activating Xist. We propose that homologous associations driven by this novel X-pairing region (Xpr) of the Xic enable a cell to sense that more than one X chromosome is present and coordinate reciprocal Xist/Tsix expression.
- Published
- 2007
- Full Text
- View/download PDF
32. Using reverse electrophoretic mobility shift assay to measure and compare protein-DNA binding affinities.
- Author
-
Filion GJ, Fouvry L, and Defossez PA
- Subjects
- Binding Sites, DNA Methylation, Humans, Protein Binding, DNA metabolism, DNA-Binding Proteins metabolism, Electrophoretic Mobility Shift Assay methods, Repressor Proteins metabolism
- Published
- 2006
- Full Text
- View/download PDF
33. Transmission and immunity: the importance of heterogeneity in the fight against malaria.
- Author
-
Filion GJ, Paul RE, and Robert V
- Subjects
- Africa epidemiology, Animals, Child, Host-Parasite Interactions, Humans, Immunity, Innate, Malaria, Falciparum epidemiology, Prevalence, Malaria, Falciparum immunology, Malaria, Falciparum transmission, Plasmodium falciparum immunology, Public Health
- Abstract
The complex relationship between transmission and parasite prevalence in humans is an important issue. Using a large dataset matching estimates of malaria transmission and Plasmodium falciparum prevalence in African children, a stimulating study published in Nature provides evidence that heterogeneity in susceptibility crucially determines the prevalence of infection. Moreover, it suggests that children who clear infections are not immune to new infections, irrespective of the amount of transmission. It is important to question the relevance of such results based on mathematical models when discussing host-parasite interactions, especially their implications for public health interventions.
- Published
- 2006
- Full Text
- View/download PDF
34. [Epigenetics and cancer].
- Author
-
Filion GJ and Defossez PA
- Subjects
- DNA Methylation, Humans, Transcription, Genetic, Genes, Tumor Suppressor, Neoplasms genetics
- Abstract
The term epigenetics encompasses all the modifications that are stable across cell generations, but which do not imply any change in DNA sequence. Post-translational modifications of the histones and DNA methylation are the most studied types of epigenetic information due to their major impact on transcription. The link between epigenetics and cancer arises from the fact that epigenetic deregulations frequently participate in tumorigenesis by inactivation of tumour-suppressor genes. Since these deregulations are reversible, hopes of treatment rely on a better understanding of the maintenance mechanisms of the epigenetic information. Among the different pathways of transcription inhibition, DNA methylation is the simplest and one of the best characterized at the present time. Inhibitors of DNA methyltransferases are currently under clinical trials and already show promising results.
- Published
- 2006
35. A family of human zinc finger proteins that bind methylated DNA and repress transcription.
- Author
-
Filion GJ, Zhenilo S, Salozhin S, Yamada D, Prokhortchouk E, and Defossez PA
- Subjects
- Amino Acid Sequence, Animals, Brain metabolism, Brain Chemistry, CpG Islands, DNA metabolism, DNA-Binding Proteins analysis, DNA-Binding Proteins genetics, Gene Expression Regulation, Humans, Mice, Molecular Sequence Data, Phylogeny, Repressor Proteins analysis, Repressor Proteins genetics, Transcription Factors analysis, Transcription Factors metabolism, Transcription, Genetic, DNA Methylation, DNA-Binding Proteins metabolism, Repressor Proteins metabolism, Zinc Fingers
- Abstract
In vertebrates, densely methylated DNA is associated with inactive transcription. Actors in this process include proteins of the MBD family that can recognize methylated CpGs and repress transcription. Kaiso, a structurally unrelated protein, has also been shown to bind methylated CGCGs through its three Krüppel-like C2H2 zinc fingers. The human genome contains two uncharacterized proteins, ZBTB4 and ZBTB38, that contain Kaiso-like zinc fingers. We report that ZBTB4 and ZBTB38 bind methylated DNA in vitro and in vivo. Unlike Kaiso, they can bind single methylated CpGs. When transfected in mouse cells, the proteins colocalize with foci of heavily methylated satellite DNA and become delocalized upon loss of DNA methylation. Chromatin immunoprecipitation suggests that both of these proteins specifically bind to the methylated allele of the H19/Igf2 differentially methylated region. ZBTB4 and ZBTB38 repress the transcription of methylated templates in transfection assays. The two genes have distinct tissue-specific expression patterns, but both are highly expressed in the brain. Our results reveal the existence of a family of Kaiso-like proteins that bind methylated CpGs. Like proteins of the MBD family, they are able to repress transcription in a methyl-dependent manner, yet their tissue-specific expression pattern suggests nonoverlapping functions.
- Published
- 2006
- Full Text
- View/download PDF
36. The human enhancer blocker CTC-binding factor interacts with the transcription factor Kaiso.
- Author
-
Defossez PA, Kelly KF, Filion GJ, Pérez-Torrado R, Magdinier F, Menoni H, Nordgaard CL, Daniel JM, and Gilson E
- Subjects
- Antibodies, Monoclonal chemistry, Base Sequence, Binding Sites, CCCTC-Binding Factor, Chromatin Immunoprecipitation, CpG Islands, DNA Methylation, DNA, Complementary metabolism, DNA-Binding Proteins metabolism, Enhancer Elements, Genetic, Genome, Genome, Human, Globins chemistry, HeLa Cells, Humans, Immunoblotting, Immunoprecipitation, Molecular Sequence Data, Protein Binding, Protein Structure, Tertiary, Repressor Proteins metabolism, Spectrophotometry, Transcription Factors chemistry, Transcription, Genetic, Two-Hybrid System Techniques, Ultraviolet Rays, Zinc Fingers, DNA-Binding Proteins physiology, Repressor Proteins physiology, Transcription Factors metabolism
- Abstract
CTC-binding factor (CTCF) is a DNA-binding protein of vertebrates that plays essential roles in regulating genome activity through its capacity to act as an enhancer blocker. We performed a yeast two-hybrid screen to identify protein partners of CTCF that could regulate its activity. Using full-length CTCF as bait we recovered Kaiso, a POZ-zinc finger transcription factor, as a specific binding partner. The interaction occurs through a C-terminal region of CTCF and the POZ domain of Kaiso. CTCF and Kaiso are co-expressed in many tissues, and CTCF was specifically co-immunoprecipitated by several Kaiso monoclonal antibodies from nuclear lysates. Kaiso is a bimodal transcription factor that recognizes methylated CpG dinucleotides or a conserved unmethylated sequence (TNGCAGGA, the Kaiso binding site). We identified one consensus unmethylated Kaiso binding site in close proximity to the CTCF binding site in the human 5' beta-globin insulator. We found, in an insulation assay, that the presence of this Kaiso binding site reduced the enhancer-blocking activity of CTCF. These data suggest that the Kaiso-CTCF interaction negatively regulates CTCF insulator activity.
- Published
- 2005
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.