Author: "Degalez F" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Degalez F"' showing total 9 results

Start Over Author "Degalez F"

9 results on '"Degalez F"'

1. GEGA (Gallus Enriched Gene Annotation): an online tool providing genomics and functional information across 47 tissues for a chicken gene-enriched atlas gathering Ensembl and Refseq genome annotations.

Author: Degalez F, Bardou P, and Lagarrigue S
Abstract: GEGA is a user-friendly tool designed to navigate through various genomic and functional information related to an enriched gene atlas in chicken that integrates the gene catalogues from the two reference databases, NCBI-RefSeq and EMBL-Ensembl/GENCODE, along with four additional rich resources such as FAANG and NONCODE. Using the latest GRCg7b genome assembly, GEGA encompasses a total of 78 323 genes, including 24 102 protein-coding genes (PCGs) and 44 428 long non-coding RNAs (lncRNAs), significantly increasing the number of genes provided by each resource independently. However, GEGA is more than just a gene database. It offers a range of features that allow us to go deeper into the functional aspects of these genes. Users can explore gene expression and co-expression profiles across 47 tissues from 36 datasets and 1400 samples, discover tissue-specific variations and their expression as a function of sex or age and extract orthologous genes or their genomic configuration relative to the closest gene. For the communities interested in a specific gene, a list of genes or a quantitative trait locus region in chicken, GEGA's user-friendly interface facilitates efficient gene analysis, easy downloading of results and a multitude of graphical representations, from genomic information to detailed visualization of expression levels., (© The Author(s) 2024. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)
Published: 2024
Full Text: View/download PDF

2. Variant calling and genotyping accuracy of ddRAD-seq: Comparison with 20X WGS in layers.

Author: Doublet M, Degalez F, Lagarrigue S, Lagoutte L, Gueret E, Allais S, and Lecerf F
Subjects: Animals, Genotype, Reproducibility of Results, Female, High-Throughput Nucleotide Sequencing methods, Sequence Analysis, DNA methods, Polymorphism, Single Nucleotide, Chickens genetics, Genotyping Techniques methods, Whole Genome Sequencing methods
Abstract: Whole Genome Sequencing (WGS) remains a costly or unsuitable method for routine genotyping of laying hens. Until now, breeding companies have been using or developing SNP chips. Nevertheless, alternatives methods based on sequencing have been developed. Among these, reduced representation sequencing approaches can offer sequencing quality and cost-effectiveness by reducing the genomic regions covered by sequencing. The aim of this study was to evaluate the ability of double digested Restriction site Associated DNA sequencing (ddRAD-seq) to identify and genotype SNPs in laying hens, by comparison with a presumed reliable WGS approach. Firstly, the sensitivity and precision of variant calling and the genotyping reliability of ddRADseq were determined. Next, the SNP Call Rate (CRSNP) and mean depth of sequencing per SNP (DPSNP) were compared between both methods. Finally, the effect of multiple combinations of thresholds for these parameters on genotyping reliability and amount of remaining SNPs in ddRAD-seq was studied. In raw form, the ddRAD-seq identified 349,497 SNPs evenly distributed on the genome with a CRSNP of 0.55, a DPSNP of 11X and a mean genotyping reliability rate per SNP of 80%. Considering genomic regions covered by expected enzymatic fragments (EFs), the sensitivity of the ddRAD-seq was estimated at 32.4% and its precision at 96.4%. The low CRSNP and DPSNP values were explained by the detection of SNPs outside the EFs theoretically generated by the ddRAD-seq protocol. Indeed, SNPs outside the EFs had significantly lower CRSNP (0.25) and DPSNP (1X) values than SNPs within the EFs (0.7 and 17X, resp.). The study demonstrated the relationship between CRSNP, DPSNP, genotyping reliability and the number of SNPs retained, to provide a decision-support tool for defining filtration thresholds. Severe quality control over ddRAD-seq data allowed to retain a minimum of 40% of the SNPs with a CcR of 98%. Then, ddRAD-seq was defined as a suitable method for variant calling and genotyping in layers., Competing Interests: The authors declare that they have no competing interests., (Copyright: © 2024 Doublet et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
Published: 2024
Full Text: View/download PDF

3. Impact of genome build on RNA-seq interpretation and diagnostics.

Author: Ungar RA, Goddard PC, Jensen TD, Degalez F, Smith KS, Jin CA, Bonner DE, Bernstein JA, Wheeler MT, and Montgomery SB
Subjects: Humans, Genomics methods, Transcriptome, Rare Diseases genetics, Rare Diseases diagnosis, Gene Expression Profiling methods, Genome, Human, RNA-Seq methods
Abstract: Transcriptomics is a powerful tool for unraveling the molecular effects of genetic variants and disease diagnosis. Prior studies have demonstrated that choice of genome build impacts variant interpretation and diagnostic yield for genomic analyses. To identify the extent genome build also impacts transcriptomics analyses, we studied the effect of the hg19, hg38, and CHM13 genome builds on expression quantification and outlier detection in 386 rare disease and familial control samples from both the Undiagnosed Diseases Network and Genomics Research to Elucidate the Genetics of Rare Disease Consortium. Across six routinely collected biospecimens, 61% of quantified genes were not influenced by genome build. However, we identified 1,492 genes with build-dependent quantification, 3,377 genes with build-exclusive expression, and 9,077 genes with annotation-specific expression across six routinely collected biospecimens, including 566 clinically relevant and 512 known OMIM genes. Further, we demonstrate that between builds for a given gene, a larger difference in quantification is well correlated with a larger change in expression outlier calling. Combined, we provide a database of genes impacted by build choice and recommend that transcriptomics-guided analyses and diagnoses are cross referenced with these data for robustness., Competing Interests: Declaration of interests During this project R.A.U. was employed for an internship by Vertex Pharmaceuticals. P.C.G. is a consultant for BioMarin. S.B.M. is an advisor to BioMarin, MyOme, and Tenaya Therapeutics., (Copyright © 2024 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
Published: 2024
Full Text: View/download PDF

4. Enriched atlas of lncRNA and protein-coding genes for the GRCg7b chicken assembly and its functional annotation across 47 tissues.

Author: Degalez F, Charles M, Foissac S, Zhou H, Guan D, Fang L, Klopp C, Allain C, Lagoutte L, Lecerf F, Acloque H, Giuffra E, Pitel F, and Lagarrigue S
Subjects: Animals, Humans, Chickens genetics, Chickens metabolism, Transcriptome, Molecular Sequence Annotation, Sequence Analysis, RNA, RNA, Long Noncoding genetics, RNA, Long Noncoding metabolism
Abstract: Gene atlases for livestock are steadily improving thanks to new genome assemblies and new expression data improving the gene annotation. However, gene content varies across databases due to differences in RNA sequencing data and bioinformatics pipelines, especially for long non-coding RNAs (lncRNAs) which have higher tissue and developmental specificity and are harder to consistently identify compared to protein coding genes (PCGs). As done previously in 2020 for chicken assemblies galgal5 and GRCg6a, we provide a new gene atlas, lncRNA-enriched, for the latest GRCg7b chicken assembly, integrating "NCBI RefSeq", "EMBL-EBI Ensembl/GENCODE" reference annotations and other resources such as FAANG and NONCODE. As a result, the number of PCGs increases from 18,022 (RefSeq) and 17,007 (Ensembl) to 24,102, and that of lncRNAs from 5789 (RefSeq) and 11,944 (Ensembl) to 44,428. Using 1400 public RNA-seq transcriptome representing 47 tissues, we provided expression evidence for 35,257 (79%) lncRNAs and 22,468 (93%) PCGs, supporting the relevance of this atlas. Further characterization including tissue-specificity, sex-differential expression and gene configurations are provided. We also identified conserved miRNA-hosting genes with human counterparts, suggesting common function. The annotated atlas is available at gega.sigenae.org., (© 2024. The Author(s).)
Published: 2024
Full Text: View/download PDF

5. Impact of genome build on RNA-seq interpretation and diagnostics.

Author: Ungar RA, Goddard PC, Jensen TD, Degalez F, Smith KS, Jin CA, Bonner DE, Bernstein JA, Wheeler MT, and Montgomery SB
Abstract: Transcriptomics is a powerful tool for unraveling the molecular effects of genetic variants and disease diagnosis. Prior studies have demonstrated that choice of genome build impacts variant interpretation and diagnostic yield for genomic analyses. To identify the extent genome build also impacts transcriptomics analyses, we studied the effect of the hg19, hg38, and CHM13 genome builds on expression quantification and outlier detection in 386 rare disease and familial control samples from both the Undiagnosed Diseases Network (UDN) and Genomics Research to Elucidate the Genetics of Rare Disease (GREGoR) Consortium. We identified 2,800 genes with build-dependent quantification across six routinely-collected biospecimens, including 1,391 protein-coding genes and 341 known rare disease genes. We further observed multiple genes that only have detectable expression in a subset of genome builds. Finally, we characterized how genome build impacts the detection of outlier transcriptomic events. Combined, we provide a database of genes impacted by build choice, and recommend that transcriptomics-guided analyses and diagnoses are cross-referenced with these data for robustness., Competing Interests: Declaration of interests During this project R.A.U. was employed for an internship by Vertex Pharmaceuticals. P.C.G. is a consultant for BioMarin. S.B.M. is an advisor to BioMarin, MyOme, and Tenaya Therapeutics.
Published: 2024
Full Text: View/download PDF

6. LncRNAs in domesticated animals: from dog to livestock species.

Author: Lagarrigue S, Lorthiois M, Degalez F, Gilot D, and Derrien T
Subjects: Animals, Animals, Domestic genetics, Dogs, Genome, Livestock genetics, Mice, Phylogeny, Transcriptome, RNA, Long Noncoding genetics
Abstract: Animal genomes are pervasively transcribed into multiple RNA molecules, of which many will not be translated into proteins. One major component of this transcribed non-coding genome is the long non-coding RNAs (lncRNAs), which are defined as transcripts longer than 200 nucleotides with low coding-potential capabilities. Domestic animals constitute a unique resource for studying the genetic and epigenetic basis of phenotypic variations involving protein-coding and non-coding RNAs, such as lncRNAs. This review presents the current knowledge regarding transcriptome-based catalogues of lncRNAs in major domesticated animals (pets and livestock species), covering a broad phylogenetic scale (from dogs to chicken), and in comparison with human and mouse lncRNA catalogues. Furthermore, we describe different methods to extract known or discover novel lncRNAs and explore comparative genomics approaches to strengthen the annotation of lncRNAs. We then detail different strategies contributing to a better understanding of lncRNA functions, from genetic studies such as GWAS to molecular biology experiments and give some case examples in domestic animals. Finally, we discuss the limitations of current lncRNA annotations and suggest research directions to improve them and their functional characterisation., (© 2021. The Author(s).)
Published: 2022
Full Text: View/download PDF

7. Fourth Report on Chicken Genes and Chromosomes 2022.

Author: Smith J, Alfieri JM, Anthony N, Arensburger P, Athrey GN, Balacco J, Balic A, Bardou P, Barela P, Bigot Y, Blackmon H, Borodin PM, Carroll R, Casono MC, Charles M, Cheng H, Chiodi M, Cigan L, Coghill LM, Crooijmans R, Das N, Davey S, Davidian A, Degalez F, Dekkers JM, Derks M, Diack AB, Djikeng A, Drechsler Y, Dyomin A, Fedrigo O, Fiddaman SR, Formenti G, Frantz LAF, Fulton JE, Gaginskaya E, Galkina S, Gallardo RA, Geibel J, Gheyas AA, Godinez CJP, Goodell A, Graves JAM, Griffin DK, Haase B, Han JL, Hanotte O, Henderson LJ, Hou ZC, Howe K, Huynh L, Ilatsia E, Jarvis ED, Johnson SM, Kaufman J, Kelly T, Kemp S, Kern C, Keroack JH, Klopp C, Lagarrigue S, Lamont SJ, Lange M, Lanke A, Larkin DM, Larson G, Layos JKN, Lebrasseur O, Malinovskaya LP, Martin RJ, Martin Cerezo ML, Mason AS, McCarthy FM, McGrew MJ, Mountcastle J, Muhonja CK, Muir W, Muret K, Murphy TD, Ng'ang'a I, Nishibori M, O'Connor RE, Ogugo M, Okimoto R, Ouko O, Patel HR, Perini F, Pigozzi MI, Potter KC, Price PD, Reimer C, Rice ES, Rocos N, Rogers TF, Saelao P, Schauer J, Schnabel RD, Schneider VA, Simianer H, Smith A, Stevens MP, Stiers K, Tiambo CK, Tixier-Boichard M, Torgasheva AA, Tracey A, Tregaskes CA, Vervelde L, Wang Y, Warren WC, Waters PD, Webb D, Weigend S, Wolc A, Wright AE, Wright D, Wu Z, Yamagata M, Yang C, Yin ZT, Young MC, Zhang G, Zhao B, and Zhou H
Published: 2022
Full Text: View/download PDF

8. Watch Out for a Second SNP: Focus on Multi-Nucleotide Variants in Coding Regions and Rescued Stop-Gained.

Author: Degalez F, Jehl F, Muret K, Bernard M, Lecerf F, Lagoutte L, Désert C, Pitel F, Klopp C, and Lagarrigue S
Abstract: Most single-nucleotide polymorphisms (SNPs) are located in non-coding regions, but the fraction usually studied is harbored in protein-coding regions because potential impacts on proteins are relatively easy to predict by popular tools such as the Variant Effect Predictor. These tools annotate variants independently without considering the potential effect of grouped or haplotypic variations, often called "multi-nucleotide variants" (MNVs). Here, we used a large RNA-seq dataset to survey MNVs, comprising 382 chicken samples originating from 11 populations analyzed in the companion paper in which 9.5M SNPs- including 3.3M SNPs with reliable genotypes-were detected. We focused our study on in-codon MNVs and evaluate their potential mis-annotation. Using GATK HaplotypeCaller read-based phasing results, we identified 2,965 MNVs observed in at least five individuals located in 1,792 genes. We found 41.1% of them showing a novel impact when compared to the effect of their constituent SNPs analyzed separately. The biggest impact variation flux concerns the originally annotated stop-gained consequences, for which around 95% were rescued; this flux is followed by the missense consequences for which 37% were reannotated with a different amino acid. We then present in more depth the rescued stop-gained MNVs and give an illustration in the SLC27A4 gene. As previously shown in human datasets, our results in chicken demonstrate the value of haplotype-aware variant annotation, and the interest to consider MNVs in the coding region, particularly when searching for severe functional consequence such as stop-gained variants., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Degalez, Jehl, Muret, Bernard, Lecerf, Lagoutte, Désert, Pitel, Klopp and Lagarrigue.)
Published: 2021
Full Text: View/download PDF

9. RNA-Seq Data for Reliable SNP Detection and Genotype Calling: Interest for Coding Variant Characterization and Cis -Regulation Analysis by Allele-Specific Expression in Livestock Species.

Author: Jehl F, Degalez F, Bernard M, Lecerf F, Lagoutte L, Désert C, Coulée M, Bouchez O, Leroux S, Abasht B, Tixier-Boichard M, Bed'hom B, Burlot T, Gourichon D, Bardou P, Acloque H, Foissac S, Djebali S, Giuffra E, Zerjal T, Pitel F, Klopp C, and Lagarrigue S
Abstract: In addition to their common usages to study gene expression, RNA-seq data accumulated over the last 10 years are a yet-unexploited resource of SNPs in numerous individuals from different populations. SNP detection by RNA-seq is particularly interesting for livestock species since whole genome sequencing is expensive and exome sequencing tools are unavailable. These SNPs detected in expressed regions can be used to characterize variants affecting protein functions, and to study cis -regulated genes by analyzing allele-specific expression (ASE) in the tissue of interest. However, gene expression can be highly variable, and filters for SNP detection using the popular GATK toolkit are not yet standardized, making SNP detection and genotype calling by RNA-seq a challenging endeavor. We compared SNP calling results using GATK suggested filters, on two chicken populations for which both RNA-seq and DNA-seq data were available for the same samples of the same tissue. We showed, in expressed regions, a RNA-seq precision of 91% (SNPs detected by RNA-seq and shared by DNA-seq) and we characterized the remaining 9% of SNPs. We then studied the genotype (GT) obtained by RNA-seq and the impact of two factors (GT call-rate and read number per GT) on the concordance of GT with DNA-seq; we proposed thresholds for them leading to a 95% concordance. Applying these thresholds to 767 multi-tissue RNA-seq of 382 birds of 11 chicken populations, we found 9.5 M SNPs in total, of which ∼550,000 SNPs per tissue and population with a reliable GT (call rate ≥ 50%) and among them, ∼340,000 with a MAF ≥ 10%. We showed that such RNA-seq data from one tissue can be used to ( i ) detect SNPs with a strong predicted impact on proteins, despite their scarcity in each population (16,307 SIFT deleterious missenses and 590 stop-gained), ( ii ) study, on a large scale, cis -regulations of gene expression, with ∼81% of protein-coding and 68% of long non-coding genes (TPM ≥ 1) that can be analyzed for ASE, and with ∼29% of them that were cis -regulated, and ( iii ) analyze population genetic using such SNPs located in expressed regions. This work shows that RNA-seq data can be used with good confidence to detect SNPs and associated GT within various populations and used them for different analyses as GTEx studies., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Jehl, Degalez, Bernard, Lecerf, Lagoutte, Désert, Coulée, Bouchez, Leroux, Abasht, Tixier-Boichard, Bed’hom, Burlot, Gourichon, Bardou, Acloque, Foissac, Djebali, Giuffra, Zerjal, Pitel, Klopp and Lagarrigue.)
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

9 results on '"Degalez F"'

1. GEGA (Gallus Enriched Gene Annotation): an online tool providing genomics and functional information across 47 tissues for a chicken gene-enriched atlas gathering Ensembl and Refseq genome annotations.

2. Variant calling and genotyping accuracy of ddRAD-seq: Comparison with 20X WGS in layers.

3. Impact of genome build on RNA-seq interpretation and diagnostics.

4. Enriched atlas of lncRNA and protein-coding genes for the GRCg7b chicken assembly and its functional annotation across 47 tissues.

5. Impact of genome build on RNA-seq interpretation and diagnostics.

6. LncRNAs in domesticated animals: from dog to livestock species.

7. Fourth Report on Chicken Genes and Chromosomes 2022.

8. Watch Out for a Second SNP: Focus on Multi-Nucleotide Variants in Coding Regions and Rescued Stop-Gained.

9. RNA-Seq Data for Reliable SNP Detection and Genotype Calling: Interest for Coding Variant Characterization and Cis -Regulation Analysis by Allele-Specific Expression in Livestock Species.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

9 results on '"Degalez F"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources