Back to Search
Start Over
Correction to: ATAC2GRN: optimized ATAC-seq and DNase1-seq pipelines for rapid and accurate genome regulatory network inference
- Source :
- BMC Genomics, BMC Genomics, Vol 20, Iss 1, Pp 1-1 (2019)
- Publication Year :
- 2019
- Publisher :
- BioMed Central, 2019.
-
Abstract
- Background Chromatin accessibility profiling assays such as ATAC-seq and DNase1-seq offer the opportunity to rapidly characterize the regulatory state of the genome at a single nucleotide resolution. Optimization of molecular protocols has enabled the molecular biologist to produce next-generation sequencing libraries in several hours, leaving the analysis of sequencing data as the primary obstacle to wide-scale deployment of accessibility profiling assays. To address this obstacle we have developed an optimized and efficient pipeline for the analysis of ATAC-seq and DNase1-seq data. Results We executed a multi-dimensional grid-search on the NIH Biowulf supercomputing cluster to assess the impact of parameter selection on biological reproducibility and ChIP-seq recovery by analyzing 4560 pipeline configurations. Our analysis improved ChIP-seq recovery by 15% for ATAC-seq and 3% for DNase1-seq and determined that PCR duplicate removal improves biological reproducibility by 36% without significant costs in footprinting transcription factors. Our analyses of down sampled reads identified a point of diminishing returns for increased library sequencing depth, with 95% of the ChIP-seq data of a 200 million read footprinting library recovered by 160 million reads. Conclusions We present optimized ATAC-seq and DNase-seq pipelines in both Snakemake and bash formats as well as optimal sequencing depths for ATAC-seq and DNase-seq projects. The optimized ATAC-seq and DNase1-seq analysis pipelines, parameters, and ground-truth ChIP-seq datasets have been made available for deployment and future algorithmic profiling. Electronic supplementary material The online version of this article (10.1186/s12864-018-4943-z) contains supplementary material, which is available to authorized users.
- Subjects :
- 0106 biological sciences
Optimization
Chromatin Immunoprecipitation
lcsh:QH426-470
lcsh:Biotechnology
Inference
ATAC-seq
Computational biology
Biology
01 natural sciences
Genome
03 medical and health sciences
lcsh:TP248.13-248.65
Pipeline
Genetics
Deoxyribonuclease I
Typographical error
030304 developmental biology
0303 health sciences
Genomic Library
Methodology Article
Correction
Computational Biology
High-Throughput Nucleotide Sequencing
Reproducibility of Results
Sequence Analysis, DNA
lcsh:Genetics
DNA footprinting
DNase1-seq
010606 plant biology & botany
Biotechnology
Regulation
Transcription Factors
Subjects
Details
- Language :
- English
- ISSN :
- 14712164
- Volume :
- 20
- Database :
- OpenAIRE
- Journal :
- BMC Genomics
- Accession number :
- edsair.doi.dedup.....48cd5f8117fc8d53259bd10926b7e75d