1. RNA Pol II transcription model and interpretation of GRO-seq data.
- Author
-
Lladser, Manuel, Azofeifa, Joseph, Allen, Mary, and Dowell, Robin
- Subjects
- *
RNA polymerases , *NUCLEOTIDE sequencing , *DROSOPHILA melanogaster , *GENOMES , *PROBABILITY theory - Abstract
A mixture model and statistical method is proposed to interpret the distribution of reads from a nascent transcriptional assay, such as global run-on sequencing (GRO-seq) data. The model is annotation agnostic and leverages on current understanding of the behavior of RNA polymerase II. Briefly, it assumes that polymerase loads at key positions (transcription start sites) within the genome. Once loaded, polymerase either remains in the initiation form (with some probability) or transitions into an elongating form (with the remaining probability). The model can be fit genome-wide, allowing patterns of Pol II behavior to be assessed on each distinct transcript. Furthermore, it allows for the first time a principled approach to distinguishing the initiation signal from the elongation signal; in particular, it implies a data driven method for calculating the pausing index, a commonly used metric that informs on the behavior of RNA polymerase II. We demonstrate that this approach improves on existing analyses of GRO-seq data and uncovers a novel biological understanding of the impact of knocking down the Male Specific Lethal (MSL) complex in Drosophilia melanogaster. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF