Back to Search Start Over

A Weighted Two-stage Sequence Alignment Framework to Identify DNA Motifs from ChIP-exo Data

Authors :
Yang Li
Yizhong Wang
Cankun Wang
Anne Fennell
Anjun Ma
Jing Jiang
Zhaoqian Liu
Qin Ma
Bingqiang Liu
Publication Year :
2023
Publisher :
Cold Spring Harbor Laboratory, 2023.

Abstract

Identifying precise transcription factor binding sites (TFBS) or regulatory DNA motifs plays a fundamental role in researching transcriptional regulatory mechanisms in cells and in helping construct regulatory networks. Current algorithms developed for motif searching focus on the analysis of ChIP-enriched peaks but are not able to integrate the ChIP signal in nucleotide resolution. We present a weighted two-stage alignment tool (TESA). Our framework implements an analysis workflow from experimental datasets to TFBS prediction results. It employs a binomial distribution model and graph searching model with ChIP-exonuclease (ChIP-exo) reads depth and sequence data. TESA can effectively measure the possibility for each position to be an actual TFBS in a given promoter sequence and predict statistically significant TFBS sequence segments. The algorithm substantially improves prediction accuracy and extends the scope of applicability of existing approaches. We apply the framework to a collection of 20 ChIP-exo datasets of E. coli from proChIPdb and evaluate the prediction performance through comparison with three existing programs. The performance evaluation against the compared programs indicates that TESA is more accurate for identifying regulatory motifs in prokaryotic genomes.

Details

Database :
OpenAIRE
Accession number :
edsair.doi...........cd0c1c7716723e730929f07b2468718f
Full Text :
https://doi.org/10.1101/2023.04.06.535915