Back to Search Start Over

Pairs and Pairix: a file format and a tool for efficient storage and retrieval for Hi-C read pairs.

Authors :
Lee, Soohyun
Bakker, Clara R
Vitzthum, Carl
Alver, Burak H
Park, Peter J
Source :
Bioinformatics. 3/15/2022, Vol. 38 Issue 6, p1729-1731. 3p.
Publication Year :
2022

Abstract

Summary As the amount of 3D chromosomal interaction data continues to increase, storing and accessing such data efficiently becomes paramount. We introduce Pairs, a block-compressed text file format for storing paired genomic coordinates from Hi-C data, and Pairix, an open-source C application to index and query Pairs files. Pairix (also available in Python and R) extends the functionalities of Tabix to paired coordinates data. We have also developed PairsQC, a collapsible HTML quality control report generator for Pairs files. Availability and implementation The format specification and source code are available at https://github.com/4dn-dcic/pairix , https://github.com/4dn-dcic/Rpairix and https://github.com/4dn-dcic/pairsqc. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13674803
Volume :
38
Issue :
6
Database :
Academic Search Index
Journal :
Bioinformatics
Publication Type :
Academic Journal
Accession number :
155584895
Full Text :
https://doi.org/10.1093/bioinformatics/btab870