Back to Search Start Over

INSPIIRED: A Pipeline for Quantitative Analysis of Sites of New DNA Integration in Cellular Genomes

Authors :
Anatoly Dryga
Emmanuelle Six
Yinghua Wu
Mary Jane Drake
Christopher L. Nobles
Eric J. Sherman
Aubrey Bailey
Laure Caccavelli
Nirav Malani
Paul Bates
Salima Hacein-Bey-Abina
Frances Male
Frederic D. Bushman
Shantan Reddy
Marina Cavazzana
Charles C. Berry
John K. Everett
Kyle Bittinger
Source :
Molecular Therapy: Methods & Clinical Development, Vol 4, Iss C, Pp 39-49 (2017), Molecular Therapy. Methods & Clinical Development, Sherman, E; Nobles, C; Berry, CC; Six, E; Wu, Y; Dryga, A; et al.(2017). INSPIIRED: A Pipeline for Quantitative Analysis of Sites of New DNA Integration in Cellular Genomes. MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT, 4, 39-49. doi: 10.1016/j.omtm.2016.11.002. UC San Diego: Retrieved from: http://www.escholarship.org/uc/item/2tq50500
Publication Year :
2017
Publisher :
Elsevier, 2017.

Abstract

Integration of new DNA into cellular genomes mediates replication of retroviruses and transposons; integration reactions have also been adapted for use in human gene therapy. Tracking the distributions of integration sites is important to characterize populations of transduced cells and to monitor potential outgrow of pathogenic cell clones. Here, we describe a pipeline for quantitative analysis of integration site distributions named INSPIIRED (integration site pipeline for paired-end reads). We describe optimized biochemical steps for site isolation using Illumina paired-end sequencing, including new technology for suppressing recovery of unwanted contaminants, then software for alignment, quality control, and management of integration site sequences. During library preparation, DNAs are broken by sonication, so that after ligation-mediated PCR the number of ligation junction sites can be used to infer abundance of gene-modified cells. We generated integration sites of known positions insilico, and we describe optimization of sample processing parameters refined by comparison to truth. We also present a novel graph-theory-based method for quantifying integration sites in repeated sequences, and we characterize the consequences using synthetic and experimental data. In an accompanying paper, we describe an additional set of statistical tools for data analysis and visualization. Software is available at https://github.com/BushmanLab/INSPIIRED.

Details

Language :
English
ISSN :
23290501
Volume :
4
Database :
OpenAIRE
Journal :
Molecular Therapy: Methods & Clinical Development
Accession number :
edsair.doi.dedup.....bbdf7982cad42256726d2cbc46a1b137
Full Text :
https://doi.org/10.1016/j.omtm.2016.11.002.