Back to Search Start Over

Finding haplotype block boundaries by using the minimum-description-length principle.

Authors :
Anderson EC
Novembre J
Source :
American journal of human genetics [Am J Hum Genet] 2003 Aug; Vol. 73 (2), pp. 336-54. Date of Electronic Publication: 2003 Jul 11.
Publication Year :
2003

Abstract

We present a method for detecting haplotype blocks that simultaneously uses information about linkage-disequilibrium decay between the blocks and the diversity of haplotypes within the blocks. By use of phased single-nucleotide polymorphism data, our method partitions a chromosome into a series of adjacent, nonoverlapping blocks. The partition is made by choosing among a family of Markov models for block structure in a chromosomal region. Specifically, in the model, the occurrence of haplotypes within blocks follows a time-inhomogeneous Markov process along the chromosome, and we choose among possible partitions by using the two-stage minimum-description-length criterion. When applied to data simulated from the coalescent with recombination hotspots, our method reliably situates block boundaries at the hotspots and infrequently places block boundaries at sites with background levels of recombination. We apply three previously published block-finding methods to the same data, showing that they either are relatively insensitive to recombination hotspots or fail to discriminate between background sites of recombination and hotspots. When applied to the 5q31 data of Daly et al., our method identifies more block boundaries in agreement with those found by Daly et al. than do other methods. These results suggest that our method may be useful for designing association-based mapping studies that exploit haplotype blocks.

Details

Language :
English
ISSN :
0002-9297
Volume :
73
Issue :
2
Database :
MEDLINE
Journal :
American journal of human genetics
Publication Type :
Academic Journal
Accession number :
12858289
Full Text :
https://doi.org/10.1086/377106