Back to Search Start Over

Linkage disequilibrium matches forensic genetic records to disjoint genomic marker sets.

Authors :
Edge, Michael D.
Algee-Hewitt, Bridget F. B.
Pemberton, Trevor J.
Li, Jun Z.
Rosenberg, Noah A.
Source :
Proceedings of the National Academy of Sciences of the United States of America. 5/30/2017, Vol. 114 Issue 22, p5671-5676. 6p.
Publication Year :
2017

Abstract

Combining genotypes across datasets is central in facilitating advances in genetics. Data aggregation efforts often face the challenge of record matching-the identification of dataset entries that represent the same individual. We show that records can be matched across genotype datasets that have no shared markers based on linkage disequilibrium between loci appearing in different datasets. Using two datasets for the same 872 people-one with 642,563 genome-wide SNPs and the other with 13 short tandem repeats (STRs) used in forensic applications-we find that 90-98% of forensic STR records can be connected to corresponding SNP records and vice versa. Accuracy increases to 99-100%when ∼30 STRs are used. Our method expands the potential of data aggregation, but it also suggests privacy risks intrinsic in maintenance of databases containing even small numbers of markers-including databases of forensic significance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00278424
Volume :
114
Issue :
22
Database :
Academic Search Index
Journal :
Proceedings of the National Academy of Sciences of the United States of America
Publication Type :
Academic Journal
Accession number :
123357747
Full Text :
https://doi.org/10.1073/pnas.1619944114