1. Genomic GPS: using genetic distance from individuals to public data for genomic analysis without disclosing personal genomes
- Author
-
Kunhee Kim, Jin Kyung Roh, Hyungryul Baik, Chloe Soohyun Jang, Buhm Han, and Eskin E
- Subjects
lcsh:QH426-470 ,Short Report ,Sample (statistics) ,Genetic distance ,Biology ,Public domain ,Genome ,03 medical and health sciences ,0302 clinical medicine ,Personal genome ,Databases, Genetic ,Humans ,Privacy protection ,lcsh:QH301-705.5 ,030304 developmental biology ,0303 health sciences ,Information retrieval ,Genome, Human ,business.industry ,Multilateration ,Genomics ,Data sharing ,Open data ,lcsh:Genetics ,Genetics, Population ,lcsh:Biology (General) ,Global Positioning System ,business ,030217 neurology & neurosurgery ,Personal genomics - Abstract
Genomic global positioning system (GPS) applies the multilateration technique commonly used in the GPS to genomic data. In the framework we present here, investigators calculate genetic distances from their samples to reference samples, which are from data held in the public domain, and share this information with others. This sharing enables certain types of genomic analysis, such as identifying sample overlaps and close relatives, decomposing ancestry, and mapping of geographical origin without disclosing personal genomes. Thus, our method can be seen as a balance between open data sharing and privacy protection. Electronic supplementary material The online version of this article (10.1186/s13059-019-1792-2) contains supplementary material, which is available to authorized users.
- Published
- 2019
- Full Text
- View/download PDF