Start Over

A likelihood method for estimating present-day human contamination in ancient male samples using low-depth X-chromosome data.

Authors :: Moreno-Mayar, J Víctor
Schwartz, Russell1
Moreno-Mayar, J Víctor
Korneliussen, Thorfinn Sand
Dalal, Jyoti
Renaud, Gabriel
Albrechtsen, Anders
Nielsen, Rasmus
Malaspinas, Anna-Sapfo
Moreno-Mayar, J Víctor
Schwartz, Russell1
Moreno-Mayar, J Víctor
Korneliussen, Thorfinn Sand
Dalal, Jyoti
Renaud, Gabriel
Albrechtsen, Anders
Nielsen, Rasmus
Malaspinas, Anna-Sapfo
Source :: Bioinformatics (Oxford, England); vol 36, iss 3, 828-841; 1367-4803
Publication Year :: 2020
Abstract: MotivationThe presence of present-day human contaminating DNA fragments is one of the challenges defining ancient DNA (aDNA) research. This is especially relevant to the ancient human DNA field where it is difficult to distinguish endogenous molecules from human contaminants due to their genetic similarity. Recently, with the advent of high-throughput sequencing and new aDNA protocols, hundreds of ancient human genomes have become available. Contamination in those genomes has been measured with computational methods often developed specifically for these empirical studies. Consequently, some of these methods have not been implemented and tested for general use while few are aimed at low-depth nuclear data, a common feature in aDNA datasets.ResultsWe develop a new X-chromosome-based maximum likelihood method for estimating present-day human contamination in low-depth sequencing data from male individuals. We implement our method for general use, assess its performance under conditions typical of ancient human DNA research, and compare it to previous nuclear data-based methods through extensive simulations. For low-depth data, we show that existing methods can produce unusable estimates or substantially underestimate contamination. In contrast, our method provides accurate estimates for a depth of coverage as low as 0.5× on the X-chromosome when contamination is below 25%. Moreover, our method still yields meaningful estimates in very challenging situations, i.e. when the contaminant and the target come from closely related populations or with increased error rates. With a running time below 5 min, our method is applicable to large scale aDNA genomic studies.Availability and implementationThe method is implemented in C++ and R and is available in github.com/sapfo/contaminationX and popgen.dk/angsd.

Details

Database :: OAIster
Journal :: Bioinformatics (Oxford, England); vol 36, iss 3, 828-841; 1367-4803
Notes :: Bioinformatics (Oxford, England) vol 36, iss 3, 828-841 1367-4803
Publication Type :: Electronic Resource
Accession number :: edsoai.on1367406847
Document Type :: Electronic Resource

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

A likelihood method for estimating present-day human contamination in ancient male samples using low-depth X-chromosome data.

Abstract

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

A likelihood method for estimating present-day human contamination in ancient male samples using low-depth X-chromosome data.

Abstract

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources