Back to Search Start Over

Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST

Authors :
Marc A. Suchard
Mandev S. Gill
Guy Baele
Philippe Lemey
Paul Bastide
Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven
KU Leuven (KU Leuven)
Fielding School of Public Health
University of California [Los Angeles] (UCLA)
University of California-University of California
Department of Biomathematics, David Geffen School of Medicine at UCLA, University of California, Los Angeles
Department of Human Genetics, David Geffen School of Medicine at UCLA, University of California, Los Angeles
Posada, David
Source :
Systematic Biology, Systematic Biology, Oxford University Press (OUP), 2020, ⟨10.1093/sysbio/syaa037⟩, Systematic biology, vol 70, iss 1
Publication Year :
2020
Publisher :
HAL CCSD, 2020.

Abstract

Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts extensions that accommodate heterogeneity of substitution rates across sites, heterogeneity in the process over time in a site-specific manner remains frequently overlooked. This is problematic, as evolutionary processes that act at the molecular level are highly variable, subjecting different sites to different selective constraints over time, impacting their substitution behavior. We propose incorporating time variability through Markov-modulated models (MMMs), which extend covarion-like models and allow the substitution process (including relative character exchange rates as well as the overall substitution rate) at individual sites to vary across lineages. We implement a general MMM framework in BEAST, a popular Bayesian phylogenetic inference software package, allowing researchers to compose a wide range of MMMs through flexible XML specification. Using examples from bacterial, viral, and plastid genome evolution, we show that MMMs impact phylogenetic tree estimation and can substantially improve model fit compared to standard substitution models. Through simulations, we show that marginal likelihood estimation accurately identifies the generative model and does not systematically prefer the more parameter-rich MMMs. To mitigate the increased computational demands associated with MMMs, our implementation exploits recent developments in BEAGLE, a high-performance computational library for phylogenetic inference. [Bayesian inference; BEAGLE; BEAST; covarion, heterotachy; Markov-modulated models; phylogenetics.]. ispartof: SYSTEMATIC BIOLOGY vol:70 issue:1 pages:181-189 ispartof: location:England status: published

Details

Language :
English
ISSN :
10635157 and 1076836X
Database :
OpenAIRE
Journal :
Systematic Biology, Systematic Biology, Oxford University Press (OUP), 2020, ⟨10.1093/sysbio/syaa037⟩, Systematic biology, vol 70, iss 1
Accession number :
edsair.doi.dedup.....27116b17582a26046fabc795fdc12593
Full Text :
https://doi.org/10.1093/sysbio/syaa037⟩