Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Bayesian inference of mixed Gaussian phylogenetic models

Brahmantio, Bayu ; Bartoszek, Krzysztof and Yapar, Etka LU orcid (2026) In BMC Bioinformatics 27(Suppl 1).
Abstract

Background: Continuous traits evolution of a group of taxa that are correlated through a phylogenetic tree is commonly modelled using parametric stochastic differential equations to represent deterministic change of traits through time, while incorporating noises that represent different unobservable evolutionary pressures. A heterogeneous Gaussian process that consists of multiple parametric sub-processes is often used when the observed data come from a very diverse set of taxa. In the maximum likelihood setting, challenges arise when exploring the involved likelihood surface and when interpreting the uncertainty around the parameters. Results: We extend the methods to tackle inference problems for mixed Gaussian phylogenetic models... (More)

Background: Continuous traits evolution of a group of taxa that are correlated through a phylogenetic tree is commonly modelled using parametric stochastic differential equations to represent deterministic change of traits through time, while incorporating noises that represent different unobservable evolutionary pressures. A heterogeneous Gaussian process that consists of multiple parametric sub-processes is often used when the observed data come from a very diverse set of taxa. In the maximum likelihood setting, challenges arise when exploring the involved likelihood surface and when interpreting the uncertainty around the parameters. Results: We extend the methods to tackle inference problems for mixed Gaussian phylogenetic models (MGPMs) by implementing a Bayesian scheme that can take into account biologically relevant priors. The posterior inference method is based on the Population Monte Carlo (PMC) algorithm that is easily parallelized, and uses an efficient algorithm to calculate the likelihood of phylogenetically correlated observations. A model evaluation method that is based on the proximity of the posterior predictive distribution to the observed data is also implemented. Simulation study is done to test the inference and evaluation capability of the method. Finally, we test our method on a real-world dataset. Conclusion: We implement the method in the R package bgphy, available at https://github.com/bayubeta/bgphy. Simulation study demonstrates that the method is capable to infer parameters to evaluate different models, while its implementation on the real-world dataset indicates that a carefully selected model of evolution based on naturally occurring classifications results in a better fit to the observed data.

(Less)
Please use this url to cite or link to this publication:
author
; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Bayesian statistics, Evolution, Gaussian diffusion process, Phylogenetic comparative methods
in
BMC Bioinformatics
volume
27
issue
Suppl 1
article number
77
publisher
BioMed Central (BMC)
external identifiers
  • scopus:105034960625
  • pmid:41922960
ISSN
1471-2105
DOI
10.1186/s12859-026-06399-y
language
English
LU publication?
yes
additional info
Publisher Copyright: © The Author(s) 2026.
id
c000d438-15de-48e2-a102-71f79c31d875
date added to LUP
2026-05-27 14:21:11
date last changed
2026-06-10 15:45:52
@article{c000d438-15de-48e2-a102-71f79c31d875,
  abstract     = {{<p>Background: Continuous traits evolution of a group of taxa that are correlated through a phylogenetic tree is commonly modelled using parametric stochastic differential equations to represent deterministic change of traits through time, while incorporating noises that represent different unobservable evolutionary pressures. A heterogeneous Gaussian process that consists of multiple parametric sub-processes is often used when the observed data come from a very diverse set of taxa. In the maximum likelihood setting, challenges arise when exploring the involved likelihood surface and when interpreting the uncertainty around the parameters. Results: We extend the methods to tackle inference problems for mixed Gaussian phylogenetic models (MGPMs) by implementing a Bayesian scheme that can take into account biologically relevant priors. The posterior inference method is based on the Population Monte Carlo (PMC) algorithm that is easily parallelized, and uses an efficient algorithm to calculate the likelihood of phylogenetically correlated observations. A model evaluation method that is based on the proximity of the posterior predictive distribution to the observed data is also implemented. Simulation study is done to test the inference and evaluation capability of the method. Finally, we test our method on a real-world dataset. Conclusion: We implement the method in the R package bgphy, available at https://github.com/bayubeta/bgphy. Simulation study demonstrates that the method is capable to infer parameters to evaluate different models, while its implementation on the real-world dataset indicates that a carefully selected model of evolution based on naturally occurring classifications results in a better fit to the observed data.</p>}},
  author       = {{Brahmantio, Bayu and Bartoszek, Krzysztof and Yapar, Etka}},
  issn         = {{1471-2105}},
  keywords     = {{Bayesian statistics; Evolution; Gaussian diffusion process; Phylogenetic comparative methods}},
  language     = {{eng}},
  number       = {{Suppl 1}},
  publisher    = {{BioMed Central (BMC)}},
  series       = {{BMC Bioinformatics}},
  title        = {{Bayesian inference of mixed Gaussian phylogenetic models}},
  url          = {{http://dx.doi.org/10.1186/s12859-026-06399-y}},
  doi          = {{10.1186/s12859-026-06399-y}},
  volume       = {{27}},
  year         = {{2026}},
}