Bayesian approaches to Model Uncertainty in Phylogenetics

Show simple item record

dc.contributor.advisor Drummond, A en
dc.contributor.advisor Fewster, R en
dc.contributor.advisor Suchard, M en
dc.contributor.author Wu, Chieh-Hsi en
dc.date.accessioned 2014-11-23T20:19:26Z en
dc.date.issued 2014 en
dc.identifier.citation 2014 en
dc.identifier.uri http://hdl.handle.net/2292/23560 en
dc.description.abstract When inferring a phylogeny in a probabilistic framework, one is faced with many choices of how to model the underlying processes that give rise to the observed data. This includes how to accommodate the across-site variation in the properties of the nucleotide substitution process and how to incorporate the across-time variation in the parameters of the tree-generating process. To model across-site heterogeneity in the properties of the nucleotide substitution process, one commonly pre-de nes the partition scheme of the alignment thereby grouping sites into a number of categories and then independently estimates the substitution model of each category. This practice ignores the uncertainty associated with the partition scheme, and the pre-de ned partition scheme may not agree with the data. This thesis rst presents three new methods that accommodate the uncertainty associated with the partition scheme. They estimate the number of categories, the assignments of sites to the categories, the nucleotide substitution model and the site rate model for each category, and the uncertainty in these selections. These methods employ approaches of Bayesian model selection and/or Bayesian nonparametrics. They di er in the a priori assumptions on the assignments of sites to categories, and therefore provide di erent views on the across-site heterogeneity in the properties of the nucleotide substitution process. Analyses with all three methods have found statistical evidence for across-site heterogeneity in the nucleotide substitution process both within a single gene and among genes in various sets of empirical data. Recently proposed models based on the birth-death-sampling process allow the rate parameters of birth, death and sampling events to vary through time as piecewise constant functions, but require the number of rate shifts to be xed a priori. This thesis presents a new method that employs a transdimensional sampling algorithm for Bayesian model selection to directly estimate the number of shifts in the parameters of the birth-death-sampling process (or epidemiological parameters in the case of a phylogeny of a rapidly evolving infectious disease). In summary, I have developed a series of new phylogenetic methods based on Bayesian model selection and Bayesian nonparametrics to permit the direct inference of the across-site variation of the nucleotide substitution process and the acrosstime variation in the birth-death-sampling process. These new methods take into account the uncertainty associated with the alignment partition scheme and the tree generating process, avoiding potential model misspeci cation. en
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof PhD Thesis - University of Auckland en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/nz/ en
dc.title Bayesian approaches to Model Uncertainty in Phylogenetics en
dc.type Thesis en
thesis.degree.grantor The University of Auckland en
thesis.degree.level Doctoral en
thesis.degree.name PhD en
dc.rights.holder Copyright: The Author en
dc.rights.accessrights http://purl.org/eprint/accessRights/OpenAccess en
pubs.elements-id 462700 en
pubs.record-created-at-source-date 2014-11-24 en
dc.identifier.wikidata Q112907732


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics