Bayesian Computational Inference of Species Trees and Population Sizes

Heled, Yosef

dc.contributor.advisor	Drummond, A	en
dc.contributor.advisor	Bryant, D	en
dc.contributor.author	Heled, Yosef	en
dc.date.accessioned	2011-04-11T02:10:09Z	en
dc.date.issued	2011	en
dc.identifier.uri	http://hdl.handle.net/2292/6657	en
dc.description.abstract	Rapid advances in sequencing technology have allowed researchers to study many fundamental questions about Evolution and Life through the analysis of molecular sequence data. The computational analysis of sequence data can reveal the patterns and processes of speciation, domestication, epidemiology, evolution of traits, biogeography, and host/parasite co-evolution. Such analyses are also central to the systematic classification of life. All of these applications require a mathematical model that characterizes the random processes governing the evolution of DNA sequences over time, subject to the specific conditions of interest. It turns out that evolutionary models also share another common basis: the need to account for the ancestral relationships and sizes of the populations within which the genetic material is being propagated. Historically, those aspects were part of both the area of populations genetics and systematics - the study of biological diversity and its origins. Those two disciplines are experiencing a synthesis in the last few decades under the branch of Molecular Phylogenetics". In this thesis we develop two methods; one for estimating the effective size of a single population, the second for estimating the family (species) tree of several populations and their sizes. Both methods utilize multi-gene data from multiple individuals, a key ingredient for reducing the uncertainty of estimates. The methods are Bayesian, utilizing the generic Markov Chain Monte Carlo algorithm to compute posterior probabilities for high-dimensional and extremely complex models. Posterior probabilities are direct probabilities of the model under suitable prior assumptions, and uncertainty bounds are an integral part of the Bayesian approach. Uncertainty bounds are especially valuable in analysis of genetic data, which may contain an extremely weak signal. In this thesis we also explore some theoretical issues relating to the tree prior and model violations. Due to the complexity of the models, computer simulations play a major role in investigating model properties and performance as a function of the amount and quality of data. The methods are implemented as part of the software package BEAST and have already been used in several studies.	en
dc.publisher	ResearchSpace@Auckland	en
dc.relation.ispartof	PhD Thesis - University of Auckland	en
dc.relation.isreferencedby	UoA99212899714002091	en
dc.rights	Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher.	en
dc.rights.uri	https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm	en
dc.title	Bayesian Computational Inference of Species Trees and Population Sizes	en
dc.type	Thesis	en
thesis.degree.discipline	Computer Science	en
thesis.degree.grantor	The University of Auckland	en
thesis.degree.level	Doctoral	en
thesis.degree.name	PhD	en
dc.rights.holder	Copyright: The author	en
pubs.peer-review	false	en
pubs.elements-id	208881	en
pubs.record-created-at-source-date	2011-04-11	en
dc.identifier.wikidata	Q112886486

Files in this item

Name: whole.pdf

Size: 7.386Mb

Format: PDF

This item appears in the following Collection(s)

Doctoral Theses [6827]

Show simple item record

Bayesian Computational Inference of Species Trees and Population Sizes

Files in this item

This item appears in the following Collection(s)

Search ResearchSpace

Browse

All of ResearchSpace

This Collection

Statistics

Bayesian Computational Inference of Species Trees and Population Sizes

Files in this item

This item appears in the following Collection(s)

Share

Search ResearchSpace

Browse

All of ResearchSpace

This Collection

Statistics