dc.contributor.advisor |
Drummond, A |
en |
dc.contributor.advisor |
Bryant, D |
en |
dc.contributor.author |
Heled, Yosef |
en |
dc.date.accessioned |
2011-04-11T02:10:09Z |
en |
dc.date.issued |
2011 |
en |
dc.identifier.uri |
http://hdl.handle.net/2292/6657 |
en |
dc.description.abstract |
Rapid advances in sequencing technology have allowed researchers to study many fundamental questions about Evolution and Life through the analysis of molecular sequence data. The computational analysis of sequence data can reveal the patterns and processes of speciation, domestication, epidemiology, evolution of traits, biogeography, and host/parasite co-evolution. Such analyses are also central to the systematic classification of life. All of these applications require a mathematical model that characterizes the random processes governing the evolution of DNA sequences over time, subject to the specific conditions of interest. It turns out that evolutionary models also share another common basis: the need to account for the ancestral relationships and sizes of the populations within which the genetic material is being propagated. Historically, those aspects were part of both the area of populations genetics and systematics - the study of biological diversity and its origins. Those two disciplines are experiencing a synthesis in the last few decades under the branch of Molecular Phylogenetics". In this thesis we develop two methods; one for estimating the effective size of a single population, the second for estimating the family (species) tree of several populations and their sizes. Both methods utilize multi-gene data from multiple individuals, a key ingredient for reducing the uncertainty of estimates. The methods are Bayesian, utilizing the generic Markov Chain Monte Carlo algorithm to compute posterior probabilities for high-dimensional and extremely complex models. Posterior probabilities are direct probabilities of the model under suitable prior assumptions, and uncertainty bounds are an integral part of the Bayesian approach. Uncertainty bounds are especially valuable in analysis of genetic data, which may contain an extremely weak signal. In this thesis we also explore some theoretical issues relating to the tree prior and model violations. Due to the complexity of the models, computer simulations play a major role in investigating model properties and performance as a function of the amount and quality of data. The methods are implemented as part of the software package BEAST and have already been used in several studies. |
en |
dc.publisher |
ResearchSpace@Auckland |
en |
dc.relation.ispartof |
PhD Thesis - University of Auckland |
en |
dc.relation.isreferencedby |
UoA99212899714002091 |
en |
dc.rights |
Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. |
en |
dc.rights.uri |
https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm |
en |
dc.title |
Bayesian Computational Inference of Species Trees and Population Sizes |
en |
dc.type |
Thesis |
en |
thesis.degree.discipline |
Computer Science |
en |
thesis.degree.grantor |
The University of Auckland |
en |
thesis.degree.level |
Doctoral |
en |
thesis.degree.name |
PhD |
en |
dc.rights.holder |
Copyright: The author |
en |
pubs.peer-review |
false |
en |
pubs.elements-id |
208881 |
en |
pubs.record-created-at-source-date |
2011-04-11 |
en |
dc.identifier.wikidata |
Q112886486 |
|