Large-scale Inference Using Non-parametric Mixtures

Show simple item record

dc.contributor.advisor Wang, Yong
dc.contributor.author Xue, Xiangjie
dc.date.accessioned 2021-03-11T19:08:34Z
dc.date.available 2021-03-11T19:08:34Z
dc.date.issued 2021 en
dc.identifier.uri https://hdl.handle.net/2292/54659
dc.description.abstract A new approach to solving large-scale problems using non-parametric mixtures is proposed. The new approach not only takes the advantage of the great flexibility in nonparametric mixture methods and the fast computation of the non-parametric maximum likelihood estimate of a mixing distribution using the constrained Newton method proposed by Wang (2007), but it is also able to make use of the common features among observations in order for a more efficient and accurate estimation. This new approach is then applied to several practical fields. In the context of multiple hypothesis testing, a new method is proposed to compute the NPMLE given a fixed probability at the location of the null effect, and then is extended to estimate the proportion of null effects based on various threshold functions. Distance-based counterparts under the Cramér-von Mises and the Anderson-Darling distance are also introduced. Furthermore, modifications are also made to the likelihood- and distancebased methods, and hence they can be used in large-scale computation. A new procedure that controls the false discovery rate, based on the estimated null proportion and its estimated mixing distribution, is also introduced. Numerical studies show that the estimators of the null proportion using the non-parametric maximum likelihood estimators and their minimum distance counterparts (the non-parametric minimum distance estimators) perform well and the proposed controlling procedure makes more rejections than existing methods given a pre-specified level. A new method based on this new approach is also proposed for covariance matrix estimation. A new covariance matrix estimator is constructed using the empirical Bayes on the sample correlation coefficients or the transformed sample correlation coefficients. The estimated density required by the proposed estimator can be computed using the non-parametric maximum likelihood estimators or the non-parametric minimum distance estimators. Estimation under the sparsity assumption is also discussed. The numerical studies using the simulated and the real world datasets all suggest that the proposed covariance matrix estimator performs well and can be applied to a wide range of covariance structures.
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof PhD Thesis - University of Auckland en
dc.relation.isreferencedby UoA en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/nz/
dc.title Large-scale Inference Using Non-parametric Mixtures
dc.type Thesis en
thesis.degree.discipline Statistics
thesis.degree.grantor The University of Auckland en
thesis.degree.level Doctoral en
thesis.degree.name PhD en
dc.date.updated 2021-03-07T21:45:52Z
dc.rights.holder Copyright: The author en
dc.rights.accessrights http://purl.org/eprint/accessRights/OpenAccess en


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics