Vector Generalized Linear Models for Zipf-Mandelbrot Distributions

Show simple item record

dc.contributor.advisor Yee, Thomas William
dc.contributor.author Chou, Mu-Jou (Miriam)
dc.date.accessioned 2024-03-25T19:54:48Z
dc.date.available 2024-03-25T19:54:48Z
dc.date.issued 2020 en
dc.identifier.uri https://hdl.handle.net/2292/67853
dc.description.abstract The aim of this thesis is to implement an additional Zipf-MandelbrotII distribution (Mandelbrot 1961), as a piece of software, on top of the existing Vector Generalized Linear and Additive Models (VGLMs/VGAMs) framework. The implemented model follows Zipf’s law, and is particularly suitable for word frequency analysis, taking into account of the occurring frequency and occurring rank of words in a corpus. Through the VGAM R package, one can develop the Maximum Likelihood Estimate (MLE) of the scaling parameter on empirical data sets by passing in the implemented model. Along with the model, associate dpqr-type functions are also implemented. All functions are used and validated using real world data sets from the Gutenberg Database. Upon the completion of this work, a greater flexibility on analysing linguistics data is provided.
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof Masters Thesis - University of Auckland en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/nz/
dc.title Vector Generalized Linear Models for Zipf-Mandelbrot Distributions
dc.type Thesis en
thesis.degree.discipline Science
thesis.degree.grantor The University of Auckland en
thesis.degree.level Masters en
dc.date.updated 2024-03-21T18:37:57Z
dc.rights.holder Copyright: the author en
dc.rights.accessrights http://purl.org/eprint/accessRights/OpenAccess en


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics