Clustering Incomplete Ordinal Data

Show simple item record

dc.contributor.advisor Dobbie, G en
dc.contributor.advisor Alam, S en
dc.contributor.author Sun, Xiaobin en
dc.date.accessioned 2014-12-07T22:32:59Z en
dc.date.issued 2014 en
dc.identifier.citation 2014 en
dc.identifier.uri http://hdl.handle.net/2292/23699 en
dc.description Full text is available to authenticated members of The University of Auckland only. en
dc.description.abstract The issue of missing data is a common problem for researchers and data analysts working with surveys and other types of questionnaires that use ordinal data. Despite the frequent occurrence and the relevance of this missing data problem, many machine learning algorithms handle missing data in a rather naive way. The standard approach involves first imputing the missing values, and then giving the completed imputed data to the learning algorithm. One advantage of this approach is that it allows the user to select the most suitable imputation method for different datasets. However, the classification result is not promising. Su et al. proposed an algorithm called “Classifier-based Nominal Imputation” (CNI), which improves the classification problem for machine learning algorithms on incomplete nominal datasets, but the performance on ordinal data remains unknown. Our work applied this CNI technique to ordinal data and the experimental results showed that using this CNI algorithm to pre-process the incomplete ordinal dataset, resulted in significantly higher classification accuracy than learners that do not apply any imputation method and those using baseline imputation techniques, such as the most common value imputation. This CNI algorithm is found to be helpful for many learners such as K Nearest Neighbour, Naive Bayes and Multilayer Perceptron Neural Networks on incomplete ordinal data. en
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof Masters Thesis - University of Auckland en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. en
dc.rights Restricted Item. Available to authenticated members of The University of Auckland. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/nz/ en
dc.title Clustering Incomplete Ordinal Data en
dc.type Thesis en
thesis.degree.grantor The University of Auckland en
thesis.degree.level Masters en
dc.rights.holder Copyright: The Author en
pubs.elements-id 468244 en
pubs.record-created-at-source-date 2014-12-08 en
dc.identifier.wikidata Q112907230


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics