Clustering Incomplete Ordinal Data

Sun, Xiaobin

dc.contributor.advisor	Dobbie, G	en
dc.contributor.advisor	Alam, S	en
dc.contributor.author	Sun, Xiaobin	en
dc.date.accessioned	2014-12-07T22:32:59Z	en
dc.date.issued	2014	en
dc.identifier.citation	2014	en
dc.identifier.uri	http://hdl.handle.net/2292/23699	en
dc.description	Full text is available to authenticated members of The University of Auckland only.	en
dc.description.abstract	The issue of missing data is a common problem for researchers and data analysts working with surveys and other types of questionnaires that use ordinal data. Despite the frequent occurrence and the relevance of this missing data problem, many machine learning algorithms handle missing data in a rather naive way. The standard approach involves first imputing the missing values, and then giving the completed imputed data to the learning algorithm. One advantage of this approach is that it allows the user to select the most suitable imputation method for different datasets. However, the classification result is not promising. Su et al. proposed an algorithm called “Classifier-based Nominal Imputation” (CNI), which improves the classification problem for machine learning algorithms on incomplete nominal datasets, but the performance on ordinal data remains unknown. Our work applied this CNI technique to ordinal data and the experimental results showed that using this CNI algorithm to pre-process the incomplete ordinal dataset, resulted in significantly higher classification accuracy than learners that do not apply any imputation method and those using baseline imputation techniques, such as the most common value imputation. This CNI algorithm is found to be helpful for many learners such as K Nearest Neighbour, Naive Bayes and Multilayer Perceptron Neural Networks on incomplete ordinal data.	en
dc.publisher	ResearchSpace@Auckland	en
dc.relation.ispartof	Masters Thesis - University of Auckland	en
dc.rights	Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher.	en
dc.rights	Restricted Item. Available to authenticated members of The University of Auckland.	en
dc.rights.uri	https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm	en
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/nz/	en
dc.title	Clustering Incomplete Ordinal Data	en
dc.type	Thesis	en
thesis.degree.grantor	The University of Auckland	en
thesis.degree.level	Masters	en
dc.rights.holder	Copyright: The Author	en
pubs.elements-id	468244	en
pubs.record-created-at-source-date	2014-12-08	en
dc.identifier.wikidata	Q112907230

Files in this item

Name: whole.pdf

Size: 1.244Mb

Format: PDF

This item appears in the following Collection(s)

Masters Theses - Authenticated Access [6749]

Show simple item record

Clustering Incomplete Ordinal Data

Files in this item

This item appears in the following Collection(s)

Search ResearchSpace

Browse

All of ResearchSpace

This Collection

Statistics

Clustering Incomplete Ordinal Data

Files in this item

This item appears in the following Collection(s)

Share

Search ResearchSpace

Browse

All of ResearchSpace

This Collection

Statistics