dc.contributor.advisor |
Dobbie, G |
en |
dc.contributor.advisor |
Alam, S |
en |
dc.contributor.author |
Sun, Xiaobin |
en |
dc.date.accessioned |
2014-12-07T22:32:59Z |
en |
dc.date.issued |
2014 |
en |
dc.identifier.citation |
2014 |
en |
dc.identifier.uri |
http://hdl.handle.net/2292/23699 |
en |
dc.description |
Full text is available to authenticated members of The University of Auckland only. |
en |
dc.description.abstract |
The issue of missing data is a common problem for researchers and data analysts working with surveys and other types of questionnaires that use ordinal data. Despite the frequent occurrence and the relevance of this missing data problem, many machine learning algorithms handle missing data in a rather naive way. The standard approach involves first imputing the missing values, and then giving the completed imputed data to the learning algorithm. One advantage of this approach is that it allows the user to select the most suitable imputation method for different datasets. However, the classification result is not promising. Su et al. proposed an algorithm called “Classifier-based Nominal Imputation” (CNI), which improves the classification problem for machine learning algorithms on incomplete nominal datasets, but the performance on ordinal data remains unknown. Our work applied this CNI technique to ordinal data and the experimental results showed that using this CNI algorithm to pre-process the incomplete ordinal dataset, resulted in significantly higher classification accuracy than learners that do not apply any imputation method and those using baseline imputation techniques, such as the most common value imputation. This CNI algorithm is found to be helpful for many learners such as K Nearest Neighbour, Naive Bayes and Multilayer Perceptron Neural Networks on incomplete ordinal data. |
en |
dc.publisher |
ResearchSpace@Auckland |
en |
dc.relation.ispartof |
Masters Thesis - University of Auckland |
en |
dc.rights |
Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. |
en |
dc.rights |
Restricted Item. Available to authenticated members of The University of Auckland. |
en |
dc.rights.uri |
https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm |
en |
dc.rights.uri |
http://creativecommons.org/licenses/by-nc-sa/3.0/nz/ |
en |
dc.title |
Clustering Incomplete Ordinal Data |
en |
dc.type |
Thesis |
en |
thesis.degree.grantor |
The University of Auckland |
en |
thesis.degree.level |
Masters |
en |
dc.rights.holder |
Copyright: The Author |
en |
pubs.elements-id |
468244 |
en |
pubs.record-created-at-source-date |
2014-12-08 |
en |
dc.identifier.wikidata |
Q112907230 |
|