dc.contributor.advisor |
Dobbie, Gillian |
|
dc.contributor.author |
Derhamy, Seyed Hosein |
|
dc.date.accessioned |
2021-11-23T22:03:45Z |
|
dc.date.available |
2021-11-23T22:03:45Z |
|
dc.date.issued |
2021 |
en |
dc.identifier.uri |
https://hdl.handle.net/2292/57532 |
|
dc.description |
Full Text is available to authenticated members of The University of Auckland only. |
en |
dc.description.abstract |
New Zealand is divided into 20 independent District Health Boards (DHBs) for the delivery of healthcare services. Most DHBs have their own independent software system to manage their purchases and suppliers, and items in each of these systems have their own naming conventions. In order to ascertain and analyse the spread of spending in healthcare, there is a drive to standardise and centralise historic information automatically.
The aim of this research is to consolidate the spend data across all DHBs by classifying the data into categories . We propose the use of classification algorithms in order to achieve highest accuracy when categorising DHB spend data. Given the large number of response values which exceeds 1,800 unique values, accuracy is of paramount importance. The random forest model was the best performing model and given abundant labeled data gave us an F1-score of 0.98.
Since our labeled data for certain DHBs is larger than other DHBs we also explore transfer learning and domain adaptation techniques to improve the prediction accuracy of the DHB with scarce amounts of labeled data. We use the data from the DHB with abundant labeled data by either adapting the data to fit the target domain or transferring parts of an actual model built using the abundant data to the target model. All methods have positive outcomes and increase the accuracy of our prediction model. However, the transfer learning method where parts of the actual model (knowledge) are transferred improved the F1-score from 0.47 to 0.72. This method showed the greatest accuracy improvement of any of the techniques used in our thesis. |
|
dc.publisher |
ResearchSpace@Auckland |
en |
dc.relation.ispartof |
Masters Thesis - University of Auckland |
en |
dc.relation.isreferencedby |
UoA |
en |
dc.rights |
Restricted Item. Full Text is available to authenticated members of The University of Auckland only. |
en |
dc.rights |
Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. |
|
dc.rights.uri |
https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm |
en |
dc.rights.uri |
http://creativecommons.org/licenses/by-nc-sa/3.0/nz/ |
|
dc.title |
Categorisation of Healthcare spend data using Machine Learning and Transfer Learning |
|
dc.type |
Thesis |
en |
thesis.degree.discipline |
Computer Science |
|
thesis.degree.grantor |
The University of Auckland |
en |
thesis.degree.level |
Masters |
en |
dc.date.updated |
2021-10-25T07:57:10Z |
|
dc.rights.holder |
Copyright: the author |
en |
dc.identifier.wikidata |
Q112955124 |
|