Categorisation of Healthcare spend data using Machine Learning and Transfer Learning

Show simple item record

dc.contributor.advisor Dobbie, Gillian
dc.contributor.author Derhamy, Seyed Hosein
dc.date.accessioned 2021-11-23T22:03:45Z
dc.date.available 2021-11-23T22:03:45Z
dc.date.issued 2021 en
dc.identifier.uri https://hdl.handle.net/2292/57532
dc.description Full Text is available to authenticated members of The University of Auckland only. en
dc.description.abstract New Zealand is divided into 20 independent District Health Boards (DHBs) for the delivery of healthcare services. Most DHBs have their own independent software system to manage their purchases and suppliers, and items in each of these systems have their own naming conventions. In order to ascertain and analyse the spread of spending in healthcare, there is a drive to standardise and centralise historic information automatically. The aim of this research is to consolidate the spend data across all DHBs by classifying the data into categories . We propose the use of classification algorithms in order to achieve highest accuracy when categorising DHB spend data. Given the large number of response values which exceeds 1,800 unique values, accuracy is of paramount importance. The random forest model was the best performing model and given abundant labeled data gave us an F1-score of 0.98. Since our labeled data for certain DHBs is larger than other DHBs we also explore transfer learning and domain adaptation techniques to improve the prediction accuracy of the DHB with scarce amounts of labeled data. We use the data from the DHB with abundant labeled data by either adapting the data to fit the target domain or transferring parts of an actual model built using the abundant data to the target model. All methods have positive outcomes and increase the accuracy of our prediction model. However, the transfer learning method where parts of the actual model (knowledge) are transferred improved the F1-score from 0.47 to 0.72. This method showed the greatest accuracy improvement of any of the techniques used in our thesis.
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof Masters Thesis - University of Auckland en
dc.relation.isreferencedby UoA en
dc.rights Restricted Item. Full Text is available to authenticated members of The University of Auckland only. en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/nz/
dc.title Categorisation of Healthcare spend data using Machine Learning and Transfer Learning
dc.type Thesis en
thesis.degree.discipline Computer Science
thesis.degree.grantor The University of Auckland en
thesis.degree.level Masters en
dc.date.updated 2021-10-25T07:57:10Z
dc.rights.holder Copyright: the author en
dc.identifier.wikidata Q112955124


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics