dc.contributor.author |
Chen, Guocai |
en |
dc.contributor.author |
Warren, James |
en |
dc.contributor.author |
Riddle, Patricia |
en |
dc.date.accessioned |
2012-03-14T21:00:40Z |
en |
dc.date.issued |
2010 |
en |
dc.identifier.citation |
Journal of Biomedical Informatics 43(5):725-735 01 Oct 2010 |
en |
dc.identifier.issn |
1532-0464 |
en |
dc.identifier.uri |
http://hdl.handle.net/2292/14365 |
en |
dc.description.abstract |
To deal with the quantity and quality issues with online healthcare resources, creating web portals centred on particular health topics and/or communities of users is a strategy to provide access to a reduced corpus of information resources that meet quality and relevance criteria. In this paper we use hyperspace analogue to language (HAL) to model the language use patterns of webpages as Semantic Spaces. We have applied machine learning methods, including support vector machine (SVM), decision forest, and a novel summed similarity measure (SSM) to automatically classify online webpages on their Semantic Space models. We find classification accuracy on metadata attributes to be over 93% for ‘medical’ versus ‘supportive’ perspective, over 92% for disease stage of ‘early’ versus ‘advanced’, and over 90% for author credentials of ‘lay’ versus ‘clinician’ based on webpages of the Breast Cancer Knowledge Online portal. These results indicate that language use patterns can be used to automate such classification with useful levels of accuracy. |
en |
dc.publisher |
Elsevier |
en |
dc.relation.ispartofseries |
Journal of Biomedical Informatics |
en |
dc.rights |
Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. Details obtained from: http://www.sherpa.ac.uk/romeo/issn/1532-0464/ |
en |
dc.rights.uri |
https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm |
en |
dc.title |
Semantic Space Models for Classification of Consumer Web Pages on Metadata Attributes |
en |
dc.type |
Journal Article |
en |
dc.identifier.doi |
10.1016/j.jbi.2010.06.005 |
en |
pubs.begin-page |
725 |
en |
pubs.volume |
43 |
en |
dc.rights.holder |
Copyright: Elsevier |
en |
dc.identifier.pmid |
20601122 |
en |
pubs.end-page |
735 |
en |
dc.rights.accessrights |
http://purl.org/eprint/accessRights/RestrictedAccess |
en |
pubs.subtype |
Article |
en |
pubs.elements-id |
182781 |
en |
dc.relation.isnodouble |
9186 |
* |
pubs.org-id |
Science |
en |
pubs.org-id |
School of Computer Science |
en |
pubs.record-created-at-source-date |
2010-11-23 |
en |
pubs.dimensions-id |
20601122 |
en |