Semantic Space Models for Classification of Consumer Web Pages on Metadata Attributes

Show simple item record

dc.contributor.author Chen, Guocai en
dc.contributor.author Warren, James en
dc.contributor.author Riddle, Patricia en
dc.date.accessioned 2012-03-14T21:00:40Z en
dc.date.issued 2010 en
dc.identifier.citation Journal of Biomedical Informatics 43(5):725-735 01 Oct 2010 en
dc.identifier.issn 1532-0464 en
dc.identifier.uri http://hdl.handle.net/2292/14365 en
dc.description.abstract To deal with the quantity and quality issues with online healthcare resources, creating web portals centred on particular health topics and/or communities of users is a strategy to provide access to a reduced corpus of information resources that meet quality and relevance criteria. In this paper we use hyperspace analogue to language (HAL) to model the language use patterns of webpages as Semantic Spaces. We have applied machine learning methods, including support vector machine (SVM), decision forest, and a novel summed similarity measure (SSM) to automatically classify online webpages on their Semantic Space models. We find classification accuracy on metadata attributes to be over 93% for ‘medical’ versus ‘supportive’ perspective, over 92% for disease stage of ‘early’ versus ‘advanced’, and over 90% for author credentials of ‘lay’ versus ‘clinician’ based on webpages of the Breast Cancer Knowledge Online portal. These results indicate that language use patterns can be used to automate such classification with useful levels of accuracy. en
dc.publisher Elsevier en
dc.relation.ispartofseries Journal of Biomedical Informatics en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. Details obtained from: http://www.sherpa.ac.uk/romeo/issn/1532-0464/ en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.title Semantic Space Models for Classification of Consumer Web Pages on Metadata Attributes en
dc.type Journal Article en
dc.identifier.doi 10.1016/j.jbi.2010.06.005 en
pubs.begin-page 725 en
pubs.volume 43 en
dc.rights.holder Copyright: Elsevier en
dc.identifier.pmid 20601122 en
pubs.end-page 735 en
dc.rights.accessrights http://purl.org/eprint/accessRights/RestrictedAccess en
pubs.subtype Article en
pubs.elements-id 182781 en
dc.relation.isnodouble 9186 *
pubs.org-id Science en
pubs.org-id School of Computer Science en
pubs.record-created-at-source-date 2010-11-23 en
pubs.dimensions-id 20601122 en


Files in this item

There are no files associated with this item.

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics