Thematic signatures for cleansing and enriching place-related linked data

Show simple item record

dc.contributor.author Adams, Benjamin en
dc.contributor.author Janowicz, K en
dc.date.accessioned 2015-04-13T05:32:02Z en
dc.date.issued 2015 en
dc.identifier.citation International Journal of Geographical Information Science 29(4):556-579 2015 en
dc.identifier.issn 1365-8816 en
dc.identifier.uri http://hdl.handle.net/2292/25202 en
dc.description.abstract There has been significant progress transforming semi-structured data about places into knowledge graphs that can be used in a wide variety of geographic information systems such as digital gazetteers or geographic information retrieval systems. For instance, in addition to information about events, actors, and objects, DBpedia contains data about hundreds of thousands of places from Wikipedia and publishes it as Linked Data. Repositories that store data about places are among the most interlinked hubs on the Linked Data cloud. However, most content about places resides in unstructured natural language text, and therefore it is not captured in these knowledge graphs. Instead, place representations are limited to facts such as their population counts, geographic locations, and relations to other entities, for example, headquarters of companies or historical figures. In this paper, we present a novel method to enrich the information stored about places in knowledge graphs using thematic signatures that are derived from unstructured text through the process of topic modeling. As proof of concept, we demonstrate that this enables the automatic categorization of articles into place types defined in the DBpedia ontology (e.g., mountain) and also provides a mechanism to infer relationships between place types that are not captured in existing ontologies. This method can also be used to uncover miscategorized places, which is a common problem arising from the automatic lifting of unstructured and semi-structured data. en
dc.relation.ispartofseries International Journal of Geographical Information Science en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. Details obtained from http://www.sherpa.ac.uk/romeo/issn/1365-8816/ en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.title Thematic signatures for cleansing and enriching place-related linked data en
dc.type Journal Article en
dc.identifier.doi 10.1080/13658816.2014.989855 en
pubs.issue 4 en
pubs.begin-page 556 en
pubs.volume 29 en
pubs.end-page 579 en
dc.rights.accessrights http://purl.org/eprint/accessRights/RestrictedAccess en
pubs.subtype Article en
pubs.elements-id 462776 en
dc.identifier.eissn 1365-8824 en
pubs.record-created-at-source-date 2015-03-06 en


Files in this item

There are no files associated with this item.

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics