Automated Reasoning about the Entity Integrity of Big Data in Possibilistic SQL

Show simple item record

dc.contributor.author Litvinenko, I en
dc.contributor.author Wei, Z en
dc.contributor.author Link, S en
dc.date.accessioned 2022-01-14T03:42:09Z
dc.date.available 2022-01-14T03:42:09Z
dc.date.issued 2020 en
dc.identifier.citation CDMTCS Research Reports CDMTCS-541 (2020) en
dc.identifier.issn 1178-3540 en
dc.identifier.uri https://hdl.handle.net/2292/58004
dc.description.abstract SQL is the de-facto industry standard for data management. Besides relational data, SQL has been extended to also manage object-relational and Web-based data. It is likely that many instances of big data will also be managed by extensions of the current SQL standard. We introduce the classes of keys and functional dependencies over possibilistic databases with duplicate and missing information. Our main contribution is to equip SQL with reasoning capabilities about the semantics of big data that may feature the volume, variety, and veracity dimensions. These capabilities are fundamental to reason about entity integrity and essential for database design as functional dependencies are sources of data redundancy, and keys prevent data redundancy. Since SQL controls the occurrences of missing information with NOT NULL constraints, we also include possibilistic extensions of this constraint in our investigation. We illustrate applications, and establish axiomatic, algorithmic, and logical characterizations to the PTIME-complete implication problem associated with the combined class of these integrity constraints. Specifically, we show that keys behave just like goal clauses and FDs just like definite clauses in Boolean propositional Horn logic, and we can therefore apply linear resolution to reason about them.
dc.publisher Department of Computer Science, The University of Auckland, New Zealand en
dc.relation.ispartofseries CDMTCS Research Report Series en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.source.uri https://www.cs.auckland.ac.nz/research/groups/CDMTCS/researchreports/index.php en
dc.title Automated Reasoning about the Entity Integrity of Big Data in Possibilistic SQL en
dc.type Technical Report en
dc.subject.marsden Fields of Research en
dc.rights.holder Copyright: The author(s) en
dc.rights.accessrights http://purl.org/eprint/accessRights/OpenAccess en


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics