SQL Schema Design: Foundations, Normal forms, and Normalization

Show simple item record

dc.contributor.author Koehler, H en
dc.contributor.author Link, Sebastian en
dc.coverage.spatial San Francisco, CA, USA en
dc.date.accessioned 2016-06-27T04:43:39Z en
dc.date.issued 2016-06-01 en
dc.identifier.citation Proceedings of the 2016 International Conference on Management of Data, 2016, pp. 267 - 279 (13) en
dc.identifier.isbn 978-1-4503-3531-7 en
dc.identifier.uri http://hdl.handle.net/2292/29202 en
dc.description.abstract Normalization helps us find a database schema at design time that can process the most frequent updates efficiently at run time. Unfortunately, relational normalization only works for idealized database instances in which duplicates and null markers are not present. On one hand, these features occur frequently in real-world data compliant with the industry standard SQL, and especially in modern application domains. On the other hand, the features impose challenges that have made it impossible so far to extend the existing forty year old normalization framework to SQL. We introduce a new class of functional dependencies and show that they provide the right notion for SQL schema design. Axiomatic and linear-time algorithmic characterizations of the associated implication problem are established. These foundations enable us to propose a Boyce-Codd normal form for SQL. Indeed, we justify the normal form by showing that it permits precisely those SQL instances which are free from data redundancy. Unlike the relational case, there are SQL schemata that cannot be converted into Boyce-Codd normal form. Nevertheless, for an expressive sub-class of our functional dependencies we establish a normalization algorithm that always produces a schema in Value-Redundancy free normal form. This normal form permits precisely those instances which are free from any redundant data value occurrences other than the null marker. Experiments show that our functional dependencies occur frequently in real-world data and that they are effective in eliminating redundant values from these data sets without loss of information. en
dc.description.uri http://dl.acm.org/citation.cfm?doid=2882903.2915239 en
dc.relation.ispartof International Conference on Management of Data (SIGMOD) en
dc.relation.ispartofseries Proceedings of the 2016 International Conference on Management of Data en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.title SQL Schema Design: Foundations, Normal forms, and Normalization en
dc.type Conference Item en
dc.identifier.doi 10.1145/2882903.2915239 en
pubs.begin-page 267 en
dc.description.version AM - Accepted Manuscript en
pubs.author-url http://doi.acm.org/10.1145/2882903.2915239 en
pubs.end-page 279 en
pubs.finish-date 2016-07-01 en
pubs.publication-status Published en
pubs.start-date 2016-06-26 en
dc.rights.accessrights http://purl.org/eprint/accessRights/RestrictedAccess en
pubs.subtype Conference Paper en
pubs.elements-id 531375 en
dc.relation.isnodouble 656973 *
pubs.org-id Science en
pubs.org-id School of Computer Science en
pubs.record-created-at-source-date 2016-06-23 en


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics