Computing optimal repairs for functional dependencies

Ester Livshits, Benny Kimelfeld, Sudeepa Roy

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We investigate the complexity of computing an optimal repair of an inconsistent database, in the case where integrity constraints are Functional Dependencies (FDs). We focus on two types of repairs: an optimal subset repair (optimal S-repair) that is obtained by a minimum number of tuple deletions, and an optimal update repair (optimal U-repair) that is obtained by a minimum number of value (cell) updates. For computing an optimal S-repair, we present a polynomial-time algorithm that succeeds on certain sets of FDs and fails on others. We prove the following about the algorithm. When it succeeds, it can also incorporate weighted tuples and duplicate tuples. When it fails, the problem is NP-hard, and in fact, APX-complete (hence, cannot be approximated better than some constant). Thus, we establish a dichotomy in the complexity of computing an optimal S-repair. We present general analysis techniques for the complexity of computing an optimal U-repair, some based on the dichotomy for S-repairs. We also draw a connection to a past dichotomy in the complexity of finding a “most probable database” that satisfies a set of FDs with a single attribute on the left hand side; the case of general FDs was left open, and we show how our dichotomy provides the missing generalization and thereby settles the open problem.

Original languageEnglish
Title of host publicationPODS 2018 - Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems
EditorsJan Van den Bussche, Mart�n Ugarte, Marcelo Arenas
Pages225-237
Number of pages13
ISBN (Electronic)9781450347068
DOIs
StatePublished - 27 May 2018
Event37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2018 - Houston, United States
Duration: 10 Jun 201815 Jun 2018

Publication series

NameProceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems

Conference

Conference37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2018
Country/TerritoryUnited States
CityHouston
Period10/06/1815/06/18

Keywords

  • Approximation
  • Cardinality Repairs
  • Database Cleaning
  • Dichotomy
  • Functional Dependencies
  • Inconsistent Databases
  • Optimal Repairs
  • Value Repairs

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Computing optimal repairs for functional dependencies'. Together they form a unique fingerprint.

Cite this