TY - GEN
T1 - Database repairing with soft functional dependencies
AU - Carmeli, Nofar
AU - Grohe, Martin
AU - Kimelfeld, Benny
AU - Livshits, Ester
AU - Tibi, Muhammad
N1 - Publisher Copyright: © Nofar Carmeli, Martin Grohe, Benny Kimelfeld, Ester Livshits, and Muhammad Tibi.
PY - 2021/3/1
Y1 - 2021/3/1
N2 - A common interpretation of soft constraints penalizes the database for every violation of every constraint, where the penalty is the cost (weight) of the constraint. A computational challenge is that of finding an optimal subset: a collection of database tuples that minimizes the total penalty when each tuple has a cost of being excluded. When the constraints are strict (i.e., have an infinite cost), this subset is a "cardinality repair"of an inconsistent database; in soft interpretations, this subset corresponds to a "most probable world"of a probabilistic database, a "most likely intention"of a probabilistic unclean database, and so on. Within the class of functional dependencies, the complexity of finding a cardinality repair is thoroughly understood. Yet, very little is known about the complexity of finding an optimal subset for the more general soft semantics. This paper makes a significant progress in this direction. In addition to general insights about the hardness and approximability of the problem, we present algorithms for two special cases: a single functional dependency, and a bipartite matching. The latter is the problem of finding an optimal "almost matching"of a bipartite graph where a penalty is paid for every lost edge and every violation of monogamy.
AB - A common interpretation of soft constraints penalizes the database for every violation of every constraint, where the penalty is the cost (weight) of the constraint. A computational challenge is that of finding an optimal subset: a collection of database tuples that minimizes the total penalty when each tuple has a cost of being excluded. When the constraints are strict (i.e., have an infinite cost), this subset is a "cardinality repair"of an inconsistent database; in soft interpretations, this subset corresponds to a "most probable world"of a probabilistic database, a "most likely intention"of a probabilistic unclean database, and so on. Within the class of functional dependencies, the complexity of finding a cardinality repair is thoroughly understood. Yet, very little is known about the complexity of finding an optimal subset for the more general soft semantics. This paper makes a significant progress in this direction. In addition to general insights about the hardness and approximability of the problem, we present algorithms for two special cases: a single functional dependency, and a bipartite matching. The latter is the problem of finding an optimal "almost matching"of a bipartite graph where a penalty is paid for every lost edge and every violation of monogamy.
KW - Database inconsistency
KW - Database repairs
KW - Functional dependencies
KW - Integrity constraints
KW - Soft constraints
UR - http://www.scopus.com/inward/record.url?scp=85115232551&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.ICDT.2021.16
DO - 10.4230/LIPIcs.ICDT.2021.16
M3 - منشور من مؤتمر
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 24th International Conference on Database Theory, ICDT 2021
A2 - Yi, Ke
A2 - Wei, Zhewei
T2 - 24th International Conference on Database Theory, ICDT 2021
Y2 - 23 March 2021 through 26 March 2021
ER -