TY - GEN
T1 - Discovering reliable causal rules
AU - Budhathoki, Kailash
AU - Boley, Mario
AU - Vreeken, Jilles
N1 - Publisher Copyright: © 2021 by SIAM.
PY - 2021
Y1 - 2021
N2 - We study the problem of deriving policies, or rules, that when enacted on a complex system, cause a desired outcome. Absent the ability to perform controlled experiments, such rules have to be inferred from past observations of the system’s behaviour. This is a challenging problem for two reasons: First, observational effects are often unrepresentative of the underlying causal effect because they are skewed by the presence of confounding factors. Second, naive empirical estimations of a rule’s effect have a high variance, and, hence, their maximisation typically leads to spurious results. To address these issues, we first identify conditions on the underlying causal system that—by correcting for the effect of potential confounders—allow estimating the causal effect from observational data. Importantly, we provide a criterion under which causal rule discovery is possible. Moreover, to discover reliable causal rules from a sample, we propose a conservative and consistent estimator of the causal effect, and derive an efficient and exact algorithm that maximises the estimator. Extensive experiments on a variety of real-world and synthetic datasets show that the proposed estimator converges faster to the ground truth than the naive estimator, recovers causal rules even at small sample sizes, and the proposed algorithm efficiently discovers meaningful rules.
AB - We study the problem of deriving policies, or rules, that when enacted on a complex system, cause a desired outcome. Absent the ability to perform controlled experiments, such rules have to be inferred from past observations of the system’s behaviour. This is a challenging problem for two reasons: First, observational effects are often unrepresentative of the underlying causal effect because they are skewed by the presence of confounding factors. Second, naive empirical estimations of a rule’s effect have a high variance, and, hence, their maximisation typically leads to spurious results. To address these issues, we first identify conditions on the underlying causal system that—by correcting for the effect of potential confounders—allow estimating the causal effect from observational data. Importantly, we provide a criterion under which causal rule discovery is possible. Moreover, to discover reliable causal rules from a sample, we propose a conservative and consistent estimator of the causal effect, and derive an efficient and exact algorithm that maximises the estimator. Extensive experiments on a variety of real-world and synthetic datasets show that the proposed estimator converges faster to the ground truth than the naive estimator, recovers causal rules even at small sample sizes, and the proposed algorithm efficiently discovers meaningful rules.
UR - http://www.scopus.com/inward/record.url?scp=85120951690&partnerID=8YFLogxK
U2 - 10.1137/1.9781611976700.1
DO - 10.1137/1.9781611976700.1
M3 - Conference contribution
T3 - SIAM International Conference on Data Mining, SDM 2021
SP - 1
EP - 9
BT - SIAM International Conference on Data Mining, SDM 2021
PB - Siam Society
T2 - 2021 SIAM International Conference on Data Mining, SDM 2021
Y2 - 29 April 2021 through 1 May 2021
ER -