Declarative probabilistic programming with datalog

Vince Barany, Balder Ten Cate, Benny Kimelfeld, Dan Olteanu, Zografoula Vagena

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Probabilistic programming languages are used for developing statistical models, and they typically consist of two components: a specification of a stochastic process (the prior), and a specification of observations that restrict the probability space to a conditional subspace (the posterior). Use cases of such formalisms include the development of algorithms in machine learning and artificial intelligence. We propose and investigate an extension of Datalog for specifying statistical models, and establish a declarative probabilistic-programming paradigm over databases. Our proposed extension provides convenient mechanisms to include common numerical probability functions; in particular, conclusions of rules may contain values drawn from such functions. The semantics of a program is a probability distribution over the possible outcomes of the input database with respect to the program. Observations are naturally incorporated by means of integrity constraints over the extensional and intensional relations. The resulting semantics is robust under different chases and invariant to rewritings that preserve logical equivalence.

Original languageEnglish
Title of host publication19th International Conference on Database Theory, ICDT 2016
EditorsThomas Zeume, Wim Martens
ISBN (Electronic)9783959770026
DOIs
StatePublished - 1 Mar 2016
Event19th International Conference on Database Theory, ICDT 2016 - Bordeaux, France
Duration: 15 Mar 201618 Mar 2016

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume48

Conference

Conference19th International Conference on Database Theory, ICDT 2016
Country/TerritoryFrance
CityBordeaux
Period15/03/1618/03/16

Keywords

  • Chase
  • Datalog
  • Probabilistic Programming
  • Probability Measure Space

All Science Journal Classification (ASJC) codes

  • Software

Cite this