Provenance-based analysis of data-centric processes

Research output: Contribution to journalArticlepeer-review

Abstract

We consider in this paper static analysis of the possible executions of data-dependent applications, namely applications whose control flow is guided by a finite-state machine, as well as by the state of an underlying database. We note that previous work in this context has not addressed two important features of such analysis, namely analysis under hypothetical scenarios, such as changes to the application’s state machine and/or to the underlying database, and the consideration of meta-data, such as cost or access privileges. Observing that semiring-based provenance has been proven highly effective in supporting these two features for database queries, we develop in this paper a semiring-based provenance framework for the analysis of data-dependent processes, accounting for hypothetical reasoning and meta-data. The development addresses two interacting new challenges: (1) combining provenance annotations for both information that resides in the database and information about external inputs (e.g., user choices) and (2) finitely capturing infinitely many process executions. We have implemented our framework as part of the PROPOLIS system.

Original languageEnglish
Pages (from-to)583-607
Number of pages25
JournalVLDB Journal
Volume24
Issue number4
DOIs
StatePublished - 24 Aug 2015

Keywords

  • Process analysis
  • Provenance semirings

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Provenance-based analysis of data-centric processes'. Together they form a unique fingerprint.

Cite this