A Formal Language Perspective on Factorized Representations

Benny Kimelfeld, Wim Martens, Matthias Niewerth

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Factorized representations (FRs) are a well-known tool to succinctly represent results of join queries and have been originally defined using the named database perspective. We define FRs in the unnamed database perspective and use them to establish several new connections. First, unnamed FRs can be exponentially more succinct than named FRs, but this difference can be alleviated by imposing a disjointness condition on columns. Conversely, named FRs can also be exponentially more succinct than unnamed FRs. Second, unnamed FRs are the same as (i.e., isomorphic to) context-free grammars for languages in which each word has the same length. This tight connection allows us to transfer a wide range of results on context-free grammars to database factorization; of which we offer a selection in the paper. Third, when we generalize unnamed FRs to arbitrary sets of tuples, they become a generalization of path multiset representations, a formalism that was recently introduced to succinctly represent sets of paths in the context of graph database query evaluation.

Original languageEnglish
Title of host publication28th International Conference on Database Theory, ICDT 2025
EditorsSudeepa Roy, Ahmet Kara
ISBN (Electronic)9783959773645
DOIs
StatePublished - 21 Mar 2025
Event28th International Conference on Database Theory, ICDT 2025 - Barcelona, Spain
Duration: 25 Mar 202528 Mar 2025

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume328

Conference

Conference28th International Conference on Database Theory, ICDT 2025
Country/TerritorySpain
CityBarcelona
Period25/03/2528/03/25

Keywords

  • compact representations
  • Databases
  • factorized databases
  • graph databases
  • regular path queries
  • relational databases

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint

Dive into the research topics of 'A Formal Language Perspective on Factorized Representations'. Together they form a unique fingerprint.

Cite this