A relational framework for information extraction

Ronald Fagin, Benny Kimelfeld, Frederick Reiss, Stijn Vansummeren

Research output: Contribution to journalArticlepeer-review

Abstract

Information Extraction commonly refers to the task of populating a relational schema, having predefined underlying semantics, from textual content. This task is pervasive in contemporary computational challenges associated with Big Data. In this article we provide an overview of our work on document spanners-a relational framework for Information Extraction that is inspired by rule-based systems such as IBM's SystemT.

Original languageEnglish
Pages (from-to)5-16
Number of pages12
JournalSIGMOD Record
Volume44
Issue number4
DOIs
StatePublished - Dec 2015

Keywords

  • Automata
  • Document spanners
  • Inconsistency
  • Information extraction
  • Prioritized repairs
  • Regular expressions

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems

Fingerprint

Dive into the research topics of 'A relational framework for information extraction'. Together they form a unique fingerprint.

Cite this