Abstract
Information Extraction commonly refers to the task of populating a relational schema, having predefined underlying semantics, from textual content. This task is pervasive in contemporary computational challenges associated with Big Data. In this article we provide an overview of our work on document spanners-a relational framework for Information Extraction that is inspired by rule-based systems such as IBM's SystemT.
| Original language | English |
|---|---|
| Pages (from-to) | 5-16 |
| Number of pages | 12 |
| Journal | SIGMOD Record |
| Volume | 44 |
| Issue number | 4 |
| DOIs | |
| State | Published - Dec 2015 |
Keywords
- Automata
- Document spanners
- Inconsistency
- Information extraction
- Prioritized repairs
- Regular expressions
All Science Journal Classification (ASJC) codes
- Software
- Information Systems
Fingerprint
Dive into the research topics of 'A relational framework for information extraction'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver