Tutorial: Uncertain entity resolution: Re-evaluating entity resolution in the big data era

Research output: Contribution to journalConference articlepeer-review

Abstract

Entity resolution is a fundamental problem in data integration dealing with the combination of data from different sources to a unified view of the data. Entity resolution is inherently an uncertain process because the decision to map a set of records to the same entity cannot be made with certainty unless these are identical in all of their attributes or have a common key. In the light of recent advancement in data accumulation, management, and analytics landscape (known as big data) the tutorial re-evaluates the entity resolution process and in particular looks at best ways to handle data veracity. The tutorial ties entity resolution with recent advances in probabilistic database research, focusing on sources of uncertainty in the entity resolution process.

Original languageEnglish
Pages (from-to)1711-1712
Number of pages2
JournalProceedings of the VLDB Endowment
Volume7
Issue number13
DOIs
StatePublished - 2014
EventProceedings of the 40th International Conference on Very Large Data Bases, VLDB 2014 - Hangzhou, China
Duration: 1 Sep 20145 Sep 2014

Keywords

  • Big data
  • Data integration
  • Entity resolution
  • Uncertainty management

All Science Journal Classification (ASJC) codes

  • Computer Science (miscellaneous)
  • General Computer Science

Cite this