Breaking the subtopic barrier in cross-document event coreference resolution

Michael Bugert, Nils Reimers, Shany Barhom, Ido Dagan, Iryna Gurevych

Research output: Contribution to journalConference articlepeer-review

Abstract

Cross-document event coreference resolution (CDCR) is the task of detecting and clustering mentions of events across a set of documents. A major bottleneck in CDCR is a lack of appropriate datasets, which stems from the difficulty of annotating data for this task. We present the first scalable approach for annotating cross-subtopic event coreference links, a highly valuable but rarely occurring type of cross-document link. The annotation of these links requires combing through hundreds of documents - an endeavor for which conventional token-level annotation schemes with trained expert annotators are too expensive. We instead propose crowdsourcing annotation on sentence level to achieve scalability.

Original languageEnglish
Pages (from-to)23-29
Number of pages7
JournalCEUR Workshop Proceedings
Volume2593
StatePublished - 2020
Event3rd Workshop on Narrative Extraction From Texts, Text2Story 2020 - Lisbon, Portugal
Duration: 14 Apr 2020 → …

All Science Journal Classification (ASJC) codes

  • General Computer Science

Cite this