Skip to main navigation Skip to search Skip to main content

Semiautomatic construction of cross-period thesaurus

Research output: Contribution to journalArticlepeer-review

Abstract

A cross-period (diachronic) thesaurus enables users to search for information using modern terminology and obtain semantically related terms from earlier historical periods. The complex task of supporting the construction of a diachronic thesaurus by a domain expert lexicographer has hardly been addressed computationally until now. In this article, we introduce a semiautomatic iterative Query Expansion (QE) scheme for supporting diachronic thesaurus construction, which identifies candidate related terms based on statistical corpus-based measures. We use ancient-modern period classification to increase the performance of the statistical cooccurrence measures and extend our methods to deal with Multi-Word Expressions (MWEs). We demonstrate the empirical benefit of our scheme for a Jewish cross-period thesaurus and evaluate its impact on recall and on the effectiveness of the lexicographer's manual efforts.

Original languageEnglish
Article number22
JournalJournal on Computing and Cultural Heritage
Volume9
Issue number4
DOIs
StatePublished - Dec 2016

Keywords

  • Cultural heritage
  • Diachronic thesaurus
  • Hebrew
  • Semantic similarity

ASJC Scopus subject areas

  • Conservation
  • Information Systems
  • Computer Science Applications
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'Semiautomatic construction of cross-period thesaurus'. Together they form a unique fingerprint.

Cite this