Opening the knowledge dam: Speech recognition for video search

Vered Silber-Varod, Amir Winer, Nitza Geri

Research output: Contribution to journalArticlepeer-review

Abstract

Automatic Speech Recognition (ASR) may increase access to spoken information captured in videos. ASR is needed, especially for online academic video lectures that gradually replace class lectures and traditional textbooks. This conceptual article examines how technological barriers to ASR in under-resourced languages impair accessibility to video content and demonstrates it with the empirical findings of Hebrew ASR evaluations. We compare ASR with Optical Character Recognition (OCR) as facilitating access to textual and speech content and show their current performance in under-resourced languages. We target ASR of under-resourced languages as the main barrier to searching academic video lectures. We further show that information retrieval technologies, such as smart video players that combine both ASR and OCR capacities, must come to the fore once ASR technologies have matured. Therefore, suggesting that the current state of information retrieval from video lectures in under-resourced languages is equivalent to a knowledge dam.

Original languageEnglish
Pages (from-to)106-111
Number of pages6
JournalJournal of Computer Information Systems
Volume57
Issue number2
DOIs
StatePublished - 2017

Keywords

  • Academic video lectures
  • Automatic speech recognition (ASR)
  • Optical character recognition (OCR)
  • Search
  • Under-resourced languages

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Education
  • Computer Networks and Communications
  • Management Information Systems

Fingerprint

Dive into the research topics of 'Opening the knowledge dam: Speech recognition for video search'. Together they form a unique fingerprint.

Cite this