A semisupervised approach for language identification based on ladder networks

Ehud Ben-Reuven, Jacob Goldberger

Research output: Contribution to conferencePaperpeer-review

Abstract

In this study we address the problem of training a neural-network for language identification using both labeled and unlabeled speech samples in the form of i-vectors. We propose a neural network architecture that can also handle out-of-set languages. We utilize a modified version of the recently proposed Ladder Network semisupervised training procedure that optimizes the reconstruction costs of a stack of denoising autoencoders. We show that this approach can be successfully applied to the case where the training dataset is composed of both labeled and unlabeled acoustic data. The results show enhanced language identification on the NIST 2015 language identification dataset.

Original languageEnglish
Pages319-325
Number of pages7
DOIs
StatePublished - 2016
EventSpeaker and Language Recognition Workshop, Odyssey 2016 - Bilbao, Spain
Duration: 21 Jun 201624 Jun 2016

Conference

ConferenceSpeaker and Language Recognition Workshop, Odyssey 2016
Country/TerritorySpain
CityBilbao
Period21/06/1624/06/16

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Software
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'A semisupervised approach for language identification based on ladder networks'. Together they form a unique fingerprint.

Cite this