Intra-cluster training strategy for deep learning with applications to language identification

Alan Joseph Bekker, Irit Opher, Itsik Lapidot, Jacob Goldberger

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this study we address the problem of training a neural network for language identification using speech samples in the form of i-vectors. Our approach involves training a classifier and analyzing the obtained confusion matrix. We cluster the languages by simultaneously clustering the columns and the rows of the confusion matrix. The language clusters are then used to define a modified cost function for training a neural-network that focuses on distinguishing between the true language and languages within the same cluster. The results show enhanced language identification on the NIST 2015 language identification dataset.

Original languageEnglish
Title of host publication2016 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings
EditorsKostas Diamantaras, Aurelio Uncini, Francesco A. N. Palmieri, Jan Larsen
PublisherIEEE Computer Society
ISBN (Electronic)9781509007462
DOIs
StatePublished - 8 Nov 2016
Event26th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings - Vietri sul Mare, Salerno, Italy
Duration: 13 Sep 201616 Sep 2016

Publication series

NameIEEE International Workshop on Machine Learning for Signal Processing, MLSP
Volume2016-November

Conference

Conference26th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings
Country/TerritoryItaly
CityVietri sul Mare, Salerno
Period13/09/1616/09/16

Keywords

  • Confusion matrix
  • clustering
  • language identification

All Science Journal Classification (ASJC) codes

  • Human-Computer Interaction
  • Signal Processing

Fingerprint

Dive into the research topics of 'Intra-cluster training strategy for deep learning with applications to language identification'. Together they form a unique fingerprint.

Cite this