Blind Vocoder Speech Reconstruction using Generative Adversarial Networks

Yoav Blum, David Burshtein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The problem of reconstructing vocoder acoustic parameters using only encoded bit stream data is considered with applications to forensics and reverse engineering. Wasserstein generative adversarial networks (GANs) and CycleGANs, that map two unpaired domains, are used. It is shown that it is possible to reconstruct key acoustic parameters such as linear predictive coefficients (LPCs) when these parameters are encoded using scalar quantization. It is further shown that speech reconstruction is possible to some extent when it is known that the vocoder belongs to the family of code excited linear prediction (CELP) models, but the coded bit frame structure is unknown.

Original languageEnglish
Title of host publication2019 IEEE International Workshop on Information Forensics and Security, WIFS 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728132174
DOIs
StatePublished - Dec 2019
Event2019 IEEE International Workshop on Information Forensics and Security, WIFS 2019 - Delft, Netherlands
Duration: 9 Dec 201912 Dec 2019

Publication series

Name2019 IEEE International Workshop on Information Forensics and Security, WIFS 2019

Conference

Conference2019 IEEE International Workshop on Information Forensics and Security, WIFS 2019
Country/TerritoryNetherlands
CityDelft
Period9/12/1912/12/19

All Science Journal Classification (ASJC) codes

  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Library and Information Sciences
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Blind Vocoder Speech Reconstruction using Generative Adversarial Networks'. Together they form a unique fingerprint.

Cite this