TY - GEN
T1 - Deep Ranking-Based DOA Tracking Algorithm
AU - Opochinsky, Renana
AU - Chechik, Gal
AU - Gannot, Sharon
N1 - Publisher Copyright: © 2021 European Signal Processing Conference. All rights reserved.
PY - 2021
Y1 - 2021
N2 - In this study, we present a weak-supervised deep neural network-based tracking algorithm for a moving source. A triplet-loss network is trained with instantaneous spatial features to estimate the time-varying DOA. The core idea is to minimize the use of labeled samples (i.e. samples which are accurately localized, and difficult to acquire) by using instead partial knowledge drawn from an unlabeled, and much larger, dataset in which only the relative spatial ordering between the samples is known. We use a deep learning architecture that stochastically combines a triplet-ranking loss for the unlabeled samples and a spatial loss for the labelled samples and learns a nonlinear deep embedding that maps acoustic features to an azimuth angle of the source. We show that it is unnecessary to train the network with a large number of random trajectories of a moving source, and that triplets of static sources from the same locus, which can be more easily acquired, are sufficient. A simulation study demonstrates the applicability of the proposed method to dynamic problems.
AB - In this study, we present a weak-supervised deep neural network-based tracking algorithm for a moving source. A triplet-loss network is trained with instantaneous spatial features to estimate the time-varying DOA. The core idea is to minimize the use of labeled samples (i.e. samples which are accurately localized, and difficult to acquire) by using instead partial knowledge drawn from an unlabeled, and much larger, dataset in which only the relative spatial ordering between the samples is known. We use a deep learning architecture that stochastically combines a triplet-ranking loss for the unlabeled samples and a spatial loss for the labelled samples and learns a nonlinear deep embedding that maps acoustic features to an azimuth angle of the source. We show that it is unnecessary to train the network with a large number of random trajectories of a moving source, and that triplets of static sources from the same locus, which can be more easily acquired, are sufficient. A simulation study demonstrates the applicability of the proposed method to dynamic problems.
KW - Acoustic source tracking
KW - Deep embedding learning
KW - Relative transfer function
KW - Triplet-loss
UR - http://www.scopus.com/inward/record.url?scp=85123163117&partnerID=8YFLogxK
U2 - 10.23919/eusipco54536.2021.9616297
DO - 10.23919/eusipco54536.2021.9616297
M3 - منشور من مؤتمر
T3 - European Signal Processing Conference
SP - 1020
EP - 1024
BT - 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings
T2 - 29th European Signal Processing Conference, EUSIPCO 2021
Y2 - 23 August 2021 through 27 August 2021
ER -