Collaborative Inference via Ensembles on the Edge

Nir Shlezinger, Erez Farhan, Hai Morgenstern, Yonina C Eldar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The success of deep neural networks (DNNs) as an enabler of artificial intelligence (AI) is heavily dependent on high computational resources. The increasing demands for accessible and personalized AI give rise to the need to operate DNNs on edge devices such as smartphones, sensors, and autonomous cars, whose computational powers are limited. Here we propose a framework for facilitating the application of DNNs on the edge in a manner which allows multiple users to collaborate during inference in order to improve their prediction accuracy. Our mechanism, referred to as edge ensembles, is based on having diverse predictors at each device, which can form a deep ensemble during inference. We analyze the latency induced in this collaborative inference approach, showing that the ability to improve performance via collaboration comes at the cost of a minor additional delay. Our experimental results demonstrate that collaborative inference via edge ensembles equipped with compact DNNs substantially improves the accuracy over having each user infer locally, and can outperform using a single centralized DNN larger than all the networks in the ensemble together.
Original languageEnglish
Title of host publicationICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Pages8478-8482
Number of pages5
Volume2021-June
ISBN (Electronic)9781728176055
DOIs
StatePublished - 13 May 2021
EventIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Toronto, ON, Canada
Duration: 6 Jun 202111 Jun 2021

Conference

ConferenceIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Period6/06/2111/06/21

Fingerprint

Dive into the research topics of 'Collaborative Inference via Ensembles on the Edge'. Together they form a unique fingerprint.

Cite this