Functional Faults

Gali Sheffi, Erez Petrank

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Hardware and software faults increasingly surface in today's computing environment and vast theoretical and practical research efforts are devoted to ameliorate the effects of malfunctionality in the computing process. Most research to date, however, has focused on how to discover and handle faulty data. In this paper we formalize and study faulty functionality in a modern multicore shared-memory environment. Functional faults have been previously studied in the architecture community. However, they have never been formally defined and lower/upper bounds were not previously proven. We present a model of functional faults, and study avenues that allow tolerating functional faults while maintaining the correctness of the entire computation. We exemplify this model by constructing a robust consensus protocol from functionally-faulty compare-and-swap objects. We then show a (tight) impossibility result for the same construction, when the number of faults exceeds a certain threshold. Interestingly, for some fault types, more functional faults can be tolerated than the analogue data faults, beating an impossibility result for data faults and demonstrating the difference between the two models.

Original languageEnglish
Title of host publicationSPAA 2020 - Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures
Pages453-463
Number of pages11
ISBN (Electronic)9781450369350
DOIs
StatePublished - 6 Jul 2020
Event32nd ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2020 - Virtual, Online, United States
Duration: 15 Jul 202017 Jul 2020

Publication series

NameAnnual ACM Symposium on Parallelism in Algorithms and Architectures

Conference

Conference32nd ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2020
Country/TerritoryUnited States
CityVirtual, Online
Period15/07/2017/07/20

Keywords

  • concurrent algorithms
  • fault-tolerance
  • hardware faults
  • lowerbounds
  • shared memory
  • software faults

All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Functional Faults'. Together they form a unique fingerprint.

Cite this