Slashing the Disaggregation Tax in Heterogeneous Data Centers with FractOS

Lluís Vilanova, Lina Maudlej, Shai Bergman, Till Miemietz, Matthias Hille, Nils Asmussen, Michael Roitzsch, Hermann Härtig, Mark Silberstein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Disaggregated heterogeneous data centers promise higher efficiency, lower total costs of ownership, and more flexibility for data-center operators. However, current software stacks can levy a high tax on application performance. Applications and OSes are designed for systems where local PCIe-connected devices are centrally managed by CPUs, but this centralization introduces unnecessary messages through the shared data-center network in a disaggregated system. We present FractOS, a distributed OS that is designed to minimize the network overheads of disaggregation in heterogeneous data centers. FractOS elevates devices to be first-class citizens, enabling direct peer-to-peer data transfers and task invocations among them, without centralized application and OS control. FractOS achieves this through: (1) new abstractions to express distributed applications across services and disaggregated devices, (2) new mechanisms that enable devices to securely interact with each other and other data-center services, (3) a distributed and isolated OS layer that implements these abstractions and mechanisms, and can run on host CPUs and SmartNICs. Our prototype shows that FractOS accelerates real-world heterogeneous applications by 47%, while reducing their network traffic by 3×.

Original languageEnglish
Title of host publicationEuroSys 2022 - Proceedings of the 17th European Conference on Computer Systems
Pages352-367
Number of pages16
ISBN (Electronic)9781450391627
DOIs
StatePublished - 28 Mar 2022
Event17th European Conference on Computer Systems, EuroSys 2022 - Rennes, France
Duration: 5 Apr 2022 → …

Publication series

NameEuroSys 2022 - Proceedings of the 17th European Conference on Computer Systems

Conference

Conference17th European Conference on Computer Systems, EuroSys 2022
Country/TerritoryFrance
CityRennes
Period5/04/22 → …

Keywords

  • Capabilities
  • Data center
  • Distributed systems
  • Operating systems
  • Resource disaggregation

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Slashing the Disaggregation Tax in Heterogeneous Data Centers with FractOS'. Together they form a unique fingerprint.

Cite this