FELI: HW/SW support for on-chip distributed shared memory in multicores

Carlos Villavieja, Yoav Etsion, Alex Ramirez, Nacho Navarro

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Modern Chip Multiprocessors (CMPs) composed of accelerators and on-chip scratchpad memories are currently emerging as power-efficient architectures. However, these architectures are hard to program because they require efficient data allocation. In addition, when running legacy applications on these architectures, unless their code is adapted to utilize the distributed memory architecture, applications cannot benefit from their high computational power. In this paper, we propose FELI, a set of operating system mechanisms that allocate application data to on-chip memories without any user intervention. FELI, automatically maps data to on-chip memories using the address translation mechanism. It relies on a set of TLB counters, and dynamical migration of pages from off-chip memory to on-chip memory. We also introduce virtually tagged L0 caches to alleviate the address translation overhead. Moreover, we make a comparison in performance and power consumption versus a homogeneous cache-based CMP design. Our evaluation shows a 50% average improvement in power consumption with the scratchpad-based CMP compared to a cache-based CMP. And a 10% in average memory access time even accounting for the cost of page migrations and TLB invalidations. FELI can automatically allocate on-chip memory to an average of 90% of the applications working set.

Original languageEnglish
Title of host publicationEuro-Par 2011 Parallel Processing - 17th International Conference, Proceedings
Pages282-294
Number of pages13
EditionPART 1
DOIs
StatePublished - 2011
Externally publishedYes
Event17th International Conference on Parallel Processing, Euro-Par 2011 - Bordeaux, France
Duration: 29 Aug 20112 Sep 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6852 LNCS

Conference

Conference17th International Conference on Parallel Processing, Euro-Par 2011
Country/TerritoryFrance
CityBordeaux
Period29/08/112/09/11

Keywords

  • Chip MultiProcessors
  • Scratchpad on-chip memories
  • page migration

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'FELI: HW/SW support for on-chip distributed shared memory in multicores'. Together they form a unique fingerprint.

Cite this