Flexible ECC Management for Low-Cost Transient Error Protection of Last-Level Caches

Jeongkyu Hong, Soontae Kim

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

The conventional error correcting code (ECC) schemes for caches are based on a fixed mapping between cache data words and ECC check bits, and fixed ECC word granularity. This leads to inefficient usage of the ECC check bits. We propose to manage the check bits flexibly for low-cost error protection of last-level caches. The proposed ECC schemes work at the word level, whereas the conventional ECC schemes work at the cache line or set level. The proposed schemes protect only dirty words with ECC check bits using a flexible mapping. Moreover, the proposed schemes utilize variable ECC word granularities. Dirty (modified) words that are unlikely to be modified further before being evicted are collectively protected with a larger ECC word granularity. The proposed schemes reduce DRAM and data bus energy overheads by 28% and 45%, respectively, with the same area overhead as previously proposed competitive schemes. Our schemes show more energy reduction results for multicore systems without noticeable performance degradation.

Original languageEnglish
Article number7368208
Pages (from-to)2152-2164
Number of pages13
JournalIEEE Transactions on Very Large Scale Integration (VLSI) Systems
Volume24
Issue number6
DOIs
StatePublished - Jun 2016

Keywords

  • Cache memory
  • error protection
  • reliability
  • transient error

Fingerprint

Dive into the research topics of 'Flexible ECC Management for Low-Cost Transient Error Protection of Last-Level Caches'. Together they form a unique fingerprint.

Cite this