an integrated ecc and redundancy repair scheme for memory reliability enhancement

16
An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement National Tsing Hua University Hsinchu, Taiwan Chin-Lung Su, Yi-Ting Yeh, and Cheng-Wen Wu

Upload: step

Post on 26-Jan-2016

30 views

Category:

Documents


4 download

DESCRIPTION

An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement. Chin-Lung Su, Yi-Ting Yeh, and Cheng-Wen Wu. National Tsing Hua University H sinchu, Taiwan. Introduction. Memory cores are widely used in SOC designs They have higher density and occupy larger area - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

An Integrated ECC and Redundancy Repair Scheme for Memory

Reliability Enhancement

National Tsing Hua University

Hsinchu, Taiwan

Chin-Lung Su, Yi-Ting Yeh, and Cheng-Wen Wu

Page 2: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

2IC-DFN/08-06/cww

IntroductionIntroduction Memory cores are widely used in SOC designs

They have higher density and occupy larger area Dominate the chip yield

Their use is increasing in nano-technologies according to ITRS

Reliability is also an important issue for memory

ECC and redundancy repair are both widely used fault tolerance techniques

After production test, there may be some un-used redundancy Combine ECC and un-used redundancy

Higher yield and greater degree of fault tolerance

Page 3: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

3IC-DFN/08-06/cww

Chip Area BreakdownChip Area Breakdown

Source: International Technology Roadmap for Semiconductors (ITRS), 2001-2005

0%

20%

40%

60%

80%

100%

Area Memory

Area Reused Logic

Area New Logic

Page 4: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

4IC-DFN/08-06/cww

Typical RAM BIST ArchitectureTypical RAM BIST Architecture

RAMT

est C

ollar (M

UX

)

BIST Module

Controller

Comparator

Pattern

Generator

Go/No-Go

RAM Controller

Counter

LUT

LFSR

Microprogram

Hardwired

CPU core

IEEE 1149.1

Page 5: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

5IC-DFN/08-06/cww

Sharing Controller & SequencerSharing Controller & Sequencer

Page 6: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

6IC-DFN/08-06/cww

Typical RAM ECC ArchitectureTypical RAM ECC Architecture

RAMCb GenCb Gen

Decoder

Syndrome Gen

Corrector16 16

16 16 16

6

6 66

Single Double

Syndrome

Data Bus

Mainly for improving reliability

Page 7: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

7IC-DFN/08-06/cww

RAM Built-In Self-Repair (BISR)RAM Built-In Self-Repair (BISR)

RAM

MU

XBIST

Redundancy

Analyzer

Reconfiguration Mechanism

Sp

are E

lem

en

ts

I/O

Mainly for improving yield

Page 8: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

8IC-DFN/08-06/cww

Main Memory

Spare Memory

BIRA

BIST

Wrapper

Q

D

A

A Power-On BISR SchemeA Power-On BISR Scheme

MAO

POR

MAO: mask address output; POR: power-on reset Source: ITC’03

Page 9: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

9IC-DFN/08-06/cww

Proposed SchemeProposed Scheme Integrated ECC and Redundancy Repair Scheme

Hard errors are repaired by physical redundancy in field

Soft-error correction ability is not harmed by hard errors

Enhance reliability

Assumptions During Error Identification phase, no other faults

may occur Error rate << system clock speed

Page 10: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

10IC-DFN/08-06/cww

Phases of Proposed SchemePhases of Proposed Scheme

Page 11: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

11IC-DFN/08-06/cww

Error Identification PhaseError Identification Phase Write back process

Write the corrected data back to memory Read data from the same address

Soft error may be eliminated with this process

Assume that no other errors may occur

After error identification “Hard repair phase” for a hard error/fault “Fault-free phase” for a soft error

Page 12: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

12IC-DFN/08-06/cww

Hard Repair PhaseHard Repair Phase Repair this hard fault with spare

Map the faulty word to redundant word Write the corrected data into redundant word

Hard fault location In main memory: follow the above procedure In redundant memory: mark the faulty redundant

element

During this phase, memory cannot be accessed Idle mode

Hard fault is removed after this phase Reliability and MTTF is increased

Page 13: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

13IC-DFN/08-06/cww

Experimental ResultsExperimental Results Technology: TSMC 0.25um CMOS process

The redundant memory consists of eight spare rows and four spare columns

Memory Size ECC BIST RC Total (%)

32K x 32 1,084 2572 824 0.52

32K x 64 1,502 3895 824 0.48

32K x 128 2,033 6434 824 0.41

16K x 64 1,391 3855 783 0.77

8K x 128 1,955 6396 728 1.16

TotalMemory Area+BIST+RC

ECC

Page 14: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

14IC-DFN/08-06/cww

Experimental Results (cont.)Experimental Results (cont.)

Memory Size Area (gates) MTTF (hours) Cost

8k X 32 !ECC 195413 321.74 613.6

SEC 220862 37147 5.96

SECP 222030 39786 5.58

4k X 64 !ECC 171522 344.36 498.1

SEC 192610 28010 6.88

SECP 194153 30834 6.30

2k X 128 !ECC 157622 359.53 438.4

SEC 176159 20681 8.52

SECP 178380 23629 7.54

Cost = Area / MTTF

!ECC: Without ECCSEC: With SEC/DED ECCSECP: Proposed Scheme

Area = Memory + ECC + BIST

Page 15: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

15IC-DFN/08-06/cww

Reliability ImprovementReliability Improvement

8K x 64 memoryr+c = 12

Page 16: An Integrated ECC and Redundancy Repair Scheme for Memory Reliability Enhancement

16IC-DFN/08-06/cww

ConclusionsConclusions An integrated ECC and redundancy repair

scheme is proposed Enhancing memory reliability and MTTF

Low area overhead Integrating ECC Controller with BIST

No timing penalty in normal operation

Cost-effective way for reducing the effect of parametric defects