mahim mishra and seth c. goldstein carnegie mellon university

55
International Test Conference Charlotte, NC, Sep 30-Oct 2, 2003 Defect Tolerance at the End of the Roadmap Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Upload: deiondre

Post on 05-Jan-2016

39 views

Category:

Documents


0 download

DESCRIPTION

International Test Conference Charlotte, NC, Sep 30-Oct 2, 2003 Defect Tolerance at the End of the Roadmap. Mahim Mishra and Seth C. Goldstein Carnegie Mellon University. Purpose. Future technologies: EUV Lithography and Chemically Assembled Electronic Nanotechnology - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

International Test ConferenceCharlotte, NC, Sep 30-Oct 2, 2003

Defect Tolerance at the End of the Roadmap

Mahim Mishra and Seth C. Goldstein

Carnegie Mellon University

Page 2: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Purpose

• Future technologies: EUV Lithography and

Chemically Assembled Electronic Nanotechnology– Single-digit nanometer feature sizes– Extreme device densities

• Problem: much higher defect rates– Defect tolerance becomes key issue

• We outline a defect tolerance strategy– novel testing method

Page 3: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Talk Outline

• Introduction – need for defect tolerance

• Outline of defect tolerance strategy– Testing requirements

• Description of proposed test strategy

• Evaluation using simulations

Page 4: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Towards the end of the (ITRS) Roadmap

• Feature sizes approach

single-digit nanometers

• Physical and economic limits

to scalingRed

Brick Wall!

• New Technologies– Chemically Assembled Electronic Nanotech. (CAEN)– Extreme Ultraviolet (EUV) Lithography

Page 5: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

New technologies: caveats

• Extremely high defect densities– As high as 10% of fabric logic and routing resources

• Cannot throw away defective fabrics– Defect-free yield: close to 0%

• Must find a way to use defective fabrics

Page 6: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Part of the solution: reconfigurability

• Regular, periodic

computing fabric– e.g., Field-Programmable

Gate Arrays (FPGAs)

• Helps achieve defect tolerance

• User programs as circuits

Page 7: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

New challenges for defect tolerance

• New testing techniques to locate all the defects – generate a defect map

• New, quick, place-and-route algorithms – utilize the defect map

• Must scale with fabric size and number of defects

Page 8: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Proposed tool flow

Fabric

(Async.) CircuitNetlist

Tester

“Soft”Config.

DefectUnaware

Place-and-Route

Defect Map “Hard”Config.

Defect Aware

Place-and-Route

Page 9: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing to locate defects

• Required: scalable testing method to locate defects

in large reconf fabrics– Capable of dealing with large defect densities, quick

• Very different from previous FPGA testing methods

– Similar approach: Teramac custom computer at HP

• Goal of this work– Show that such a testing method is possible

– Develop some of the new, smart techniques required

Page 10: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: previous methods

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Page 11: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: previous methods

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Returnssuccess

Returnsfailure

Page 12: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: previous methods

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Returnssuccess

Returnsfailure

Page 13: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: previous methods

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Returnssuccess

Returnsfailure

Page 14: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: previous methods

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Returnssuccess

Returnsfailure

Page 15: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing with high defect rates

• Previous method: works for low defect rates– Uses “binary” circuits– Requires significant number of defect-free test-circuits

• Will not work for high defect rates– Each test circuit has multiple defects– Very, very few circuits with 0 defects

Page 16: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: high defect rates

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Page 17: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: high defect rates

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Returnssuccess

Returnsfailure

Page 18: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Testing: high defect rates

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Page 19: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Dealing with high defect rates: our algorithm

Finds probabilities of being defective

Eliminates components w/ high prob.Probabilit

y-Assignme

ntPhase

Defect-Location

Phase

2 key ideas: More powerful test-circuits

More than binary info; e.g. approximate defect counts

More powerful analysis techniques

Eliminates remaining defects

Deterministic; no mistakes

Fabric “Probabilistic”Defect Map

DefectMap

Page 20: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability assignment: example

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

Page 21: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability assignment: example

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

1 2 1 2 1 2 1 3 More than binaryinformation!

Test circuits: Counter None-some-many

Page 22: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability assignment: example

Defect-free,unknown

Defective,unknown

Defect-free,known

Defective,known

1 2 1 2 1 2 1 3

1

2

1

3

2

1

1

2

Page 23: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability assignment: example

1 2 1 2 1 2 1 3

1

2

1

3

2

1

1

2

AnalysisMethod

ComponentDefect

Probabilities

Analysis methods: Sorting Bayesian

Page 24: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability assignment: example

DefectProbabilities

lower

higher

Mahim Mishra
FLIP HIGHER TO LOWER
Page 25: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability assignment: example

Removed:

&

AssumedDefective

Page 26: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability assignment: example

Page 27: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Defect location: example

Returnssuccess

Returnsfailure

BinaryInformation

AssumedDefective

Page 28: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Defect location: example

Returnssuccess

Returnsfailure

AssumedDefective

Page 29: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Defect location: example

Returnssuccess

Returnsfailure

AssumedDefective

Page 30: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Defect location: example

Returnssuccess

Returnsfailure

AssumedDefective

Page 31: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Final defect map

Page 32: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Test circuits for prob. assignment

• Idealized counter circuits– Conceptual circuits– Return defect counts, upto threshold t– Higher threshold more powerful circuit

• None-some-many circuits– Tell if none, some or many defects– Less powerful than counters, easier to realize– e.g., our LFSR-based design

Page 33: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

None-some-many circuits

manynoat most 1 of

(b), (c) wrong?

some

no

yes

(a) right?

none

yes

(a)

(b)

(c)

Page 34: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Analysis methods

• Sorting analysis– Example

• Bayesian analysis– See paper

• Comparison

Page 35: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Analysis methods: Sorting analysis

1 2 1 2 1 2 1 3

1

2

1

3

2

1

1

2

Page 36: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Analysis methods: Sorting analysis

1 2 1 2 1 2 1 3

1

2

1

3

2

1

1

2

5

Page 37: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Analysis methods: Sorting analysis

1 2 1 2 1 2 1 3

1

2

1

3

2

1

1

2

5

2

Page 38: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Analysis methods: Sorting analysis

1 2 1 2 1 2 1 3

1

2

1

3

2

1

1

2

5

2

Higher probabilityof being defective

Page 39: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Analysis methods: comparison

• Ease of implementation– Bayesian: harder to implement (restricted circuits)– Sorting: no restrictions

• Complexity– Bayesian: O(n2) best case– Sorting: O(n logn )

• Quality of results: ~10% better for Bayesian

Page 40: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Algorithm: discussion

• Minimally-adaptive algorithm– Minimal rerouting required at test time

• No false negatives– After defect-location phase, all defects identified

• Algorithm complexity:– circuit size k– defect rate p– k × k fabric – requires O(kp) test-circuit orientations

Page 41: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Evaluation

• Quality metric: recovery– percentage of defect-free components marked

not defective

• Each simulated test circuit: 100 components

• Simulated defect rates: 1 to 13%– 1 to 13 defects on average per test circuit– Results valid for circuits with this many defects

Page 42: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Evaluation results: comparison

Ctr

Nsm

sorting

Bayesian

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

5

4

3

2

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

5

4

3

2

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

inf

10

6

3

1

0

10

20

30

40

50

60

70

80

90

100

1 3 5 7 9 11 13Defect Rate (%)

Re

co

ve

ry (

%)

inf

10

6

3

1

Page 43: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Eval.: counter circuits, Bayesian anal.

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

inf

10

6

3

1

Page 44: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Evaluation results: comparison

Ctr

Nsm

sorting

Bayesian

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

5

4

3

2

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

5

4

3

2

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

inf

10

6

3

1

0

10

20

30

40

50

60

70

80

90

100

1 3 5 7 9 11 13Defect Rate (%)

Re

co

ve

ry (

%)

inf

10

6

3

1

Page 45: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

5

4

3

2

Eval.: n-s-m circuits, Bayesian anal.

Each line: represents a different number of

pieces into which the large LFSR is broken

Page 46: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Evaluation results: comparison

Ctr

Nsm

sorting

Bayesian

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

5

4

3

2

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

5

4

3

2

0

10

20

30

40

50

60

70

80

90

100

1 6 11Defect Rate (%)

Re

co

ve

ry (

%)

inf

10

6

3

1

0

10

20

30

40

50

60

70

80

90

100

1 3 5 7 9 11 13Defect Rate (%)

Re

co

ve

ry (

%)

inf

10

6

3

1

Page 47: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Evaluation results: clustered defects

• So far: uniformly

distributed

defects

• In VLSI: defects

often clustered

0

10

20

30

40

50

60

70

80

90

100

Rec

ove

ry (

%)

2 3

Normally TightlyDistributed Clustered

• Clustered defects higher recovery

Page 48: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Conclusions

• New manufacturing paradigm– Reduced manufacturing complexity and cost– Increased post-fabrication testing and defect-tolerant

place-and-route effort

• Scalable testing with high recovery is possible

• Defect tolerance is a major challenge

• Locate defects and configure around them

Page 49: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Backup slides

• Fabric architecture

• Algorithm

• Probability calculation

• Wave testing

• Individual results graphs

Page 50: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

nanoFabric architecture (ISCA’01)

Con

trol

, con

figu

rati

on &

defe

ct m

appi

ng s

eed

cluster

long-lines

nanoBlock

switch-block

Page 51: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Algorithm (Part 1: prob. assignment)

1 mark all fabric components not suspect

2 for iteration from 1 to N1 do

3 while termination condition not met do

4 for all fabric components marked not suspect do

5 configure components into type 1 test circuits using a

particular tiling

6 compute defect probability for each component using

circuit results from current iteration

7 done

8 done

9 mark components with high defect probability as suspect

10 done

Page 52: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Algorithm (Part 2: defect location)

11 for iteration from 1 to N2 do12 while termination condition not met do13 for all fabric components marked not suspect or not defective do14 configure components into type 2 test-circuits using a particular tiling15 for all circuits with correct output do16 mark all circuit components not defective17 done18 done19 done20 mark some suspect components not suspect21 done

Page 53: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Analysis methods: Sorting analysis

• Let component c1 have defect counts c11, c12, …,

c1n, and c2 have counts c21, c22, …, c2n, for n circuits

each

• Prob_defect(c1) > Prob_defect(c2) if Σc1i > Σc2i

• Complexity: O(n logn) for each probability calculation step

Page 54: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Probability calculation

A is the event of the component being good, and B is the event of obtaining the defect counts c1, c2, ….for it,

If k is the circuit size and p is the fabric defect rate,

Page 55: Mahim Mishra and Seth C. Goldstein Carnegie Mellon University

Scaling with fabric size

• Testing proceeds in a wave through fabric• darker areas test and

configure their adjacent lighter ones.

• Total time required: time for this wave to traverse the fabric• square root of the fabric

size.