pushing cmos to the limit - university of california,...

Pushing CMOS to the Limit

Ali M. Niknejad

Berkeley Wireless Research Center

University of California, Berkeley

Outline

Motivation

CMOS Device Scaling

Effect of scaling on:

Gain, Noise, Distortion, Power

Passive devices

Conclusion

MOTIVATION

Wideband High DR Front-End

Broadband, high dynamic range, reconfigurable front-end

Can we design such a front-end using 45nm CMOS?

32nm? When is the party over?

The “Last Inch”

Wireless HD Video

Required Link Budget

Need +10dBm power, +10dB system NF zero link

margin

Low Power Phased-Array

A fully-integrated low-cost Gb/s data communication using 60 GHz band.

10 Gb/s at 100mW per channel should be possible!

Distributed MIMO

Automotive Radar

Short range radar for

parking assist, object

detection

Long range radars for

automatic cruise

control, low visibility

(fog) object detection,

impact warning

Long range vision:

automatic driver

mm-Wave Imaging … THz?

Use of microwave scattering from

objects to predict image

A low-cost, noninvasive solution

(meV versus keV)

Active and Passive Microwave

Imaging

Ultrawideband imaging

THz detection … ?

TeraView Ltd

SCALING TO ATOMIC LIMITS

Process Cross Section

Typical process offers isolated high fT PMOS/NMOS

device, multiple Cu metal layers, high quality MIM caps

(or enough layers to realize it in standard metalization)

Scaling Rules

Originally formulated by Dennard (JSSC 1974)

Lgate scaling improved speed of device

Tox gate oxide scaling (now at atomic limits)

Xj junction scaling (keep gate control)

VDD supply scaling (reduce fields for better SCE)

Show stoppers:

Gate leakage

Parasitic resistance

Mobility degradation

CMOS Technology Scaling

CMOS technology is getting faster and faster…

But lower supply voltage hurts analog. At best,

constant dynamic range requires quadratic increase in

current with voltage scaling.

Vdd fT

Lmin

ITRS Scaling

CMOS Technology Trends

130nm 90 nm 65nm 45nm

Ft 70 GHz 100 GHz 150 GHz 280 GHz

Fmax 135 GHz 200-300 GHz 300 GHz 560 GHz

NFmin

(60 GHz)4-5 dB 3-4 dB 3 dB 2.3 dB

Measured ITRS+projection

dsschggggdmg

tmax

gg

mt

g)RrR()CCg(R2

ff

C2

gf

From Bulk to Multi-Gate

Scaled CMOS is not your uncles Si + SiO2 + Al:

Using materials from the entire periodic table!

45nm Technology Node:

Bulk CMOS

Metal gate/high-K (low leakage)

Strain for mobility enhancement

32nm likely to be a bulk device

22nm is going to be a multi-gate device

TEENY TINY BUT VERY FAST

fmax vs. Finger Width

Increasing gate resistance

Gate Drain

Source

fmax in GHz

Mason’s Unilateral Gain

Apply lossless feedback to unilateralize the 2-port

Properties of U:

If U > 1, the two-port is active. Otherwise, if U 1, the two-port is

passive.

U is the maximum unilateral power gain of a device under a lossless

reciprocal embedding.

U is the maximum gain of a three-terminal device regardless of the

common terminal.

U is very sensitive to any loss in the 2-port. Good way to test

accuracy of model and measurements.

VGS = 0.65 V

VDS = 1.2 V

IDS = 30 mA

W/L = 100x1u/0.13u

130-nm Device Performance

90nm Technology Performance

Use short fingers to minimize

gate resistance and substrate

resistance.

Can easily cascode devices

(shared junc).

Compared to 130nm, 90nm

technology has

~ 1 dB higher MSG

~1-2 dB lower noise

fmax ~ 180 GHz (compared to

135 GHz in 130nm)

“Roundtable” Layout

MSG 60GHz = 8.5 dB, NFmin ~ 3-4 dB

fmax= 300 GHz (extrapolated), fT= 100 GHz

Highest reported fmax/fT=3 ratio for CMOS!

104 GHz Oscillator (90nm)

Core Rountable Transistor

Output power -8 dBm

DC Power < 7 mW

Highest reported VCO efficiency > 100 GHz

Pre

sen

ted

at

ISS

CC

200

7

104 GHz CMOS Amplifier (90nm)P

resente

d a

t IS

SC

C 2

007

Operating above Ft (100 GHz)

Highest reported CMOS amp

Designed with non-optimal device

mm-Wave Cascode Device

Cascode HF Bilaterization

MSG of cascode close to U of common-source device

below 30 GHz

At 60 GHz MSG of cascode same as common-source …

device is no longer unilateral

Cascode Gain Enhancement

Tune out parasitics or design as a two-stage

amplifier.

Cannot use shared junction layout.

Unilateralization of Cascode Device

Possible to simultaneously cancel the real and imaginary of

Y12 using a gate inductance on cascode device.

Substantial gain enhancement possible (MSG 8 dB 20 dB).

SMALL SIGNALS: NOISE

Limits for RF Noise

MOSFET noise at high frequency limited by thermal noise

and correlated gate noise.

Using the Pospiezalski noise model, the lowest possible

device noise figure is given by

Using short transistor fingers to minimize the gate

resistance, Rg, noise, this is bounded by

gm

T

Rgf

fF

21min

5

121min

Tf

fF

Minimum Achievable Noise

NFmin (dB)

fT (GHz)

130nm

90nm

65nm

mm-Wave Noise Measurements

Broadband Techniques

LsZin

-A

LsZin

Cpad

Rf

Cpar

LsZin

A

ZL

Ls

Cgs

CpadZin

[A. Liscidini et al. VLSI’05] [D. Allstot et al. RFIC’04]

[F. Brucoleri et al. ISSCC02]

36

Rs

M1

M3

M4

M2

R1 RL

Vx

Vy

Vout

Vs

Noise Cancellation

in2

also in [ C-F, Liao et al. CICC’05 ] Reduce M1 noise by proper choice of gm3/gm4

LARGE SIGNALS: DISTORTION

Important MOS Non-Linearity

Square law high field/short channel effects

Mobility degradation

Universal mobility curve (phonon and surface

scattering) Vgs

Velocity saturation & ballistic transport Vds

Body effect (non-uniform doping)

Output impedance non-linearity (and interaction

with gm)

Cgs for high input swings

Mobility with Coulomb Scattering

2 4 6 8 100

1x102

2x102

3x102

4x102

5x102

6x102

123K

173K

300K

Ho

le M

ob

ility(c

m2/V

s)

Qinv(x1012

/cm2)

data model model w/o C.S.

inb

b

effeff

eff

QQ

QUCEUBEUA

2

0

1

Single Equation I-V Curve: “Smoothing Approach”

Since we have good models for weak/strong

inversion, but missing moderate inversion, we can

“smooth” between these regions and hope we

capture the region in between (BSIM 3/4 do this

too)

The factor θ models short-channel effects and η

models the weak-inversion slope

X

XKVVfI TGSDS

1)(

2

kT

VVq TGS

eq

kTX 2

)(

1ln2

Limiting Behavior

In strong inversion, the exponential

dominates giving us square law

In weak inversion, we expand the ln

function

TGS

kT

VVq

VVeq

kTX

TGS

2

)(

1ln2

X

XKVVfI TGSDS

1)(

2

xx 1ln

kT

VVq

DS

TGS

eq

kTKKX

X

XKI

)(2

22

21

kT

VVq TGS

eq

kTX 2

)(

2

I-V Derivatives

X

XKVfIDS

1)(

2

VXV Xff VVXVXXVV XfXff 2

VVVXVVVXXVXXXVVV XfXXfXff 33

2)1(

)2(

X

XXKfX

3)1(

2

XKfXX

4)1(

6

XKfXXX

21

1

s

XV

21

1

2

sskT

qXVV

31

1

2

2

2

ss

ss

kT

qXVVV

Distortion Calculation

Source: Manolis Terrovitis Analysis and Design of Current-Commutating CMOS Mixers

44

0 0.2 0.4 0.6 0.8 1-0.02

-0.01

0

0.01

0.02Drain current and its derivatives

VGS

(V)

(A),

(A

/V),

(A

/V2)

0 0.2 0.4 0.6 0.8 1-0.1

-0.05

0

0.05


VGS

(V)(A

/V3)

IDS

gm

gm

gm

0 0.2 0.4 0.6 0.8 1-20

-10

0

10

20

30

VGS

(V)

Viip

3 (

dB

v)

Taylor series representation

MOSFET Linearity Sweet Spot

Transconductor

TOHvg

vg

vg

TOHvV

Iv

V

Iv

V

Ii

gsm

gsm

gsm

gs

GS

DSgs

GS

DSgs

GS

DSd

..!3

"

!2

'

..!3

1

!2

1

32

3

3

32

2

2

"83

m

mIIP

g

gV

0 0.2 0.4 0.6 0.8 10

0.003

0.006

0.009

0.012


VGS

(V)

(A),

(A

/V),

(A

/V2)

IDS

gm

gm

0 0.2 0.4 0.6 0.8 10

1.25

2.5

3.75

5x 10

-3 Drain current and its derivatives

VGS

(V)

(A),

(A

/V)

IDS

gm

3rd order nonlinearity

i d

Vgs

Sweet spot

i d

Vgs

[0.13um]

“Multiple Gated Transistors” Notice that we can use two

parallel MOS devices, one

biased in weak inversion

(positive gm3) and one in strong

inversion (negative gm3).

The composite transistor has

zero gm3 at bias point!

Source: A Low-Power Highly Linear Cascoded Multiple-Gated Transistor CMOS

RF Amplifier With 10 dB IP3 Improvement, Tae Wook Kim et al., IEEE MWCL, Vol.

13, NO. 6, JUNE 2003

MGTR Linearization Range

VgsVgs1 Vgs2

Rs

M1

M3

M4

M2

R1 RL

Vx

Vy

Vout

Vs

Rs

M1

M3

M4

M2

R1 RL

Vx

Vy

Vout

Vs

47

0 0.2 0.4 0.6 0.8 1-1.5

-1

-0.5

0

0.5

1

1.5

2

Vgs (V)

2n

d d

eri

va

tive

of g

m

MGTR

single transistor

weak inversion bias

strong inversion bias

composite transistor

Noise and Distortion Cancellation

Vgs1 Vgs2

M3/M4 perform both IM3 cancellation and M1 noise

cancellation

[ Tae W. et al. IEEE MWCL’06 ]

Volterra Series Analysis

Vs

mgmg mg LmSmSfundout RgVBgVAV 3141,

][ 3

3

34

3

33, LmSmSoutRgVBgVAV rd

]66

[ 33

143

1 Lm

Sm

S Rg

VBg

VA

LmSmS RgVBBgVAA ][ 3

3

214

3

21

LmSmSfundout RgVBgVAV 3141,

][ 3

3

34

3


]66

[ 33

143

1 Lm

Sm

S Rg

VBg

VA

Both M3/M4 bias and size are

governed by the first 2

cancellations

Residual Vout IM3 due to

second-order interaction

could be substantial

m)

LmSmS RgVBBgVAA ][ 3

3

214

3

21

Single-ended “Differential Pair”

A2/B2 related to gm’ of the common-gate transistor

Single-ended PMOS/NMOS pair zeros out gm’

3

3

34

3


66 33

143

1 Lm

Sm

S Rg

VBg

VA

LmSmS RgVBBgVAA 3

3

214

3

21

Silicon Implementation

IFX 0.13um

Regular VT transistors

MIM caps

Inductorless

S-parameters and Noise Figure

Bandwidth

(S11<-8.5dB)0.8~2.1GHz

Gain (S21)>8.6dB

(6dB more on chip)

Noise Figure (NF) <2.6dB

52

WCDMA IIP3 Test

WCDMA blocker

compliant test

P1dB=-12dBm

53

IIP3/NF bias sensitivity

50mV

POWER IN NUMBERS

Power Amplifiers

Peak Output Power

Efficiency

Power Gain

Amplifier Linearity

Stability over VSWR

Ability to transmit into an unknown/varying

load

Power Control

Step size, range

CMOS Oxymoron of a PA

Is CMOS a viable technology for power amplifiers?

Low voltage means less power available for a given

device width

Higher current from given device increases sensitivity

to series loss

Low supply voltage means high impedance

transformation ratio (and thus higher insertion loss)

New power amplifiers should be more

linear and operate more efficiently over a

wide range of output power!

Cal Tech DAT

Use virtual grounds wisely to turn 1:1 coupled lines into a

transformer loop.

Power Combining and Control

Use transformer to

perform efficient

power combining

Can also use structure

for efficient power

back-off to improve

average power

efficiency

At moderate back-off

(6 dB), efficiency

close to peak level

Transformer FOM

Unlike inductor Q factor, there

is no obvious “silver bullet”

FOM for transformers.

For power combining

applications, the maximum

power gain (bi-conjugate

match) has been used as a

figure of merit

For a simple 1:1 transformer,

the maximum gain is a function

of only the Q and K factors

Transformer Efficiency

Notice that relatively low insertion loss is possible with moderate

on-chip Q and K factors, thus allowing fully-integrated

transformers.

CMOS “RF” Prototype

Each stage is a pseudo-differential cascode amplifier

Can scale output power by adding more stages to transformer

Use dual thick metal layers to implement high “K” low loss

transformer (80% efficient)

Measured Linear PA Performance

Fully 130nm CMOS integrated

prototype

27 dBm (30% efficiency)

Linear mode: 24 dBm (25%)

No external passives

Measurements with EDGE Signals

Freq = 2.4-GHz, Peak Power Mode

Measurements with 802.11g Signals

Pout = 14.5-dBm EVM = 4.48%

Freq = 2.4-GHz, Peak Power Mode

Table of PA Performance

Technology 0.13-μm RF CMOS

Supply voltage 1.2-V

DC Current 114-mA

P-1dB 24-dBm

Drain efficiency 25%

Saturated Power 27-dBm

Drain efficiency 32%

Improved Layout…

Using two primary windings

Improved coupling

Lower loss (current crowding at edge of conductors)

More symmetric primary/secondary for optimal power

transfer

Prototype PA in Digital CMOS

Four stage

differential

design

Single-ended

50Ω output

Thin oxide

90nm

transistors

Measured Output Power

Peak power is 24 dBm. Good match to simulation up to

1dB compression point.

24 dBm

Measured Efficiency

27%

THE FATE OF PASSIVE ELEMENTS

Passive Devices Don’t Scale

For fixed frequency, area of inductors roughly constant

Multiple metal layers allow more compact inductors but

high Q inductor still single layer (top thick layer)

Microprocessor or Inductor?

It is widely appreciated that the area of an inductor

in today’s CMOS is equivalent to a powerful CPU

Low frequencies: Get rid of them

Use broadband circuit techniques

Analog design = RF design … use feedback

Use linearity enhancement schemes for out of band

blockers

High frequencies:

Area is still reasonable

T-line versus lumped?

What to do with all that metal?

Slow wave structures

Scaling Helps and Hurts

Can build very high density caps

Cu and thick metal stacks were very

exciting (130nm, 90nm)

Metals are getting thinner (low K)

Inductors and T-lines are getting worse

Inductor or T-Line?

LC resonators have good Q factor … varactors are problematic

above 40GHz

High Z0 quarter wave resonators loop inductors

Optimum Taper Profile

Andress and Ham [JSSC

2005] showed that a tapered

resonator has improved Q

Assumed a constant Z0 line.

What if you remove this

constraint?

Result looks like an LC tank!

Transformers Scale to mm-Waves

Isolation, impedance

matching, biasing …

Good insertion loss

Compact layout

compared to T-lines

Conclusion

CMOS technology offers

Fast transistors (low noise)

Low supply voltage

Poor linearity

Circuit techniques can overcome many limitations

in linearity and power

Technology scaling mostly beneficial for mm-

wave

High fmax transistors, reduced power consumption

Increase in parasitic resistances can be detrimental to

fmax scaling

Si technology is the new playing field for

electromagnetic structures … room for creativity!

Acknowledgements

DARPA TEAM Program, C2S2

STMicroelectronics & Infineon

chip fabrication and support

BWRC Member Companies

pushing cmos to the limit - university of california,...

Documents