an analysis of virtual channel memory and enhanced memories technologies

38
San Jose San Jose January 23-24, 2001 January 23-24, 2001 Taipei Taipei February February 14-15, 2001 14-15, 2001 An Analysis of Virtual An Analysis of Virtual Channel Memory and Enhanced Channel Memory and Enhanced Memories Technologies Memories Technologies Bill Gervasi Technology Analyst Chairman, JEDEC Memory Parametrics Committee

Upload: hansel

Post on 10-Jan-2016

18 views

Category:

Documents


0 download

DESCRIPTION

An Analysis of Virtual Channel Memory and Enhanced Memories Technologies. Bill Gervasi Technology Analyst Chairman, JEDEC Memory Parametrics Committee. Agenda. Next Generation PC Controllers Concerns With Standard SDRAM Cached SDRAMs: Enhanced & Virtual Channel - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

San JoseSan Jose January 23-24, 2001 January 23-24, 2001 TaipeiTaipei February 14-15, 2001 February 14-15, 2001

An Analysis of Virtual Channel An Analysis of Virtual Channel Memory and Enhanced Memories Memory and Enhanced Memories

TechnologiesTechnologiesBill Gervasi

Technology Analyst

Chairman, JEDEC Memory

Parametrics Committee

Page 2: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

AgendaAgenda

• Next Generation PC Controllers• Concerns With Standard SDRAM• Cached SDRAMs:

Enhanced & Virtual Channel

• Controller Complexity vs DRAM type• Pros and Cons of Cached DRAMs• Conclusions & Call to Action

Page 3: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Next Gen PC ControllersNext Gen PC Controllers

R1

ReqArb

R2

R3

PathArb

DataFIFOs

CommandFIFOs

DRAM

I/O

Page 4: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

RequestorsRequestors

• CPU port: Cache fills dominate– DRAM frequency =

1/3 to 1/5 of CPU frequency– Big L2 caches randomize memory accesses

• Graphics port: Lots of random accesses– Especially 3D rendering

• South Bus port: Mix of short, long packets

Page 5: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

SDRAM RoadmapSDRAM Roadmap

DDR II

DDR-333

DDR-266

PC-133

PC-100

SDRAM 66

The good news: ever faster cores, power manageable, simple evolutionary changes, at about the same price

The bad news: random access time has not changed appreciably, and power is higher than it needs to be

Page 6: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Concerns With SDRAMConcerns With SDRAM

1. Power

2. Latency

3. Refresh Overhead

Page 7: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

SDRAM Power vs LatencySDRAM Power vs Latency

• Active power is very high– Active on power 500X inactive off power– Encourages controllers to close pages

• But access time to a closed page is long– Row activation time + column read time

Page 8: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

SDRAM Power ProfileSDRAM Power Profile  Relative

PowerCPU ClockLatency**

Active on 100% 0 x 5 = 0

Inactive on   3 x 5 = 15

Active off   1 x 5 = 5

Inactive off 0.2% 4 x 5 = 20

Sleep  0.4% 200 x 5=1000

PowerState*

12%

4%

* Not industry standard terms – simplified for brevity** Assuming memory clock frequency = 1/5 CPU frequency

Op

enP

age

Clo

sed

Pag

e

Page 9: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

RefreshRefresh

• Gigabit generation refresh overhead– 256Mb generation is 75ns each 15.6us– 1Gb generation will be 120ns each 7.8us

• This is a 3X performance penalty

Page 10: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

An Argument for Cached An Argument for Cached SDRAM ArchitecturesSDRAM Architectures

Page 11: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

What Are Cached SDRAMs?What Are Cached SDRAMs?

Narrow I/O

channelDRAM

ARRAYWideI/O

channelx4 to x32

x256 to

x1024

SRAM

ARRAY

onchip

Page 12: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Cached DRAM architectures can address SDRAM’s key limitations

However, only commodity memories are affordable for mass market use

I hope to encourage the adoption of cache for all standard future SDRAMs

Page 13: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Cached SDRAM SolutionsCached SDRAM Solutions

1. Power: Encourage closed page use

2. Latency: Fast access to closed pages

3. Refresh: Background operation

Page 14: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Cached DRAMCached DRAM

• SRAM cache before DRAM array– Much like CPU onchip caches– Exploit wide internal buses for fast block

transfer of DRAM to SRAM and back– Allow DRAM core to be deactivated while…– … I/O performed on SRAM

Page 15: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Two Leading Cached DRAMsTwo Leading Cached DRAMs

• Enhanced SDRAM– SRAM in sense amps– Direct mapped into array

• Virtual Channel SDRAM– SRAM in periphery– DRAM associativity maintained by controller

Note: Other cached DRAM architectures exist, however none have been proposed as a commodity DRAM standard.

Page 16: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Enhanced & Virtual ChannelEnhanced & Virtual Channel

DRAM array

ChannelSRAM

(1Kbx17)

I/O

I/O

Segment SenseAmps

SenseAmps & SRAM

(4Kbx4)

Page 17: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Basic FunctionsBasic Functions

• Enhanced– Activate– Read (& cache fill)– Write (array & cache

update)– Precharge– Auto Refresh– Self Refresh

• Virtual Channel– Activate– Prefetch (cache fill)– Read (from cache)– Write (cache update)– Restore (array update)– Precharge– Auto Refresh– Self Refresh

Note: Identical to standard SDRAM

Page 18: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Power FactorsPower Factors

• Enhanced– Page open to read to

cache– Closed while reading– Open during writes

• Virtual Channel– Page open to read to

cache– Closed while reading– Closed during writes– Reopen for restore

Page 19: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Latency FactorsLatency Factors

• Enhanced– Read automatically fills

cache– Reads on closed page

permitted– No write latencies– Masking increases write

recovery time (normal for SDRAM architectures)

• Virtual Channel– Prefetch for cache fill– Reads on closed page

permitted– Writes on closed page

permitted– Reactivation to restore– Inherently RMW – no

penalty for masking

Page 20: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Refresh FactorsRefresh Factors

• Enhanced– Activation not allowed– Permits reads during auto

refresh

• Virtual Channel– Activation not allowed– No reads during auto

refresh

Page 21: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Let’s Compare the Power and Let’s Compare the Power and Activity Profiles of SDRAM, Activity Profiles of SDRAM,

Enhanced, and Virtual Enhanced, and Virtual ChannelChannel

Page 22: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Closed Page SDRAM ProfileClosed Page SDRAM Profile

NOP ACT R R PRE NOP

NOP ACT W W PRE NOP

Command Activity

Power Profile

Power Profile

Command ActivityLower Power

Higher Power

Page 23: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Enhanced ProfileEnhanced Profile

NOP ACT R-PRE R R NOP

NOP ACT W W PRE NOP

Lower Power

Higher Power

Command Activity

Power Profile

Power Profile

Command Activity

Page 24: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Virtual Channel ProfileVirtual Channel Profile

NOP ACT F-PRE R W NOP

NOP R W ACT RST NOP

Lower Power

Higher Power

Command Activity

Power Profile

Power Profile

Command Activity

Page 25: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Power Profile FactorsPower Profile Factors

• Standard L3 cache hit analysis

• Profile of memory operations affected by randomness of accesses– Balance of activations & precharges,

reads & writes– Requestor channel profile depends on

application – games, video, or office app?

Page 26: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Cache Entry SizeCache Entry Size• Max words burst per hit before reload needed

Entry size bus width burst length

• Miss impacts performance & power• Affects controller association overhead• Die size impacted by entry size

Burst length = 4 x4 x8 x16

Enhanced 256 128 64

Virtual 64 32 16

Page 27: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

RandomnessRandomness

• Enhanced controllers replace any entry– Physical memory locality determines

• Virtual controllers can lock channels, e.g.– Screen refresh channel never replaced– Priorities can be assigned to channels– Weighted replacement algorithms possible

Page 28: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Enhanced Controller IssuesEnhanced Controller Issues

• Direct mapping restricts complexity

• 4 physical banks x 4 cache lines

• Comparitor for cache hit

Page 29: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Virtual Controller IssuesVirtual Controller Issues

• Tags required to do closed page writes– Keep track of where to restore

• 4 physical banks x 16 channels

• CAM needed to exploit large # channels– CAM routing challenging – metal over array?

Page 30: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Complexity vs ThroughputComplexity vs ThroughputTHROUGHPUT

COMPLEXITY

VCM

EMS

SDRAM

One entry hit logic

Closed bank

Visible refresh

2 cmd FIFO

1 entry per phys bank

Write flush

Direct map

3 cmd FIFO

4 entries per phys bank

Scheduled writeback

Hidden refresh

6-8 cmd FIFO

Full CAM16 channels per phys bank * 4 phys banks

Scheduled writeback with reorder

Gates 10-20K 35-50K 100-120K 200+K

Page 31: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Brian Davis @ University of Michigan

analysis

Page 32: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Brian Davis @ University of Michigan

analysis

Page 33: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Virtual ChannelVirtual Channel• PROS

– Best performance headroom from cache associativity

– Flexible channel maps– Saves power on reads and

writes– More hits in random

access environment– Pin compatible in SDR &

DDR I configurations

• CONS– Die penalty from 7-13%– Incompatible command

set• Cannot have one die for

standard and virtual

– Refresh blocks access– Simple controller

performance < “standard”– Small cache entry sizes

cause more replacement– 2 banks = higher power

Page 34: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Enhanced MemoryEnhanced Memory• PROS

– Die penalty from 2-5%– Compatible command set

• Same die supports standard or enhanced

– Optional features– Simple to implement– Pin compatible– No performance drop even

for simple controller– Refresh does not block

reads– 4 banks = lower power

• CONS– Royalty to DRAM suppliers– Performance boost lower

than VCM max– Less flexible maps– No power savings on writes

Page 35: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

User PerspectiveUser Perspective

• Users want a standard cached DRAM architecture – solves real system problems– Lower power for closed bank policy– Lower latency– Hidden refresh

• Costs cannot exceed 5% over non-cached

Page 36: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

ConclusionsConclusions• Cached DRAM is highly desirable

– Both Enhanced and Virtual offer significant benefits

• VCM requires more controller complexity– Low end not well served

• Next “commodity” must fit all markets• Enhanced preferred over Virtual Channel

– Better chance at becoming “standard”– Compatibility, simple transition lowers adoption risk– Like L2 went direct set associative, maybe VCM is a

good fit for DDR III in 2006/2007

Page 37: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Call to ActionCall to Action

• Act now!

• Demand vendors provide cached DRAM

• Express support for Enhanced style

• Commit controller designs

Industry likely to decide in March 2001

Page 38: An Analysis of Virtual Channel Memory and Enhanced Memories Technologies

Thank YouThank You