research in reconfigurable computing: an industrial ... · research in reconfigurable computing: an...
TRANSCRIPT
Research in Reconfigurable Computing: An Industrial Perspective
Juanjo Noguera
Xilinx Research Labs
September 2009
Page 2 © Copyright 2009 Xilinx
Agenda
Why Programmable Logic?
Impact of Research in Industry
Quick Update on Xilinx Technology
Page 3 © Copyright 2009 Xilinx
2,000 Patents
Introduction to Xilinx
Xilinx Patent Hall
� Worldwide Leader in Programmable Logic
– Founded in 1984
– Inventor of the FPGA
– Pioneer of the fabless model
– $1.8B in revenues in FY ’09
– ~3,100 employees worldwide
– 20,000+ customers worldwide
� Industry’s first 65 nm FPGAs
– Shipping 98% of high-end 65nm production FPGAs in the world
� Next-generation 40/45nm FPGAs
– Virtex-6 & Spartan-6 families
© Copyright 2009 Xilinx
14%6%
49%
31%
Revenue by Geography Revenue by End Market
37%
35%
20%
8%
North America
Europe
Japan Asia Pacific
Consumer& Auto
Communications
DataProcessing Industrial
& Other
Xilinx Revenue BreakdownQ2 Calendar Year 2009
Page 5 © Copyright 2009 Xilinx
CostPowerPerformance
The Time for Programmable Logic is Now!
Finan
cial
Constra
ints
Technology
Innovation
Market
Forces
Programmable Imperative!Programmable Imperative!
Do more with lessReduce risk profileFocus on core
competencies
Rapid changing standardsTime to Market (TTM)Fickle, fragmented markets
Page 6 © Copyright 2009 Xilinx
WW ASIC Design Starts22% Decline in 2009
"More likely, we will see a large percentage of these questionable designs not hit any production and die a slow death by indefinite push-outs”
Bryan Lewis, Gartner Analyst
Page 7 © Copyright 2009 Xilinx
$0
$20
$40
$60
$80
$100
$120
$140
$160
500 350 250 180 130 90 65 45 32 22 10
$0
$200
$400
$600
$800
Development Costs
Min Market Size
Development Costs ($M) Minimum Market Size ($M)
High Development Costs Driving
ASSPs to Ultra High Volume Markets
Source: Xilinx, Venture Capital Insights
Process Nodes (nm)
Page 8 © Copyright 2009 Xilinx
The Growing ASIC/ASSP Application GapM
ark
et S
ize
Application Market Segments + 100s More
AS
IC / A
SS
P C
las
s A
pp
lic
ati
on
s
Traditional FPGA Class Applications
UnderservedApplications
Page 9 © Copyright 2009 Xilinx
Agenda
Why Programmable Logic?
Impact of Research in Industry
Quick Update on Xilinx Technology
Page 10 © Copyright 2009 Xilinx
High Performance, Low PowerDSP Slices
Efficient LogicLUT-6 CLB
On-Chip MemoryBlockRAM
Abundant resourcesClocking
550K Logic Cell Device
150K Logic Cell Device
Built-InPCIe® Interface
Increased SerialBandwidthTransceivers
Enhanced Parallel ConnectivityParallel I/O
Virtex-6
Spartan-6
Combination of Flexibility (CLB’s), Integration and
Performance (heterogeneity of hard-IP Blocks)
� Xilinx Virtex-6 and Spartan-6 devices
Modern FPGA Architectures
Page 11 © Copyright 2009 Xilinx
Low Cost Solution Interfacing DDR3Integrated Memory Controllers
Built-InTri-mode EMAC
Higher EfficiencyFIFO Logic
Integrated ReliabilitySystem Monitor
Interface to Legacy Systems3.3 Volt compatible I/O
550K Logic Cell Device
150K Logic Cell Device
Virtex-6
Spartan-6
Key differences between both families
� Xilinx Virtex-6 and Spartan-6 devices
Modern FPGA Architectures
Page 12 © Copyright 2009 Xilinx
=
XX
+/+/--A
D
B
C
P
Power Consumption Benefits Performance Benefits Cost Benefits
� Lowest power operation of any FPGA solution
� 1.23mW/100Mz at 38% toggle rate
� 600MHz operations for any DSP operation including large filters
� ~1.2 TeraMACC in a single device
• Hardened pre-adder and adder cascade saves significant logic resources
• Logic functions can be mapped into DSP blocks
ALU-like Second Stage
Virtex-6 DSP48E1 Slice
Simplified Block Diagram of Virtex-6 DSP49 Slice
Page 13 © Copyright 2009 Xilinx
� Time-multiplex functionality not required at the same time on
the Xilinx FPGA
– Use an smaller device
– Reduces cost and power consumption
Partial Reconfiguration on Xilinx FPGAs
Reconfiguration
Static Implementation Reconfigurable Implementation
Function A
Function B
Sta
tic Fnct.
ASta
tic
Fnct.AS
tati
c
Page 14 © Copyright 2009 Xilinx
Virtex-5 Configuration Memory Architecture
Reconfigurable modulescan be located in the
same column
ReconfigurableRegion #1
ReconfigurableRegion #2
Reconfiguration framesare 20 CLBs high
Fast reconfigurationwith 32-bit wide100 MHz V-4 ICAP
*** Improvements in configuration memory architecture and design
flow have enabled wider use of partial reconfiguration ***
Page 15 © Copyright 2009 Xilinx
Lower Reconfiguration Latencies
Reconfiguration Latency (ms)
0.1 1.0 10 100
2VP22VP42VP72VP202VPX202VP302VP402VP502VP702VPX702VP100
V4LX15V4LX25V4LX40V4LX60V4LX80V4LX100V4LX160V4LX200
% of device reconfigured
10%
25%
50%
75%
100%
~ Order of magnitude improvement
in latency when reconfiguring
25% of a XC2VP100
versus 25% of a XCV4LX100
( Both devices have approximately
100,000 logic elements each )
Page 16 © Copyright 2009 Xilinx
PRM
DES /3DES Core
PPC405
ICAP
UART
Reconfigurable Encryption/Decryption PRMsDES/Triple DES cores connected to the PLB bus
PLB ArbiterArbitrates activity on the On-chip Peripheral Bus
UART
Provides user interface via PC terminal.
PPC405 Program code controls all peripherals and partial reconfiguration activity.
ICAP Internal Configuration Access Port. Provides interface to config logic.
ACE CTLR
SystemACE Controller
stores bitstreams from a Compact Flash card.
Bus MacroDedicated slice logic provides communication between static and reconfig modules.
PLB
Example Design using Partial Reconfiguration
*** Key Concepts and Prototypes Developed by
German Academics ***
Page 17 © Copyright 2009 Xilinx
Agenda
Why Programmable Logic?
Impact of Research in Industry
Quick Update on Xilinx Technology
Page 18 © Copyright 2009 Xilinx
Wired Communication Routers
Customer Needs
Xilinx Value Proposition
� Performance and bandwidth to support 40/100G
� Reduced power consumption
� Flexibility and scalability allow the same FPGA to be used in
multiple sockets
� Rapid development and deployment
� FPGA Power reduction = no change to power infrastructure
� Single FPGA platform using multiple configurations replaces
multiple ASICs
Low Power, High Performance, Scalable Solution FPGAs Provide:
*** Relevant Market in German Industry ***
Page 19 © Copyright 2009 Xilinx
� FPGA power optimization can be addressed
at multiple levels
� System (Application) and front-end tools
can achieve significant power optimizations
� Describe how FPGAs can achieve
competitive power advantage when
compared to other programmable platforms
– DSP processors and Network processors
– Various application domains (e.g., video and
networking)
Back end CADBack end CAD
Front end CAD
System
CircuitCircuit
ArchitectureArchitecture
ProcessProcess
FPGA Power-aware Design
Page 20 © Copyright 2009 Xilinx
� Reduced electricity consumption
– Lower operational expenses for the customer
� Why is heat a bad thing?
– Reliability; Speed; Leakage
– App-specific constraints (size, noise)
� Heat management solutions
– Advanced chip package
– Heat sink on top of the chip
– Active fan
– Spreader plate/heat pipe
– Liquid immersion
[Intel Tech Journal]
$0.15/W
$2.5/W
Benefits of Reduced Power Consumption
Page 21 © Copyright 2009 Xilinx
� Based on Network Processing Units (NPUs)
– NPUs offer software programmability with high data rates
– But also on CPUs, ASICs, ASSPs, FPGAs
� Typical NPU architectures
Intel IXP2805 AMCC nP3700 Xelerated X11
� Based on multiple processing engines that operate on packets in parallel
– 16 multi-threaded micro-engines (Intel)
– 3 multi-tasking processor cores (AMCC)
– 200-stage programmable pipeline of processor cores (Xelerated)
FPGAs NPUs
ASSPs
ASICs
CPUs
Packet Processing Domain
Page 22 © Copyright 2009 Xilinx
$$$11W40Xelerated X11-S200
$$$25.5W10Intel IXP2800
$4.8W1Wintegra WinPath1 787
$1.5W1.6Infineon Convergate C
$$9.8W2Intel IXP2350
$4W2LSI APP300
$<2W25.6Virtex-4 LX25 128-bit
$<2W12.8Virtex-4 LX25 64-bit
$<2W6.4Virtex-4 LX25 32-bit
CostPowerRate (Gb/s)Device
Case Study: DSL Access Multiplexer
� Comparable power consumption and price to low-end NPUs
� Performance in the high-end NPU range
Page 23 © Copyright 2009 Xilinx
Automotive Infotainment and Driver Assistance
Customer Needs
Xilinx Value Proposition
� Deploy scalable solution that supports multiple models
� Provide high level connectivity and entertainment in the
automobile
� Meet challenging performance and power requirements
� Use single FPGA platform to replace multiple similar
function ASSPs
� Provide low power, scalable programmable environment
� Flexibility to address changing standards and
requirements
“Customizable Standard Product”FPGAs Provide:
*** Strong German Industry Leadership in this Market ***
Page 24 © Copyright 2009 Xilinx
Adaptive Cruise Control & Collision Avoidance
Obstacle Detection
Discrete Logic
Lane Detection
32-bit Soft-CPU
LVDS Rx
Hardware Decisions
6-Channel CAN
Controller
DDR Memory I/F
PHY
PHY
PHY
PHY
SDRAMSDRAM
CCD
CCDLVDS Tx
Image Capture Pipeline
CLK DLLs
Other Peripherals
Other DSP Functions
Cameras
Gyroscopes
Laser
Radar
Wheel SensorsPHY
PHY
Accelerator
Brakes
*** Key Concepts and Prototypes Developed by
German Academics ***
Page 25 © Copyright 2009 Xilinx
Lane Departure Warning System Rear Back-up Camera System
� Partial reconfiguration can be exploited in the
automotive domain
– Mutually exclusive applications
Driver Assistance Applications
Page 26 © Copyright 2009 Xilinx
Wireless Base Stations
Customer Needs
Xilinx Value Proposition
� Ability to differentiate = Market Leadership
� Reduced power consumption
� Flexibility to adapt to changing wireless
standards
� Flexibility and Reduced power consumption
� Performance required for next generation
wireless standards
Flexible, High Performance DSPFPGAs Provide:
*** Relevant Market in German Industry ***
Page 27 © Copyright 2009 Xilinx
• Industry looking for base
station energy reductions
(~10% per annum)
• Femtocell drives power
and cost reductions
Longer-term trends:
• Multi-mode radio and
cognitive radio
(increasing adaptability)
• Collaborative
communication (diminished
base station role)
• Mobility as central feature
of Internet (increasing
demands)
Computation and programmability grow, power budgets shrink
(2001) (2005) (2008) (2011) (2015)
Wireless Coder-decoder Compute Requirements (NXP/Nokia)
Wireless Technology Trends
Page 28 © Copyright 2009 Xilinx
Market Requirements
TransmissionFrequency
Reduc
e
CapEx / OpEx
Air InterfaceStandard
Multi-Mode BTSEquipment
Manufacturers
EquipmentManufacturers
WCDMAWCDMA
WiMAXWiMAX
TD-SCDMATD-SCDMA
TD-LTETD-LTE
FDD-LTEFDD-LTE
CDMA2000CDMA2000
2.1GHz2.1GHz
2.5GHz2.5GHz 2.6GHz2.6GHz
3.5GHz3.5GHz 1900MHz1900MHz
700MHz700MHz
↓Power Consumption
↑Transmission eff
↓Power Consumption
↑Transmission eff
Multi-mode/SDR as common platform strategy to
reduce costs and/or increase flexibility
Page 29 © Copyright 2009 Xilinx
� Adapting coding to wireless channel conditions
Partial Reconfiguration for Power Savings:
Application to the Wireless Domain
*** Key Concepts and Prototypes Developed by
German Academics ***
Page 30 © Copyright 2009 Xilinx
30
� FPGA meets processing needs now and expected
in 2013 (~172 Mb/s rate), but…
� Only used for some DSP functions and as a co-
processor to a DSP processor because:
– No high-level programming for FPGA, to facilitate
scaleable integrated solutions
� And CPU/NPU gets the network processing
30
Example System Specification Xilinx technology
FFTChannel
equalizer
MIMO
decoderMACTurbo
decoder
RLC/
PDCP
TCP/IP
Baseband card
Analog
front
end
Digital
down
converter CPRI
OBSAI
LTE: 10 MHz, 2 antenna, 16-QAM 5/6 code rate
iFFTMIMO
coderMACTurbo
coder
RLC/
PDCP
TCP/IPAnalog
front
end
Digital
up
converter
Radio card
CPRI
OBSAI
����
����
����
����
����
����
����
����
During system
development
DSP
XilinxASSP
CPU/NPU
Wireless Base Station Overview
Page 31 © Copyright 2009 Xilinx
High-levelApplication Description
Imp. 1 Imp. N• • •
FPGA Implementations
High-level
Design ToolConstraints
(e.g., power)
� Generation of multiple
implementations from a single
high-level description
� Domain Specific Tools
– System Generator (DSP)
– EDK Plaftorm Studio
(Embedded)
• Customize connectivity
– AccelDSP
• Matlab to FPGA path
� Recent improvements in
general purpose tools
– e.g., PICO Express (Synfora)
High-level Tools for FPGA Design Space
Exploration
Page 32 © Copyright 2009 Xilinx
Design FlowFocus on algorithm functionality, not FPGA implementation details
System Specification (C)
All blocks programmed in C
Using APIs
Preprocessing & Analysis
High-level Optimization
(Memory) Optimized Specification (C)
Partitioning
SW for FPGASW Tuning
Integration
Functional/Architecture Model (parallel)
Implementation
Exploit and express parallelism
PICO Extrem
e FPGA
High-level (C) algorithmic and memory optimizations
(off-chip transfers, on chip size, #computations)
Rapid iterations over the
Design Flow
Page 33 © Copyright 2009 XilinxPage 33
The Answer: Targeted Design PlatformsEnabling customers to innovate faster…
Virtex®-6 FPGA • Spartan®-6 FPGABase IP, ISE program, base boardsBase Platform
Embedded • DSP • ConnectivityDomain IP, Domain tools, FMC daughter
cardsDomain-Specific
Communication • Video • AVB Market specific IP, custom tools, custom boards
Market-Specific
Focus on DifferentiationCustomer
Design
Targeted Design Platforms enable developers and designers to focus on differentiation instead of design infrastructure
Page 34 © Copyright 2009 Xilinx
� The Time for Programmable Logic is Now!
– Increasing markets segments served by Xilinx FPGA’s
– FPGA Benefits: Performance, Flexibility, Power efficiency
� Several Examples of Industrial Impact of this Research Program
– Wireless and wired communications
– Automotive
– Medical imaging
� German academics leadership in Reconfigurable Computing
– Strategic partnership for Xilinx Research Labs
• Multiple extended visits to Xilinx San Jose and Dublin
• Multiple Collaborations with Xilinx business units
Summary
Page 35 © Copyright 2009 Xilinx
Thank you
Juanjo Noguera
Xilinx Research Labs
+353 1 461 5556