high performance dsp with xilinx all programmable devices (design conference 2013)

42
High Performance DSP with Xilinx All Programmable Devices

Upload: analog-devices-inc

Post on 06-May-2015

1.712 views

Category:

Technology


4 download

DESCRIPTION

This session includes a discussion on rapid prototyping concepts using Xilinx All Programmable FPGAs and SoCs with Analog Devices high speed and precision products. Covered in this session will be common use cases for Xilinx devices in DSP applications that interface to high speed analog. An overview will be provided of how Xilinx accelerates development with DSP platforms that can be used to quickly evaluate and prototype systems that include high speed analog, programmable logic, and embedded processing. Also covered will be an introduction to Xilinx’s new Vivado Design Suite development environment that shortens design cycles by providing an IP centric design flow, easy to use design analysis and debug, and high level design flows supporting C/C++ and MATLAB/Simulink.

TRANSCRIPT

Page 1: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

High Performance DSP with Xilinx All Programmable Devices

Page 2: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

The Signal Processing Design Challenge

Requires 4 distinct skill sets Analog, Digital, Software, IO

Analog and Digital designs need to be optimized together to maximize system performance and minimize hardware cost

Requires real world data for testing to insure functional correctness

2

Page 3: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Rapid Prototyping Addresses this Challenge by Providing: Analog Interfaces that can be easily programmed

Digital hardware that can be easily configured with algorithms MATLAB/Simulink C/C++

Large ecosystem of interchangeable analog interfaces FMC PMODs

Partners working together! Algorithms Analog Digital Software High-Speed IO

3

Page 4: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Agenda

Xilinx All Programmable Devices for Signal Processing

Signal Processing System Design Considerations

Xilinx DSP Design Environment

DUC/DDC Design using Model Based Design

Xilinx JESD204B Solution

Rapid Prototyping using Xilinx DSP Platforms

4

Page 5: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Agenda

Xilinx All Programmable Devices for Signal Processing

Signal Processing System Design Considerations

Xilinx DSP Design Environment

DUC/DDC Design using Model Based Design

Xilinx JESD204B Solution

Rapid Prototyping using Xilinx DSP Platforms

5

Page 6: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Lowest Total Power and System Cost

Industry’s Best Price-Performance

Industry’s Highest System Performance

and Capacity

Scalable Optimized 28 nm Architecture Enables Design Portability

6

Low-end HDTV High-end

2 Channel 4 Channel 8+ Channel

Page 7: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Lowest Total Power and System Cost

Industry’s Best Price-Performance

Industry’s Highest System Performance

and Capacity

Scalable Optimized 28 nm Architecture Enables Design Portability

7

Low-end HDTV High-end

2 Channel 4 Channel 8+ Channel

Page 8: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

The First All Programmable SoC

8

Page 9: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Industry’s most Advanced DSP Slice Artix-7, Kintex-7, Virtex-7, Zynq-7000

+/- X

=

B

A

D

C

+ - Pre-Add

25x18

Pattern Detector

48-Bit Accum

P DSP48E1 Slice

DSP48E1 Slice In

terc

onne

ct

DSP48 Tile

DSP48E1 Slice

Two DSP48E1 slices / tile connected by 5 high-speed interconnects

Independent access to accumulators

Wide multiplies for greater numerical precision

Flexible access to dedicated pre-adder

Pattern detector for efficient rounding hardware

Resources / Family Artix Kintex Virtex Zynq

Max DSP48E1 Fmax 628 MHz 741 MHz 741 MHz 741 MHz

Max DSP48E1 Count 740 1920 3600 2020

9

Page 10: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

16x16 GMACs

Single Precision Multiply GFLOPs

DSP Silicon Performance Leadership at 28nm

10

160 196.0

4

N/A 602

1,024 1,129

0

200

400

600

800

1,000

1,200

Multicore DSP Artix Zynq PS Zynq PL Kintex Industry Best HP Virtex

GM

AC

s

40.0 212 465

1,496

717

1,422 1,963

2,667

0

1,000

2,000

3,000

Multicore DSP CompetitiveHigh Volume

FPGA

Artix Zynq CompetitiveCost

PerformanceFPGA

Kintex CompetitiveHigh

PerformanceFPGA

Virtex

GFL

OP

s

Page 11: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Xilinx 7 Series Transceiver

7 series offers a full transceiver portfolio for variant customer needs Ultra-high performance GTZ: 28.05Gb/s X 16 High-end Low-power GTH: 13.1Gb/s X 96 Mid-Range GTX: 12.5Gb/s X 32 High-Volume Low-Power GTP: 6.6Gb/s X 16

Virtex-7 HT has up to 2.802Tbps total transceiver bandwidth 16 GTZ and 72 GTH transceivers

11

Page 12: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Jitter Performance

7 series transceivers offer the best jitter performance at 6Gb/s, 10Gb/s+ and 28Gb/s in FPGA industry

Both transmitter and receiver use the high performance PLL

12

6.25Gb/s 10.3125Gb/s

28Gb/s 13.1Gb/s

Page 13: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Agenda

Xilinx All Programmable Devices for Signal Processing

Signal Processing System Design Considerations

Xilinx DSP Design Environment

DUC/DDC Design using Model Based Design

Xilinx JESD204B Solution

Rapid Prototyping using Xilinx DSP Platforms

13

Page 14: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Oversampling and Process Gain

For a Gaussian signal with signal power Ps and total noise power due to quantization Pn the Signal to Noise Ration is SNR = Ps / Pn

Oversampling by M improves the SNR Pnb = Noise power in the signal

bandwidth (B) M = Oversampling Factor Pnb = Pn / M

Each 2X over-sampling improves the SNR by 3db SNR = Ps / Pnb SNR = M*Ps / P

Faster Sample Rates However leads to More difficult timing closure in the FPGA design Extra FPGA Resources Higher Power Consumption

14

Ps = Signal Power Pn = Quantization Noise Power B = Bandwidth of Interest Nyquist = 2*B Fs = Actual Sampling Rate

Page 15: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Decimation and SNR of Ideal ADC

15

Page 16: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Decimation Filter Preserves Processing Gain

16

Page 17: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

System Design Considerations

Higher analog IF Sample rates result in better SNR

Designing a digital down converter that minimizes hardware resources

IF Sampling clock and the decimation filtering clocks need to be multiples of each other

17

DSP48

Page 18: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Improving Area Efficiency using Hardware Overclocking The DSP Resources in the FPGA can be overclocked to save

hardware resources DSP48E1 slice Fmax = 741 MHz at 28nm Implement more channels or more signal processing with smaller All

Programmable FPGAs or SoCs

18

X

X

X

100 MHz

X

Time Division Mux Time Division DeMux

100 MHz 300 MHz 100 MHz

3X Over Clocking

DSP48

Page 19: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Agenda

Xilinx All Programmable Devices for Signal Processing

Signal Processing System Design Considerations

Xilinx DSP Design Environment

DUC/DDC Design using Model Based Design

Xilinx JESD204B Solution

Rapid Prototyping using Xilinx DSP Platforms

19

Page 20: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

DSP Design Methodology Leadership

Flexible design environment

Floating-point and Fixed-point Hardware Generation

Real time analog data acquisition

World class C design flow

20

HDL Coder

System Generator

Vivado HLS

ZC706

MATLAB / Simulink

Toolbox IP Xilinx IP

MATLAB Simulink

MATLAB Simulink C/C++

C Libraries

Analog Signals

A/D

D/A

Vivado

RTL

IP Catalog

Vivado System Edition

Algorithm Specification

AD9250

AD9129

Page 21: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

DSP IP and Reference Designs Leadership

DSP IP libraries available for multiple design environments

Industries most advanced library of LTE specific wireless IP

Reference Designs provide working starting points DSP, SDR, Wireless, Aerospace and Defense

21

LTE Specific IP General Wireless IP Video IP HLS Libraries

LTE Turbo Enc/Dec Convolutional Encoder Scalar Math.h

LTE Channel Enc/Dec Viterbi Decoder Color Correction Floating Point library

LTE MIMO Enc/Dec DFT/iDFT Timing controller OpenCV

LTE FFT DUC/DDC Compiler On-Screen Display AXI Interfaces

LTE Channel Estimator FIR Compiler Image Edge Enhancement Linear Algebra *

LTE PUCCH Receiver DDS Compiler H.264 Encoder

LTE RACH Detector CORDIC Color Space Converter

RTL MATLAB/ Simulink C/C++

Page 22: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Xilinx System Generator for DSP

Enables MATLAB/Simulink for DSP FPGA Design 100+ Xilinx optimized DSP building blocks Automatic code generation Supports parameterization in MATLAB

Vivado Integration Import C-based IP from Vivado HLS Automatic bitstream generation Vivado Project File generation

22

System Generator for DSP

Page 23: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Vivado High-Level C/C++ Synthesis

Comprehensive coverage C/C++/SystemC Arbitrary precision Floating-point

Accelerated verification 2 to 3 orders of magnitude

faster than RTL for larger design

Fast compilation and design exploration Algorithm feasibility Architecture Iteration

Customer proven results

23

Page 24: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Introducing Vivado IP Integrator IP Deployment and Assembly Abstract IP Assembly Does not require RTL Debug integration

Intelligent IP Integration Real time design rule checks Parameter propagation Designer Assistance

IP Aware AXI Interconnect Automated interface Device driver & address map generation

Page 25: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Use with High-Level Tool Flows and Design Subsystems

IP Catalog

IP Integrator

Vivado HLS

System Generator for DSP

Vivado Integrated Design Environment

Populate the Vivado IP Catalog using High-Level Design tools • HLS • System Generator

Integrate IP into design subsystems for Xilinx platforms

Page 26: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Vivado Design Suite: From Months to Weeks

Page 27: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

High-level Hardware Debugging

Flexible, targeted probing of HDL and XDC using MARK_DEBUG property

RTL and synthesized netlist probing in multiple views

System-level probing inside of IP integrator view

27

entity FIR is port (clk : in rst : in din : in

HDL

Schematic Hierarchy

Debug Probe

Vivado IP Integrator

Page 28: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Agenda

Xilinx All Programmable Devices for Signal Processing

Signal Processing System Design Considerations

Xilinx DSP Design Environment

DUC/DDC Design using Model Based Design

Xilinx JESD204B Solution

Rapid Prototyping using Xilinx DSP Platforms

28

Page 29: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

DUC/DDC Architectural Considerations

Both system performance and hardware resources should be taken into account

Pulse shaping needs to be performed on TX / RX side

Wideband / Narrowband signal

Filter type and # of filter stages

IF Sampling rate vs Programmable Logic Fmax

29

Page 30: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Using Model Based Design to Explore Filter Configurations

30

Anti-Aliasing Filter

Filter Length and Sample Rate for 1st

Filter

Filter Length and Sample Rate for 2nd

Filter

Filter Length and Sample Rate for 3rd

Filter

Architecture of choice

Configuration 1 91 taps (↑8) 61.44 MSPS

Configuration 2 47 taps (↑4) 30.72 MSPS

11 taps (↑2) 61.44 MSPS

Configuration 3 23 taps (↑4) 15.38 MSPS

25 taps (↑2) 61.44 MSPS

Configuration 4 23 taps (↑2) 15.38 MSPS

11 taps (↑2) 30.72 MSPS

11 taps (↑2) 61.44 MSPS

MathWorks FDATool

Page 31: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Create Executable Specification in Simulink

31

Filters automatically generated by FDATool

Page 32: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Correct by Construction Hardware Design using System Generator

32

System Generator Hardware Design

Simulink Executable Spec

Xilinx Gateways define FPGA Boundary

Page 33: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Improve Results through Overclocking

33

Resource Separate Filter Chains TDM TDM + 3X Overclocking DUC IF Sample Rate 61.44 MHz 61.44 MHz 61.44 MHz

Hardware Clock Frequency 61.44 MHz 122.88 MHz 368.64 MHz

LUTs 1486 952 961

Registers 1987 1230 1176

DSP48E1 27 17 11

DSP48E1s used in Filters 20 10 4

Page 34: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Agenda

Xilinx All Programmable Devices for Signal Processing

Signal Processing System Design Considerations

Xilinx DSP Design Environment

DUC/DDC Design using Model Based Design

Xilinx JESD204B Solution

Rapid Prototyping using Xilinx DSP Platforms

34

Page 35: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Xilinx JESD204 IP v3.1 (Released in 2012.4)

Designed to JEDEC JESD204B standard specification

Supports Transmit, Receive, and Duplex (TX & RX) Modes

1 to 8 lane configurations up to 12.5 Gbps

Support for subclass 0, 1, and 2

Deterministic latency for subclass 1 and 2

Supports GTX, GTH, GTP

35

JESD204B State

Machine

GT TILE

Clocks

Management Registers

Control & Status

JESD204 Data (AXI4-Stream)

Test Data Gen/

Checker

Status & Control (AXI4-Lite)

Page 36: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Wizard Based JESD204 IP Core Generation

Easy to use Graphical Interface

Generated Files include The netlist file for the core XCO files Release notes and documentation A Verilog example design and

demonstration test bench Scripts to synthesize, implement

and simulate the example design

36

Page 37: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

JESD204B Example Design

37

Page 38: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Analog Devices Scan Viewer

Displays Eye Scan diagram for JESD204B transceivers

Leverages the eyescan block in the Xilinx 7-series transceivers Validate interface Reusable into production designs

38

Page 39: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Agenda

Xilinx All Programmable Devices for Signal Processing

Signal Processing System Design Considerations

Xilinx DSP Design Environment

DUC/DDC Design using Model Based Design

Xilinx JESD204B Solution

Rapid Prototyping using Xilinx DSP Platforms

39

Page 40: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

HDL Blocks: AD9250-FMC-250EBZ

Reference design for: Virtex-7, Kintex-7, Virtex-6 KC705 ZC706 VC707 ML605

40

ADI IP Core Xilinx IP Core

Page 41: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

JESD204B High-Speed ADC Demo

41

Analog Devices’ AD9250-FMC-250EBZ 14-bit / 250 MSPS 4-ch ADC

AD9250 High Speed JESD204B Serdes Outputs Data Eye @ 5Gbps

Analog Input (Single-Tone FFT with

fIN = 90.1 MHz)

Ethernet data connection to PC for Verification of Analog Signal on VisualAnalog™

Xilinx Kintex-7 FPGA KC705 Eval Kit

Recovered Eye (after EQ/CDR)

Page 42: High Performance DSP with Xilinx All Programmable Devices (Design Conference 2013)

Summary

View Demonstrations at Show Analog Devices Booth JESD204B Platform Demonstration

Xilinx Booth Zynq SDR Kit with ZC702

Development board Artix-7 AMS Demo

Get Started Today with the Zynq SDR Kit

Learn More at www.xilinx.com/DSP www.mathworks.com/Xilinx www.mathworks.com/zynq www.analog.com/xilinx

42