accelerating design productivity with 7 series fpgas … · matlab and simulink are registered ......

14
WP406 (v1.1) February 22, 2013 www.xilinx.com 1 © Copyright 2012–2013 Xilinx, Inc. Xilinx, the Xilinx logo, Artix, ISE, Kintex, Spartan, Virtex, Vivado, Zynq, and other designated brands included herein are trademarks of Xilinx in the United States and other countries. AMBA, AMBA Designer, ARM, ARM1176JZ-S, CoreSight, Cortex, and PrimeCell are trademarks of ARM in the EU and other countries. MATLAB and Simulink are registered trademarks of The MathWorks, Inc. PCI, PCIe, and PCI Express are trademarks of PCI-SIG and used under license. All other trademarks are the property of their respective owners. Designers tasked with delivering more capability and performance in today's sophisticated DSP applications are increasingly turning to programmable logic for their hardware solutions. Xilinx® 7 series FPGAs meet this demand with a family of devices uniquely developed to address specific market needs, such as highest performance, lowest cost, or lowest power. Xilinx 7 series DSP Targeted Design Platforms accelerate development of DSP applications by reducing schedule risk, enabling design reuse, and introducing new high-level design methodologies. White Paper: 7 Series FPGAs WP406 (v1.1) February 22, 2013 Accelerating Design Productivity with 7 Series FPGAs and DSP Platforms By: Tom Hill

Upload: ledien

Post on 17-Jun-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

WP406 (v1.1) February 22, 2013 www.xilinx.com 1

© Copyright 2012–2013 Xilinx, Inc. Xilinx, the Xilinx logo, Artix, ISE, Kintex, Spartan, Virtex, Vivado, Zynq, and other designated brands included herein are trademarks of Xilinx in the United States and other countries. AMBA, AMBA Designer, ARM, ARM1176JZ-S, CoreSight, Cortex, and PrimeCell are trademarks of ARM in the EU and other countries. MATLAB and Simulink are registered trademarks of The MathWorks, Inc. PCI, PCIe, and PCI Express are trademarks of PCI-SIG and used under license. All other trademarks are the property of their respective owners.

Designers tasked with delivering more capabilityand performance in today's sophisticated DSPapplications are increasingly turning toprogrammable logic for their hardware solutions.Xilinx® 7 series FPGAs meet this demand with afamily of devices uniquely developed to addressspecific market needs, such as highest performance,lowest cost, or lowest power. Xilinx 7 series DSPTargeted Design Platforms accelerate developmentof DSP applications by reducing schedule risk,enabling design reuse, and introducing newhigh-level design methodologies.

White Paper: 7 Series FPGAs

WP406 (v1.1) February 22, 2013

Accelerating Design Productivity with 7 Series FPGAs and

DSP Platforms

By: Tom Hill

Page 2: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

2 www.xilinx.com WP406 (v1.1) February 22, 2013

7 Series FPGAs for DSP

7 Series FPGAs for DSPWith 7 series FPGAs, Xilinx introduced a new high-k metal gate (HKMG), high-performance, low-power (HPL) variant of 28 nm process technology. The resulting Artix™-7, Kintex™-7, and Virtex®-7 FPGAs allow designers to achieve both low power consumption and high DSP performance without compromise (Figure 1). The 7 series devices are built around a scalable, optimized structural concept that eliminates architectural differences between device families. This makes it easy to target designs towards the 7 series device that most perfectly meets the designer’s requirements.

Fourth Generation DSP48E1 SliceHigh DSP performance in the 7 series FPGAs is achieved through the capabilities of the DSP48E1 slice, the fourth generation of this innovative embedded block in 7 series FPGAs. The DSP48E1 slice provides designers with dynamic control input signals that can select different arithmetic operations on a clock-by-clock basis. The slice is implemented in fully customized silicon, and as a result, delivers superior power and performance.The structural elements of the slices and tile are illustrated in Figure 2.

X-Ref Target - Figure 1

Figure 1: 7 Series FPGAs Optimized, Scalable Architecture

WP406_01_100511

Low

Lower

Lowest

Highest

LowPower

HighPerformance

Higher

High

Virtex-7FPGAs

Kintex-7FPGAs

Artix-7FPGAs

Page 3: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

7 Series FPGAs for DSP

WP406 (v1.1) February 22, 2013 www.xilinx.com 3

Five high-speed interconnects are used to combine two DSP48E1 slices into a single DSP48E1 tile that can be used to implement a variety of arithmetic operations of variable precision with no loss of FMAX performance.

This enables high-resolution applications such as radar, medical imaging, and wireless to take full advantage of the 7 series FPGAs' industry-leading 28 nm DSP performances.

The Independent Multiplier and Accumulator Access section compares the capabilities of the 7 series DSP48E1 tile to the Altera Stratix V DSP block for a variety of arithmetic operations commonly used in DSP algorithms.

Independent Multiplier and Accumulator AccessThe DSP48E1 slice allows for greater flexibility by providing independent access to each multiplier and accumulator compared to the Altera Stratix V DSP block that limits access to one. See Figure 3.

X-Ref Target - Figure 2

Figure 2: DSP48E1 Tile (Two DSP48E1 Slices and Interconnects)

WP406_02_121411

DSP48E1Slice

B

DSP48 Tile

Inte

rcon

nect

DSP48E1Slice

APre-Adder

25x1848-Bit

Accumulator

PatternDetector

P+/-

D

C

X

=

+-

B

APre-Adder

25x1848-Bit

Accumulator

PatternDetector

P+/-

D

C

X

=

+-

Page 4: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

4 www.xilinx.com WP406 (v1.1) February 22, 2013

7 Series FPGAs for DSP

Support for Wider Multiply OperationsEach multiplier in the DSP48E1 slice supports up to 18x25 bits that allow the DSP48E1 tile to implement wider multiply operations up to 35x24 with pre-add and 35x26 without pre-add. See Figure 4 and Table 1.

X-Ref Target - Figure 3

Figure 3: Independent Multipliers in the DSP48E1 Tile

WP406_03_121312

B

Virtex-7 FPGA DSP48E1 Slices

Stratix V DSP BlockA Pre-Adder

25x18

Pre-AdderCoeffMem

18x18

18x18

64-BitAccumulator

48-BitAccumulator

PatternDetector

P+/-

+/-

+/-+/-

D

C

X

X

X

=

+-

B

A Pre-Adder

25x1848-Bit

Accumulator

PatternDetector

P+/-

D

C

X

=

+-

X-Ref Target - Figure 4

Figure 4: Wide Multiplier Comparison

WP406_04_011713

B

Virtex-7 FPGA DSP48E1 Tile35x25 Multiplier

A[24:0]

B[34:17]

A[17:0]

B[17:0]

A[17:0]

B[35:18]

A[24:0]

B[16:0]

Out[59:17]

Out[53:0]

Stratix-V DSP Block 36x18 MultiplierA

Pre-Adder

25x18

Pre-AddCoeffMem

18x18

18x18

64-BitAccumulator

48-BitAccumulator

PatternDetector

P+/-

+/-

+/-+/-

D

C

X

X

X

=

+-

B

APre-Adder

25x1848-Bit

Accumulator

PatternDetector

+/-D

C

X

=

+-

Out[16:0]

Page 5: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

7 Series FPGAs for DSP

WP406 (v1.1) February 22, 2013 www.xilinx.com 5

Symmetric Filter Coefficients Not Limited to 8 ValuesThe DSP48E1 slice provide complete flexibility to support coefficients of any length using the input ports for each accumulator. See Figure 5.

ALU Block for Efficient Rounding Each DSP48E1 slice supports advanced functions, such as an ALU and pattern detection block, which allow more efficient hardware implementations for common rounding modes, including convergent, symmetric, and random rounding. See Figure 6.

Table 1: DSP Resource Comparison for Wide Multipliers

Configuration Xilinx DSP48E1 Tile Altera Stratix V DSP Block

Independent Multipliers

Two 25x18 with full 43-bit resolution

Two 18x18 with limited 32-bit resolution

With Pre-add 35x24 26x22

Without Pre-add 35x25 or 42x18 27x27 or 36x18

Without Pre-add (Cport ext) 35x26 or 42x19 N/A

X-Ref Target - Figure 5

Figure 5: Support for Filter Coefficients

WP406_05_121312

DSP48E1Slice

B

DSP48 Tile

DSPBlock

Virtex-7 FPGA DSP SlicesDSP48E1 Slice

Stratix V DSP Block

Inte

rcon

nect

DSP48E1Slice

APre-Adder

25x18

Pre-AddCoeffMem

18x18

18x18

64-Bit Accumulator

48-BitAccumulator

PatternDetector

P+/-

+/-

+/-+/-

D

C B

A

D

C

X

X

X

=

+-

B

APre-Adder

25x1848-Bit

Accumulator

PatternDetector

P+/-

D

C

X

=

+-

Page 6: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

6 www.xilinx.com WP406 (v1.1) February 22, 2013

7 Series FPGAs for DSP

Larger, More Flexible AccumulatorsThe DSP48E1 tile offers the flexibility to use two 48-bit accumulators independently or to combine them into a single 96-bit accumulator. See Figure 7. See Table 2.

X-Ref Target - Figure 6

Figure 6: Hardware Support for Fixed-Point Rounding

WP406_06_121312

DSP48E1Slice

B

DSP48 Tile

DSPBlock

Virtex-7 FPGA DSP SlicesDSP48E1 Slice

Stratix V DSP Block

• Convergent Rounding• Symmetric Rounding• Random Rounding

Inte

rcon

nect

DSP48E1Slice

APre-Adder

25x18

Pre-AddCoeffMem

18x18

18x18

64-Bit Accumulator

48-BitAccumulator

PatternDetector

P+/-

+/-

+/-+/-

D

C B

A

D

C

X

X

X

=

+-

B

APre-Adder

25x1848-Bit

Accumulator

PatternDetector

P+/-

D

C

X

=

+-

N/A

X-Ref Target - Figure 7

Figure 7: Wide Accumulator Comparison

WP406_07_121312

DSP48E1Slice

B

DSP48 Tile

DSPBlock

Virtex-7 FPGA DSP SlicesDSP48E1 Slice

Stratix V DSP Block

Inte

rcon

nect

DSP48E1Slice

APre-Adder

25x18

Pre-AddCoeffMem

18x18

18x18

64-Bit Accumulator

48-BitAccumulator

PatternDetector

P+/-

+/-

+/-+/-

D

C B

A

D

C

X

X

X

=

+-

B

APre-Adder

25x1848-Bit

Accumulator

PatternDetector

P+/-

D

C

X

=

+-

Page 7: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

7 Series FPGAs for DSP

WP406 (v1.1) February 22, 2013 www.xilinx.com 7

More Efficient Complex Multiply ImplementationsThe DSP48E1 tile can be configured to implement two 25x18 multiply-add operations that can be used to build complex multipliers with 30% less logic:

out_r = (ar*br) - (ai*bi)

out_i = (ar*bi) + (ai*br)

See Figure 8.

Note: Figures only show configuration for the real output of the complex multiply.

Table 3 compares the DSP hardware resources for wide complex multiply operations commonly used in wireless and A&D applications

Table 2: DSP Resource Comparison for Wide Accumulators

Configuration Xilinx Altera

One Accumulator 96-bit 64-bit

Two Accumulators 48-bits N/A

X-Ref Target - Figure 8

Figure 8: Comparison of Complex Multiply Real Output

WP406_08_022113

B

Virtex-7 FPGA DSP48E1 Tile18x25 Complex Multiplier - Real Part

Ar[24:0]

Br[17:0]

Ai[17:0]

Bi[17:0]

Ar[17:0]

Br[17:0]

Ai[24:0]

Bi[17:0]

Out_r[43:0]

Out_r[36:0]

Stratix-V DSP Block 18x18 Complex Multiplier - Real PartA

Pre-Adder

25x18

Pre-AddCoeffMem

18x18

18x18

64-BitAccumulator

48-BitAccumulator

PatternDetector

P+/-

+/-

+/-+/-

D

C

X

X

X

=

+-

B

APre-Adder

25x1848-Bit

Accumulator

PatternDetector

P+/-

D

C

X

=

+-

Table 3: Wide Complex Multiply Operations

Operation Xilinx Altera

18x18 complex multiplier 3 DSP48 slices (1.5 DSP tiles) 2 DSP blocks

18x25 complex multiplier 2 DSP48 tiles 3 DSP blocks

18x25 complex multiplier FMAX 741 MHz 400 MHz

Page 8: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

8 www.xilinx.com WP406 (v1.1) February 22, 2013

Virtex-7 FPGAs: Highest Performance in a Programmable Device

Virtex-7 FPGAs: Highest Performance in a Programmable DeviceVirtex-7 FPGAs address a need for a hardware solution in applications that require the highest levels of DSP performance. Virtex-7 FPGAs include up to 3,600 DSP48E1 slices, capable of clocking at over 741 MHz, that combine to deliver over 5,000 GMAC/s of peak DSP performance, satisfying even the most demanding applications, as illustrated in Figure 9.

Virtex-7 FPGAs deliver over 2X the peak DSP processing bandwidth of other leading 28 nm alternatives for symmetric filtering operations typical of wireless, radar, and medical applications.

Kintex-7 FPGAs: Highest DSP Performance per DollarThe Kintex-7 FPGA shares the same high-performance silicon process and DSP48E1 slice as the Virtex-7 FPGA, but the amounts and ratios of these resources have been optimized to lower costs for high-volume applications. These applications include wireless infrastructure, broadcast, and medical, which previously required a high-performance class device; they can now use the Kintex-7 FPGA to reduce cost and power by half. In fact, Kintex-7 devices offer over twice the DSP processing bandwidth of competing mid-range 28 nm programmable devices, as shown in Figure 10.

X-Ref Target - Figure 9

Figure 9: High-End FPGA DSP Performance

WP406_09_121712Symmetric Filtering Operations

5,314 GMACs

~1,963GMACs

Virtex-7 FPGAs(25x18 Multipliers)

Competitive 28 nm FPGA(18x18 Multipliers)

Algorithm Complexity

5,314 GMultAdd/s

3,926 GMultAdd/s

Xilinx Virtex-7 HTFPGAs(25x18 Multipliers)

Altera Stratix V GS(18x18 Multipliers)

Algorithm Complexity

X-Ref Target - Figure 10

Figure 10: High-Volume FPGA DSP Performance

2,845 GMAC/s

717 GMAC/s

Kintex-7 FPGAs(25x18Multipliers)

Competitive 28 nm FPGA(18x18Multipliers)

WP406_10_121412Symmetric Filtering Operations

2,845GMultAdd/s

1,433GMultAdd/s

Xilinx Kintex-7 FPGAs(25x18 Multipliers)

Altera Arriva V GT(18x18Multipliers)

Algorithm ComplexityAlgorithm Complexity

Page 9: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

Artix-7 FPGAs: Highest DSP Performance per Watt in a High-Volume Device

WP406 (v1.1) February 22, 2013 www.xilinx.com 9

Artix-7 FPGAs: Highest DSP Performance per Watt in a High-Volume Device

The Artix-7 device offers the lowest power and lowest cost in an FPGA, addressing the high-volume market. The Artix-7 device shares a scalable, optimized architecture with Kintex-7 and Virtex-7 FPGAs, including the DSP48E1 slice, but the Artix-7 FPGA silicon processes have been further optimized for maximum power efficiency. The 50% power reduction achieved by 7 series FPGAs is the cumulative effect of multiple power optimizations, including an advanced, low-power (HPL) HKMG process that delivers a 65% reduction in static power. Additional architectural and process improvements have lowered dynamic power in the Artix-7 device by 25%, while I/O power has been reduced by 30%. See Figure 11.

Power-sensitive, high-volume applications can be implemented on Artix-7 devices with over 3X the DSP processing bandwidth and up to 87% less power than competitive devices, enabling more DSP processing bandwidth per clock and per watt.

Additionally, Artix-7 FPGAs offer up to a 5X performance-per-watt advantage compared to multi-core DSPs. For example, a TI TMC320C6678 multi-core DSP consumes an estimated 12.3 watts to deliver a peak DSP performance of 320 GMAC/s, which normalizes to 26 GMAC/s per watt. The Artix-7 FPGA can deliver up to 140 GMAC/s per watt, making it the ideal device for applications that demand both high performance and low power. Refer to Figure 12.

X-Ref Target - Figure 11

Figure 11: 7 Series Power Reduction

100%

90%

80%

70%

60%

50%

40%

30%

20%

10%

0%

Nor

mal

ized

Tot

al P

ower

40 nm 28 nm 28 nmWP406_11_121312

Up to50%

LowerPower

IncreaseUsable

Performanceand Capacity

I/O Power

DynamicPower

MAX StaticPower

TYP StaticPower

-30%

-65%

-25%+*

Page 10: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

10 www.xilinx.com WP406 (v1.1) February 22, 2013

Accelerate Development with DSP Targeted Design Platforms

When compared against competing 28 nm high-volume devices, the Artix-7 family offers over a 3X performance advantage, delivering up to 929 GMAC/s of DSP processing bandwidth for symmetric filters, as shown in Figure 13.

Artix-7 FPGAs offer both performance and flexibility advantages over off-the-shelf ASSP solutions and over 2X the performance of competitive devices.

Accelerate Development with DSP Targeted Design PlatformsDesigners with a variety of technical backgrounds can use the leading DSP Targeted Design Platforms from Xilinx (see Figure 14) to evaluate the benefits of 7 series FPGAs in their designs. Targeted Design Platforms provide development hardware, FPGA Mezzanine Card (FMC) I/O daughter cards, reference designs, IP and domain-specific design methodologies, and tools that reduce schedule risk and accelerate development of new designs.

X-Ref Target - Figure 12

Figure 12: DSP Performance per Watt

WP406_12_121312

140

20G

MA

C/s

per

Wat

t

40

60

80

100

120

Artix-7 FPGAs Multi-Core DSP

X-Ref Target - Figure 13

Figure 13: Low Power, High Volume DSP Performance

929GMAC/s

212GMAC/s

Artix-7 FPGAs(25x18 Multipliers)

Competitive 28 nm FPGA(18x18Multipliers)

WP406_13_121312Symmetric Filtering Operations

926GMultAdd/s

424GMultAdd/s

Xilinx Artix-7 FPGAs(25x18 Multipliers)

Altera Cyclone V GT(18x18 Multipliers)

Algorithm ComplexityAlgorithm Complexity

Page 11: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

Accelerate Development with DSP Targeted Design Platforms

WP406 (v1.1) February 22, 2013 www.xilinx.com 11

The Kintex-7 FPGA DSP Kit is the first DSP domain platform available from Xilinx for DSP development in 7 series devices. Technical highlights of this kit include:

• Kintex-7 FPGA KC705 Evaluation Kit equipped with the Kintex-7 XC7K325T device, featuring 840 DSP48E1 slices that can deliver up to 1,245 GMAC/s of DSP processing bandwidth

• 4DSP FMC150 ADC/DAC high-speed FMC mezzanine card that includes:

• Dual-channel 16-bit 800 MSPS DAC

• Dual-channel 14-bit 250 MSPS ADC data converters

• Clock generation/clock conditioning

• ISE® Design Suite: System Edition/Vivado™ Design Suite — Targeted to the KC705 board

• Includes System Generator for DSP

• DSP “Getting Started” Reference Design with digital up/down conversion and high-speed analog interface logic

• RTL DSP design tutorial• Model-based design tutorial with MATLAB® and Simulink® software

Additional RF and High-Speed analog cards are available from the Xilinx FMC ecosystem.

Reduce Schedule Risk with Proven DSP Targeted Reference DesignsXilinx DSP Development kits serve as a validation platform for DSP development that reduces schedule risk. Prior to release, each reference design shipped with the kits is thoroughly tested for functional correctness, design efficiency, timing closure, and high quality of results.

The digital signal processing logic and high-speed analog interfaces are designed to fully leverage the performance of the FPGA, including the DSP48E1 slice and the I/O. These reference designs are provided as precompiled bitstreams that can be directly downloaded onto the FPGA and executed in hardware. The reference designs reduce schedule risk by validating working devices at full performance. After silicon has been validated, step-by-step design tutorials are provided that take users through the process of regenerating IP cores and recompiling design source files through the ISE Design Suite tool to generate new programming files. Validating working silicon,

X-Ref Target - Figure 14

Figure 14: Xilinx Targeted Design Platforms

WP406_14_121312

Application

Market-Specific Industrial Imaging, Medical,Surveillance, A&D, Wireless

Domain-Specific

Base Platform

Focus on differentiation andinnovation instead of designinfrastructure to shrink systemdevelopment time

Base IP - Ethernet, USB, etcTools - ISE Design Suite Logic EditionFPGAs, EPPs, and base boards

DSP, Video, Embedded, Connectivity, Analog

Page 12: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

12 www.xilinx.com WP406 (v1.1) February 22, 2013

Accelerate Development with DSP Targeted Design Platforms

working IP, and working tool flows ensures that time-critical projects start with confidence.

The DSP reference design fully leverages the performance capability of the Kintex-7 FPGA. The programmable logic used to implement the DUC/DDC functionality is overclocked at 491.52 MHz to allow resource sharing of the DSP48E1 slices. The DUC output sampling rate supports 245.76 MSPS with a 2X interpolation through the ISERDES to achieve a final DAC output sampling rate of 491.52 MSPS. The ISERDES ADC data capture is configured to support either a single stream of 245.76 MSPS (per ADC channel) or a dual stream, where each stream is decimated by 2X at 122.88 MSPS.

An example of an initial design is shown in Figure 15.

DSP Targeted Reference Designs are fully supported designs that are updated for each design tool release. Portions of these designs are often reusable in end-user designs, saving weeks or even months of design effort recreating non-proprietary design infrastructure. For example, the targeted reference design for the Kintex-7 FPGA DSP Kit includes signal generation, digital up/down conversion blocks, and interface blocks to the high-speed data converters.

X-Ref Target - Figure 15

Figure 15: Kintex-7 FPGA DSP Kit Targeted Reference Design Block Diagram

X-Ref Target - Figure 16

Figure 16: Kintex-7 FPGA DSP Kit Design Reuse

PatternGenerator

DACInterface

WP406_15_121312

MMCMClocking

DAC

ADC

Digital Up-Converter

(DUC)

ChipScopeILA

ADCInterface

Digital Down-Converter

(DDC)

Kintex-7 FPGA

FMC150DAC/ADC

PatternGenerator

DACInterface

WP406_16_121312

MMCMClocking

DAC

ADC

Digital Up-Converter

(DUC)

ChipScopeILA

ADCInterface

Digital Down-Converter

(DDC)

Kintex-7 FPGA

FMC150DAC/ADC

Reusable DesignInfrastructure

Page 13: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

Summary

WP406 (v1.1) February 22, 2013 www.xilinx.com 13

Extend DSP Platform Capabilities through the FMC EcosystemDeveloped by a consortium of companies ranging from FPGA vendors to end users, the FPGA Mezzanine Card (FMC) is compliant to an ANSI standard that provides a standard form factor, connectors, and modular interface to an FPGA located on a base board. Decoupling the I/O interfaces from the FPGA simplifies I/O interface module design while maximizing carrier card reuse. See Figure 17. Today, roughly 30 FMC high-speed analog FMC daughter cards are available through Xilinx ecosystem partners.

Key Benefits of FMCs:• Data throughput: Support of individual signaling speeds up to 10 Gb/s, with

potential overall bandwidth of 40 Gb/s between mezzanine and carrier card• Latency: Elimination of protocol overhead removes latency and ensures

deterministic data delivery• Design simplicity: Expertise in protocol standards such as PCI™, PCI Express®,

or Serial RapidIO not required• System overhead: Power consumption, IP core costs, engineering time, and

material costs reduced through simplification of system design• Design reuse: Whether using a custom in-house board design or a commercial

off-the-shelf (COTS) mezzanine or carrier card, the FMC standard promotes the ability to retarget existing FPGA/carrier card designs to a new I/O. All that is required is swapping out the FMC module and slightly adjusting the FPGA design.

Over 30 high-speed analog FMC cards are currently available through the FMC ecosystem. A complete list of can be found at: www.xilinx.com/fmc.

SummaryXilinx DSP design platforms reduce schedule risk and accelerate productivity for any new DSP design that targets FPGAs. Xilinx 7 series FPGAs deliver high levels of DSP performance at half the cost and power compared to previous generation devices. The device family shares an optimized, scalable silicon architecture that enables designs to migrate across device families, and thus to highlight the device that offers the best combination of performance, cost, and power.

X-Ref Target - Figure 17

Figure 17: FMC Mezzanine Cards for Xilinx Development Boards

WP406_17_121712

Page 14: Accelerating Design Productivity with 7 Series FPGAs … · MATLAB and Simulink are registered ... Accelerating Design Productivity with 7 Series FPGAs and DSP ... performance.The

14 www.xilinx.com WP406 (v1.1) February 22, 2013

Revision History

As more designers turn to programmable logic to solve their design challenges, an amazing synergy is taking place. Designers are discovering that these many evolving technologies support and facilitate each other:

• Greater confidence in the working silicon• Greater confidence in the design tools• Easier design reuse• Increased design efficiency• Easier-to-use high-level design flows

All these advances, working together, enable more designers and product managers to get better products to market in record time.

For more information, go to http://www.xilinx.com/innovation/7-series-fpgas.htm.

Revision HistoryThe following table shows the revision history for this document:

Notice of DisclaimerThe information disclosed to you hereunder (the “Materials”) is provided solely for the selection and useof Xilinx products. To the maximum extent permitted by applicable law: (1) Materials are made available“AS IS” and with all faults, Xilinx hereby DISCLAIMS ALL WARRANTIES AND CONDITIONS,EXPRESS, IMPLIED, OR STATUTORY, INCLUDING BUT NOT LIMITED TO WARRANTIES OFMERCHANTABILITY, NON-INFRINGEMENT, OR FITNESS FOR ANY PARTICULAR PURPOSE; and(2) Xilinx shall not be liable (whether in contract or tort, including negligence, or under any other theoryof liability) for any loss or damage of any kind or nature related to, arising under, or in connection with,the Materials (including your use of the Materials), including for any direct, indirect, special, incidental,or consequential loss or damage (including loss of data, profits, goodwill, or any type of loss or damagesuffered as a result of any action brought by a third party) even if such damage or loss was reasonablyforeseeable or Xilinx had been advised of the possibility of the same. Xilinx assumes no obligation tocorrect any errors contained in the Materials or to notify you of updates to the Materials or to productspecifications. You may not reproduce, modify, distribute, or publicly display the Materials without priorwritten consent. Certain products are subject to the terms and conditions of the Limited Warranties whichcan be viewed at http://www.xilinx.com/warranty.htm; IP cores may be subject to warranty and supportterms contained in a license issued to you by Xilinx. Xilinx products are not designed or intended to befail-safe or for use in any application requiring fail-safe performance; you assume sole risk and liabilityfor use of Xilinx products in Critical Applications: http://www.xilinx.com/warranty.htm#critapps.

Date Version Description of Revisions

12/17/12 1.0 Initial Xilinx release.

02/22/13 1.1 Updated Support for Wider Multiply Operations, Table 1, Figure 4, Figure 8, and Table 3.