fujitsu supercomputer primehpc fx100€¦ · fujitsu supercomputer primehpc fx100 0. ... 2u rack...

13
Copyright 2014 FUJITSU LIMITED FUJITSU Supercomputer PRIMEHPC FX100 0

Upload: trinhthien

Post on 18-Jul-2018

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

Copyright 2014 FUJITSU LIMITED

FUJITSU Supercomputer PRIMEHPC FX100

0

Page 2: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

Copyright 2014 FUJITSU LIMITED

The K computer and the evolution of PRIMEHPC

Fujitsu has been developing supercomputers over 30 years, and will continue its development to deliver the best application performance.

SPARC64 VIIIfx: 8 cores / 128 GF11 PF, 2010~

K computer PRIMEHPC FX10SPARC64 IXfx: 16 cores / 236.5 GF23 PF, 2012~

PRIMEHPC FX100SPARC64 XIfx: 32 cores / over 1 TFOver 100 PF, 2015~

C RIKEN

1

Page 3: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

PRIMEHPC FX100 design concept

Copyright 2014 FUJITSU LIMITED

Designed for massively parallel supercomputer system

Enhance and inherit K computer features・Many-core CPU-based architecture for application productivity

Introduce new technologies to Exascale computing

・High performance for a wide range of real applications

・HPC-ACE2 : Wide SIMD enhancements・Assistant cores : Dedicated cores for non-calculation operation

2

・Enhanced VISIMPACT (hardware barrier synchronization, sector cache, etc.)

・HMC : Leading-edge memory technology

Page 4: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

SPARC64TM XIfx

Copyright 2014 FUJITSU LIMITED

Over 1 TF high performance processor・32 compute cores

HPC-ACE2: ISA enhancements・Two 256-bit wide SIMD units per core・64 bit x 4 / 32 bit x 8 FMA

Tofu2 interface

Tofu2 controller

L2 cache

L2 cache

HM

C interfa

ce

HM

C in

terf

ace

PCI interface

core core

core core

core core

core core

core core

core core

core core

core core

core

core

core

core

core

core

core

core

core

core

core

core

core

core

core

core

・2 assistant cores:

Assistant core

Assistant core

Daemon, IO, non-blocking MPI functions, etc.

・Addressing mode (stride load/store, indirect load/store)・Cross lane operation (compress, permutation)

Offloading non-calculation operations

Page 5: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

Peak performance per node K computer PRIMEHPC FX10 PRIMEHPC FX100

DP perf. (GFLOPS) 128 236.5 Over 1000

Memory BW (GB/s) 64 85 480

Hybrid Memory Cube (HMC)

Copyright 2014 FUJITSU LIMITED

Excellent byte/flop balance・HMC: high performance per watt in small footprint

4

Page 6: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

Tofu interconnect 2

Copyright 2014 FUJITSU LIMITED

Enhanced Tofu interconnect

CPU-integrated interconnect controller・Reduced communication latency

Optical cable connection between chassis

・Improved packaging density and energy efficiency

・Highly scalable, 6-dimensional mesh/torus topology・Increased link bandwidth by 2.5 times to 12.5GB/s

5

・Enable flexible installation

Page 7: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

Enhanced VISIMPACT

Copyright 2014 FUJITSU LIMITED

Advantages of hybrid parallelization

Technology for hybrid parallelization・ Automatic parallelization technology by Fujitsu’s compiler

Enhanced hardware barriers

・To reduce communication cost in highly parallel programs・To increase user memory space by reducing communication buffer

6

・8 set between 32 cores

・ Hardware barrier for fast synchronization

Flat MPI

MPIcom

MPIcom

Hybrid Parallel

Page 8: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

CPU memory board

Copyright 2014 FUJITSU LIMITED

Three identical computation nodes

SPARC64 XIfx

Optical module

HMC

7

Page 9: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

2U rack mountable chassis・High density: ・Water cooled:

Main unit

Copyright 2014 FUJITSU LIMITED

Cooling unit

CPU memory board x 4Optical connectors

Coolant water inlet/outlet

8

Four CPU memory boards per unit (total 12 nodes, over 12 TF)

High reliability

Page 10: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

System rack

Copyright 2014 FUJITSU LIMITED9

Optical cables

Coolant pipe

Coolant hose

Page 11: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

System rack : Front View

Copyright 2014 FUJITSU LIMITED10

Page 12: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack

System rack : Rear View

Copyright 2014 FUJITSU LIMITED11

Page 13: FUJITSU Supercomputer PRIMEHPC FX100€¦ · FUJITSU Supercomputer PRIMEHPC FX100 0. ... 2U rack mountable chassis ... (total 12 nodes, over 12 TF) High reliability. System rack