advancing weather prediction at noaa - nvidiaimages.nvidia.com/events/sc15/pdfs/sc5130... ·...

17
Advancing Weather Prediction at NOAA 18 November 2015 Tom Henderson NOAA / ESRL / GSD

Upload: vuongkhue

Post on 26-Sep-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

Advancing Weather Prediction at NOAA

18 November 2015

Tom Henderson

NOAA / ESRL / GSD

The U. S. Needs Better Global Numerical Weather Prediction

“A European forecast that closely predicted Hurricane Sandy's onslaught days ahead of U.S. and other models is raising complaints in the meteorological community.”

"The U.S. does not lead the world; we are not No. 1 in weather forecasting, I'm very sorry to say that," says AccuWeather's Mike Smith…”

October 28, 2012 Hurricane Sandy

Source: USA Today, October 30, 2012

Congressional Response: High Impact Weather Prediction Program (HIWPP)

Three Years Later… Hurricane Joaquin

• US forecast track improved, but still not quite as good as European’s

• NOAA’s Hurricane Weather Research & Forecast Model intensity forecasts were ‘second to none’

• US research models had 20” precipitation forecasts in South Carolina 36 hours in advance (verified)

• The National Hurricane Center correctly never issued any hurricane watches or warnings for the mainland

October 2, 2015

Data

HPC

Assimilation

NWP

Post-Processing

Forecaster

Stakeholders

HIWPP (Sandy Supplemental)

The Forecast Process

HPC critical for Assimilation and Numerical Weather Prediction (NWP)

NWP Resolution Matters: Simulation of a Tornado-Producing Super-Cell Storm

4-km

1-km

Simulations with GFDL’s variable-resolution FV3, non-hydrostatic (aka cloud-permitting) model. Courtesy of Lin and Harris (2015 manuscript)

More Intense Updrafts

Produces a Tornado

Resolution of current operational GFS: ~13km

Better Data Assimilation = Better Forecasts Hurricane Joaquin

European model

US model w/old data assimilation

US model w/new data assimilation

Actual track (through 03Z 07

October)

00Z October 1, 2015

50°N

45°N

40°N

35°N

30°N

25°N

80°W 70°W 60°W Source: Corey Guastini EMC’s Model Evaluation Group

Enabler: High Performance Computing New supercomputing funded by the Sandy Supplemental Disaster Relief Act:

8 NOAA/ESRL/GSD and NOAA/NWS/EMC

Forecast Skill Improvement

0.0

10.0

20.0

30.0

40.0

50.0

60.0

70.0

80.0

90.0

1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015

36 Hour Forecast 72 Hour Forecast

NCEP Central Operations January 2015

NCEP Operational Forecast Skill 36 and 72 Hour Forecasts @ 500 MB over North America

[100 * (1-S1/70) Method]

15 Years

IBM P690

IBM 701

IBM 704

IBM 7090

IBM 7094

CDC 6600

IBM 360/195

CYBER 205

CRAY Y-MP

CRAY C90

IBM SP

IBM P655+

IBM Power 6

Can Rate of Improvement be Maintained?

• Exponentially increasing computer power has enabled linear forecast improvement

• How to overcome CPU clock speed stall?

– More cores -> must exploit fine-grained parallelism

– MIC, GPU?

• Two components of modern NWP models

– “Dynamics”: large scale flows

– “Physics”: sub grid-scale parameterizations

• Key requirement: single source code

NWP Dynamics on GPU and MIC

• Non-Hydrostatic Icosahedral Model (NIM)

– Single-source => directives

• GPU OpenACC, F2C-ACC

• CPU OpenMP

• MIC OpenMP

• MPI SMS

Fine-Grained Parallelism

• GPU

• “Blocks” in horizontal

• “Threads” in vertical

• CPU, MIC

• “Threads” in horizontal

• “Vectors” in vertical

NIM Single Source Code

!SMS$PARALLEL (ipn) BEGIN

!$OMP PARALLEL DO

do i=1,nip

!$ACC loop gang(16), vector(32)

do k=1,nz

x(k,i) = z(k,i) + y(k,i)

end do

!$ACC end loop

end do

!$OMP END PARALLEL DO

!SMS$ PARALLEL END

NIM Dynamics Performance

Performance of the NIM dynamics: – CPU: Westmere (2010), SandyBridge (2012), IvyBridge (2013), Haswell (2014) – GPU: Fermi (2011), Kepler K20x (2012), Kepler K40 (2013), Kepler K80 (2014) – MIC: Knights Corner (2013) – NOTE: CPU times are for two sockets, GPU and MIC for one device

0

10

20

30

40

50

60

2010/11 2012 2013 2014

NIM DYNAMICS 110 KM RESOLUTION 96 VERTICAL LEVELS Intel CPU

NVIDIA GPU

Intel MIC

run

tim

e (s

ec)

Use F2C-ACC To Improve OpenACC

• OpenACC performance is now competitive with “domain-specific” F2C-ACC compiler

– In 2014 PGI and Cray OpenACC compilers were ~2x slower than F2C-ACC for the NIM routines shown below

NIMRoutine F2C-ACCv5.8(sec) PGIv15.7(sec) PGISpeedupvdmints 4.58 5.40 0.84vdmintv 1.63 2.29 0.71diag 1.24 0.65 1.90flux 0.98 1.03 0.95force 0.61 0.52 1.17trisol 0.36 0.28 1.28TimeDiff 0.18 0.15 1.20Total 11.77 12.34 0.95

Performance of WSM6 “Physics” Package

• Most performance improvement due to compile-time constants and alignment

• GPU result “preliminary”, expect ~2x further speedup on K80x – … and even more speedup on KNL

Device Threads Chunk Width

(DP words)

Baseline

Time

Optimized

Time

Speedup

KNC 240 8 13.2 8.7 1.52

SNB 32 4 9.4 7.5 1.25

IVB-EP 48 4 4.7 3.4 1.38

HSW-EP 56 4 -- 2.6 --

K20X GPU -- -- -- 5.3 --

NGGPS

• Implement a cloud-permitting coupled NWP

system

• Through accelerated development and

implementation of • Current global weather prediction models and physics

• Improved data assimilation techniques, and

• Improved software architecture and system engineering

• Extend forecast skill beyond two weeks

• Improve high-impact weather forecasts including

hurricane track and intensity

• Build upon HIWPP successes

• Apply NIM optimization techniques

• Continue strong vendor interactions

Next Generation Global Prediction System (NGGPS)

Summary

• Must make U.S. global NWP “second to none”

• More HPC needed for NWP and assimilation – BUT, clock frequencies have stalled

• “Accelerators” to the rescue? – Dynamics: some benefit

– Physics: TBD

• Expect “high-bandwidth memory” to give an edge to future MIC & GPU – KNL, “Pascal”

• Optimizations benefit MIC, GPU, and CPU

• Procurement of exploratory “fine-grained” system

Thanks to…

• GSD: Tim Schneider, Mark Govett, Jim Rosinski, Jacques Middlecoff, Paul Madden, Julie Schramm

• NWS: John Michalakes

• Strong vendor interactions – Cray

– Intel

– NVIDIA

– PGI

• HIWPP, NGGPS, NOAA HPCC