tau performance system [email protected]

21
TAU Performance System http://tau.uoregon.edu [email protected]

Post on 15-Jan-2016

247 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance Systemhttp://tau.uoregon.edu

[email protected]

Page 2: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 2

TAU Parallel Performance System

http://tau.uoregon.edu/ Multi-level performance instrumentation

Multi-language automatic source instrumentation Flexible and configurable performance measurement Widely-ported parallel performance profiling system

Computer system architectures and operating systems Different programming languages and compilers

Support for multiple parallel programming paradigms Multi-threading, message passing, mixed-mode, hybrid

Page 3: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 3

TAU Port to IBM BG/P Supports automatic instrumentation at:

Source level (PDT, tau_instrumentor; KOJAK, opari) MPI

Flexible and configurable performance measurement Support for profiling and tracing Support for PAPI counters on BG/P Uses bgxlC_r, bgxlc_r, bgxlf90_r as compilers

To configure TAU: ./installtau -arch=bgp -mpi -pdt=<dir> -pdt_c++=xlC -papi=<dir> ./tau_validate --html --build bgp >& results.html

Parallel Profile Analysis: Paraprof profile browser PerfDMF profile database Perfexplorer cross-experiment data analysis toolkit

Page 4: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 4

Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration

% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …

% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt

% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries

Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively

Page 5: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 5

Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration

% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …

% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries

Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively

Visualize performance data with paraprof, pprof, vampir, jumpshot

Page 6: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 6

TAU’s ParaProf 3D Profile Browser: Matmult

Page 7: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 7

Profiling FLASH3 on IBM BG/P

Page 8: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 8

Sedov 2D Auto

• Initial test run did not include a load balanced problem• Small problem: too little work for 1024 processor• Proof of concept to validate porting of tools

Page 9: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 9

PerfExplorer: Cross Experiment Analysis

Page 10: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 10

TAU PerfExplorer: Runtime Breakdown

MPI_Barrier

IO_OUTPUT

Page 11: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 11

Relative Efficiency

Page 12: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 12

Relative Speedup for One Event

Page 13: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 13

TAU’s PerfExplorer: IBM BG/P

Page 14: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 14

TAU Portal

TAU portal supports the FLASH regression testing Allows groups to share profiling data in a secure way Allows users to launch TAU performance tools (paraprof,

perfexplorer) Nightly regression testcases uploaded to the database

automatically SVN checkout each night TAU: http://www.cs.uoregon.edu/research/tau TAU Portal: http://tau.nic.uoregon.edu

Page 15: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 15

Portal: Nightly Performance Regression Testing

Page 16: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 16

TAU Portal: Launch ParaProf/PerfExplorer

Page 17: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 17

PerfExplorer: Regression Testing

Page 18: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 18

PerfExplorer: Limiting Events (> 3% ), Oct 2007

Page 19: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 19

PerfExplorer: Exclusive Time for Events (2007)

Page 20: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 20

ParaProf: 3D Visualization

Page 21: TAU Performance System  tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 21

Support Acknowledgements

Department of Energy (DOE)

Office of Science LLNL, LANL, ASC Argonne National Laboratory University of Chicago

Department of Defense NSF