test con mpich-g2 roberto alfieri - fabio spataro wp1 meeting january 11, 2001 gruppo collegato di...

15
Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Upload: gabriella-owen

Post on 27-Mar-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Test con MPIch-G2

Roberto Alfieri - Fabio Spataro

WP1 meeting

January 11, 2001

Gruppo Collegato di Parma

Page 2: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

Globus.pr.infn.it

GIIS server

Dual PII350, 256MB RedHat6.2, INFNgrid1.2

Janus.pr.infn.it

MPIchG2-cluster

Dual PII350, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2

Janus1.pr.infn.it

MPIchG2-cluster

Dual PII350, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2

Janus2.pr.infn.it

MPIchG2-cluster

Dual PII350, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2

Lxde02.pd.infn.it

MPIchG2-cluster

Dual PII450, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2

The Installation:

Page 3: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

MPICH: our rpm packages

(ftp://ftp.pr.infn.it./pub/linux/rpm/contrib/)

mpich-1.2.1-2.i386.rpm (-with-device=ch_p4)

mpich-smp-1.2.1-2.i386.rpm (-with-device=ch_p4 -comm=shared)

mpich-G2 -1.2.1-2.i386.rpm (-with-device=globus2)

mpich-G2-smp -1.2.1-2.i386.rpm (-with-device=globus2 -comm=shared)

Parma

Page 4: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

TESTS PLAN

Test Tool:

Perftest (http://www-unix.mcs.anl.gov/mpi/mpptest/)

• mpptest (point-to-point communication performance)

• goptest (collective communication performance)

Rete_Mpi (Neural Network Training - our MPI benchmark )

LAN Tests:

• mpptest on SMP using mpich mpich-smp mpichg2 mpichg2-smp

• mpptest on TCP using mpich and mpichg2

• goptest janus(2)-janus1(2)-janus2(2) using mpich and mpichg2

WAN Tests (Parma - Padova):

• Processes distribution: mpptest , RETE_MPI

• Remote submitting Parma

Page 5: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

LAN point to point comm.: roundtrip throughput

Page 6: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

LAN point to point comm.: head-to-head throughput

Page 7: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

LAN point to point comm.: roundtrip latency

Page 8: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

LAN point to point comm.: head-to-head latency

Page 9: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

LAN collective test : reduction

Page 10: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

LAN collective test : broadcast

Page 11: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

+( &(resourceManagerContact="janus.pr.infn.it") (count= 1) (label="subjob 0") (environment=(GLOBUS_DUROC_SUBJOB_INDEX 0)) (arguments= "-reps" "10" "-size" "0" "50" "2" ) (directory="/home/alfieri") (executable="/home/alfieri/mpptest"))( &(resourceManagerContact="lxde02.pd.infn.it") (count= 1) (label="subjob 1") (environment=(GLOBUS_DUROC_SUBJOB_INDEX 1)) (arguments= "-reps" "10" "-size" "0" "50" "2") (directory="/home/alfieri") (executable="/home/alfieri/mpptest"))

mpirun -globusrsl mympptest.rsl

mympptest.rsl

WAN point to point communication

Page 12: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

WAN point to point communication : latency

Page 13: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Parma

WAN point to point communication : throughput

Page 14: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

RETE_MPI BENCH

Neural Network training using 1 node: 356 sec

Neural Network training using 2 local nodes:

-MPICH-G2 -> 165 sec

-MPICH -> 165 sec

Neural Network training using 2 remote nodes:

- MPICH-G2 (Parma-Padova) -> 429 sec

Parma

Page 15: Test con MPIch-G2 Roberto Alfieri - Fabio Spataro WP1 meeting January 11, 2001 Gruppo Collegato di Parma

Remote submitting test

• globus-job-submit janus/jobmanager-pbs ~alfieri/mympi.sh => OK

• globus-job-submit janus/jobmanager-pbs ~alfieri/mympiG2.sh => FAILED

GSS authentication failure

GSS status: major:000a0000 minor: 00000000 token: 00000000

GSS_S_DEFECTIVE_CREDENTIAL - sslv3 handshake

Function:gss_accept_sec_context Reason:Peer is using (limited) proxy

Failure: GSS failed Major:000a0000 Minor:00000000 Token:00000000

GSS_S_DEFECTIVE_CREDENTIAL

Consistency checks

performed on the

credential failed.

Parma