test con mpich-g2 roberto alfieri - fabio spataro wp1 meeting january 11, 2001 gruppo collegato di...
TRANSCRIPT
Test con MPIch-G2
Roberto Alfieri - Fabio Spataro
WP1 meeting
January 11, 2001
Gruppo Collegato di Parma
Parma
Globus.pr.infn.it
GIIS server
Dual PII350, 256MB RedHat6.2, INFNgrid1.2
Janus.pr.infn.it
MPIchG2-cluster
Dual PII350, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2
Janus1.pr.infn.it
MPIchG2-cluster
Dual PII350, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2
Janus2.pr.infn.it
MPIchG2-cluster
Dual PII350, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2
Lxde02.pd.infn.it
MPIchG2-cluster
Dual PII450, 256MB RedHat6.2, INFNgrid1.1.2, MPIchG2
The Installation:
MPICH: our rpm packages
(ftp://ftp.pr.infn.it./pub/linux/rpm/contrib/)
mpich-1.2.1-2.i386.rpm (-with-device=ch_p4)
mpich-smp-1.2.1-2.i386.rpm (-with-device=ch_p4 -comm=shared)
mpich-G2 -1.2.1-2.i386.rpm (-with-device=globus2)
mpich-G2-smp -1.2.1-2.i386.rpm (-with-device=globus2 -comm=shared)
Parma
TESTS PLAN
Test Tool:
Perftest (http://www-unix.mcs.anl.gov/mpi/mpptest/)
• mpptest (point-to-point communication performance)
• goptest (collective communication performance)
Rete_Mpi (Neural Network Training - our MPI benchmark )
LAN Tests:
• mpptest on SMP using mpich mpich-smp mpichg2 mpichg2-smp
• mpptest on TCP using mpich and mpichg2
• goptest janus(2)-janus1(2)-janus2(2) using mpich and mpichg2
WAN Tests (Parma - Padova):
• Processes distribution: mpptest , RETE_MPI
• Remote submitting Parma
Parma
LAN point to point comm.: roundtrip throughput
Parma
LAN point to point comm.: head-to-head throughput
Parma
LAN point to point comm.: roundtrip latency
Parma
LAN point to point comm.: head-to-head latency
Parma
LAN collective test : reduction
Parma
LAN collective test : broadcast
Parma
+( &(resourceManagerContact="janus.pr.infn.it") (count= 1) (label="subjob 0") (environment=(GLOBUS_DUROC_SUBJOB_INDEX 0)) (arguments= "-reps" "10" "-size" "0" "50" "2" ) (directory="/home/alfieri") (executable="/home/alfieri/mpptest"))( &(resourceManagerContact="lxde02.pd.infn.it") (count= 1) (label="subjob 1") (environment=(GLOBUS_DUROC_SUBJOB_INDEX 1)) (arguments= "-reps" "10" "-size" "0" "50" "2") (directory="/home/alfieri") (executable="/home/alfieri/mpptest"))
mpirun -globusrsl mympptest.rsl
mympptest.rsl
WAN point to point communication
Parma
WAN point to point communication : latency
Parma
WAN point to point communication : throughput
RETE_MPI BENCH
Neural Network training using 1 node: 356 sec
Neural Network training using 2 local nodes:
-MPICH-G2 -> 165 sec
-MPICH -> 165 sec
Neural Network training using 2 remote nodes:
- MPICH-G2 (Parma-Padova) -> 429 sec
Parma
Remote submitting test
• globus-job-submit janus/jobmanager-pbs ~alfieri/mympi.sh => OK
• globus-job-submit janus/jobmanager-pbs ~alfieri/mympiG2.sh => FAILED
GSS authentication failure
GSS status: major:000a0000 minor: 00000000 token: 00000000
GSS_S_DEFECTIVE_CREDENTIAL - sslv3 handshake
Function:gss_accept_sec_context Reason:Peer is using (limited) proxy
Failure: GSS failed Major:000a0000 Minor:00000000 Token:00000000
GSS_S_DEFECTIVE_CREDENTIAL
Consistency checks
performed on the
credential failed.
Parma