glenn moloneythe australian national grid programegee'06, geneva, 2006 1 the australian...

20
Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information and grid infrastructure for eResearch” Glenn Moloney University of Melbourne [email protected]

Upload: patience-brooks

Post on 13-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 1

The Australian National Grid Program

“providing advanced computing, information andgrid infrastructure for eResearch”

Glenn MoloneyUniversity of Melbourne

[email protected]

Page 2: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 2

Darwin

APAC National Grid

GrangeNet BackboneCentie/GrangeNet Link

AARNet Links

Internet2CanarieGeantAPAN

APACNational Facility

BrisbaneQPSF

CanberraANU

MelbourneVPACCSIRO

Sydneyac3

PerthIVEC

CSIRO AdelaideSAPAC

HobartTPACCSIRO

•10 Gbps• IPv6• Multicast

Page 3: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 3

Australian Partnership for Advanced Computing

The APAC Partners:• AC3: Australian Centre for Advanced Computing and

Communications in NSW• CSIRO: Commonwealth Science and Industry Research

Organisation• QPSF: Queensland Parallel Supercomputing Foundation • IVEC: Interactive Virtual Environments Centre in WA• SAPAC: South Australian Partnership for Advanced

Computing• ANUSF: The Australian National University • TPAC: The University of Tasmania• VPAC: Victorian Partnership for Advanced Computing

“providing advanced computing, information andgrid infrastructure for eResearch”

Page 4: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 4

National Role of APAC

Advanced Computing Infrastructure– Peak computing facilities

Information Infrastructure– Support for community-based data collections– Management of large-scale data collections (archiving)

Grid Infrastructure– Access to national computing and information

infrastructure– Advanced collaborative services for research groups

• collaborative visualisation, computational steering, tele-presence, virtual organisation support

– Support Australian participation in international research programs

• eg. astronomy, high-energy physics, earth systems, geosciences

Page 5: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 5

APAC 2: The APAC Grid ProgramAustralian government provided AU$29m for stage 2 of APAC:

Providing the advanced computing and grid infrastructure for eresearch

· AU$12.5m for upgrade of National Facility Canberra · Commisioned mid 2005 · National grid infrastructure projects: · Computing infrastructure · Information infrastructure · User Interface and Visualisation · Application support projects: · Astronomy (Virtual Observatory) · Computational chemistry · Theoretical and experimental high energy physics · International Lattice Data Grid, ATLAS, Belle · Geosciences · Bioinformatics

Page 6: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 6

APAC National FacilityUsage• mainly biology, chemistry, physics

• currently 247 projects and 722 users (27 universities)Computing Systems• SGI Altix 3700 Bx2 system: 1680 processors

• Dell Linux cluster: 150 processorsMass Data Storage System (MDSS)• Storagetek (robotic silo) HSM tape library

– Petabyte capable storageVisualisation Systems• Virtual reality systems, Access Grid roomsStaff• User support, Systems support

• Computational tools and techniques

• Large-scale data collection managementhttp://nf.apac.edu.au

Page 7: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 7

Global Connectivity

10Gbpsring

Page 8: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 8

APAC Grid Deployment

2005 2006

APAC National Grid.v1 – Single Sign-on, data sharing

Base: VDT (GT2.4.3, Monalisa, Ganglia), GridSphere, SRB, OpenDAP, Nimrod, LCG

VO model: follow Grid3Use APAC CA

Manually configured solutions

APAC National Grid.v2– Add portals and workflow support

Base: VDT-> GT4, Gridsphere, SRB OpenDAP, Nimrod, LCG

VO Model: not yet determined

Use National CAsAuto configuration

APAC National Grid.v3

Interoperability:

Align with OSG, EGEE

Use aarnet3 backbone

Page 9: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 9

APAC Grid Gatekeeper MachinesEach partner site has a 'gateway' machine which 'hosts' Grid

front-ends to the available resourcesXen Virtual Machine MonitorUniversity of Cambride Computer Laboratory

Hardware:Dual Xeon 2.8GHz, 4Gb RAM, 300Gb mirrored SCSI disk, 5 GigEnetwork cards (1 mgmt, 2 data VM, 2 other VM's)

Grid front-ends:•Globus 2 (VDT-1.2.4), Globus 4 (VDT-1.4 ??), Glite3 •Storage Resource Broker 3.3.1, Nimrod/G

Physical HardwareCPU, disk, network

Linux (2.6) dom0Xenhypervis

or

VM(domU

)

VM(domU

)

VM(domU

)

Page 10: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 10

QPSF

ANU

VPAC

ac3

TPAC

CSIRO

Data Transfer: RFTGridFTPGlobal File System

Data Management:GlobusSRBSRMGlite

Data Access:OGSA-DAIWeb servicesOPenDAP

Mass Data Storage Systems: Tape – based (silos) Disc-based

IVEC

SAPAC

APACNational Facility

APAC National GridData Management Infrastructure

QPSF(JCU)

Page 11: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 11

Delivering National Grid Services

Other Grids:Institutional

NationalInternational

Other Grids:Institutional

NationalInternational

Data Centres

Data Centres

Instruments

SensorNetworks

Research Teams

grid-based portalsdistributed computationfederated data access

remote controlcollaboratories

Page 12: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 12

Astronomy and Astrophysics• MACHO Project Data

– Largest online astro data set in Australia (~10TB)

– Hosted by APAC as part of IVO collection

– Mapping metadata to VOTable 1.0 standard

• Australian Virtual Observatory– Provide uniform access to key data collections

• 2dFGRS, HIPASS, ATCA-OA, SUMSS, MACHO, TNO…

– Grids for theoretical astrophysics simulations • Portals for job configuration, submission and monitoring• MLAPM, GCD+, Zeus-MP, LensView, (x)oopic, Swift,

• International Virtual Observatory – SIAP service for ATCA Phoenix Deep Field Survey

• SIAP is an International Virtual Observatory protocol

Page 13: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 13

BioinformaticsAccelerate progress on genome annotation, for genomes of national economic significanceSupport lead discovery through molecular docking

• Data update and synchronisation services, including the BioMirror

• Grid-wide compute services for Ensembl, Blast, RepeatMasker and Glimmer

• Grid-wide compute services for molecular docking including support for analysis workflows

Page 14: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 14

VPAC

QPSF

TPAC

IVEC

APACNATIONALFACILITY

ANU

CSIRO

SAPAC

AC3

Computational Chemistry

Unified Grid-based portal to chemistry software• Portal to computational chemistry software on APAC Grid

• Uniform access to software on a computer system

• Gaussian, Amber, Gamess-US, Gromacs, Mopac and Molpro

Page 15: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 15

Earth Systems Science

Access to Data Products• Inter-governmental Panel on Climate

Change scenarios of future climate (3TB)

• Ocean Colour Products of Australasian and Antarctic region (10TB)

• 1/8 degree ocean simulations (4TB)

• Weather research products (4TB)

• Earth Systems Simulations

• Terrestrial Land Surface Data

Grid Services– Globus based version of OPeNDAP (UCAR/NCAR/URI)– Server side analysis tools for data sets: GRADS, NOMADS– Client side visualisation from on-line servers– THREDDS (catalogues of OPeNDAP repositories)

Page 16: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 16

Geosciences

Develop systems that support the real-time steering of complex geoscience analysis

This requires:

• Workflow support for mantle convection modelling with components running on distributed grid resources

• Portlets for compute services including ‘snark’ and ‘Finley’

• Hypothesis exploration through real-time ensemble management

Page 17: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 17

High-Energy Particle PhysicsBelle Physics Collaboration• K.E.K. B-factory detector

– Tsukuba, Japan

• Matter/Anti-matter asymmetry in B meson decays

• 45 Institutions, 400 users worldwide– ~1 PB data currently

• Australian grid for Belle: Simulation and Data analysis– Data grid centred on APAC National Facility

Atlas Experiment• Large Hadron Collider (LHC) at CERN

– Operational in 2007

• Deploying EGEE infrastructure on APAC Grid– WLCG Tier 2 at University of Melbourne

Page 18: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

APAC National Grid Status

• Core services installed– Core services implemented

• APAC CA and myproxy, VOMRS, GT2

• First applications in operational status– Some applications close to ‘production’ mode

• Geosciences, HEP (Belle experiment)

• Systems coverage– Users can access ALL systems at APAC partners– About 4600 processors and 100’s of Tbytes of disk– Around 3Pbytes of disk cached HSM systems

• Extension of the Grid– Requests for service are spreading to multiple sites

• leading to an affiliate model

Page 19: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 19

The APAC Grid Program

The APAC grid program has been active in deploying a grid infrastructure in Australia

• Focussed on needs of Application Projects

• Interoperability – must work closely with international grids

• Tyranny of distance is being tamed: high bandwidth international connections

But – we need to do more:• Improved international collaboration

• more efficient deployment

• Operations: we are just beginning

Page 20: Glenn MoloneyThe Australian National Grid ProgramEGEE'06, Geneva, 2006 1 The Australian National Grid Program “providing advanced computing, information

Glenn Moloney The Australian National Grid Program EGEE'06, Geneva, 2006 20

Looking Forward...

APAC 3 funding in 2007:• Interoperability:

– Expand engagement with GGF inter-operability activities

• Operations:– Establish APAC Grid Operations Centre– Improve distributed team management

• Expand user community:– All users of APAC facilities are Grid Users– Data Management

• Expand infrastructure to include:– Major data centres: eg. University data repositories– Facilties: Telescopes, Synchrotron, ANSTO, ...

• Proposal to deploy Glite infrastructure across APAC facilities:– To support HEP: Australian Tier 2 Federation– Explore other user communities: collaborations with Europe and Asia