uabgrid : a campus-wide distributed computational infrastructure
DESCRIPTION
UABgrid : A campus-wide distributed computational infrastructure. University of Alabama at Birmingham UABgrid Architecture Team Jill Gemmill Purushotham Bangalore John-Paul Robinson. Acknowledgments. This work has been supported by: - PowerPoint PPT PresentationTRANSCRIPT
Inte
rnet
2 M
eet
ing
20
06
UABgrid : A campus-wide distributed computational infrastructure
University of Alabama at Birmingham
UABgrid Architecture TeamJill GemmillPurushotham BangaloreJohn-Paul Robinson
AcknowledgmentsThis work has been supported by: Office of the Vice President for Information Technology Department of Computer & Information Sciences, School of Natural Sciences and Mathematics Enabling Technology Laboratory, School of Engineering
National Science Foundationo ANI-0330543 “NMI Enabled Open Source Collaboration Tools for Virtual Organizations”o NSF ANI-0123937 via SURA-2002-103 Subcontract “UAB Middleware Testbed Program: Integrated Directory Services, PKI, Video, and Parallel Computing”oNSF CNR-0420614 “Computer and Information Sciences Grid Node Research Facility” Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation
3
UAB Background 36 yr. old urban medical research U. 82 city blocks 13 schools (= 13 deans) 17,000 students; 16,000 employees Students are 26.3 percent African American and
60.2 percent female 70 research centers 20th in NIH funding (4th in SE) $433 million in research funding; doubling every 10
years No history of centrally supported HPC or other
research-oriented computing services (eg Statistics) The Alabama Supercomputer Authority
4
What’s a Campus Grid?
Strategic View: Maximize use of university’s investment in
computational resources Minimize administrative effort involved in
campus-wide resource sharing By leveraging investments in Identity
Management, WebISO, Directories, and Network infrastructures
UABgrid is a federation of resource owners who happen to share a common identity provider
5
UABgrid Partners
Office of VPIT: Sheila Sanders, VPIT; IT Academic Computing: David L. Shealy, Jill Gemmill, John-Paul Robinson, 128 node cluster; 64 node P3 cluster; desktop condor pool; 6
terabytes IBP storage Department of Computer and Information Sciences:
Tony Skjellum, CIS Chair; Puri Bangalore, Asst. Prof. 256 processor & 64 processor clusters;, Viz Wall, Parallel Storage
System Engineering Enabling Technology Lab: Bharat Soni,
Chair Mechanical Engineering; Alan Shih, ETLab Director 256 processor and 128 processor clusters; Viz Wall, High Speed
Storage Systems
6
Current UABgrid Applications
BioInformatics BLAST, Gene Sequence Analysis, Structural
Biology, Micro-Array Data Analysis, Visualization PDE
Automotive & Industrial, Surface Simulations, Optimization
Grid and Middleware Research Scheduling, Load Balancing, Granular
Authorization
7
UABgrid Architecture Today: Phase I
GigaBit
8
UABgrid Phase II
Additional Grid Nodes
10 GigE
9
Factors Supporting Resource Sharing
Provost and VP Research are being inundated with competing school requests to purchase clusters;
Deans who’ve gotten clusters find themselves losing classroom space to equipment racks and facing large power and AC bills;
Clusters, large databases, schedulers, etc. require expensive expertise
10
Grid User Management
Grid identity comes from enterprise authentication system ( “BlazerID”)
WebISO leveraged to provide digital certificate, private key and proxy certs behind the scenes
Grid Portal and Per-System User Accounts Are Provisioned Automatically, saving much administrative effort (Phase I : grid-mapfiles; Phase II : LDAP stored posix accounts + GridShib)
11
12
Grids for Mere Mortals
For jobs run repeatedly where only the database or query varies, it is worthwhile to build a user-friendly interface and also to optimize use of resources
Example: BLAST (National Library of Medicine gene sequence matching software)
http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/information3.html
13
Access using BlazerID and password
Queries and Results easily uploaded & downloaded
Web UI can be hosted on any server
Web UI can be written in any development language
Improving the Interface : GridBLAST
14
Improving Performance: G-BLAST
A native Grid Service Interface for BLAST G-BLAST provides automatic BLAST
algorithm selection based on # of queries, length of queries, size of the database used, and machines available
BLAST algorithms employed: multi-threaded BLAST, database-splitting BLAST (e.g., mpiBLAST), query-splitting BLAST
15
G-BLAST architecture
Client Program
Web Interface
Users
… …BLAST1 BLAST2 BLASTn
GIS
Invoker
Grid Service Interface
Resource Information
Grid Service
Query (2)
Response (3)
Query(1), (7)
Dispatch (4) Result (5)
Notify(6)
AIS
Scheduler
Application Information
16
G-BLAST Scheduler Architecture
Jobs
Job ID’s(JIDs)
Analyzer
BLAST Benchmark database
Resource Broker
Resource Information
Resource Information
Job Submission
Agent
ResourceResource
GIIS/GRIS
AISAIS
17
UABgrid Funding and Management Today
All equipment has been purchased with various grant funds
ETLab has been designated as a campus resource; ETLab has contracted for 50% of one IT provided
unix administrator to manage its clusters Academic Computing has 2.3 employees and
provides other support in addition to HPC Computer science / NS&M resources are available
to other campus computational scienctists Computer Science has 1 administrator for all CIS
systems Each research department hires its own
programmer(s) Developing sustainable funding model(s) is
challenging
18
Federated Grids
Exploring cross-domain resource sharing scenarios Federated Identity : experiences in
SURAgrid Federated Attributes : myVocs and
GridShib
19
SURAgridwww.sura.org
Bla
zerI
D a
nd
pas
sw
ord
Grid Portal
Resources
Ke
rbe
ros
Lo
gin
Grid Portal
Resources
Louisiana State University
Dig
ital
Ce
rtif
ica
te
Lo
gin
Grid PortalResources
University of Virginia
SURAGrid CA Bridge
SURAGrid Portal
Texas Advanced Computating Center
UABgridCA
UVACA
LSUCA
20
Use of Shibboleth in Grids provides Attribute based Access Control (not just identity)
Example: Faculty may be assigned higher priority in job queues than students
For VO’s the most important attribute is “member of VO ABC”, and VO memberships typically cross domains.
myVocs offers easy, self management for VOs and expects web browser as primary access to resources
Combined with GridShib, myVocs enables VO membership-based access to grid resources
: a Virtual Organization Service Center
21
Inside myVocsAttribute Aggregation
22