rcac research computing presents: diagird overview

25
RCAC Research Computing Presents: DiaGird Overview Tuesday, September 24, 2013 INFORMATION TECHNOLOGY AT PURDUE

Upload: gayora

Post on 26-Feb-2016

33 views

Category:

Documents


2 download

DESCRIPTION

INFORMATION TECHNOLOGY AT PURDUE. RCAC Research Computing Presents: DiaGird Overview. Agenda. What is DiaGrid ? What can DiaGrid do for me today ? How can I make DiaGrid work for me ? User Experience (Prof. Wen Jiang, Biological Sciences) Q&A Posters Meet the team. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: RCAC Research Computing Presents: DiaGird  Overview

RCAC Research Computing Presents:DiaGird Overview

Tuesday, September 24, 2013

INFORMATION TECHNOLOGY AT PURDUE

Page 2: RCAC Research Computing Presents: DiaGird  Overview

Agenda• What is DiaGrid?• What can DiaGrid do for me today?• How can I make DiaGrid work for me?• User Experience (Prof. Wen Jiang, Biological

Sciences)• Q&A• Posters• Meet the team

2

Page 3: RCAC Research Computing Presents: DiaGird  Overview

What is DiaGrid?Carol SongScientific SolutionsRCAC

Page 4: RCAC Research Computing Presents: DiaGird  Overview

What is DiaGrid?

4

http://diagrid.org

Page 5: RCAC Research Computing Presents: DiaGird  Overview

What is DiaGrid?

5

• Science-as-a-Service• Online applications and tools, no software download or

installation• Interactive, graphics user interface

• Access to large computing resources, workflow engine• 50,000 HT Condor pool for high throughput computing• Nodes in the community clusters for parallel computation

• Instant access: no forms, no wait• Supported infrastructure, 24x7 availability• Feedback for tools, datasets, and other publications• Collaboration and dissemination platform

• Group, Project, Forum, Wiki, Sharing, Publishing, etc.

Page 6: RCAC Research Computing Presents: DiaGird  Overview

What can DiaGrid do for me today?Brian RaubScientific SolutionsRCAC

Page 7: RCAC Research Computing Presents: DiaGird  Overview

Tools available todayWhat tools can I use today?

• BLASTer• SubmitR• CryoEM• GROMACSIMUM• CESM

7

Page 8: RCAC Research Computing Presents: DiaGird  Overview

BLASTer• BLAST - popular tool to

scan genomes for target sequencing

• Searches can contain thousands of sequences– Split input file because

all sequences are independent

• Greatly improves search speed with the help of HTCondor

8

Page 9: RCAC Research Computing Presents: DiaGird  Overview

BLASTer

9

Page 10: RCAC Research Computing Presents: DiaGird  Overview

SubmitR• Allows users to submit their

R scripts for execution on the community clusters

• Supports different execution modes– Single – one process– Parallel – multiple

processes communicating with each other

– Sweep – many isolated processes with different parameters, inputs or both

10

Page 11: RCAC Research Computing Presents: DiaGird  Overview

SubmitR• Users upload and run scripts

without having to know technical details of where and how R is executing

• Supports a wide range of R libraries:– snow/snowfall– Rmpi– rgdal– Biobase– RSQLite

• Request more libraries!– Submit a ticket on DiaGrid to

request libraries currently unavailable

11

Page 12: RCAC Research Computing Presents: DiaGird  Overview

CryoEM• Analyzes images taken at cryogenic temperatures within an electron

microscope to reveal much about microscopic structure of samples• First user-developed tool for DiaGrid• Image processing is a good candidate for parallelization

12

Page 13: RCAC Research Computing Presents: DiaGird  Overview

CryoEM• DiaGrid staff helped adapt CryoEM to the HUB environment• Split tasks for image breakdown analysis (HTCondor)• Reassembled the images for 3D visualization using MPI

13

Page 14: RCAC Research Computing Presents: DiaGird  Overview

GROMACSIMUM• GROMACS – designed to perform molecular dynamics• First DiaGrid tool to modify and improve an existing open source

project– Extends the features of GROMACS GUI and jSimMacs to include new features

for high performance computing

14

Page 15: RCAC Research Computing Presents: DiaGird  Overview

GROMACSIMUM• A unified interface for all GROMACS simulation tools. • Advanced project management system.• A powerful molecular design and 3D protein visualization tool.• Access your models and data from anywhere in the world.

15

Page 16: RCAC Research Computing Presents: DiaGird  Overview

CESM• Global climate model

coupling many aspects of Earth sciences research

• First DiaGrid tool to provide access to existing Purdue gateway– Purdue developed CESM web

gateway and designed it to support multiple interfaces

– Provides an alternate interface to the CESM gateway service from within the HUB environment

16

Page 17: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?Rob CampbellScientific SolutionsRCAC

Page 18: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?

18

What are my options?

• Run an existing tool• Use your existing code• Create a new tool• Let us create - or adapt - a tool for you

Page 19: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?

19

How can I use code I already have?

Command line? Use Rappture… (“Rapid APPlication infrastrucTURE”)

Graphical User Interface (GUI)? Enable it to run on DiaGrid…

Page 20: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?

20

What is Rappture?

• Toolkit - makes it easy to develop a GUI for scientific modeling code

• Describe your code’s input & output, Rappture automatically builds GUI

• Rappture API: get input values, save results –bindings for many programming languages

• Embed Rappture in your code - or create a wrapper script around your code

• Users see standard graphical controls plus line graphs, contour plots, 3D isosurfaces, …

Page 21: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?

21

How will my code run on DiaGrid?

• Needs cluster resources or parallel execution? Tap into HPC resources via “submit”

• Inputs and outputs files? Upload to remote server via sftp,

WebDAV, “importfile”

• Pulls data from external sites or databases? We can open a connection

• Relies on a graphics card for acceleration? Will function with performance

difference

• Requires Windows or Mac? Tools run in a Linux/X11

environment GUI toolkits and Wine are

available

Page 22: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?

22

What can “submit” do?

• Complex job scheduling made easy• Long runners, parallel processing, parameter sweeps• Splits out sweep runs (derives param. combos)

1. Gathers files2. Transports to HPC resource3. Schedules & watches job(s)4. Returns results

submit

Page 23: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?

23

How can I create something new?

Use Rappture - or a familiar development environment…

C/C++ Perl Python Ruby TCL More…

C/C++ Fortran Java MATLAB Octave

Page 24: RCAC Research Computing Presents: DiaGird  Overview

How can I make DiaGrid work for me?

24

Can you make a tool for me?

Yes! We can…

Adapt your existing code

Or, start with your choice of open source packages

Or, build a tool from scratch based on your specifications

Page 25: RCAC Research Computing Presents: DiaGird  Overview

User ExperienceProfessor Wen JiangBiological Sciences