polska infrastruktura informatycznego wspomagania nauki w europejskiej przestrzeni badawczej...

10
Polska Infrastruktura Informatycznego Polska Infrastruktura Informatycznego Wspomagania Nauki Wspomagania Nauki w Europejskiej Przestrzeni Badawczej w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH Institute of Computer Science AGH ACC Cyfronet AGH ACC Cyfronet AGH The PL-Grid Virtual The PL-Grid Virtual Laboratory in the Life Laboratory in the Life Sciences Domain Sciences Domain Maciej Malawski, Maciej Malawski, Eryk Ciepiela Eryk Ciepiela , Tomasz Gubała, , Tomasz Gubała, Piotr Nowakowski, Daniel Harężlak, Marek Piotr Nowakowski, Daniel Harężlak, Marek Kasztelnik, Joanna Kocot, Tomasz Bartyński, and Kasztelnik, Joanna Kocot, Tomasz Bartyński, and Marian Bubak Marian Bubak

Upload: zachary-davies

Post on 26-Mar-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

Polska Infrastruktura Informatycznego Wspomagania Nauki Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczejw Europejskiej Przestrzeni Badawczej

Institute of Computer Science AGHInstitute of Computer Science AGHACC Cyfronet AGHACC Cyfronet AGH

The PL-Grid Virtual Laboratory The PL-Grid Virtual Laboratory in the Life Sciences Domainin the Life Sciences Domain

Maciej Malawski, Maciej Malawski, Eryk CiepielaEryk Ciepiela, Tomasz Gubała, , Tomasz Gubała, Piotr Nowakowski, Daniel Harężlak, Marek Kasztelnik, Piotr Nowakowski, Daniel Harężlak, Marek Kasztelnik, Joanna Kocot, Tomasz Bartyński, and Marian BubakJoanna Kocot, Tomasz Bartyński, and Marian Bubak

Page 2: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

2

OutlineOutline Motivation – complex scientific applications on

modern computing infrastructures In-silico experiments and Virtual Laboratory GridSpace2 as a solution

─ Architecture─ Working with GridSpace

Examples of applications─ Computational chemistry─ Bioinformatics

Conclusions

Page 3: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

3

MotivationMotivation Complex scientific applications on

modern computing infrastructures─ Clusters, Grids, Clouds

Diverse software packages─ Applications (Gaussian, NAMD,…)─ Web Services─ Scripts: Perl, Python, Ruby

Different users─ Chemists, biologists─ Programmers─ End users

Various data types─ Files, databases, URLs

Exploratory programming─ Unstructured, dynamic, prototyping

Collaboration─ Teams, communities

Page 4: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

4

ExperimentExperiment Experiment (in-silico)- a process that combines

together data with a set of activities (programs, services) that act on that data in order to produce experiment results─ Experiment plan – a specific type of software─ Experiment run – a specific execution of the experiment

Complex workflow going beyond manual simple and repeatable execution of installed programs

Combines steps realized on a range of software environments, platforms, tools, languages etc.

Developed, shared and reused collaboratively amongst ad-hoc researching teams

Composed of collaboratively owned libraries and services used (called gems) and experiment parts (called snippets)

Virtual Laboratory – environment for development, execution and sharing of experiments

Page 5: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

5

Working with GridSpace2Working with GridSpace2 Easy access using Web

browser Experiment Workbench

─ Constructing experiment plans from code snippets

─ Interactively run experiments Experiment Execution

Environment─ Multiple interpreters─ Access to libraries,

programs and services (gems)

Access to computing infrastructure─ Cluster, grid, cloud

Page 6: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

6

Exp

erim

ent

Wo

rkb

ench

Exp

erim

ent

Wo

rkb

ench

Page 7: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

7

Binding sites in proteinsBinding sites in proteins Comparison of Services for Predicting

Ligand Binding Sites─ Multiple services available on the Web─ Conversions between data formats─ Visualization scripts (Jmol, Gnuplot)─ Single access based on experiments

developed in Virtual Laboratory Calculation of hydrophobicity profiles

─ Multiple scales, parameters, input data─ Computed using PL-Grid resources –

easy access to Zeus cluster at Cyfronet─ Management of experiment results:

~ 1 Million output files─ Using semantic integration framework for

metadata management

Collaboration with Department of Bioinformatics and Telemedicine, Jagiellonian University, Prof. Irena Roterman-Konieczna, Katarzyna Prymula

Page 8: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

8

Analysis of water solutions of aminoacidsAnalysis of water solutions of aminoacids Involving multiple steps realized

with many tools, languages and libraries used for─ Packmol – molecular dynamics

simulations of packing molecules in a defined regions of space

─ Jmol – visualization of solution─ Gaussian – computing a spectrum of

the solution─ Python/CCLIB – extracting spectrum

info─ jqPlot – displaying plot

Collaboration with computational chemists of ACC Cyfronet AGH and Department of Chemistry, Jagiellonian University, Dr. Mariusz Sterzel, Klemens Noga

Page 9: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

9

ConclusionsConclusions Complex scientific applications need dedicated tools

and approaches. In-silico experiments are supported by Virtual

Laboratory powered by GridSpace2 technology. Applications:

─ Bioinformatics─ Computational chemistry─ More are welcome!

Virtual laboratory is open for PL-Grid users.

Page 10: Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid

10

ReferencesReferences http://wl.plgrid.pl

– open the Virtual Laboratory in your browser

http://gs2.cyfronet.pl – learn more about GridSpace2 technology

http://virolab.cyfronet.pl – see our earlier achievements

http://www.plgrid.pl – become a user of PL-Grid