cyberinfrastructure for advanced marine microbial ecology research and analysis (camera)

21
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA) Invited Talk CONNECT Investment Community Meeting Calit2@UCSD La Jolla, CA July 31, 2006 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technologies Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD

Upload: larry-smarr

Post on 03-Jun-2015

321 views

Category:

Education


0 download

DESCRIPTION

06.07.31 Invited Talk CONNECT Investment Community Meeting Calit2@UCSD Title: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA) La Jolla, CA

TRANSCRIPT

Page 1: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Invited Talk

CONNECT Investment Community Meeting

Calit2@UCSD

La Jolla, CA

July 31, 2006

Dr. Larry Smarr

Director, California Institute for Telecommunications and Information Technologies

Harry E. Gruber Professor,

Dept. of Computer Science and Engineering

Jacobs School of Engineering, UCSD

Page 2: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Two New Calit2 Buildings Provide ~340,000 GSF and New Laboratories for “Living in the Future”

• Over 1000 Researchers in Two Buildings– Linked via Dedicated Optical Networks– International Conferences and Testbeds

• New Laboratories– Nanotechnology– Virtual Reality, Digital Cinema

UC Irvine

www.calit2.net

State Funded $100M in Capital for Calit2 Buildings

UC San DiegoRichard C. Atkinson Hall Dedication Oct. 28, 2005

Page 3: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Calit2 Brings Computer Scientists and Engineers Together with Biomedical Researchers

• Some Areas of Concentration:– Metagenomics– Genomic Analysis of Organisms– Evolution of Genomes– Cancer Genomics– Human Genomic Variation and Disease– Proteomics– Mitochondrial Evolution– Computational Biology– Information Theory and Biological Systems

UC San Diego

UC Irvine

Page 4: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

The OptIPuter Project – Creating High Resolution Portals

Over Dedicated Optical Channels to Global Science Data• NSF Large Information Technology Research Proposal

– Calit2 (UCSD, UCI) and UIC Lead Campuses—Larry Smarr PI– Partnering Campuses: SDSC, USC, SDSU, NCSA, NW, TA&M, UvA,

SARA, NASA Goddard, KISTI, AIST, CRC(Canada), CICESE (Mexico)

• Industrial Partners– IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent

• $13.5 Million Over Five Years—Now In the Fourth YearNIH Biomedical Informatics

NSF EarthScope and ORIONResearch Network

Page 5: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

fc *

Dedicated Optical Channels Makes High Performance Cyberinfrastructure Possible

(WDM)

Source: Steve Wallach, Chiaro Networks

“Lambdas”Parallel Lambdas are Driving Optical Networking

The Way Parallel Processors Drove 1990s Computing

Page 6: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

National LambdaRail (NLR) and TeraGrid Provides Cyberinfrastructure Backbone for U.S. Researchers

NLR 4 x 10Gb Lambdas Initially Capable of 40 x 10Gb wavelengths at Buildout

Links Two Dozen State and Regional Optical

Networks

DOE, NSF, & NASA

Using NLR

San Francisco Pittsburgh

Cleveland

San Diego

Los Angeles

Portland

Seattle

Pensacola

Baton Rouge

HoustonSan Antonio

Las Cruces /El Paso

Phoenix

New York City

Washington, DC

Raleigh

Jacksonville

Dallas

Tulsa

Atlanta

Kansas City

Denver

Ogden/Salt Lake City

Boise

Albuquerque

UC-TeraGridUIC/NW-Starlight

Chicago

International Collaborators

NSF’s TeraGrid Has 4 x 10Gb Lambda Backbone

Page 7: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

PI Larry Smarr

Announced January 17, 2006$24.5M Over Seven Years

Page 8: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Paul Gilna Has Just Been Recruited from Los Alamos to Become Executive Director of CAMERA

• Formerly– Former Director of the Department of Energy’s Joint Genome

Institute (JGI) Operations at Los Alamos National Laboratory (LANL)– Group Leader of Genomic Science and Computational Biology in

LANL’s Bioscience Division

• JGI – A $70-million-per-Year Collaboration:

– Lawrence Berkeley, – Lawrence Livermore, – Los Alamos, – Oak Ridge, and – Pacific Northwest – and the Stanford Human Genome Center

– Working at The Frontiers of Genome Sequencing and Biosciences

Page 9: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Flat FileServerFarm

W E

B P

OR

TA

L

TraditionalUser

Response

Request

DedicatedCompute Farm(1000 CPUs)

TeraGrid: Cyberinfrastructure Backplane(scheduled activities, e.g. all by all comparison)

(10000s of CPUs)

Web(other service)

Local Cluster

LocalEnvironment

DirectAccess LambdaCnxns

Data-BaseFarm

~100s TB

10 GigE Fabric

Calit2’s Direct Access Core Architecture Will Create Next Generation Metagenomics Server

Source: Phil Papadopoulos, SDSC, Calit2+

We

b S

erv

ice

s

Sargasso Sea Data

Sorcerer II Expedition (GOS)

JGI Community Sequencing Project

Moore Marine Microbial Project

NASA Goddard Satellite Data

Community Microbial Metagenomics Data

Page 10: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

The Future Home of the Moore Foundation Funded Marine Microbial Ecology Metagenomics Complex

First Implementation of the CAMERA Complex

Photo Courtesy Joe Keefe, Calit2

Major Buildout of Calit2 Server Room Underway

Page 11: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

CAMERA Tools and Workflows

• Initial set

– BLAST Server

– Clustering

– HMM/Profile

– Neighborhood Analysis

– Multiple Sequence

Alignments

– Assembly

• Proposed New Tools

– Multiple Auto Annotation

Pipelines

– Fast Sequence Lookup

– Customized Assembly

– Phylogenetic Analysis

– Clustering Tools

Page 12: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

The Bioinformatics Core of the Joint Center for Structural Genomics will be Housed in the Calit2@UCSD Building

Extremely Thermostable -- Useful for Many Industrial Processes (e.g. Chemical and Food)

173 Structures (122 from JCSG)

• Determining the Protein Structures of the Thermotoga Maritima Genome • 122 T.M. Structures Solved by JCSG (75 Unique In The PDB) • Direct Structural Coverage of 25% of the Expressed Soluble Proteins• Probably Represents the Highest Structural Coverage of Any Organism

Source: John Wooley, UCSD

Page 13: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Interactive Visualization of Thermatoga Proteins at Calit2

Source: John Wooley, Jurgen Schulze, Calit2

Page 14: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

OptIPuter Scalable Adaptive Graphics Environment (SAGE) Allows Integration of HD Streams

Source: David Lee, NCMIR, UCSD

Page 15: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

Calit2 and the Venter Institute Will Combine Telepresence with Remote Interactive Analysis

OptIPuter Visualized

Data

HDTV Over

Lambda

Live Demonstration

of 21st Century National-Scale Team Science 25 Miles

Venter Institute

Page 16: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

www.glif.is

Created in Reykjavik, Iceland 2003

Countries are Aggressively Creating Gigabit Services:Interactive Access to CAMERA and LOOKING Systems

Visualization courtesy of Bob Patterson, NCSA.

Page 17: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

OptIPortal– Termination Device for the OptIPuter Global Backplane

• 20 Dual CPU Nodes, 20 24” Monitors, ~$50,000• 1/4 Teraflop, 5 Terabyte Storage, 45 Mega Pixels--Nice PC!• Scalable Adaptive Graphics Environment ( SAGE) Jason Leigh, EVL-UIC

Source: Phil Papadopoulos SDSC, Calit2

Page 18: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

UIC/UCSD 10GE CAVEWave on the National LambdaRailEmerging OptIPortal Sites

CAVEWave Connects Chicago to Seattle to San Diego…and Washington D.C. as of 4/1/06

and JCVI as of 5/15/06

NEW!

NEW!

SunLight

CICESE

UW

JCVI

MIT

SIO UCSD

SDSU

UIC EVL

UCI

OptIPortals

Page 19: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

First Remote Interactive High Definition Video Exploration of Deep Sea Vents

Source John Delaney & Deborah Kelley, UWash

Canadian-U.S. Collaboration

Page 20: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

High Definition Still Frame of Hydrothermal Vent Ecology 2.3 Km Deep

White Filamentous Bacteria on 'Pill Bug' Outer Carapace

1 cm.

Source: John Delaney and

Research Channel, U Washington

Page 21: Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA)

A Near Future Metagenomics Fiber Optic-Enabled Data Generator

Source John Delaney, UWash