unified qualityofservice and data lifecycle definitions

31
Unified Quality-of-Service and Data- Lifecycle Definitions for Data Storage and Access … or how to managing expectations         Paul Millar RDA Plenary 6 BoF (2015-09-25)

Upload: others

Post on 16-Jan-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Unified Quality­of­Service and Data­Lifecycle Definitions for Data Storage 

and Access

… or how to managing expectations         

Paul Millar

RDA Plenary 6 BoF (2015­09­25)

Why are we here?

In INDIGO­DataCloud ...● We've identified a problem,

(well, two actually),● We want to fix this problem,

(we hope you do too!)● We want your help in fixing it

(we hope you do too!)

Storage software: Free, Open­Source

https://github.com/dCache/dCachemailto:[email protected]

Software running throughout the world

dCache and INDIGO­DataCloud

The problem...

Quality of Serviceand

Data Life­Cycle

Quality of Service

Store data on disk or tape?

Now we have more media options

Replicating data

How many copies? Where are they located?

Motivation: budgets

How to make this a possibility

What are my options? How do I choose?

Bridging the gap

Attributes and islands

Combining QoS attributes

Independent Dependent

Continuous

Discrete

Discrete

Discrete “Islands of QoS”“Fee selection of QoS”

Figure­of­merit: allowing decisions

Best available QoS

Best available QoS

Data Life­Cycle

DLC use­cases: the story of a file

Time

CreatedMain

analysis complete

Publicembargo

ends

Anticipated end of

interestEnd of life

Change QoS

Allow public access

Change QoS

Delete data

Change QoS

Accept/Reject

Deadline

(not to scale)

Format for DLC rules

<trigger> <action>

(e.g., <after 6 months> <add public­access ACE>)

The plan:Definition of terms

Protocol definition

Implementations

Photos: greeblie@flickr, Steve Jurvetson, Gorazd Božič

Proposal: RDA WG “dictionary of terms”

Photo: greeblie@flickr

“Speed” access­latency or bandwidth?

Photo: John Holm

How fast is “High”?

Photo: lungstruck@flickr

Would you be able to work this?

Photo: David Pursehouse

Thanks for listening

Used to search for the Higgs boson

Feed data for HPC applications

HPC jobs on supercomputer

HPC jobs get access to dCache storage.

Research: pushing frontiersPower supply

HGSTDisk

Clip Yves

Software that scales up to tens of PiB

Pool

NFS 4.1/pNFS

HTTP/WebDAV

PoolManager

gPlazma

1 TB

700 MHz ARM512 MB Memory2 * USB 2100 MB Ethernet

… and down to a single Raspberry Pi