o ak r idge n ational l aboratory u.s. d epartment of e nergy hpss features and futures presentation...

21
OAK RIDGE NATIONAL LABORATORY U.S. DEPARTMENT OF ENERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

Upload: hope-franklin

Post on 04-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS Features and Futures

Presentation to SCICOMP4

Randy Burris

ORNL’s Storage Systems Manager

Page 2: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

Table of Contents Background – design goals and descriptions

General information Architecture How it works Infrastructure

HPSS 4.3 – current release (as of Sept. 1) HPSS 4.5 HPSS 5.1

Background Main features

Page 3: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS is… File-based storage system – software only. Extremely scalable, targeting:

Millions of files; Multiple petabyte capacity; Gigabyte/second transfer rates; Single files ranging to terabyte size.

Distributed: Multiple nodes; Multiple instances of most servers.

Winner of an R&D 100 award (1997).

Page 4: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS is … Developed by LLNL, Sandia, LANL, ORNL, NERSC, IBM Used in >40 very large installations

ASCI (Livermore, Sandia, Los Alamos Labs) High-energy physics sites (SLAC, Brookhaven, other US sites and

sites in Europe and Japan) NASA Universities

As an Examples at ORNL Archiving system ARM Backup system Backups of servers, O2000 Active repository Climate, bioinformatics, …

Page 5: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

Example of the type of configurationHPSS is designed to support

Control

Control

ParallelRAIDDiskFarm

Local Devices

HPSSServer(s)

Workstation Cluster orParallel Systems

Sequential Systems

HIPPI/GigE/ATM

Network

Parallel Tape Farm

Visualization EnginesFrame buffers

HSINFSFTPDFS

Control

Control

Secondary Server(s)

LANsInternetTo Client HostsWANs

Throuhput Scalable tothe GB/s Region

Page 6: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS Software Architecture Diagram

Communications Security

Transaction ManagerMetadata Manager

LoggingInfrastructure Services

64-bit MathLibraries

Management

Client(s)

- Client API- PFS

ApplicationsData ManagementSystem Daemons:-HSI-FTP & PFTP- NFS- DFS

Storage System

Management

(all components)

BitfileServers

StorageServers

NameServers

LocationServers

Migration/Purge Repack

Movers

NSL UniTree Migration

Other Modules

Green componentsare defined in theIEEE Mass StorageReference Model.

Common Infrastructure

HPSS Software Architecture

PhysicalVolumeLibrary

PhysicalVolumeRespositories

Installation

Page 7: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

How’s it work? User stores a file using hsi, ftp, parallel ftp or nfs. It will be sent to a particular Class of Service (COS)

depending upon user selection or defaults. Default COS specifies a hierarchy with disk at the top

level and tape below it.

So, file is first stored on disk (HPSS cache) When enough time elapses or the cache gets full

enough, the file will automatically be copied to the next level - tape - and purged from disk.

Page 8: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS Infrastructure HPSS depends upon (I.e., is layered over):

Operating system (AIX or Solaris for core servers) Distributed Computing Environment (DCE)

• Security – authentication and authorization

• Name service

• Remote Procedure Calls

Encina Structured File System – flat-file system used to store metadata such as file names, segment locations, etc. Encina is built upon DCE.

GUI – Sammi product from Kinesix Distributed File System (DFS) – for some installations. DFS is built

upon DCE

Page 9: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 4.3 (Newest released version) Support for new hardware

StorageTek 9940 tape drives IBM Linear Tape Open (LTO) tape drives and robots Sony GY-8240 tape drives

Redundant Arrays of Independent Tapes An ASCI PathForward project contracted with

StorageTek Target is multiple tape drives striped with parity

Page 10: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 4.3 (continued) Mass configuration

Earlier, each device or server had to be individually configured through the GUI

Could be tedious and error-prone for installations with hundreds of drives or servers

Mass configuration takes advantage of the command line interface (new with HPSS 4.2)

Allows scripted configuration of devices and various types of servers.

Page 11: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 4.3 (continued) Support for IBM High Availability configurations

HACMP (High Availability Cluster MultiProcessor) hardware feature

HACMP supporting AIX software Handles node and network interface failures

Essentially a controlled failover to a spare node Initiated manually

Page 12: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 4.3 (continued) Other features:

Support for Solaris 8 Client API ported to Redhat Linux Support for NFS v3

By the way In our Probe testbed, we’re running HPSS 4.3 on AIX 5L

on our S80 Not certified, just trying it to see what happens.

Page 13: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 4.5 – target date 7/1/2002 Features

Implement an efficient, transparent interface for users to access their HPSS data

Uses HPSS as an archive Available freely for Linux (no licensing fee)

Key requirements Support HPSS access via XFS using DMAPI XFS / HPSS filesystems shall be accessible via NFS for

transparent access Support archived filesets (rename / delete) Support on Linux

Page 14: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 4.5 (continued) Provide migration and purge from XFS based on policy Stage data from HPSS when data has been purged from

XFS Support whole and partial file migration Support utilities for the following:

• Create / Delete XFS fileset metadata in HPSS• List HPSS filenames in archived fileset• List XFS names of files• Compare archive dumps from HPSS and XFS• Delete all files from HPSS side of XFS fileset• Delete files older than a specified age from HPSS side• Recover files deleted from XFS filesets not yet deleted from HPSS

Page 15: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 5.1- release date Jan. 2003 Background

HPSS was designed in 1992/1993 as a total rewrite of NSL UniTree.

Goal – achieve speed using many parallel servers. The Distributed Computing Environment (DCE) was a

prominent and promising infrastructure product Encina’s Structured File System (SFS) was the only

product supporting distributed nested transactions. Management GUI mandated to be Sammi, from Kinesix,

because of anticipated reuse of NSL UniTree screens.

Page 16: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 5.1 Background (continued) Today:

DCE – future in doubt Encina’s Structured File System

• Future in doubt• Performance problems• No longer need nested transactions• Or distributed transactions

Sammi relatively expensive and feature poor

Page 17: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS 5.1 Features New basic structure

DCE still used – still no alternative Designing a “core” server combining the name server, the bitfile

server, the storage server and parts of the Client API Replacing SFS with a commercial DBMS – DB2 – but design and

coding goal is easy replacement of the DBMS

Expect considerable speed improvement Oracle and DB2 were both ~10 times faster than SFS in a model run

in ORNL’s Probe testbed There is reduced communication between servers

Page 18: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

HPSS Software Architecture Diagram

Communications Security

Transaction ManagerMetadata Manager

LoggingInfrastructure Services

64-bit MathLibraries

Management

Client(s)

- Client API- PFS

ApplicationsData ManagementSystem Daemons:-HSI-FTP & PFTP- NFS- DFS

Storage System

Management

(all components)

BitfileServers

StorageServers

NameServers

LocationServers

Migration/Purge Repack

Movers

NSL UniTree Migration

Other Modules

Green componentsare defined in theIEEE Mass StorageReference Model.

Common Infrastructure

HPSS Software Architecture

PhysicalVolumeLibrary

PhysicalVolumeRespositories

Installation

Page 19: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

New Java Admin Interface User benefits:

Fast Immediately portable to Unix, Windows, Macintosh Picking up various manageability improvements

Developer benefits Object oriented Much code sharing

• Central communication and processing engine• Different presentation engines

• GUI• ASCII for the command-line interface• A third one, a Web interface, would be easy to add later

Overall maintenance much easier - code generated from HPSS C structures

Page 20: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

Future futures These topics are under discussion; no guarantees In each case, a gating function is the availability of

staff to do the development. Modification to HPSS’s parallel ftp to comply with

specs for GridFTP. Interest from ASCI, Argonne and others.

GPFS/HPSS interface Participants - LLNL, LBNL, Indiana University and IBM Seeking further help

SAN exploitation – gleam in the eye right now

Page 21: O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY HPSS Features and Futures Presentation to SCICOMP4 Randy Burris ORNL’s Storage Systems Manager

OAK RIDGE NATIONAL LABORATORYU.S. DEPARTMENT OF ENERGY

Questions?

http://www4.clearlake.ibm.com/hpss/ HPSS home pagehttp://www.sdsc.edu/hpss/hpss1.html HPSS tutorialhttp://www.ccs.ornl.gov Center for Comp. Sci.http://www.csm.ornl.gov Computer Sci and Math Divhttp://www.csm.ornl.gov/PROBE Testbed