storage management and caching in past, a large-scale, persistent peer-to-peer storage utility

9
Storage management and caching in PAST, a large- scale, persistent peer-to- peer storage utility Antony Rowstron, Peter Dr uschel

Upload: indira-trevino

Post on 30-Dec-2015

21 views

Category:

Documents


4 download

DESCRIPTION

Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility. Antony Rowstron, Peter Druschel. Introduction. PAST is an Internet-based, peer-to-peer global storage utility. It aims to provide strong persistence, high availability, scalability and security. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

Storage management and caching in PAST, a large-scale, persistent peer-

to-peer storage utility

Antony Rowstron, Peter Druschel

Page 2: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

Introduction

PAST is an Internet-based, peer-to-peer global storage utility.

It aims to provide strong persistence, high availability, scalability and security.

PAST employs Pastry as the routing scheme.

Pastry is an efficient routing scheme used to reliably route client requests to the appropriate nodes.

Page 3: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

PAST Operations

fileId=Insert(name, owner-credentials, k, file)fileId is 160-bit, while nodeId is 128-bit;

fileId

nodeId

file=Lookup(fileId) Reliably retrieve a copy of the file

Reclaim(fileId, owner-credentials)Notice: Reclaim doesn’t guarantee that the file is no longer

available after it was reclaimed.

128 Bits 32 Bits

128 Bits

M L

Page 4: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

Storage Management

Design Goals: Allow high global storage utilization Gracefully degrade as the system approaches t

he maximal utilization

Load balancing: Replica diversion File diversion

Page 5: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

Replica Diversion

Replica diversion is to accommodate differences in the storage capacity and utilization of nodes within a leaf set.

B

A

C

An entry points to B

File is stored in B due to the less capacity in A

A is the right node to store the file, but has insufficient space

C is the k+1th closest node, which stores the pointer in case of the failure of node A

Page 6: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

File Diversion

The k closest nodes to the fileId could not accommodate the file nor divert the replicas; then, a file diversion is invoked.

The client generates a new fileId, and retries to insert the file.

Repeat the second step up to 3 times.If it still fails, an error is reported to the app

lication.

Page 7: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

Caching

PAST nodes use the “unused” portion of their advertised disk space to cache files.

Caching in PAST is based on GreedyDual-Size (GD-S) policy, which was originally developed for caching Web proxies.

Page 8: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

Experimental Results

*This graph is from the paper.

Page 9: Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

Question?