locality characteristics of web streams revisited (spects 2005)

Upload: asicsnew

Post on 08-Apr-2018

224 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    1/28

    Locality Characteristics

    of Web Streams Revisited

    Aniket Mahanti, Anirban Mahanti, andCarey Williamson

    University of Calgary, Canada

    International Symposium on Performance Evaluation of Computer and

    Telecommunication Systems, Philadelphia, 2005

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    2/28

    Introduction

    Motivation

    ADF framework

    Objectives

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    3/28

    3

    Introduction

    Web caching proxies are an effectivemeans of reducing network traffic

    Web caches are widely deployed by ISPs

    Caches improve performance byexploiting workload characteristicssuch as locality of reference

    Workload characterisation of localitystructure can provide insight into thedesign and performance of the Web

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    4/28

    4

    Motivation

    Locality characteristics can be used bycaching policies when making decisionsto evict or retain documents in the cache

    Most prior Web caching work focused onanalysing Web streams in isolation

    [Fonseca et al. 2005] proposed a

    system level view called the ADFframework for analysing transformationsof a Web reference stream

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    5/28

    5

    ADFFramework [Fonseca, 2005]

    Reference: R. Fonseca et al. (2005), Locality in a Web of Streams, In: Communications of the ACM, 48(1):8288.

    A:AggregationD:Disaggregation

    F:Filtering

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    6/28

    6

    Research Objectives

    Study locality properties in Web request

    streams using the ADF framework:

    What impact do locality characteristicshave on caching performance?

    What are the locality characteristics ofWeb request streams after the

    aggregation of filtered streams?

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    7/28

    Background

    Flow of requests

    Locality of reference

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    8/28

    8

    Flow of Requests

    Image reproduced from: R. Fonseca et al. (2003), Locality in a Web of Streams, Technical Report, Department of Computer Science, Boston University.

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    9/28

    9

    Locality of Reference

    Popularity: An object is simply more

    popular than other objects

    .XABXXCXDXXXEFXX.

    Temporal locality: References to an

    object occur in a correlated manner.AAHIJAAAUOLYPJKAA.

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    10/28

    Metrics Used

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    11/28

    11

    Performance Metrics

    Document hit ratio: Percentage of totalrequests satisfied by Web proxy cache

    Byte hit ratio: Percentage of total byte volumeof data satisfied by Web proxy cache

    Cumulative reference measure: Fraction oftotal requests accounted for by the top 10%of the most popular documents

    Inter-reference measure: Probability ofreferencing document again within Mintervening requests (e.g., M=1000)

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    12/28

    FilteringModel

    Model description

    Simulation results

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    13/28

    13

    FilteringModel

    Filtering

    Input stream

    Filtered stream (misses)

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    14/28

    14

    Workload and System Parameters

    WebTraff: synthetic Web proxy workloads Two traces differing only in temporal locality

    Trace1 (weak) and Trace 2 (strong)

    Trace characteristics:

    1.5 million requests

    495,000 unique documents

    14 GB total bytes of Web content

    Cache replacement policies:LRU, LFU-Aging, GDS, RAND, FIFO

    Cache size:

    1 MB 16 GB

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    15/28

    15

    Caching Performance (1 of 3)

    Trace1: Weak temporal locality

    Document Hit Ratio

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    16/28

    16

    Caching Performance (2 of 3)

    Trace2: Strong temporal locality

    Document Hit Ratio

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    17/28

    17

    Caching Performance (3 of 3)

    Trace1: Weak temporal locality Trace2: Strong temporal locality

    Document Hit Ratio

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    18/28

    18

    Popularity Characteristics

    Trace1: Weak temporal locality Trace2: Strong temporal locality

    Cumulative Reference Measure

    forFiltered Request Stream

    (after the cache)

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    19/28

    19

    Temporal Locality Characteristics

    Trace1: Weak temporal locality Trace2: Strong temporal locality

    Inter-referenceMeasure

    forFiltered Request Stream

    (after the cache)

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    20/28

    Aggregation Model

    Model description

    Simulation results

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    21/28

    21

    Aggregation Model

    Aggregated

    filtered stream

    Filtering

    Input stream

    Filtered stream

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    22/28

    22

    System Model and Parameters

    Two-level hierarchal web proxy configuration

    Aggregated streams from:N = 1, 2, 4, 8 child proxies

    Caching policy:LRU at child proxies

    Cache size:

    1 MB 256 MB Degree of overlap:No overlap, partial overlap

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    23/28

    23

    Popularity Characteristics

    No Overlap Partial Overlap

    Cumulative Reference Measure

    (Strong temporal locality)

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    24/28

    24

    Temporal Locality Characteristics

    No Overlap Partial Overlap

    Inter-referenceMeasure

    (Strong temporal locality)

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    25/28

    25

    No Overlap: Temporal Locality

    Temporal locality decreases with increasingN Phenomenon consistent over various cache

    sizes and degree of temporal locality

    Design of the no overlap scenario

    New stream has twice as many documentsbetween 1A1 and

    2A1

    N=2

    Child Proxy 1: 1A1,1U1 ,

    1U2,.,1U50,

    1A1

    Child Proxy 2: 2A1,2U1 ,

    2U2 ,.,2U50,

    2A1

    Aggregated filtered stream:1

    A1,2

    A1,1

    U1,2

    U1,,1

    U50,2

    U50,1

    A1,2

    A1,

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    26/28

    26

    Partial Overlap: Temporal Locality

    Temporal locality increases with increasingN Due to 50% overlap among all traces

    References ofA from other proxies clustered

    N=4

    Child Proxy 1:A, B,1U1 ,.,1U50,A, B

    Child Proxy 2:A, B,2U1 ,.,2U50,A, B

    Child Proxy 3:A, B,3U1 ,.,3U50,A, B

    Child Proxy 4:A, B,4U1 ,.,4U50,A, B

    Aggregated filtered stream:A, A, A, A, B, B, B, B,1U1 ,2U1 ,

    3U1 ,4U1

    ,.,1U50,2U50,

    3U50 ,4U50,A, A, A, A, B, B, B, B

    ,

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    27/28

    Conclusions

  • 8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)

    28/28

    28

    Conclusions

    Caching policies should exploit temporalcorrelation andpopularityof documents

    LRU and FIFO exploit temporal locality

    GDS insensitive to changes in temporal locality Structural change in temporal locality for

    aggregated streams depends on the degree

    of overlap in the workloads

    These results imply limited advantages of

    using caching hierarchies