locality characteristics of web streams revisited (spects 2005)
TRANSCRIPT
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
1/28
Locality Characteristics
of Web Streams Revisited
Aniket Mahanti, Anirban Mahanti, andCarey Williamson
University of Calgary, Canada
International Symposium on Performance Evaluation of Computer and
Telecommunication Systems, Philadelphia, 2005
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
2/28
Introduction
Motivation
ADF framework
Objectives
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
3/28
3
Introduction
Web caching proxies are an effectivemeans of reducing network traffic
Web caches are widely deployed by ISPs
Caches improve performance byexploiting workload characteristicssuch as locality of reference
Workload characterisation of localitystructure can provide insight into thedesign and performance of the Web
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
4/28
4
Motivation
Locality characteristics can be used bycaching policies when making decisionsto evict or retain documents in the cache
Most prior Web caching work focused onanalysing Web streams in isolation
[Fonseca et al. 2005] proposed a
system level view called the ADFframework for analysing transformationsof a Web reference stream
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
5/28
5
ADFFramework [Fonseca, 2005]
Reference: R. Fonseca et al. (2005), Locality in a Web of Streams, In: Communications of the ACM, 48(1):8288.
A:AggregationD:Disaggregation
F:Filtering
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
6/28
6
Research Objectives
Study locality properties in Web request
streams using the ADF framework:
What impact do locality characteristicshave on caching performance?
What are the locality characteristics ofWeb request streams after the
aggregation of filtered streams?
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
7/28
Background
Flow of requests
Locality of reference
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
8/28
8
Flow of Requests
Image reproduced from: R. Fonseca et al. (2003), Locality in a Web of Streams, Technical Report, Department of Computer Science, Boston University.
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
9/28
9
Locality of Reference
Popularity: An object is simply more
popular than other objects
.XABXXCXDXXXEFXX.
Temporal locality: References to an
object occur in a correlated manner.AAHIJAAAUOLYPJKAA.
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
10/28
Metrics Used
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
11/28
11
Performance Metrics
Document hit ratio: Percentage of totalrequests satisfied by Web proxy cache
Byte hit ratio: Percentage of total byte volumeof data satisfied by Web proxy cache
Cumulative reference measure: Fraction oftotal requests accounted for by the top 10%of the most popular documents
Inter-reference measure: Probability ofreferencing document again within Mintervening requests (e.g., M=1000)
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
12/28
FilteringModel
Model description
Simulation results
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
13/28
13
FilteringModel
Filtering
Input stream
Filtered stream (misses)
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
14/28
14
Workload and System Parameters
WebTraff: synthetic Web proxy workloads Two traces differing only in temporal locality
Trace1 (weak) and Trace 2 (strong)
Trace characteristics:
1.5 million requests
495,000 unique documents
14 GB total bytes of Web content
Cache replacement policies:LRU, LFU-Aging, GDS, RAND, FIFO
Cache size:
1 MB 16 GB
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
15/28
15
Caching Performance (1 of 3)
Trace1: Weak temporal locality
Document Hit Ratio
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
16/28
16
Caching Performance (2 of 3)
Trace2: Strong temporal locality
Document Hit Ratio
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
17/28
17
Caching Performance (3 of 3)
Trace1: Weak temporal locality Trace2: Strong temporal locality
Document Hit Ratio
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
18/28
18
Popularity Characteristics
Trace1: Weak temporal locality Trace2: Strong temporal locality
Cumulative Reference Measure
forFiltered Request Stream
(after the cache)
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
19/28
19
Temporal Locality Characteristics
Trace1: Weak temporal locality Trace2: Strong temporal locality
Inter-referenceMeasure
forFiltered Request Stream
(after the cache)
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
20/28
Aggregation Model
Model description
Simulation results
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
21/28
21
Aggregation Model
Aggregated
filtered stream
Filtering
Input stream
Filtered stream
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
22/28
22
System Model and Parameters
Two-level hierarchal web proxy configuration
Aggregated streams from:N = 1, 2, 4, 8 child proxies
Caching policy:LRU at child proxies
Cache size:
1 MB 256 MB Degree of overlap:No overlap, partial overlap
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
23/28
23
Popularity Characteristics
No Overlap Partial Overlap
Cumulative Reference Measure
(Strong temporal locality)
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
24/28
24
Temporal Locality Characteristics
No Overlap Partial Overlap
Inter-referenceMeasure
(Strong temporal locality)
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
25/28
25
No Overlap: Temporal Locality
Temporal locality decreases with increasingN Phenomenon consistent over various cache
sizes and degree of temporal locality
Design of the no overlap scenario
New stream has twice as many documentsbetween 1A1 and
2A1
N=2
Child Proxy 1: 1A1,1U1 ,
1U2,.,1U50,
1A1
Child Proxy 2: 2A1,2U1 ,
2U2 ,.,2U50,
2A1
Aggregated filtered stream:1
A1,2
A1,1
U1,2
U1,,1
U50,2
U50,1
A1,2
A1,
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
26/28
26
Partial Overlap: Temporal Locality
Temporal locality increases with increasingN Due to 50% overlap among all traces
References ofA from other proxies clustered
N=4
Child Proxy 1:A, B,1U1 ,.,1U50,A, B
Child Proxy 2:A, B,2U1 ,.,2U50,A, B
Child Proxy 3:A, B,3U1 ,.,3U50,A, B
Child Proxy 4:A, B,4U1 ,.,4U50,A, B
Aggregated filtered stream:A, A, A, A, B, B, B, B,1U1 ,2U1 ,
3U1 ,4U1
,.,1U50,2U50,
3U50 ,4U50,A, A, A, A, B, B, B, B
,
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
27/28
Conclusions
-
8/6/2019 Locality Characteristics of Web Streams Revisited (SPECTS 2005)
28/28
28
Conclusions
Caching policies should exploit temporalcorrelation andpopularityof documents
LRU and FIFO exploit temporal locality
GDS insensitive to changes in temporal locality Structural change in temporal locality for
aggregated streams depends on the degree
of overlap in the workloads
These results imply limited advantages of
using caching hierarchies