april 2008 - infolab · 2016. 5. 21. · step 1 – query decomposition . 2008, 28 april csci585 -...
TRANSCRIPT
2008, 28 April CSCI585 - Distributed Databases
Distributed Databases
by Farnoush Banaei-Kashani
Excerpt from “Principles of Distributed Database Systems”by M. Tamer Özsu and Patrick Valduriez
2008, 28 April CSCI585 - Distributed Databases 2
Topics
2008, 28 April CSCI585 - Distributed Databases 3
Outline
� Problem Definition � Issues to Consider� Methodology
� Step 1: Query Decomposition� Step 2: Data Localization� Step 3: Global Optimization� Step 4: Local Optimization
2008, 28 April CSCI585 - Distributed Databases 4
Query Processing
2008, 28 April CSCI585 - Distributed Databases 5
Problem?
2008, 28 April CSCI585 - Distributed Databases 6
Problem in DDBS?
2008, 28 April CSCI585 - Distributed Databases 7
Problem in DDBS?
2008, 28 April CSCI585 - Distributed Databases 8
Query Optimization Objectives
2008, 28 April CSCI585 - Distributed Databases 9
Complexity of Relational Operations
2008, 28 April CSCI585 - Distributed Databases 10
Outline
� Problem Definition � Issues to Consider� Methodology
� Step 1: Query Decomposition� Step 2: Data Localization� Step 3: Global Optimization� Step 4: Local Optimization
2008, 28 April CSCI585 - Distributed Databases 11
Query Processing Issues – Types of Optimizers
2008, 28 April CSCI585 - Distributed Databases 12
Query Processing Issues – Optimization Granularity
2008, 28 April CSCI585 - Distributed Databases 13
Query Processing Issues – Optimization Timing
2008, 28 April CSCI585 - Distributed Databases 14
Query Processing Issues – Statistics
2008, 28 April CSCI585 - Distributed Databases 15
Query Processing Issues – Decision Sites
2008, 28 April CSCI585 - Distributed Databases 16
Query Processing Issues – Network Topology
2008, 28 April CSCI585 - Distributed Databases 17
Outline
� Problem Definition � Issues to Consider� Methodology
� Step 1: Query Decomposition� Step 2: Data Localization� Step 3: Global Optimization� Step 4: Local Optimization
2008, 28 April CSCI585 - Distributed Databases 18
Distributed Query Processing Methodology
2008, 28 April CSCI585 - Distributed Databases 19
Step 1 – Query Decomposition
2008, 28 April CSCI585 - Distributed Databases 20
Normalization
2008, 28 April CSCI585 - Distributed Databases 21
Analysis
2008, 28 April CSCI585 - Distributed Databases 22
Analysis - Example
2008, 28 April CSCI585 - Distributed Databases 23
Analysis - Example
2008, 28 April CSCI585 - Distributed Databases 24
Simplification
2008, 28 April CSCI585 - Distributed Databases 25
Simplification - Example
2008, 28 April CSCI585 - Distributed Databases 26
Restructuring
2008, 28 April CSCI585 - Distributed Databases 27
Restructuring - Transformation Rules
2008, 28 April CSCI585 - Distributed Databases 28
Restructuring - Transformation Rules
2008, 28 April CSCI585 - Distributed Databases 29
Example
2008, 28 April CSCI585 - Distributed Databases 30
Equivalent Query
2008, 28 April CSCI585 - Distributed Databases 31
Restructuring
2008, 28 April CSCI585 - Distributed Databases 32
Distributed Query Processing Methodology
2008, 28 April CSCI585 - Distributed Databases 33
Step 2 – Data Localization
2008, 28 April CSCI585 - Distributed Databases 34
Example
2008, 28 April CSCI585 - Distributed Databases 35
Provides Parallelism
2008, 28 April CSCI585 - Distributed Databases 36
Eliminates Unnecessary Work
2008, 28 April CSCI585 - Distributed Databases 37
Reduction for PHF
2008, 28 April CSCI585 - Distributed Databases 38
Reduction for PHF
2008, 28 April CSCI585 - Distributed Databases 39
Reduction for PHF
2008, 28 April CSCI585 - Distributed Databases 40
Reduction for PHF
2008, 28 April CSCI585 - Distributed Databases 41
Reduction for VF
2008, 28 April CSCI585 - Distributed Databases 42
Reduction for DHF
≠
2008, 28 April CSCI585 - Distributed Databases 43
Reduction for DHF
2008, 28 April CSCI585 - Distributed Databases 44
Reduction for DHF
2008, 28 April CSCI585 - Distributed Databases 45
Reduction for HF
2008, 28 April CSCI585 - Distributed Databases 46
Reduction for HF