copy of parallel database systems

17
Technical Seminar Presentation AMRITA MISHRA CS200198086 1 Parallel Database Systems PARALLEL DATABASE SYSTEMS PARALLEL DATABASE SYSTEMS Under the guidance of : Mr. Anisur Rahman

Upload: lipika008

Post on 16-Nov-2014

203 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 1

Parallel Database Systems

PARALLEL DATABASE PARALLEL DATABASE SYSTEMSSYSTEMS

Under the guidance of :

Mr. Anisur Rahman

Page 2: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 2

Parallel Database Systems

INTRODUCTIONA parallel database system (PDBS) is a DBMS implemented on a parallel computer which is made of a number of nodes (processors and memories) connected by a fast network within a cabinet.

It strives to exploit modern multiprocessor architectures using software-oriented solutions for data management.

Page 3: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 3

Parallel Database Systems

OBJECTIVEProblems of conventional DBMS

- high disk access time.

- very large databases cant be supportedwithin a single system.

PDBS is the only viable solution for increasing the I/O bandwidth through parallelism & for storing huge databases in a single system.

Page 4: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 4

Parallel Database Systems

ADVANTAGES OF PDBS High Performance – Increased throughput

(inter-query parallelism) & decreased response time (intra-query parallelism).

High Availability – Using data replication.

Extensibility – Linear scaleup and Linear speedup.

Page 5: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 5

Parallel Database Systems

PARALLEL DBMS ARCHITECTURE

Shared Memory Advantages – Simplicity, Load Balancing. Problems – Cost, Limited Extensibility, Low

Availability.

Page 6: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 6

Parallel Database Systems

Shared Disk Advantages – Cost, Extensibility, Load Balancing,

Availability.Problems – Higher Complexity, Potential Coherence

Problems.

Page 7: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 7

Parallel Database Systems

Shared NothingAdvantages – Cost, Extensibility, Availability.Problems – Complex, Addition of new nodes requires

reorganizing the database.

Page 8: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 8

Parallel Database Systems

PARALLEL DBMS TECHNIQUESDATA ALLOCATION – Methods that spread the database across the system’s disks to ensure efficient parallel I/O. Partitioning (Fragmentation) – 3 strategies # Round Robin – i th tuple to partition (i mod n)

for n partitions. # Hashing – Apply hash function to some attribute to give partition no. # Range Partitioning – Distribute tuples based

on value(ranges) of some attribute.

 

 

Page 9: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 9

Parallel Database Systems

elUSES OF DATA FRAGMENTATION Maximize system performance.

Minimize response time (through intra-queryparallelism).

Maximize throughput (through inter-queryparallelism).

Problems: Skewed data distributions lead tonon-uniform partitioning & hurt load balancing.

Page 10: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 10

Parallel Database Systems

  

QUERY PARALLELISM Inter query parallelism – parallel execution of multiple queries generated by concurrent transactions.

Intra query parallelism – Two kinds:#Inter-operator – Executing several operators of

the query tree on several processors in parallel.#Intra-operator – Same operator is executed by

many processors, each one working on a subset of the data.

Page 11: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 11

Parallel Database Systems

PARALLEL ALGORITHMS  

Page 12: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 12

Parallel Database Systems

  

Page 13: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 13

Parallel Database Systems

 

Page 14: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 14

Parallel Database Systems

PARALLEL JOIN EXAMPLE

 Insert into C Select * from A , B where A.x = B.y ;

Page 15: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 15

Parallel Database Systems

Page 16: Copy of Parallel Database Systems

T

echn

ical S

emin

ar P

rese

ntat

ion

AMRITA MISHRA CS200198086 16

Parallel Database Systems

CONCLUSIONThe availability of fast microprocessors &

small inexpensive disks is an ideal platform for parallel database systems.

Employing data partitioning, intra-operator parallelism etc., converting an existing DBMS into a highly parallel one is relatively straightforward.

Page 17: Copy of Parallel Database Systems

Tech

nica

l Sem

inar

Pre

sent

atio

nat

iona

l Ins

titut

e of

Scie

nce

& Te

chno

logy

AMRITA MISHRA CS200198086

Parallel Database Systems

17

THANK YOU!!!