120412 oracle big data summit

18
De praktijk van Big Data En waarom de huidige technologie niet (altijd) voldoet Friso van Vollenhoven [email protected]

Upload: xebia-nederland-bv

Post on 10-Jun-2015

499 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: 120412 oracle big data summit

De prakti jk van Big Data

En waarom de huidige technologie niet (alt i jd) voldoet

Friso van [email protected]

Page 2: 120412 oracle big data summit

Big Data

Page 3: 120412 oracle big data summit

Big Data

Page 4: 120412 oracle big data summit

Big Data

Page 5: 120412 oracle big data summit

Big Data

Requirement:Full table scan, 200GB table

Page 6: 120412 oracle big data summit

Big Data

Page 7: 120412 oracle big data summit

Big Data

Egypte, 27 januari 2011

Page 8: 120412 oracle big data summit

Big Data

Requirement:40.000 updates per seconde, 24/7.

Page 9: 120412 oracle big data summit

Databases

++=

Page 10: 120412 oracle big data summit

Databases

++=

SANstorage

network

Page 11: 120412 oracle big data summit

HDFS en MapReduce

storagenetwork

CLIENT

bottleneck

SELECT SESSION, COUNT(*) FROM WEB_CLICKS GROUP BY SESSION;

Page 12: 120412 oracle big data summit

HDFS en MapReduce

storagenetwork

bottleneck

CLIENT

SELECT SESSION, COUNT(*) FROM WEB_CLICKS GROUP BY SESSION;

Page 13: 120412 oracle big data summit

HDFS en MapReduce

Page 14: 120412 oracle big data summit

HDFS en MapReduce

SELECT * FROMWEB_CLICKS;

SELECT * FROMWEB_CLICKS;

SELECT * FROMWEB_CLICKS;

Page 15: 120412 oracle big data summit

HDFS en MapReduce

GROUP BY SESSION

Page 16: 120412 oracle big data summit

HDFS en MapReduce

COUNT(*)

COUNT(*)

COUNT(*)

Page 17: 120412 oracle big data summit

HDFS en MapReduce

SELECT * FROMWEB_CLICKS;

SELECT * FROMWEB_CLICKS;

SELECT * FROMWEB_CLICKS; COUNT(*)

COUNT(*)

COUNT(*)

GROUP BY SESSION

MAP REDUCE

MAP REDUCE

MAP REDUCE

SORT/SHUFFLE

Page 18: 120412 oracle big data summit

NoSQL

A B C D E F G H I J K L M N O P Q R S T U V W X Y Zindex