data days 2014 - mikio braun

8
WHAT IS REAL-TIME? Dr. Mikio L. Braun / streamdrill

Upload: datadays

Post on 25-Jun-2015

68 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Data Days 2014 - Mikio Braun

WHAT IS REAL-TIME?

Dr. Mikio L. Braun / streamdrill

Page 2: Data Days 2014 - Mikio Braun

WHAT IS REAL-TIME?

Page 3: Data Days 2014 - Mikio Braun

Dr. Mikio L. Braun / streamdrill

VELOCITY VS. DIVERSITY

• 100 events/second• 360k per hour• 8.6M per day• 260M per month• 3.2B per year

http://www.flickr.com/photos/arenamontanus/269158554/

Page 4: Data Days 2014 - Mikio Braun

LATENCY VS. ACTION

Page 5: Data Days 2014 - Mikio Braun

BATCH VS. STREAM

Page 6: Data Days 2014 - Mikio Braun

APPROXIMATE VS. EXACT

ExactFast

Big Data

Approximate! Parallelize!

First seen here: http://www.slideshare.net/acunu/realtime-analytics-with-apache-cassandra

Page 7: Data Days 2014 - Mikio Braun

Dr. Mikio L. Braun / streamdrill

LESSON‘S LEARNED

• Process data as it comes in.• In-memory, disk too slow.• Focus on relevant data and approximate.• 20k/s on single machine.• 1M objects tracked per 1GB RAM.

Page 8: Data Days 2014 - Mikio Braun

THANK YOU VERY MUCH FOR YOUR ATTENTION!

Dr. Mikio L. Braun / streamdrill