bdt303 data science with elastic mapreduce - aws re: invent 2012
Post on 05-Dec-2014
2.059 Views
Preview:
DESCRIPTION
TRANSCRIPT
What is Netflix’s data warehouse?
a) Cassandra
b) Teradata
c) Hive
d) S3
DSE Platform
DSE Platform
S3
Chukwa
Aegisthus
DSE Platform
S3
Chukwa
Aegisthus
Sting
DSE Platform
S3
Chukwa
Aegisthus
Sting
What is Netflix’s data warehouse?
a) Cassandra
b) Teradata
c) Hive
d) S3
DSE Platform
S3
Chukwa
Aegisthus
Sting
S3
S3
99.999999999%
S3
S3
High SLA
Query
HDFS ?
“Data Science as a Service”
• Execution Service / Genie
• Event Service
• Metadata Service
High SLA Cluster Job
High SLA
S3
Query Cluster Job
Query
High SLA
S3
Query Cluster Job
Query
High SLA Cluster Job
High SLA
S3
Query Cluster Job
Query
High SLA Cluster Job
High SLA
S3
Query Cluster Job
Query
Super SLA Cluster Job
Super SLA
High SLA Cluster Job
High SLA
S3
Query Cluster Job
Query
Super SLA Cluster Job
Questions?
http://jobs.netflix.com
kurtbrown@netflix.com
top related