sujal and scott fina lb
DESCRIPTION
2011 EMC worldTRANSCRIPT
© Copyright 2010 EMC Corporation. All rights reserved. 1
EMC WORLDLas Vegas 2011
2© Copyright 2011 EMC Corporation. All rights reserved.
Big Data, Big OpportunitySujal PatelPresident, Isilon Storage Division
May 9, 2011
3© Copyright 2011 EMC Corporation. All rights reserved.
!!!
!!!
!!!
!!!
!!!
“Big Data Is Less About Size, And More About Freedom”―Techcrunch
!!!
!!!
!!!“Findings: ‘Big Data’ Is More Extreme Than Volume”― Gartner
“Big Data! It’s Real, It’s Real-time, and It’s Already Changing Your World” ―IDB
“Total data: ‘bigger’ than big data”
― 451 Group
THE ERA OF
BIG DATA
IS HERE…
4© Copyright 2011 EMC Corporation. All rights reserved.
ActEMC Documentum xCP
The EMC Big Data “Stack”
AnalyzeEMC Greenplum + Hadoop
StoreEMC Isilon + Atmos
?
PetabyteScale1
Structured &
Unstructured
2
Real Time3
Collaborative4
5© Copyright 2011 EMC Corporation. All rights reserved.
Big Data Is Changing Enterprise Storage
2009 2010 2011 2012 2013 20140
10
20
30
40
50
60
70
80
90
Source: IDC
File Based: 60.7% CAGR Block Based: 21.8% CAGREX
ABYTES
By 2012, 80% of all storage capacity sold will be for file-based data
BigData
Sources
6© Copyright 2011 EMC Corporation. All rights reserved.
Scale-UP Architectures NOT Ideal for Big Data
ScalabilityPerformanceManagementAvailabilityCost
SCALE UPSCALE UP
Sto
rage
Netw
ork
Serv
er
7© Copyright 2011 EMC Corporation. All rights reserved.
Architectural Transition - Scale-OUT for Big Data
ScalabilityPerformanceManagementAvailabilityCost
10 GbENETWORK 10X
PERFORMANCE
SCALE OUTSCALE OUT
CLUSTERED COMPUTING&
SERVER VIRTUALIZATION
SCALE-OUTSTORAGEARCHITECTURESS
tora
ge
Netw
ork
Serv
er
8© Copyright 2011 EMC Corporation. All rights reserved.
It’s Day and Night Different
Scale-UpScale-Out
Shared StorageShared Nothing
ManualAutomated
Performance BottlenecksLinear Scalability
Increasing ComplexityOperational Efficiency
9© Copyright 2011 EMC Corporation. All rights reserved.
Isilon: Scale-Out NAS Innovation
Perf
orm
an
ce
Massive Scalability15+ PB in a single file system
Scalability
App
Management Simplicity
Industry-leading Reliability and Self-Healing
Application and Workflow Consolidation
Unmatched PerformanceUp to 85 GB/s of throughput and 1.2M+ IOPS
10© Copyright 2011 EMC Corporation. All rights reserved.
Big Data Requires:
Tremendous Scalability of Capacity and Performance.
11© Copyright 2011 EMC Corporation. All rights reserved.
VIDEO144-Node Cluster Build
12© Copyright 2011 EMC Corporation. All rights reserved.
Core Innovation…Value to CustomersIsilon’s OneFS Scale-Out Operating System
Single file system, single volume...up to 15+ PBs
>80% raw storage utilization
Highest performance, fully symmetric cluster
Easy to manage and grow
Multi-tier single file system/single cluster
Single unified platform across all products
13© Copyright 2011 EMC Corporation. All rights reserved.
Extension NodesPlatform Nodes
Isilon Scale-out NAS Product Portfolio
S-SERIES X-SERIES NL-SERIESACCELERATOR
& BACKUP ACCELERATOR
ISILON EX NODES
HA
RD
WA
RE
AP
PL
ICA
TIO
NS
OF
TW
AR
EO
PE
RA
TIN
GS
YS
TE
M
14© Copyright 2011 EMC Corporation. All rights reserved.
Isilon Scale-Out NAS -- Benefits
Utilization
ProductivityCost
ROI
15© Copyright 2011 EMC Corporation. All rights reserved.
• Large-scale home directories
• Large-Scale File-Archives
• Disaster Recovery & Business Continuance
Enterprise IT Workflows @ Scale
• Private Cloud
• Tier-3 Server Virtualization
• Storage Consolidation
Enterprise Shared Infrastructure
• Quantitative Finance
• Seismic Processing
• Research & Analysis
• Bioinformatics
High Performance Computing
• Media & Entertainment
• Life Sciences
• Internet & Web 2.0
• EDA & Software Development
IndustrySolutions
Isilon Solutions For…
16© Copyright 2011 EMC Corporation. All rights reserved.
Isilon Enables Success For…
16
17© Copyright 2011 EMC Corporation. All rights reserved.
Why Isilon?Because Big Data Demands New Thinking
Product Families Purpose-built to Optimize for IOps, Throughput and/or $/TB.
Record-breaking Scaling of Capacity and Performance.
Remarkable Simplicity.
© Copyright 2010 EMC Corporation. All rights reserved. 18
EMC WORLDLas Vegas 2011
19© Copyright 2011 EMC Corporation. All rights reserved.
May 9, 2011
Big Data, Big OpportunityScott YaraCo-founder Greenplum and VP of Products, EMC Data Computing Division
20© Copyright 2011 EMC Corporation. All rights reserved.
J U LY 2 0 1 0 - E M C A C Q U I R E S G R E E N P L U M
“For three years, Gartner has identified Greenplum as the most advanced vendor in the visionary
quadrant of its data warehouse DBMS Magic Quadrant….”
– Gartner
Background:Greenplum Joins EMC in July, 2010…
21© Copyright 2011 EMC Corporation. All rights reserved.
Background: EMC + Greenplum, A Fast Track to Innovation• EMC Leverage:
– Established new Data Computing Products Division• Reports directly to Pat Gelsinger, President & COO
– Investing to grow from 150 employees to +600 in 2011• Increase R&D organization by more than 3X
– Launched new Greenplum Data Computing Appliance • Built by EMC manufacturing, single-call support
• Simple integration with complimentary EMC products
• Available globally, serviced by EMC
– Established joint R&D with VMware around Enterprise Data Cloud– Building disruptive solutions with EMC’s global, tier-1 partners
22© Copyright 2011 EMC Corporation. All rights reserved.
Current Success and Market Momentum
• 22
• Leaders Quadrant in Gartner DW 2011
• Mission critical deployments across multiple industries
• Installations from small (TBs) to very large (PBs)
• Scalable analytics platform to complement EDW
23© Copyright 2011 EMC Corporation. All rights reserved.
Steve Jobs, 1995
23© Copyright 2011 EMC Corporation. All rights reserved.
To make step-function changes, revolutionary changes, seems to take a very unique combination of timing, technology, talent…and luck to make significant change in our industry. It hasn't happened that often.
24© Copyright 2011 EMC Corporation. All rights reserved.
DATADATAINTERNETINTERNETPERSONAL COMPUTER
Data Is A REVOLUTIONARY Change
PERSONAL COMPUTER
25© Copyright 2011 EMC Corporation. All rights reserved.
RetailPhone/TV
Government Internet
Medical
Financial
New Realities. The New Normal.
© Copyright 2011 EMC Corporation. All rights reserved.
DataCol lectors
26© Copyright 2011 EMC Corporation. All rights reserved.
New Realities. The New Normal.
© Copyright 2011 EMC Corporation. All rights reserved.
DataCol lectors
Internet
Phone/TV Retail
GovernmentLaw EnforcementPublic Education
Medical
Financial
DataDevices
27© Copyright 2011 EMC Corporation. All rights reserved.
New Realities. The New Normal.
© Copyright 2011 EMC Corporation. All rights reserved.
AnalyticServices
Information Brokers
Advertising
Websites
CatalogCo-Ops
Credit Bureaus
ListBrokers
MediaArchives
DataAggregators
RetailPhone/TV
Government Internet
Medical
Financial
DataCol lectors
DataDevices
28© Copyright 2011 EMC Corporation. All rights reserved.
New Realities. The New Normal.
© Copyright 2011 EMC Corporation. All rights reserved.
AnalyticServices
Advertising
DataAggregators
LawEnforceme
nt
Media
BanksGovernment
DeliveryService
PrivateInvestigators
/Lawyers
Marketers Employers
Individual
DataUsers /Buyers
Websites
Information Brokers
MediaArchives
Credit Bureaus
ListBrokers
CatalogCo-Ops
RetailPhone/TV
Government Internet
Medical
Financial
DataCol lectors
DataDevices
29© Copyright 2011 EMC Corporation. All rights reserved.
But Why Now?
Inn
ovati
on
Time
X86
Storage
Virtualization
Networking
WebConvergen
ce(aka “cloud”)
30© Copyright 2011 EMC Corporation. All rights reserved.
100-1000XFASTER AND CHEAPER
Processing data is now
31© Copyright 2011 EMC Corporation. All rights reserved.
What do we need?
32© Copyright 2011 EMC Corporation. All rights reserved.
We Need…
a complete big data analytics stackAnd
Data Scientists
Innovation
Community
33© Copyright 2011 EMC Corporation. All rights reserved.
34© Copyright 2011 EMC Corporation. All rights reserved.
EMC HADOOPUnstructured.Real-time.Enterprise-Ready.
35© Copyright 2011 EMC Corporation. All rights reserved.
What Is Hadoop?• Apache Hadoop is an open-source technology
inspired by Google’s MapReduce and Google File System papers
• It is a software framework that supports data-intensive distributed applications and is effective for analyzing and storing massive amounts of data
• Leading internet companies like Yahoo!, Facebook, eHarmony, Twitter, and eBay, have pioneered the use of Hadoop
36© Copyright 2011 EMC Corporation. All rights reserved.
Greenplum HD Product Family• Greenplum HD Community Edition:
– Certified Full-Stack, 100% Open Source– Virtual Machine Appliance– All core feature development contributed back to Apache Hadoop
• Greenplum HD Enterprise Edition:– Differentiated, hybrid distribution, advanced features– Integrated, tested, hardened– 100% Hadoop, HBase, HDFS API compatible
• Greenplum HD Data Computing Appliance:– Optimized appliance configuration– Eliminates complexity, simplifies deployment and management– Seamless integration with Greenplum Database
37© Copyright 2011 EMC Corporation. All rights reserved.
Greenplum HD InnovationsMajor Technical Innovations for Hadoop
Pluggable I/O
• Isilon OneFS
• Atmos
• Cassandra
• MapR
• Enables greater efficiency and performance
Real-time Processing
• Low latency read/write operations
• Realtime data interaction and analytic processing
• Integration with Cassandra and MapR
Fault-Tolerance
• Eliminate SPOF for Name-Node
• Job Tracker and other key components
38© Copyright 2011 EMC Corporation. All rights reserved.
GREENPLUM HDDATA COMPUTING APPLIANCEThe Powerful Combination of Greenplum Database and Apache Hadoop
39© Copyright 2011 EMC Corporation. All rights reserved.
Building a Complete Big Data Analytics Stack
Analytic Toolsets(Business Analytics, BI, Statistics, etc.)
Greenplum ChorusEnterprise Collaboration Platform for Data
Greenplum Data Computing Appliances
Purpose-built for Big Data Analytics
Greenplum DatabaseEnterprise & Community EditionsWorld’s Most Scalable MPP Database
Platform
Greenplum HDHadoop Enterprise & Community Editions
Enterprise Analytics Platform for Unstructured Data
40© Copyright 2011 EMC Corporation. All rights reserved.
Celebrating Big Data Innovatorswww.DataHeroAwards.com
41© Copyright 2011 EMC Corporation. All rights reserved.
Data Hero Award WinnersSilver Spring Networks – Energy Category
42© Copyright 2011 EMC Corporation. All rights reserved.
Data Hero Award WinnersBroad Institute of MIT and Harvard – Life Sciences Category
43© Copyright 2011 EMC Corporation. All rights reserved.
Data Hero Award WinnersVivek Kundra, U.S. CIO – Visionary Award
44© Copyright 2011 EMC Corporation. All rights reserved.
VIDEO
45© Copyright 2011 EMC Corporation. All rights reserved.
Big Data = Big Opportunity
46© Copyright 2011 EMC Corporation. All rights reserved.
Thank you
47© Copyright 2011 EMC Corporation. All rights reserved.
EMC WORLDLas Vegas 2011