big data. small data. all data.download.microsoft.com/download/8/8/1/881a3c2b-343b-48d2... ·...
TRANSCRIPT
Big Data. Small
Data. All Data.
Stéphane D’Avril-Favreau
Technology Specialist Dataplatform & BI
1
Enabling Familiar, Powerful Business Intelligence
Big Data. Small Data. All Data
3
Data sources
4
Data sourcesNon-Relational Data
Microsoft’s technologycan uniquely speed people to insights and action
Data sources Non-Relational Data
10
Scale out technologies in SQL Server Parallel Data Warehouse
PDW 2012 Appliance design
Server types
– Control
– Passive Failover
– Active Compute
Control Server
Passive Failover Server
Active Compute Server
Active Compute Server
Each server is
configured with two
8-core processors
Storage JBOD
Representative ¼ Rack Configuration
The Parallel Data Warehouse (PDW)
Appliance is a single unit made up of two or
more compute nodes all controlled by a single
PDW control machine
Smallest (0TB) To Largest (5PB)
• Start small with a few Terabyte warehouse
• Add capacity up to 5 Petabytes
0TB 5 PB
Add
Capacity
Add
Capacity
Largest Warehouse
PB
Start Small And Grow
No Downtime
Start Small With A Few TB and Linearly Scale OUT
13
18
Scale out non-relational data in HDInsight (for Azure or PDW)
19
SQL Result setPolyBase
20
30
A Definition of Big Data – 4Vs
Big data: techniques and
technologies that make handling
data at extreme scale economical.
Volume Velocity
Variety Variability
35
MapReduce (Job Scheduling/Execution System)
HDFS (Hadoop Distributed File System)
HBase (Column DB)
Hive Mahout
Oozie
Sqoop
HBase/Cassandra/Couch/
MongoDB
Avro
Zo
okeep
er
Pig
Hadoop = MapReduce + HDFS
FlumeCascad-
ingR
Am
bari
HCatalog
Web app
optimization
Smart meter
monitoring
Equipment
monitoring
Advertising
analysis
Life sciences
research
Fraud
detection
Healthcare
outcomes
Weather
forecasting
Social network
analysis
Churn
analysis
Traffic flow
optimization
IT infrastructure
optimization
Legal
discovery
Natural resource
exploration
37
Windows Azure
38
Easy Access to Data, Big and Small
41
Freedom of deployment options
and hybrid solutions
Enabling Familiar, Powerful Business Intelligence
Big Data. Small Data. All Data