2015 hortonworks mda roadshow presentation
TRANSCRIPT
Copyright © 2015, SAS Institute Inc. All rights reserved.
Big Data Analytics with SAS and HadoopFelix LiaoBusiness Solutions ManagerSAS Australia/New Zealand
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Agenda
5 things you didn’t know about SAS (and Hadoop)
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
#1 SAS is the largest private software company in the world
1000+ customer sites in Australia & New Zealand
A market leader in the areas of Data Management, Reporting and Advanced Analytics
23% annual re-investment in R&D
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
#2 SAS has been doing machine learning for 39 years
SAS is the "800-pound gorilla" in the analytics space
- Gartner
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Breadth and Depth of Analytical CapabilitiesAppend
Data
PartitionFile
Import Filter Merge SampleSAMPLE
Association DMDB
MultiPlotEXPLORE Graph
ExploreLink Analysis
Path AnalysisSOM/Kohonen
StatExploreVariable
ClusteringVariable
SelectionMarket Basket
Cluster
MODIFY DropRules Builder
ReplacementPrincipal Components
Interactive BinningImpute
Transform Variables
Decision Tree
AutoNeural Neural NetworkRegression
Partial Least Squares
Dmine Regression
MODEL
DM Neural
Ensemble
Rule Induction
Gradient Boosting
LARS
MBR
Two Stage
Model Import
Incremental Response
Survival Analysis
Credit Scoring*
TS Correlation
TS Data Prep
TS Dimension Reduction
TS Decomp.
TS Similarity
TS Exponential Smoothing
HP Explore
HP ImputeHP
RegressionHP
TransformHP Variable Selection
HP Neural
HP Forest
HP Decision Tree
HP Data Partition
HP GLM HP Cluster
HP Principal ComponentsHP SVM
Cutoff Segment ProfileASSESS Model Comparison
ScoreDecisions
UTILITY Control Point
MetadataSAS Code
ReporterEnd Groups Score Code
ExportStart Groups Ext Demo
Input
Data
Open Source Integration
Register Metadata
Save Data
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
#3 SAS is serious and committed about Hadoop
Hadoop as catalyst for big data analytics Bringing SAS analytics to Hadoop Joint R&D effort with leading Hadoop vendors
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Open Data Platform Initiative
SAS is a founding member of the open data platform (ODP) initiative
Accelerate innovations around a stable common core platform
Maximize big data adoption and productivity
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
#4 SAS is a certified workload engine on YARN
We are very excited today to announce the next step in our joint journey
achieved by integrating SAS HPA and LASR with the YARN resource manager
so it will run as a first class citizen in the Hadoop cluster, co-existing and sharing
cluster resources with other YARN enabled workloads running Hadoop and third-party YARN enabled applications.
Arun C. Murthy
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
SAS & Hadoop Accelerating the Analytical Life Cycle
Prepare data IN Hadoop for analytics
Deploy and manage model score code IN
Hadoop
Lift data IN to memory for analytics at scale
Model data at scale in-memory WITH advanced
modeling tools
Explore data at scale, in-memory WITH data
visualization
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Prepare Hadoop Data: SAS Data Loader for Hadoop
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Hadoop Data Discovery: SAS Visual Analytics
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Model Development: SAS In-Memory Statistics for Hadoop
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
#5 SAS is delivering big data analytics today!
Now we can run hundreds and thousands of models at the product level - at the SKU level
- because you have the big data and analytics to support those models at that
level.
- Kerem Tomak (VP of Analytics)
We have a lot of data, but now we can start unleashing the power of that information
- Joanna Gurry (Head of Information)
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
SAS and Hortonworks - Rogers Media
40 million records per month in Hortonworks HDP
More than 600 relevant web characteristics Processing data on 12 million customers SAS High Performance Analytics to place
better targeted ads “Several of us from Rogers in the room looked at each
other, and said ‘That is really wicked; that’s cool.”
Chris Dingle
Senior Director of Audience Solutions
Rogers Communications
https://www.youtube.com/watch?v=YFtrK02VaM4
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
Five things you now know about SAS and Hadoop! #1 SAS is the largest private software company in the world #2 SAS has been doing machine learning for 39 years #3 SAS is serious and committed about Hadoop #4 SAS is a certified workload engine on YARN #5 SAS is delivering big data analytics today
Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.http://www.sas.com/au/sashadoop
Copyr igh t © 2012, SAS Ins t i tute Inc . A l l r i gh ts r es erved.
@felixliao
felixliao Thank You!
http://www.sas.com/au/sashadoop