netezza integration with sas software
DESCRIPTION
TRANSCRIPT
© 2010 Netezza, Inc. All rights reserved
Analytics Without Constraints
Netezza Integration with SAS Software
Data Analysis before Netezza
Where were we before Netezza?
• Extremely long processing cycles for Data analysis – long waits…• Working with small datasets (Subsets) – very restrictive• Analytic Models restricted to tests against small datasets• Too much data is being moved between servers• Data Models run for long time periods against full data base –
processing of other jobs impacted severely• Data Models sometimes re-coded in C/C++ for better performance • Productivity of the Data Analysts is low
SAS Software and Netezza DWH Appliance
Page 4
SAS
SAS
SAS Score Code
SQL
SAS for Customer Scoring
SAS for Fraud
Detection
Confidential Briefing for Customer
SAS/Access to Netezza
Big Data Meets Big Math – Netezza + SAS Software
Page 5
Analytics Without Constraints
Confidential Briefing for Customer
SAS Support on Netezza for Analytics Processing
Data Extraction
• Tools • Base SAS – DATA STEP• Base SAS – PROC SQL• SAS/Access via ODBC Driver
to Netezza• Use Case• Build & score model on SAS
Server• Method• Create & score model using
SAS tools• Extract data from Netezza
using proc SQL, data step or via SAS Access to ODBC connector into SAS datasets.
• Issues:• too much data movement,• redundant databases, • slow processing
Database Connector
• Tools: • Base SAS• SAS/Access to Netezza• Use Case• Build & score model on
Analytics Server• Method• Create & score model using
SAS tools • Connect to data in Netezza
with SAS/ Access to Netezza• Benefits• High speed connector• More flexible SQL allowed• Support native SAS formats
and quicker data type transforms
• Allows SQL to be passed to the database – “SQL Pass Thru” option
In-Database Analytics
• Tools• SAS Enterprise Miner (EM)• SAS Scoring Accelerator for
Netezza • Use Case• Build model on Analytics
Server• Score model on Netezza
Server• Method• Create models using SAS EM
software• Export model from SAS EM
into Netezza – automatic SQL/UDF code generated for Netezza
• Benefits: • Analyst Productivity increases• Less data movement• Parallelized scoring on
Netezza• Faster data processing• Allows users to publish
scoring and user defined formats to Netezza
Page 6
Netezza Enabled with SAS Software
Page 7
011011010010100101110011011010010100101110011011010010100110110100101001011100110110010100101110011011010010100110110100101001011100110110100101001011100110110100101001101101001010010111001101101001010010111001101101001010011011010010
100101110011010100101001011100110110100101011111000
DATA WAREHOUSE
Data Extraction Database Connector
In-Database Analytics
Recoded Scoring Processes
01101101001010010
11100110110100101
00101110
01101101001010
LOTS OF
DATA
MOVEMENT
ScoringAlgorithms & Transforms
Pulled from
Analyst
DW Developer
Recoded Scoring Processes
01101101000101110000101110 LESS DATAAND FASTER
ScoringAlgorithms & Transforms
MORE TRANSFORMSIN-DATABASE
Pulled from
Analyst
DW Developer
Scoring Algorithms & Transforms Published to Database as Scoring Processes
PROCESSAUTOMATION
Pushed down
Analyst
Netezza and SAS Connected
Page 8
Data Extraction Database Connector
In-Database Analytics
Analytics Server
Data Warehouse
Data Extraction
ODBC
• Base SAS – DATA STEP
• Base SAS – PROC SQL
• SAS/Access to ODBC
Data Extraction• ODBC• Netezza Database
Data Extraction
FAST
• Base SAS – Proc SQL• SQL Pass-Thru Option• SAS/Access to
Netezza
Data Extraction• SAS/Access to Netezza
• Netezza Database
Data Extraction
FASTER
• SAS Enterprise Miner• SAS Scoring
Accelerator for Netezza
Data Extraction• SAS Scoring Accelerator for Netezza
• Netezza Database• In-Database Analytics
Page 9
Enterprise Applications
HR
ERP
CRM
Transactional Databases
NetezzaPerformance Server
NPS
Business Analytics Applications
Employees
Customers
Suppliers
Netezza in the SAS Ecosystem
Open standards platform to enable actionable intelligence “out of the box”
Users
Representative vendors
Key AdvantagesPerformanceScalabilityLow TCO
SCM
RFID
Billing
External Source Systems
Supplier DataE-Commerce
ETL – SAS DI Studio, or other
Catalina Marketing Case Study
Page 10
Value Proposition
• Supercharge SAS with Netezza’s high performance, scalable scoring
• Effective resource utilization via automatic code generation
• Reduced end-to-end processing time
• Reduced time-to-model implementation
• Simplified infrastructure to maintain and administer
• SAS Enterprise Miner model development process is simple and easy to manage
• Leverage existing SAS knowledge and Netezza high-performance
• Increase Analyst Productivity
• Score Database more frequently for better results
Page 11
Model Scoring In-Database
SAS Scoring Accelerator for Netezza
Page 12
Copyright © 2009, SAS Institute Inc. All rights reserved.
SAS® In-DatabaseTraditional Architecture In-
Database Architecture
Netezza TwinFin™
DataPreparation
Analytic Modeling
SAS Scoring
SAS C & PMML Scoring
Netezza TwinFin™
Analytic Modeling
SAS Scoring
SAS Modeling
Data Preparation
DataPreparation
Copyright © 2009, SAS Institute Inc. All rights reserved.
SAS® Scoring Accelerator for Netezza1.6
NZ TwinFin™ SAS Model Manager
or
SAS publishing agent
Publishing Macro
Score asDATA Step
code
ScoreDefinitions
SASFormatLibrary
sas_score()
BIClient
SAS Enterprise Miner
Model
Export
FormatDefinitions
SASDATA Step
Engine
SAS 9.2
Copyright © 2009, SAS Institute Inc. All rights reserved.
Benefits Achieve higher model-scoring performance and
faster time to results
Improve accuracy and effectiveness of analytic models
Reduce data movement and latency
Eliminate model score code rewrite and model re-validation efforts (i.e. labor costs and error prone)
Consolidate data to improve regulatory compliance
Better manage, provision and govern data