microsoft data platform airlift 2017 rui quintino machine learning with sql server 2016 and r...

33
Data Platform Airlift Rui Quintino Data Research, DevScope [email protected] Machine Learning with SQL Server 2016 and R Services 24 de fevereiro \\ Microsoft Lisbon Experience

Upload: rui-quintino

Post on 11-Apr-2017

39 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

 

Data PlatformAirlift

Rui QuintinoData Research, [email protected]

Machine Learning withSQL Server 2016 and R Services

24 de fevereiro \\ Microsoft Lisbon Experience

Page 2: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Your feedback is important!

Page 3: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Keep in touch with Microsoft Azure

Try Azure for FREE now: https://azure.microsoft.com/free/

Step by step guide to setting up Azure SQLhttps://aka.ms/adc_guidetosettingupazuresql

Web Apps on Microsoft Azurehttps://aka.ms/adc_webappsazure

E-book | Azure Overviewhttps://aka.ms/adc_azureoverview

Free Manual | Microsoft Azure https://aka.ms/adc_manualgratisazure

Understanding Data Technologies on the Cloud

https://aka.ms/adc_datatechnologiescloud

E-book | Microsoft Loves Linuxhttps://aka.ms/adc_microsoftloveslinux

Page 4: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Agenda•Machine Learning•R – What & Why?•R, Microsoft & SQL Server•SQL 2016 R Services•Q&A

Page 5: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Machine Learning ?Using existing structured or unstructured data to:

1. Predict unknown/future data

2. Create intelligent & automated agents/services

3. Advanced Data Insights -> Why? Drivers? Root causes?

4. Content producers & “creative” agents

Page 6: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Agent allocation

Warehouse efficiency

Smart buildings

Predictive maintenance

Supply chain optimization

Churn/segmentation

Personalized offers

Product recommendation

Fraud detection

Risk management

Sales forecasting

Demand forecasting

Sales lead scoring

Marketing mix optimization

Sales and marketing

Finance and risk

Customer and channel

Operations and workforce

Investment modelling $$$

Example use cases (classic)Revenue Forecasting$$$Portfolio optimization

Page 7: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

What is

• A statistics programming language• A data visualization tool• Open source

• 2.5+M users • Taught in most universities

• Thriving user groups worldwide

• 10.000+ free algorithms in CRAN• Scalable to big data

• New and recent grad’s use it

Language Platform

Community

Ecosystem• Rich application & platform integration

Page 8: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

R & R Studio

Demo

Page 9: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services
Page 11: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

https://www.r-bloggers.com/10000-cran-packages/

Page 12: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

http://blog.revolutionanalytics.com/2017/01/cran-10000.html

Page 13: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

??

Lack of Commercial

Support

Inadequate/Limited

Performance

Complex Deploymen

tProcesses

Limited

Data Scale

Challenges posed by open source R

Page 14: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

• R & SQL Server• SQL Server is one of the most widely used SQL databases• R is the most widely used statistical and advanced

analytical language • Complications From Using R with SQL Databases• Requires Data Extraction• Bottlenecks in Performance• Data Sizes Limitations• Increases Security Risks• Increases Duplication Costs• Poor operationalization support

Before SQL Server 2016 & R Services

Page 15: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

April 6, 2015

Page 16: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

SQL Server 2016 EESQL Server 2016 SE

Growing Beyond Revolution Analytics

Red HatSUSE

Pre Acquisition

Microsoft R Server

Azure HDInsights

Azure

Expanding Product Family

SQL Server R Services

Post Acquisition

Continued Support of Enterprise R Solutions

Expanding Support for Open Source R

Cortana Analytics Suite

Open

Page 17: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Introducing Microsoft R Server

High-performance, Scalable R100% open source RCRAN, Bioconductor, MRAN, GitHub compatibilityBig-data connectivityScalable analyticsMulti-platform In-database, in-cluster scalabilityChoice of IDE (R Studio, R Tools for Visual Studio)

Linux, Windows, SQL Server, Hadoop & Teradata

Open Source Components

Licensed Components

CRAN

Micr

osof

t R

Open

DistributedR

ScaleR

ConnectR

DeployR IDE

R Server Technology

Page 18: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Introducing SQL Server 2016 R services

Included in SQL Server 2016

Reuse and optimize existing R codeEliminate data

movement

In-database deployment

Memory and disk scalability

No R memory limitsWrite once, deploy

anywhere

Enterprise speed and scale

Near-DB analytics Parallel threading and

processing Reuse SQL skills for data engineering

Cost effectiveness

Scalability and choice

Simplicity and agility

Page 19: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

SQL Server 2016SQL Server R ServicesIntegration Facilities:• Component

Integration• Launchers• Parameter

Passing• Results Return• Console Output

Return• Parallel Data

Exchange (RTM)• Stored Procedures• Package

Administration

SQL Server 2016 & SQL R Services

SQL ServerQuery

Processor

Algorithm Library

• Data Prep• Descriptive Stats• Sampling• Statistical Tests• Predictive Models

• Variable Selection• Clustering• Classification• Custom APIs for R +

CRAN• Parallel Scoring

Fast, Parallel, Storage Efficient Algorithms

Revolution R Open• 100% Open Source R• Fully CRAN

Compatible• Accelerated Math

Open Source R Interpreter

Page 20: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Using Parallel Algorithms, Remote Contexts

Ingest

Set Context

Use

Page 21: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Scalable algorithms

• 5+ hours to 40 seconds:

Page 22: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

R - Portability & investment assurance

Write Once – Deploy Anywhere

R Server portfolio

Cloud• Windows• Linux• HDInsight

• SQL Server 2016 EE• SQL Server 2016 SERDBMS• Windows• LinuxDesktops & Servers

Hadoop & Spark • Hortonworks• Cloudera• MapR

EDW • SQL Server 2016• Teradata Database

CRAN

Micr

osof

t R

Open

DistributedR

ScaleR

ConnectR

DeployR IDe

R Server Technology

Page 23: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services
Page 24: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

SQL Server 2016 R Services

Demo

Page 25: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Additional Notes/References•Azure Data Science Virtual Machine

Page 26: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Additional Notes/References•SQL 2016 R Services Virtual Labs

Page 27: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Additional Notes/References•Free ebook: Data Science with Microsoft SQL Server 2016

Page 28: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Additional Notes/References•New R packages•olapR•MicrosoftML •LightGBM (“xgboost” by Msft)

Page 30: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

SQL Server 2016 R Services

Q&A

Page 32: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

Free AzureTrial

Try SQL Server 2016http://aka.ms/trysql2016

http://aka.ms/tryazure

Try Power BIhttp://powerbi.com

Cortana Intelligence Serviceshttp://aka.ms/cortanaintelligence

Page 33: Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL Server 2016 and R Services

© 2016 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.