arno candel aibythebay 030617

Post on 19-Mar-2017

722 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

DeepLearningisLikeWater:It'sEverywhere!

ArnoCandel,PhDCTO,H2O.ai@ArnoCandel

AIByTheBay,SanFrancisco

March6,2017

MeettheH2OMakers

3

SoftwareProduct:H2O-AIforBusinessTransformation• DistributedDataFramewithScalableExecutionEngine• DistributedAlgorithmsDeepLearning,GBM,RF,GLM,K-Means,PCA,…• Apachev2OpenSource(github.com/h2oai)

H2OisEasytoUseandDeploy• h2o.ai/downloadandrunanywhere,immediately• ClientAPIs:R,Python,Java,Scala,REST,FlowGUI• Spark(cf.SparklingWater),Hadoop,BareMetal• Productionizewithauto-generatedJava/C++scoringcode

H2O.ai-MakersofH2O,SparklingWater,DeepWater,…

https://www.cbinsights.com

H2O.ai-AttheCoreofAI

H2O.ai-LovedByTheBest

H2O.ai-Visionaryin2017GartnerMQforDataScience

H2ODeepWatergot(Tech)Crunch’ed

H2O.ai-HighlightedbyFortuneMagazine

http://fortune.com/2017/02/23/artificial-intelligence-companies/

Powerful,ScalableTechniquesforDeepLearningandAI

brandnew:Dec2016

H2OBook-WrittenbytheCommunity

H2O.aiCustomerLove

10http://www.h2o.ai/customers/

11http://www.h2o.ai/customers/

H2O.aiCustomerLove

12

HighLevelArchitectureofH2O

HDFS

S3

NFS

DistributedIn-Memory

ParallelParser

LosslessCompression

H2OComputeEngine

ProductionScoringEnvironment

Exploratory&DescriptiveAnalysis

FeatureEngineering&Selection

Supervised&UnsupervisedModeling

ModelEvaluation&Selection

Predict

Data&ModelStorage

ModelExport: StandaloneScoringCode

C/C++/Java R/Py/etc.

DataPrepExport:PlainOldJavaObject

Local

SQL

LDAP Kerberos SSL HTTPS

HTTP

NativeAPIs:Java,Scala—RESTAPIs:R,Python,Flow,JavaScript,Java

13

library(h2o)h2o.init()h2o.deeplearning(x=1:4,y=5,as.h2o(iris))

importh2o

fromh2o.estimators.deeplearningimportH2ODeepLearningEstimator

h2o.init()

dl=H2ODeepLearningEstimator()

dl.train(x=list(range(1,4)),y="Species",training_frame=iris.hex)

import_root_.hex.deeplearning.DeepLearningimport_root_.hex.deeplearning.DeepLearningParametersvaldlParams=newDeepLearningParameters()dlParams._train=iris.hexdlParams._response_column=‘Speciesvaldl=newDeepLearning(dlParams)valdlModel=dl.trainModel.get

Allheavyliftingisdonebythebackend!

Built-ininteractiveGUIandnotebook-nocodingneeded!

LiveDemoofDistributedDeepLearninginH2O

Airline dataset: 116M flights in the U.S. over 20 years

10xin-memorycompressionvsCSV AllclusterCPUcoresarebusy

Brand-new:H2OXGBoostIntegration(GradientBoosting)

WhyXGBoost?Competitiveaccuracyandspeed(greatforKaggle)

GPUsupport(forsmall/mediumdata)Efficientonsparsedata

WhyintegrateintoH2O?Easeofuse(FlowGUI,R/PyAPIs)

Real-timemodelstatus(varimp,metrics)Efficientdatapreprocessing(sparse,categorical)

IntegrationintoH2Oecosystem(modeling,deployment,support)

LiveDemoofGPUGradientBoostinginH2O

Kaggle dataset: BNP Paribas Cardif Claims Management

DeepWater=THEDeepLearningPlatform

H2Ointegratesthetop open-sourceDLtools

NativeGPUsupport isupto100xfasterthan

EnterpriseReadyEasytotrainanddeploy,interactive,scalable,etc.Flow,R,Python,Spark/Scala,Java,REST,POJO,Steam

NewBigDataUseCases(previouslyimpossibleordifficultinH2O)

Image-socialmedia,manufacturing,healthcare,…Video-UX/UI,security,automotive,socialmedia,…Sound-automotive,security,callcenters,healthcare,…Text-NLP,sentiment,security,finance,fraud,…TimeSeries-security,IoT,finance,e-commerce,…

DeepWater:BestOpen-SourceDeepLearning

EnterpriseDeepLearningforBusinessTransformation

DeepWaterBringsState-Of-The-ArtDeepLearningonGPUstoH2O

H2ODeepLearning:simplemulti-layernetworks,CPUs

H2ODeepWater:arbitrarynetworks,CPUsorGPUs

Limitedtobusinessanalytics,statisticalmodels(CSVdata)

Largenetworksforbigdata(e.g.image1000x1000x3->3minputsperobservation)

1-5layersMBs/GBsofdata

1-1000layersGBs/TBsofdata

Open-Source-LeverageCommunityCode,DataandModels

BestImageClassifierasofAug2016:Google+MicrosoftHybridArchitecture

https://research.googleblog.com/2016/08/improving-inception-and-image.html

open-sourceimplementation

H2OtakesDLmodeldefinitionasinput

BuildyourownmodelswithDeepWaterToday!https://github.com/h2oai/h2o-3/blob/master/h2o-py/tests/testdir_algos/deepwater/pyunit_inception_resnet_v2_deepwater.py

LiveDemoofDeepWaterforImageClassificationonGPUs

Yesterday:SmallData(<GB) Today:BigData(TeraBytes,ExaBytes)

Data+Skillsaregoodforbusiness

Data+MachineLearningAREthebusiness

ThingsareChangingQuickly

ChallengesWithAIandDeepLearning

CEO:“WewilltransformourbusinesswithAI”Management:“HiresomeonetogiveusAI”SeniorDataScientist:“IshouldlookintoAI”JuniorDataScientist:“IuseTensorFlowallthetime”HighSchoolKid:“IdidmyinternshiponDeepLearning”AverageJoe:“Iwantaself-drivingcar(andkeepmyjob)”

StanfordProfessors:“focusoninterpretability,startwithsimplemodels!”

TheHypeandRealityofAI

H2O.aiStanfordAdvisors

stankrdpic

Sri/CEOBoydHastieTibshirani

inaconferenceroomnotsofaraway

me

WhichOpen-SourceAIPlatformtoUse?

WhichProgrammingLanguageToUse?

WhichoneforDevelopmentvsProduction?

WhichHardwareToUse?

WhichoneforDevelopmentvsProduction?

Analog/Neuromorphic

WhoDoestheWorkandonWhatInfrastructure?

WhichoneforDevelopmentvsProduction?

Cloud?Which? OnPremise?

DataLake?Micro-Services?

WhichoneforDevelopmentvsProduction?

WhenistheModelGoodEnough?

Crowdsourcing? TrustaGenius? InternalBake-Off?

Whatproblemareyousolvinginthefirstplace?

Whatproblemshould/couldyoubesolvinginstead?

Whatcanyoulearnfromthemodel?

Howcanyouimprovethemodels?More,betterdata?

Howcanyoucharacterizethemodel?

DoyouneedAI,DeepLearningorjustasimplemodel?

BacktotheDrawingBoard!

GradientBoosting Machine

Generalized LinearModeling

DeepLearning

Distributed RandomForest

DoyouneedAI,DeepLearningorjustaSimpleModel?

FutureOfAI:OrWhat’sLeftforHumanstoDo?

CharlieChaplin-ModernTimes1936

top related