arno candel aibythebay 030617
Post on 19-Mar-2017
722 Views
Preview:
TRANSCRIPT
DeepLearningisLikeWater:It'sEverywhere!
ArnoCandel,PhDCTO,H2O.ai@ArnoCandel
AIByTheBay,SanFrancisco
March6,2017
MeettheH2OMakers
3
SoftwareProduct:H2O-AIforBusinessTransformation• DistributedDataFramewithScalableExecutionEngine• DistributedAlgorithmsDeepLearning,GBM,RF,GLM,K-Means,PCA,…• Apachev2OpenSource(github.com/h2oai)
H2OisEasytoUseandDeploy• h2o.ai/downloadandrunanywhere,immediately• ClientAPIs:R,Python,Java,Scala,REST,FlowGUI• Spark(cf.SparklingWater),Hadoop,BareMetal• Productionizewithauto-generatedJava/C++scoringcode
H2O.ai-MakersofH2O,SparklingWater,DeepWater,…
H2O.ai-LovedByTheBest
H2O.ai-Visionaryin2017GartnerMQforDataScience
H2ODeepWatergot(Tech)Crunch’ed
H2O.ai-HighlightedbyFortuneMagazine
http://fortune.com/2017/02/23/artificial-intelligence-companies/
Powerful,ScalableTechniquesforDeepLearningandAI
brandnew:Dec2016
H2OBook-WrittenbytheCommunity
12
HighLevelArchitectureofH2O
HDFS
S3
NFS
DistributedIn-Memory
ParallelParser
LosslessCompression
H2OComputeEngine
ProductionScoringEnvironment
Exploratory&DescriptiveAnalysis
FeatureEngineering&Selection
Supervised&UnsupervisedModeling
ModelEvaluation&Selection
Predict
Data&ModelStorage
ModelExport: StandaloneScoringCode
C/C++/Java R/Py/etc.
DataPrepExport:PlainOldJavaObject
Local
SQL
LDAP Kerberos SSL HTTPS
HTTP
NativeAPIs:Java,Scala—RESTAPIs:R,Python,Flow,JavaScript,Java
13
library(h2o)h2o.init()h2o.deeplearning(x=1:4,y=5,as.h2o(iris))
importh2o
fromh2o.estimators.deeplearningimportH2ODeepLearningEstimator
h2o.init()
dl=H2ODeepLearningEstimator()
dl.train(x=list(range(1,4)),y="Species",training_frame=iris.hex)
import_root_.hex.deeplearning.DeepLearningimport_root_.hex.deeplearning.DeepLearningParametersvaldlParams=newDeepLearningParameters()dlParams._train=iris.hexdlParams._response_column=‘Speciesvaldl=newDeepLearning(dlParams)valdlModel=dl.trainModel.get
Allheavyliftingisdonebythebackend!
Built-ininteractiveGUIandnotebook-nocodingneeded!
LiveDemoofDistributedDeepLearninginH2O
Airline dataset: 116M flights in the U.S. over 20 years
10xin-memorycompressionvsCSV AllclusterCPUcoresarebusy
Brand-new:H2OXGBoostIntegration(GradientBoosting)
WhyXGBoost?Competitiveaccuracyandspeed(greatforKaggle)
GPUsupport(forsmall/mediumdata)Efficientonsparsedata
WhyintegrateintoH2O?Easeofuse(FlowGUI,R/PyAPIs)
Real-timemodelstatus(varimp,metrics)Efficientdatapreprocessing(sparse,categorical)
IntegrationintoH2Oecosystem(modeling,deployment,support)
LiveDemoofGPUGradientBoostinginH2O
Kaggle dataset: BNP Paribas Cardif Claims Management
DeepWater=THEDeepLearningPlatform
H2Ointegratesthetop open-sourceDLtools
NativeGPUsupport isupto100xfasterthan
EnterpriseReadyEasytotrainanddeploy,interactive,scalable,etc.Flow,R,Python,Spark/Scala,Java,REST,POJO,Steam
NewBigDataUseCases(previouslyimpossibleordifficultinH2O)
Image-socialmedia,manufacturing,healthcare,…Video-UX/UI,security,automotive,socialmedia,…Sound-automotive,security,callcenters,healthcare,…Text-NLP,sentiment,security,finance,fraud,…TimeSeries-security,IoT,finance,e-commerce,…
DeepWater:BestOpen-SourceDeepLearning
EnterpriseDeepLearningforBusinessTransformation
DeepWaterBringsState-Of-The-ArtDeepLearningonGPUstoH2O
H2ODeepLearning:simplemulti-layernetworks,CPUs
H2ODeepWater:arbitrarynetworks,CPUsorGPUs
Limitedtobusinessanalytics,statisticalmodels(CSVdata)
Largenetworksforbigdata(e.g.image1000x1000x3->3minputsperobservation)
1-5layersMBs/GBsofdata
1-1000layersGBs/TBsofdata
Open-Source-LeverageCommunityCode,DataandModels
BestImageClassifierasofAug2016:Google+MicrosoftHybridArchitecture
https://research.googleblog.com/2016/08/improving-inception-and-image.html
open-sourceimplementation
H2OtakesDLmodeldefinitionasinput
BuildyourownmodelswithDeepWaterToday!https://github.com/h2oai/h2o-3/blob/master/h2o-py/tests/testdir_algos/deepwater/pyunit_inception_resnet_v2_deepwater.py
LiveDemoofDeepWaterforImageClassificationonGPUs
Yesterday:SmallData(<GB) Today:BigData(TeraBytes,ExaBytes)
Data+Skillsaregoodforbusiness
Data+MachineLearningAREthebusiness
ThingsareChangingQuickly
ChallengesWithAIandDeepLearning
CEO:“WewilltransformourbusinesswithAI”Management:“HiresomeonetogiveusAI”SeniorDataScientist:“IshouldlookintoAI”JuniorDataScientist:“IuseTensorFlowallthetime”HighSchoolKid:“IdidmyinternshiponDeepLearning”AverageJoe:“Iwantaself-drivingcar(andkeepmyjob)”
StanfordProfessors:“focusoninterpretability,startwithsimplemodels!”
TheHypeandRealityofAI
H2O.aiStanfordAdvisors
stankrdpic
Sri/CEOBoydHastieTibshirani
inaconferenceroomnotsofaraway
me
WhichOpen-SourceAIPlatformtoUse?
WhichProgrammingLanguageToUse?
WhichoneforDevelopmentvsProduction?
WhichHardwareToUse?
WhichoneforDevelopmentvsProduction?
Analog/Neuromorphic
WhoDoestheWorkandonWhatInfrastructure?
WhichoneforDevelopmentvsProduction?
Cloud?Which? OnPremise?
DataLake?Micro-Services?
WhichoneforDevelopmentvsProduction?
WhenistheModelGoodEnough?
Crowdsourcing? TrustaGenius? InternalBake-Off?
Whatproblemareyousolvinginthefirstplace?
Whatproblemshould/couldyoubesolvinginstead?
Whatcanyoulearnfromthemodel?
Howcanyouimprovethemodels?More,betterdata?
Howcanyoucharacterizethemodel?
DoyouneedAI,DeepLearningorjustasimplemodel?
BacktotheDrawingBoard!
GradientBoosting Machine
Generalized LinearModeling
DeepLearning
Distributed RandomForest
DoyouneedAI,DeepLearningorjustaSimpleModel?
FutureOfAI:OrWhat’sLeftforHumanstoDo?
CharlieChaplin-ModernTimes1936
top related