machine learning in wan research - internet2 · • convolutional neural networks recurrent neural...

24
Machine Learning in WAN Research Mariam Kiran [email protected] Energy Sciences Network (ESnet) Lawrence Berkeley National Lab Oct 2017 Presented at Internet2 TechEx 2017

Upload: others

Post on 28-May-2020

14 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

[email protected]

EnergySciencesNetwork(ESnet)LawrenceBerkeleyNationalLab

Oct2017

PresentedatInternet2TechEx 2017

Page 2: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

Outline

• MLingeneral

• MLinnetworkresearch– LiteratureReviewofresearchfrom[2010- Sept2017]ofMLalgorithmsinWANs

– Commonareas,datainvolved,whatproblemssolved

• RoadAhead(unexploredareas)

Page 3: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

AI,ML,DL– What’stheDifference?

• Turing“CanMachinesThink”– TuringTest:Exhibithuman-likeintelligence• MachinelearningiscollectionofalgorithmsthatcanhelpachieveAI

• e.g spamfilters,HRhiring,etc• DeeplearningisoneoftheseMLtechniques

• RecentadvancesduetoGPUandHPCprocessing(previouslyveryslow,toomuchdata,needtrainingtowork)

• Mainlyforimageandspeechrecognition– commercialapps

CourtesyNvidia Blog

Page 4: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

AITree(exampletechniques)onlyasubsetareMLalgorithms

AI

Optimizationtechnique

Manymore….

Expertsystems

Fuzzysystems

NeuralNetworks

Evolutionaryalgorithms(Geneticalgorithms,evolutionarystrategies,etc)

Swarmintelligence(antcolony,etc)

Deepbeliefnetworks

Deepboltzman networks

Convolutionalnetworks

Stackedautoencoders

Networks:graphalgorithm(routing–shortestpath)

ML:Whereevertrainingor‘learning’onstatisticaldata

RandomForrest,Clustering,etc

Page 5: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

5

Algorithmschosendependingon- dataavailable- problembeingsolved- combiningmultipletechniques(some50%accuracy,others80%accuracy)

Page 6: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

Example:ChoosingAlgorithmsforProblems(e.g.DNNs)

Deepneuralnetwork

InputData Appliedfor Variants

Feedforwardneuralnetwork

Hierarchicaldatarepresentations

• Generalclassification• Clustering• Anomalyfinding• Featureextraction

• Deepbeliefnetworks(usesrestrictedboltzmanmachineforactivationfunction)

• Convolutionalneuralnetworks

Recurrentneuralnetwork

Sequentialdatarepresentation(i.e.timeseriesdata)

Sequentiallearning (whentimerelationshipexists)

Longshorttermmemory(LTSM)usedforspeechtranslation

• There are many variants of DNNs. Papers and researchers in each specific DNN.

• DeepMind used Deep Q-learning for Attari and Go• Action-pairs based on learned data.

Page 7: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

MultipleToolsAvailable(DLLibraries)

• Google’s DNN platform TensorFlow used to tag unlabeled videos, recognize images with 70% accuracy and predict Gmail replies

• Scikit-learn good for learning, python library• Mostly used in image analysis

• HPC innovation: analyze massive data sets, quick training • Model and data parallelism to reduce the training time

Toolkit Language Use Processing capabilityCaffe C++ Images and video Distributed

(HPC, GPU)TensorFlow Python Images, regression, video, text, speech Distributed

(HPC, GPU)Theano Python Images Distributed

(HPC, GPU)Torch Lua Images and speech Distributed

(HPC, GPU)

Page 8: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

BringingitbacktoNetworks…(Reviewingpaperssince2010)

Page 9: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

MachinelearningUsecases(IETFforums)

• NetworkSecurity– Normalandoutlierbehaviorsintraffic

• Changeorpredictpossiblebehavior– This<QoS value>willcausethis<eventY>withprobability<P>

• Bugdetection– Softwareorhardwarefaults

• WANpathoptimization– Anticipatecongestion– Diverttraffictoalternatepaths

Page 10: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

ConductedaSystematicLiteratureReview

• Step1:Identifyresearchquestions

• Step2:Identifyasearchstring– “Wideareanetworks”AND(estimateORpredict)AND(learningOR‘‘datamining’’OR‘‘artificialintelligence’’OR‘‘patternrecognition’’ORregressionORclassificationORoptimization)

• Step3:Identifyrelevantlibraries,journals,papers– IEEEXplore,ACMDigitalLibrary,ScienceDirect,WebofScience,EICompendex,andGoogleScholar

Step1:Researchquestions

Step2:Searchstrategy

Step3:Studyselectioncriteria

Step3:Quality

assessment

Relevantpapers

Page 11: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

But…toomanypapersfound

• Spacewastoolarge:

• WANarecompletesystems

• Havemultiplelayers(e.g.seepicture)

• MultipleWANproblems

• Solution“Letsorganizetheresultsbasedon”:

• Createcategoriesofsimilarproblems

• ExploreMLandnon-MLsolutions

• Whichdatasetswereused

Page 12: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

CategorizingsimilarProblems

Usertrafficdata Usertraffic(directedflows)

12

WANTopology(trafficengineering)

(flow-level,trafficprediction,adaptation,pathoptimization,linkfailure)Infrastructuretrafficdata

(Packet-level,queues,TCP,UDP)

Infrastructure-levelmodifications(Switches,deployment,etc)

Page 13: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

MachinelearningapproachesinWAN

networks

2)TopologyEngineering

Trafficprediction

Trafficadaptation

Pathoptimization

Faultfinding

Multipledatacenter

connectivity

4)Infrastructureoptimization

1)Usertrafficoptimization

3)Packetleveloptimizations

TCPspecificproblems

Controllerplacements

Scheduling,congestion

Switchconfigurations

Note:SDNrelatedin(2,3,4)

Actual‘Actions’ontheWAN

Page 14: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

Results

Page 15: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

RelevantPapers:Statistics

IEEEExplore

ACMpub

ScienceDirect

WebofScience

#188

• Removeduplications

• Applyselectioncriteria

• Searchadditionalrelevancethroughreferences

• Removesurveys

• Applyqualityassessment

#3

#10

#532

#25

#223

Note:Googlescholargavemanyirrelevantresultsandisnotregardedasagoodpublicationsearchtool.

Page 16: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

Results– peryear(1)

• RiseofMLtechniquesin2017(WorkshopsatSigComm,HotNets,etc)

0

5

10

15

20

25

30

2010 2011 2012 2013 2014 2015 2016 2017

ML Non-ML

No.ofpapers

Page 17: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

Results– percategory(2)

• Non-MLstilllargelyfavored– problemsolving

• MostMLtechniquesareusedforclassification(oftraffic)andprediction(failures)– TechniquescoupledwithOpenFlow:Performclassificationandconfigurepackets

• SometoolsareenhancedbyMLembeddingfordecisionmaking:– Trafficawarenessandsecurityproblems– Formingtopologies,optimumpathfinding– Improvepathutilizationsdependingonarrivingtraffic

0

10

20

30

40

50

60

UserTraffic TrafficEngineering Packet-levelimprovements

Optimizinginfrastructure

ML Non-ML

No.ofpapers

Page 18: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

Techniquesused

Cat1:Usertrafficanalysis

Cat2:Trafficengineering

Cat3:Packetoptimization

Cat4:Optimizeinfrastructure

ML NaïveBayestheorem,decisiontrees,SVM,RandomForest,ANN

Regression andclassificationtechniques

SVR,decisiontrees, naïve-bayes

Regressionandclassificationtechniques

Non-ML Rule-basedlearning,statisticalanalysistechniques

Graphopt–mincost,greedysearch,SPF

Fairnesscomputations,pathfindinggametheory,Markovmodels,simulations

Simulation,greedyalgorithmsforresourceallocation

Classification,Regression

Page 19: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

Cat1:Usertrafficanalysis

Cat2:Trafficengineering

Cat3:Packetoptimization

Cat4:Optimizeinfrastructure

Usecases • Intrusiondetection

• Trafficprofiling

Classifyflowstoformoptimumtopologies

Pathperformance

Optimumconnectionsbetweendatacenters

Classification X X X X

Regression X X X

Clustering XDimensionreduction

Anomalydetection

X X

Featurelearning

Couplingwithdevices

XDemousingsimulations

Page 20: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

DataInvolved

• Rangefrompacketdata,pathproperties,IPaddresses,QoS,TCP/UDPtraces,etc…

– E.g.Google’sB4optimizestopologytoSDWAN(basedondemand,packetloss,utilization)

Usecases Focus DatasetusedCategory 3:Packet-leveloptimization

VMresources Fairnessschemes,MTTF,MTTR,Netflow

Category 4:Infrastructureoptimization

Flowtables,controllerplacements

No.ofjobsrunning,VMdata,CPUusage,Applicationdata

Page 21: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

RoadAhead…

Page 22: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

LostofAreasstillUnder-developed• Mostlygraphoptimizationproblems(MLislessapplied)

• Identifywhatwewanttoachievealongthepipeline:

Understanding(Classification) Prediction Action

Linkwithdevices(SDN,NFV,etc),butwhatarethe‘knobs’wecanalter?

Short-termgoals Long-termgoalsWhichMLalgorithmtouse?• No‘One-Pill-Solution’• Fast versusOK response,dataused

Toomanydatasets• Costoftrainingmodels• Dynamicenvironments

Dimension reductionandfeaturelearning Notonlydeeplearningbutother‘distributed’MLapproaches

Andthelistgoeson….

Page 23: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

MLforDistributedNetworks

ReinforcementLearningAgent

State s

DeepNeuralNetwork

parameterθ

policyπθ(s,a)

Takeactiona

• Whatifwedon’thaveacentralcloudorHPCtotrain?• Localizedlearningversusgloballearning• Learningindynamicenvironments(e.g.changingtrafficdemands)• MLresearchfocusesongamestrategies.Wedon’thavesimilar

“strategies”innetworks

• Learn,Try,Fail,Learn,Try,Succeed!

DQN

Page 24: Machine Learning in WAN Research - Internet2 · • Convolutional neural networks Recurrent neural network Sequential data representation (i.e. time series data) Sequential learning

ConclusionsandContact

• AIshowssomepromise

• Networks+ML+HPC+(complexworkflows)

• Opendatasetsforresearch

• Combiningtechniques(andalgos)toadvanceresearchinexplored:– Newareasinnetworkandperhapsevenmore

Thankyou!

<[email protected]>

FundedunderDOEPanoramaProject(2017-2019),DOEASCR(2017-2022)