+ All Categories
Home > Documents > Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and...

Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and...

Date post: 17-Aug-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
25
Machine Learning in WAN Research Mariam Kiran [email protected] Energy Sciences Network (ESnet) Lawrence Berkeley National Lab Oct 2017 Presented at Internet2 TechEx 2017
Transcript
Page 1: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

[email protected]

EnergySciencesNetwork(ESnet)LawrenceBerkeleyNationalLab

Oct2017

PresentedatInternet2TechEx 2017

Page 2: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Outline

• MLingeneral

• MLinnetworkresearch– LiteratureReviewofresearchfrom[2010- Sept2017]ofMLalgorithmsinWANs

– Commonareas,datainvolved,whatproblemssolved

• RoadAhead(unexploredareas)

Page 3: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

AI,ML,DL– What’stheDifference?

• Turing“CanMachinesThink”– TuringTest:Exhibithuman-likeintelligence• MachinelearningiscollectionofalgorithmsthatcanhelpachieveAI

• e.g spamfilters,HRhiring,etc• DeeplearningisoneoftheseMLtechniques

• RecentadvancesduetoGPUandHPCprocessing(previouslyveryslow,toomuchdata,needtrainingtowork)

• Mainlyforimageandspeechrecognition– commercialapps

CourtesyNvidia Blog

Page 4: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

AITree(exampletechniques)onlyasubsetareMLalgorithms

AI

Optimizationtechnique

Manymore….

Expertsystems

Fuzzysystems

NeuralNetworks

Evolutionaryalgorithms(Geneticalgorithms,evolutionarystrategies,etc)

Swarmintelligence(antcolony,etc)

Deepbeliefnetworks

Deepboltzman networks

Convolutionalnetworks

Stackedautoencoders

Networks:graphalgorithm(routing–shortestpath)

ML:Whereevertrainingor‘learning’onstatisticaldata

RandomForrest,Clustering,etc

Page 5: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

5

Algorithmschosendependingon- dataavailable- problembeingsolved- combiningmultipletechniques(some50%accuracy,others80%accuracy)

Page 6: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Example:ChoosingAlgorithmsforProblems(e.g.deeplearningorDNNs)

Deepneuralnetwork

InputData Appliedfor Variants

Feedforwardneuralnetwork

Hierarchicaldatarepresentations

• Generalclassification• Clustering• Anomalyfinding• Featureextraction

• Deepbeliefnetworks(usesrestrictedboltzmanmachineforactivationfunction)

• Convolutionalneuralnetworks

Recurrentneuralnetwork

Sequentialdatarepresentation(i.e.timeseriesdata)

Sequentiallearning (whentimerelationshipexists)

Longshorttermmemory(LTSM)usedforspeechtranslation

• There are many variants of DNNs. Papers and researchers in each specific DNN.

• DeepMind used Deep Q-learning for Attari and Go• Action-pairs based on learned data.

Page 7: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

MultipleToolsAvailable(DNNLibraries)

• Google’s DNN platform TensorFlow used to tag unlabeled videos, recognize images with 70% accuracy and predict Gmail replies

• Scikit-learn good for learning, python library

• HPC innovation: analyze massive data sets • Model and data parallelism to reduce the training time• DNNs mostly used in image analysis

Toolkit Language Use Processing capabilityCaffe C++ Images and video Distributed

(HPC, GPU)TensorFlow Python Images, regression, video, text, speech Distributed

(HPC, GPU)Theano Python Images Distributed

(HPC, GPU)Torch Lua Images and speech Distributed

(HPC, GPU)

Page 8: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

BringingitbacktoNetworks…(Reviewingpaperssince2010)

Page 9: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

RecommendedMachinelearningUsecases(IETFforums)

• NetworkSecurity– Normalandoutlierbehaviorsintraffic

• Changeorpredictpossiblebehavior– This<QoS value>willcausethis<eventY>withprobability<P>

• Bugdetection– Softwareorhardwarefaults

• WANpathoptimization– Anticipatecongestion– Diverttraffictoalternatepaths

Page 10: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

ConductedaSystematicLiteratureReview

• Step1:Identifyresearchquestions

• Step2:Identifyasearchstring– “Wideareanetworks”AND(estimateORpredict)AND(learningOR‘‘datamining’’OR‘‘artificialintelligence’’OR‘‘patternrecognition’’ORregressionORclassificationORoptimization)

• Step3:Identifyrelevantlibraries,journals,papers– IEEEXplore,ACMDigitalLibrary,ScienceDirect,WebofScience,EICompendex,andGoogleScholar

Step1:Researchquestions

Step2:Searchstrategy

Step3:Studyselectioncriteria

Step3:Quality

assessment

Relevantpapers

Page 11: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

But…toomanypapersfound

• Spacewastoolarge:

• WANarecompletesystems

• Havemultiplelayers(e.g.seepicture)

• MultipleWANproblems

• Solution“Letsorganizetheresultsbasedon”:

• Createcategoriesofsimilarproblems

• ExploreMLandnon-MLsolutions

• Whichdatasetswereused

Page 12: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

GroupingProblemsinto4Categories

Usertrafficdata Usertraffic(directedflows)

12

WANTopology(trafficengineering)

(flow-level,trafficprediction,adaptation,pathoptimization,linkfailure)Infrastructuretrafficdata

(Packet-level,queues,TCP,UDP)

Infrastructure-levelmodifications(Switches,deployment,etc)

Page 13: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

MachinelearningapproachesinWAN

networks

2)TopologyEngineering

Trafficprediction

Trafficadaptation

Pathoptimization

Faultfinding

Multipledatacenter

connectivity

4)Infrastructureoptimization

1)Usertrafficoptimization

3)Packetleveloptimizations

TCPspecificproblems

Controllerplacements

Scheduling,congestion

Switchconfigurations

Note:SDNrelatedin(2,3,4)

Actual‘Actions’ontheWAN

Page 14: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Results

Page 15: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

RelevantPapers:Statistics

IEEEExplore

ACMpub

ScienceDirect

WebofScience

#188

• Removeduplications

• Applyselectioncriteria

• Searchadditionalrelevancethroughreferences

• Removesurveys

• Applyqualityassessment

#3

#10

#532

#25

#223

Note:Googlescholargavemanyirrelevantresultsandisnotregardedasagoodpublicationsearchtool.

Page 16: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Results– peryear(1)

• RiseofMLtechniquesin2017(WorkshopsatSigComm,HotNets,etc)

0

5

10

15

20

25

30

2010 2011 2012 2013 2014 2015 2016 2017

ML Non-ML

No.ofpapers

Page 17: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Results– percategory(2)

• Non-MLstilllargelyfavored

• MostMLtechniquesareusedforclassification(oftraffic)andprediction(failures)– TechniquescoupledwithOpenFlow:Performclassificationandconfigurepackets

• SometoolsareenhancedbyMLembeddingfordecisionmaking:– Trafficawarenessandsecurityproblems– Formingtopologies,optimumpathfinding– Improvepathutilizationsdependingonarrivingtraffic

0

10

20

30

40

50

60

UserTraffic TrafficEngineering Packet-levelimprovements

Optimizinginfrastructure

ML Non-ML

No.ofpapers

Page 18: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Techniquesused

Cat1:Usertrafficanalysis

Cat2:Trafficengineering

Cat3:Packetoptimization

Cat4:Optimizeinfrastructure

ML NaïveBayestheorem,decisiontrees,SVM,RandomForest,ANN

Regression andclassificationtechniques

SVR,decisiontrees, naïve-bayes

Regressionandclassificationtechniques

Non-ML Rule-basedlearning,statisticalanalysistechniques

Graphopt–mincost,greedysearch,SPF

Fairnesscomputations,pathfindinggametheory,Markovmodels,simulations

Simulation,greedyalgorithmsforresourceallocation

Classification,Regression

Page 19: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Cat1:Usertrafficanalysis

Cat2:Trafficengineering

Cat3:Packetoptimization

Cat4:Optimizeinfrastructure

Usecases • Intrusiondetection

• Trafficprofiling

Classifyflowstoformoptimumtopologies

Pathperformance

Optimumconnectionsbetweendatacenters

Classification X X X X

Regression X X X

Clustering XDimensionreduction

Anomalydetection

X X

Featurelearning

Couplingwithdevices

XDemousingsimulations

Page 20: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

DataInvolved

• Rangefrompacketdata,pathproperties,IPaddresses,QoS,TCP/UDPtraces,etc…

– E.g.Google’sB4optimizestopologytoSDWAN(basedondemand,packetloss,utilization)

Usecases Focus DatasetusedCategory 3:Packet-leveloptimization

VMresources Fairnessschemes,MTTF,MTTR,Netflow

Category 4:Infrastructureoptimization

Flowtables,controllerplacements

No.ofjobsrunning,VMdata,CPUusage,Applicationdata

Page 21: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

RoadAhead…

Page 22: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

LostofAreasstillUnder-developed

• Networksaremostlygraphoptimizationproblems:ApplyingMLtechniquesisunique

• Identifywhatwewanttoachievealongthepipeline:

Understanding(Classification) Prediction Action

Linkwithdevices(SDN,NFV,etc),butwhatarethe‘knobs’wecanalter?

MLresearchfocusesongamestrategies.Wedon’thavesimilar“strategies”innetworks!

ReinforcementLearningAgent

State s

parameterθ

policyπθ(s,a)

Takeactiona

Page 23: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

• Ratherthan‘One’havemultiplealgorithms

• Workingwithheterogeneousdatasets– featurelearning

• Computationalcostsofdataprocessingandmodeltraining

• UsingHPC/GPUtoallmodelstolearn

• NottotreatMLasablackbox,butunderstandwhy

BreakingDown– MLBlackbox

Page 24: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

Conclusions

• AIshowssomepromise:Learn,Try,Fail,Learn,Try,Succeed!

• MixofSkills:Networks+ML+HPC+(complexworkflows)

• Combiningtechniques(andalgos)toadvanceresearchinexplored:• Newareasinnetworkandperhapsevenmore

• Openingandsharingdatasets/techniquesforresearch(R&Enetworkcommunity)

Page 25: Machine Learning in WAN Research - Internet2...Oct 18, 2017  · • Network Security – Normal and outlier behaviors in traffic • Change or predict possible behavior – This

• Anyquestions/comments?

• Lookingtobecomeapostdoc,pleasecontact…

Thankyou!

[email protected]

FundedunderDOEPanoramaProject(2017-2019),DOEASCR(2017-2022)


Recommended