+ All Categories
Home > Documents > Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network...

Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network...

Date post: 23-Jun-2020
Category:
Upload: others
View: 8 times
Download: 0 times
Share this document with a friend
25
Experimental Design for Experimental Design for Practical Network Diagnosis Practical Network Diagnosis Yin Zhang Yin Zhang University of Texas at Austin University of Texas at Austin [email protected] [email protected] Joint work with Han Joint work with Han Hee Hee Song and Song and Lili Lili Qiu Qiu MSR MSR EdgeNet EdgeNet Summit Summit June 2, 2006 June 2, 2006
Transcript
Page 1: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

Experimental Design for Experimental Design for Practical Network DiagnosisPractical Network Diagnosis

Yin ZhangYin ZhangUniversity of Texas at AustinUniversity of Texas at Austin

[email protected]@cs.utexas.edu

Joint work with Han Joint work with Han HeeHee Song and Song and LiliLili QiuQiuMSR MSR EdgeNetEdgeNet SummitSummit

June 2, 2006June 2, 2006

Page 2: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

2

Practical Network DiagnosisPractical Network Diagnosis• Ideal

– Every network element is self-monitoring, self-reporting, self-…, there is no silent failures …

– Oracle walks through the haystack of data, accurately pinpoints root causes, and suggests response actions

• Reality– Finite resources (CPU, BW, human cycles, …) � cannot afford to instrument/monitor every element

– Decentralized, autonomous nature of the Internet � infeasible to instrument/monitor every organization

– Protocol layering minimizes information exposure� difficult to obtain complete information at every layer

Practical network diagnosis: Maximize diagnosis accuracy under given resource constraint and information availability

Page 3: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

3

Design of Diagnosis ExperimentsDesign of Diagnosis Experiments• Input

– A candidate set of diagnosis experiments• Reflects infrastructure constraints

– Information availability• Existing information already available• Information provided by each new experiment

– Resource constraint• E.g., number of experiments to conduct (per hour), number of monitors available

• Output: A diagnosis experimental plan– A subset of experiments to conduct– Configuration of various control parameters

• E.g., frequency, duration, sampling ratio, …

Page 4: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

4

Example: Example: Network BenchmarkingNetwork Benchmarking• 1000s of virtual networks over the same physical network• Wants to summarize the performance of each virtual net

– E.g. traffic-weighted average of individual virtual path performance (loss, delay, jitter, …)– Similar problem exists for monitoring per-application/customer performance

• Challenge: Cannot afford to monitor all individual virtual paths– N2 explosion times 1000s of virtual nets

• Solution: monitor a subset of virtual paths and infer the rest• Q: which subset of virtual paths to monitor?

R

R

R

R

R

R

R

Page 5: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

5

Example: ClientExample: Client--based Diagnosisbased Diagnosis• Clients probe each other• Use tomography/inference to

localize trouble spot– E.g. links/regions with high loss

rate, delay jitter, etc.• Challenge: Pair-wise probing too

expensive due to N2 explosion• Solution: monitor a subset of

paths and infer the link performance

• Q: which subset of paths to probe?

C&W

UUNet

AOL Sprint

Qwest

AT&T

Why is itso slow?

Page 6: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

6

More ExamplesMore Examples• Wireless sniffer placement

– Input:• A set of locations to place wireless sniffers

– Not all locations possible – some people hate to be surrounded by sniffers

• Monitoring quality at each candidate location– E.g. probabilities for capturing packets from different APs

• Expected workload of different APs• Locations of existing sniffers

– Output:• K additional locations for placing sniffers

• Cross-layer diagnosis– Infer layer-2 properties based on layer-3 performance– Which subset of layer-3 paths to probe?

Page 7: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

7

Beyond NetworkingBeyond Networking• Software debugging– Select a given number of tests to maximize the coverage of corner cases

• Car crash test– Crash a given number of cars to find a maximal number of defects

• Medicine design– Conducting a given number of tests to maximize the chance of finding an effective ingredient

• Many more …

Page 8: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

8

Need Common Solution FrameworkNeed Common Solution Framework• Can we have a framework that solves them all?

– As opposed to ad hoc solutions for individual problems

• Key requirements:– Scalable: work for large networks (e.g. 10000 nodes)– Flexible: accommodate different applications

• Differentiated design – Different quantities have different importance, e.g., a subset of paths belong to a major customer

• Augmented design– Conduct additional experiments given existing observations, e.g., after measurement failures

• Multi-user design– Multiple users interested in different parts of network or have different objective functions

Page 9: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

9

NetQuestNetQuest• A baby step towards such a framework

– “NetQuest: A flexible framework for large-scale network measurement”, Han Hee Song, Lili Qiu and Yin Zhang. ACM SIGMETRICS 2006.

• Achieves scalability and flexibility by combining – Bayesian experimental design– Statistical inference

• Developed in the context of e2e performance monitoring

• Can extend to other network monitoring/ diagnosis problems

Page 10: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

10

What We WantWhat We WantA function f(x) of link performance x

– We use a linear function f(x)=F*x in this talk

2

51 4

76

x1

x11

x4x5

x10

x9

x7

x6

x8

3x2

x3

Ex. 1: average link delayf(x) = (x1+…+x11)/11Ex. 2: end-to-end delays

Apply to any additive metric, eg. Log (1 – loss rate)

=

11

2

1

:

10...0

....

0...011

0....01

)(

x

x

x

xf

Page 11: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

11

Problem FormulationProblem FormulationWhat we can measure: e2e performanceNetwork performance estimation– Goal: e2e performance on some paths � f(x)– Design of experiments

• Select a subset of paths S to probe such that we can estimate f(x) based on the observed performance yS, AS, and yS=ASx– Network inference

• Given e2e performance, infer link performance• Infer x based on y=F*x, y, and F

Page 12: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

12

Design of ExperimentsDesign of Experiments• State of the art

– Probe every path (e.g., RON)• Not scalable since # paths grow quadratically with #nodes

– Rank-based approach [sigcomm04]• Let A denote routing matrix• Monitor rank(A) paths that are linearly independent to exactly reconstruct end-to-end path properties

• Still very expensive

• Select a “best” subset of paths to probe so that we can accurately infer f(x)

• How to quantify goodness of a subset of paths?

Page 13: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

13

Bayesian Experimental DesignBayesian Experimental Design• A good design maximizes the expected utility under the optimal inference algorithm

• Different utility functions yield different design criteria– Let , where is covariance

matrix of x– Bayesian A-optimality

• Goal: minimize the squared error

– Bayesian D-optimality• Goal: maximize the expected gain in Shannon information

1)()( −+= RAAD sTSη 12 −Rσ

22|||| SFxFx −

})({trace)( TA FFD ηηφ =

})(det{)( TD FFD ηηφ =

Page 14: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

14

Search AlgorithmSearch Algorithm• Given a design criterion , next step is to find s rows of A to optimize – This problem is NP-hard– We use a sequential search algorithm to greedily

select the row that results in the largest improvement in

– Better search algorithms?

)(ηφ

)(ηφ

)(ηφ

Page 15: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

15

FlexibilityFlexibilityDifferentiated design– Give higher weights to the important rowsof matrix F

Augmented design– Ensure the newly selected paths in conjunction with previously monitored paths maximize the utility

Multi-user design– New design criteria: a linear combination of different users’ design criteria

Page 16: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

16

Network InferenceNetwork InferenceGoal: find x s.t. Y=AxMain challenge: under-constrained problemL2-norm minimization

L1-norm minimization

Maximum entropy estimation

222

2 ||||||||min Axyx −+− µλ

11 ||||||||min Axyx −+− µλ

222 ||||logmin Axy

xx

i i

ii −+∑ µ

Page 17: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

17

Evaluation MethodologyEvaluation MethodologyData sets

Accuracy metric ∑∑ −

=i i

i ii

actual

actualMAEnormalized

|infer|

9729146983594006005000Brite-n5000-o60020512883398002001000Brite-n1000-o20069046283270601795Planetlab-loss76954673657612514PlanetLab-RTT

Rank# links# paths# overlay nodes

# nodes

Page 18: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

18Comparison of DOE Algorithms: Comparison of DOE Algorithms: Estimating NetworkEstimating Network--Wide Mean RTTWide Mean RTT

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

100 200 300 400 500 600 700 800

normalized MAE

# monitored paths

RandomQRSVDA-opt.

A-optimal yields the lowest error.

Page 19: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

19Comparison of DOE Algorithms: Comparison of DOE Algorithms: Estimating PerEstimating Per--Path RTTPath RTT

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

100 200 300 400 500 600 700 800

normalized MAE

# monitored paths

RandomQRSVDA-opt.D-opt.

A-optimal yields the lowest error.

Page 20: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

20Differentiated Design: Differentiated Design: Inference Error on Preferred PathsInference Error on Preferred Paths

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

0 2 4 6 8 10 12 14 16

normalized MAE

weight

20 weighted40 weighted60 weighted100 weighted120 weighted160 weighted

Lower error on the paths with higher weights.

Page 21: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

21Differentiated Design: Differentiated Design: Inference Error on the Remaining PathsInference Error on the Remaining Paths

0

0.05

0.1

0.15

0.2

0.25

0 2 4 6 8 10 12 14 16

normalized MAE

weight

20 weighted40 weighted60 weighted100 weighted120 weighted160 weighted

Error on the remaining paths increases slightly.

Page 22: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

22

Augmented DesignAugmented Design

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0 10 20 30 40 50 60 70 80

normalized MAE

# failed paths

RandomSVDQRA-opt.

A-optimal is most effective in augmenting an existing design.

Page 23: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

23

MultiMulti--user Designuser Design

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0 50 100 150 200 250 300 350 400

normalized MAE

# monitored paths

QRSVDA-opt.

A-optimal yields the lowest error.

Page 24: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

24

SummarySummaryOur contributions

– Bring Bayesian experimental design to network measurement and diagnosis

– Develop a flexible framework to accommodate different design requirements

– Experimentally show its effectiveness

Future work– Making measurement design fault tolerant– Applying our technique to other diagnosis problems– Extend our framework to incorporate additional design

constraints

Page 25: Experimental Design for Practical Network Diagnosis · Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin yzhang@cs.utexas.edu Joint work

25

Thank you!Thank you!


Recommended