(Paper) A Method for Overlay Network Latency Estimation from Previous Observation

A Method for Overlay Network Latency Estimationfrom Previous Observation

Weihua SunNara Institute of

Science and TechnologyIkoma, Japan 630–[email protected]

Naoki ShibataNara Institute of

Science and TechnologyIkoma, Japan 630–0192

[email protected]

Keiichi YasumotoNara Institute of

Science and TechnologyIkoma, Japan 630–0192

[email protected]

Masaaki MoriShiga University

1-1-1 Banba, Hikone,Shiga, Japan 522–8522

[email protected]

Abstract—Estimation of the qualities of overlay links is usefulfor optimizing overlay networks on the Internet. Existing estima-tion methods requires sending large quantities of probe packetsbetween two nodes, and the software for measurements have tobe executed at both of the end nodes. Accurate measurementsrequire many probe packets to be sent, and other communicationcan be disrupted by significantly increased network traffic. Inthis paper, we propose a link quality estimation method basedon supervised learning from the previous observation of othersimilar links. Our method does not need to exchange probepackets, estimation can be quickly made to know qualities ofmany overlay links without wasting bandwidth and processingtime on many nodes. We conducted evaluation of our methodon PlanetLab, and our method showed better performance onpath latency estimation than estimating results from geographicaldistance between the two end nodes.

I. I NTRODUCTION

In order to construct an efficient Peer-to-peer (P2P) overlaynetwork, we need to know the link quality of overlay links, andseveral methods for estimating link qualities such as availablebandwidth, packet-loss rate and latency between peers on theInternet have been proposed. This kind of estimation methodsare also useful in client-server applications.

Existing estimation methods requires sending large quan-tities of probe packets between two nodes. Pathload[1] as-sumes that a periodic packet stream shows an increasingtrend when the stream’s transmission rate is higher than theavailable bandwidth, and it measures the available bandwidthbetween two nodes. Abing [2] estimates the capacity of a path(bottleneck bandwidth) based on the observed the dispersionexperienced by two back-to-back packets. These methodsrequire measurement software to be executed at both of theend nodes. Since accurate measurements require many probepackets to be sent, other communication can be disrupted bysignificantly increased network traffic. Moreover, in order tomake more accurate measurement of link qualities, more probepackets need to be sent into the network. If we could estimatelink qualities between each pair of nodes on the Internet.Estimation of link qualities is useful for optimizing overlaynetworks on the Internet. However, the number of overlaylinks is the square of the number of peers, it is difficult toestimate all the link qualities using the tools discussed above,since the packets for estimation between a pair of nodes can

disrupt measurements between other nodes. In general, thenetwork delay is considered to increase as the geographicaldistance or the number of routers in the route increases.However, due to the disproportionate data flow, large delayoccurs at some specified routers. Also, there are detour ofphysical communication links by geographical or politicalreasons. Because of these reasons, link qualities are consideredto be attributed to the geographical positions of the two endnodes, rather than just the geographical distance betweenthe nodes. We also need to consider the varying conditionsof congestion, and that the situation can suddenly change.However our observation tells that most of the links usuallyhave relatively stable available bandwidths and delays. Sincemost people use the Internet in the daytime, there should beconstant periodical changes of link qualities. Thus, we assumethat we can estimate the degree of congestion of an overlaylink from periodical observation of the link qualities in thepast.

We first discuss these assumptions by conducting exper-iments on observing link qualities of PlanetLab nodes, andshow that the assumptions stated above are probable. Then,we explain our proposed method based on supervised learningfor estimation of overlay link qualities from qualities observedin the past. Our method takes account of the geographicallocations of end nodes to estimate the link qualities. Ourexperiments on PlanetLab showed that our method has goodperformance on path latency estimation. The estimation basedon just geographical distance showed large error, especiallywhen the distance is shorter than 2000 km. The proposedmethod achieved high estimation accuracy in that range. Wehave shown a part of our results in a work-in-progress paper[3], and we show detailed experimental results and discussionin this paper.

In Sect. II, we provide an overview of related works, whilein Sect. III we present a preliminary discussion on howaccurately we are able to estimate link qualities from thosepreviously observed. We propose a method for estimating linkqualities based on a supervised learning algorithm in IV andpresent the results of experiments on PlanetLab to demonstratethe accuracy of the proposed method in Sect. V. Finally, ourconclusions are given in Sect. VI.

II. RELATED RESEARCH ANDCONTRIBUTION

Previously, in the field of wide area networks, many ap-proaches were proposed to measure and estimate the delayand bandwidth between end nodes. Accurate estimation of theavailable bandwidth is important for throughput optimizationbetween end nodes, overlay network routing, peer-to-peer filedistribution, traffic engineering, and capacity planning. In thissection, we discuss the measurement and prediction methodswith respect to the available path bandwidth between endnodes.

A. Bandwidth Measurement Method

There are three different metrics for path bandwidth betweenend nodes: (1) capacity (maximum bandwidth), (2) availablebandwidth (maximum unused bandwidth), (3) TCP through-put/bulk transfer capacity (maximum achievable bandwidth).The existing four measurement methodologies are:

• VPS (Variable Packet Size probing) is a method toestimate link capacity by measuring the round-trip time;that is, calculating the serialization delay of various sizedpackets sent from a sender node to a receiver node.

• PPTD (Packet Pair/Train Dispersion) is a method formeasuring the capacity of the path between end nodes.Letting a sender node continuously send uniform sizedpacket pairs or trains to a receiver node, this approachcalculates the maximum link serialization delay in thepath to estimate the minimum link capacity (bottleneck)by measuring the dispersion of the received packet times.

• SLoPS (Self-Loading Periodic Streams)is a methodfor measuring available bandwidth. While a sender nodecontinuously sends uniform sized packets to a receivernode with transmission rateR, SLoPS observes thevariation in delay for each packet at the destination node,and measures whetherR is greater thanA. By adjustingthe transmission rateR, SLoPS estimates the availablebandwidthA.

• TOPP (Trains of Packet Pairs) measures capacity andavailable bandwidth by transmitting data at a particulartransmission rate for a specified number of packet pairs.Unlike SLoPS, TOPP estimates the available bandwidthby increasing the transmission rate linearly and observingthe arrival delay.

Other tools that have been proposed and implemented arePathchar, Clink, and Pchar for measuring link capacity,Brpobe, Nettimer, Pathrate andSprobe for measuring pathcapacity,Cprobe, Pathload, IGI, andpathChirp for measur-ing available bandwidth, andTreno, Cap, TTCP, NetPerf,Iperf for measuring TCP throughput. As reported in [4],Pathload and pathChirp showed better performance thanAbing, Spruce, and Iperf on a high-speed network testbed.

Most of the above tools focus on measuring the averageavailable bandwidth, but do not consider bandwidth variation.Therefore, the authors in [6] proposed a method to measurebandwidth variation. Moreover, with the goal of estimatingthe bandwidth without causing excessive traffic, a method was

proposed in [5] to estimate capacity and available bandwidthwithout congesting the minimum capacity link in the path.

Most of the existing bandwidth measurement methods andtools work by exchanging probe packets between sender andreceiver nodes. Although these methods are useful for accuratebandwidth measurement, they generate traffic while measuringbandwidth. SLoPS and TOPP, in particular, cause temporarycongestion of the minimum capacity link. Consequently, in alarge scale P2P network with millions of nodes, these methodsmay cause serious deterioration in the network performance.

B. Bandwidth/Latency Prediction Method

Various network traffic prediction models have been pro-posed. In networks, similar traffic patterns with long timeintervals are said to be self-similar, while those patternswith short time intervals are called multi-fractal. In [8], amethod was proposed to predict network traffic at several timesteps in advance, based on past measured traffic information.Moreover, the authors in [7] improved the method in [8], byproposing a new ARIMA/GARCH model that predicts net-work traffic with higher accuracy. In this model, self-similarityand multi-fractals can be predicted by utilizing short-range andlong-range dependencies. Through comparison experimentswith real network traffic, the authors showed that networktraffic can be predicted with reasonable accuracy.

These models aim to predict future traffic from previousdetailed measurements. Moreover, the models can be used topredict the available bandwidth and latency by separately mea-suring the capacity of the path between the end nodes. Similarto the above methods, the method in [9] accurately estimatesthe latency of the path between end nodes based on trafficmeasurement. However, because detailed measurements areneeded in advance, the models are not suitable for estimatingbandwidth/latency at low cost owing to the additional trafficgenerated.

C. Contribution

The traffic prediction model makes use of the self-similarityand multi-fractal properties of traffic. By applying these char-acteristics to the different nature of similar paths, link qualities(including end to end delay, available bandwidth, and so on)can be predicted using fewer a priori measurement results.

In this paper, by considering the similarity of paths, wepropose an overlay link quality prediction method, whichassumes that similar paths have similar characteristics. To thebest of the authors’ knowledge, there is no other predictionmethod that, like ours, does not require much bandwidth.Moreover, we have implemented the proposed method inPlanetLab and evaluated the performance thereof.

III. PRELIMINARY EXPERIMENTSAND OBSERVATION

In this section, we first describe the results of two prelim-inary experiments. In the first experiment, we observed thefluctuations in link quality over time, while in the second, weinvestigated the relation between route (overlay link) similarityand the difference in link qualities. The amount of traffic on the

��

��

��

��

��

��

��

��

��

� ��

Fig. 1. Observed fluctuation of latency (X axis = time )

��

��

��

��

��

��

��

��

��

��

��

� ��

Fig. 2. Observed fluctuation of available BW (X axis = time)

Internet changes continuously, influenced both by the day ofthe week and the season. We observed the actual fluctuationsin link quality on PlanetLab. In the subsequent subsections, wedescribe the configuration of the experiments, the definition ofroute similarity, and the results of these experiments.

Observation of PlanetLab:We observed the fluctuations inavailable bandwidth and latency between nodes in PlanetLabover 7 days starting on 20th January 2011. We created 500random pairs among the nodes in PlanetLab and measuredthe available bandwidth and latency using Pathload and pingevery hour. About 63000 valid data records were obtained.

Figure 1 shows a stacked bar graph of the observed latencyat each time divided by the latency observed at the beginning.The bottom series indicates the ratio of routes where the ob-served latency divided by the latency observed at the beginningwas between 0.91 and 1.1. The second series indicates theratio of overlay links with latencies between 0.83 and 1.21times. Figure 2 shows the results for bandwidth. From Fig.1 it is clear that for 80% of the routes, the fluctuation inlatency was less than 10%, and this ratio did not change forthe whole week. Fig. 2 shows that for 70% of the routes, thefluctuation in bandwidth was less than 10% for 20 hours fromthe beginning of the experiment. It also shows that for half theroutes, bandwidth fluctuation was less than 10% for the week.We did not observe daily periodic fluctuations in bandwidthor latency.

A. Relation between Route Similarity and the Difference inLink Quality

It would be convenient if we could estimate the link qualityof an unknown overlay link on which no link quality observa-tions have been made. To realize such a method, we first define

��

��

��

��

��

��

Fig. 3. Similarity Definition

(a) After 1 hour

(b) After 6 daysFig. 4. Estimated latency by Proposed method, X axis = distance(Km)

similarity between two overlay links based on geographicaldistance. There are free databases from which we can find thegeographical location of nodes from their IP addresses, andthus it is easy to locate the geographical position of nodes onthe Internet. We also show the measurement results for linksimilarity and the difference in link quality.

Route Similarity: As shown in Fig. 3, the route sim-ilarity geo(v0, v1, v2, v3) between two routes is defined asthe minimum value betweendist(v0, v2) + dist(v1, v3) anddist(v0, v3) + dist(v1, v2), where dist(v0, v1) denotes thegeographical distance betweenv0 andv1.

Measurements on PlanetLab: We created 500 randompairs of nodes on PlanetLab, and investigated the relationbetween similarity as defined above and the observed latency.Figure 4(a) shows the relation between link similarity andlatency fluctuation one hour after the first measurement wasmade, while Fig. 4(b) shows the results obtained six days afterthe first measurement. We can see that these two graphs arealmost identical, and that there is almost no change in thefluctuation over time. We can also see that the amount offluctuation decreases with more similar routes. With the sumof the distance less than 600 km, the fluctuation is within 50%

to 200% for 80% of the routes.We also performed similar experiments on bandwidth, but

did not observe any relation between route similarity andfluctuation. This seems to be due to the fact that the availablebandwidth is usually limited by the bandwidth for the lasthop rather than that for the entire backbone. However, we arestill investigating finding an appropriate similarity definitionfor estimating the correct bandwidth.

IV. OVERLAY L INK QUALITY ESTIMATION METHOD

In this section, we propose an overlay link quality estimationmethod based on the results of the preliminary experimentsin Sect. III. In the proposed method, (1) a centralized serverperiodically collects, from various peers in the P2P network,quality information of overlay links they have observed, and(2) the quality of a given overlay link is estimated from theinformation of previously observed overlay links based on theweighted k-nearest neighbor (WKNN ) algorithm, which isone of the supervised learning techniques.

A. Preliminaries

1) Weighted k-nearest neighbor algorithm:The WKNNmethod uses training samples expressed as pairs of an objectand a real number, and learns a function that maps an arbitraryobject to a real number. In our proposed method, the objectand real number correspond to an overlay link and latency,respectively.

To use the WKNN algorithm, the following two functionsmust be given: (1) a function to calculate the distance betweentwo objects; and (2) a function that assigns a weight to eachobject.

In the WKNN algorithm, learning is carried out using alltraining samples (the training set) stored in memory. Whenestimating a real number for an input object, WKNN selectsthe k samples in the training set geographically closest to theinput object, and estimates a real number by calculating theweighted average of the k samples with their weights.

2) Assumptions, estimation target, and algorithm outline:We aim to apply the proposed method to estimate overlaylink quality in a P2P application such as video streaming. Weassume that the P2P application consists of a central serverand many peers (users). In the application, each peer observesthe quality of the overlay links directly connected to otherpeers and periodically sends the observed information to theserver. In this study, we have designed the learning algorithmas a centralized one, but it could easily be implemented as adistributed algorithm using, e.g., a distributed hash table.

The proposed algorithm is executed on the server andestimates the quality of a given overlay link by applying theWKNN method to the previously observed quality informationcollected by the server. As described in Sect. III-A, wecould not find any correlation between link similarity andthe observed available bandwidth. Thus, we focus mainly onoverlay link latency as the quality estimation target in thisstudy.

Each peer sends the server a query to estimate the quality ofthe specified overlay links. When the server receives a query, itestimates the quality of the given links based on the proposedalgorithm and sends the estimated result back to the peer.

The server carries out learning and estimation. In theWKNN algorithm, the server performs learning using alltraining samples stored in its memory. As time progresses,the number of training samples increases and more memoryspace is required. To limit the required memory size, when thenumber of training samples exceeds a predefined threshold, theoldest samples are deleted from memory.

The size of a message that a peer exchanges with the server(to upload the observed link quality, send a query for linkquality estimation, or receive the estimation result) is at most200 bytes since it contains only an overlay link together withthe associated quality.

The server has a table that maps IP addresses to geographiccoordinates as explained in Sect. III-A.

B. Learning algorithm

The proposed algorithm consists of two phases: (i) a learn-ing phase, and (ii) an estimation phase. We describe thesephases in detail below.

1) Learning phase:We assume that each peer participatingin a target application communicates frequently with otherpeers participating in the same application, e.g., to realizevideo P2P streaming.

In the proposed algorithm, each peer performs the followingsteps:

• When the peer communicates with other peers, it mea-sures the quality of the overlay links to those peers.

• The peer periodically sends the quality of overlay linksobserved during the current period to the server. Themessage contains the IP addresses of both ends of eachoverlay link and the measured latency.

When the server receives the observed quality of an overlaylink from a peer, it stores the data –that is, the IP addressesof the end nodes of the overlay link and the latency, in itsmemory.

When the amount of data exceeds a predefined threshold,the server removes the oldest data from its memory.

2) Estimation phase:When a peer wishes to know thequality of an overlay link, it sends the server a query specifyingthe IP addresses of the end points of the link. When the serverreceives the query, it estimates the quality of the specified linkas follows:

• The server selects the k closest training samples from thetraining set.

• It calculates the weight of each selected sample as ex-plained in Sect. IV-B4.

• It calculates the weighted average of the latency of theselected k samples.

• It sends the calculated result to the peer that originallysent the query.

3) Estimation example:Let us suppose that peern0 hassent the server a query regarding the overlay link betweenitself and peern1. When the server receives the query, it selectsthe k training samples closest to the overlay link betweenn0

and n1 based on the distance function defined in Sect. III-A. Let us suppose thatk = 2 and overlay linksr1 and r2have been selected. Then, the server calculates the weights ofr1 and r2 according to the method in Sect. IV-B4. Let theweights for r1 and r2 be 1 and 2, respectively. Let us alsosuppose that the previously observed latencies ofr1 and r2are3 and4, respectively. Finally, the server obtains the value(1× 3 + 2× 4)/(3 + 4) = 1.57 as the weighted average andsends this value as a reply to peern0.

4) Weight function:In Sect. III-A, we defined the similaritybetween two overlay links observed at the same time. Ingeneral, this similarity should be defined between two linksobserved at different times. However, as explained in Sect.III-A, the variation in latency with time is rather small. Thus,we use the similarity function defined for two links observedat the same time in the proposed algorithm.

According to the measurement results presented in Sect. III,more than 80% of overlay links experience a latency variationbetween 0.71 and 1.41 times the initial measured latency.Thus, we define the weight function as follows:

Weight(us, ud, vs, vd)0.7−0.3

5000· geo(us, ud, vs, vd) (1)

where(us, ud) and (vs, vd) are the geographic coordinatesof the target overlay link and the training sample, respectively.

V. EVALUATION

In this section, we evaluate the estimation accuracy of theproposed method. According to the underlying principle ofthe proposed method, the estimation accuracy depends on thedistance and time from the measured path. The greater thedifference in time or distance is, the worse is the estimationaccuracy. With respect to available bandwidth, we investigatedthe relationship over time and estimation accuracy. Withrespect to latency, we investigated the relationship over thedistance between paths and estimation accuracy.

A. Evaluation of Available Bandwidth

As described above, despite the paths being similar, nocorrelation with available bandwidth was observed. In thisexperiment, using thek measured results of both cases ofone measurement per day and one measurement per hour ona certain path, we investigated the estimation accuracy whenvaryingk and the elapsed time from the last measurement. Theresults are shown in Figs. 5(a)-5(c). According to these figures,the observed estimation accuracy corresponds to the results ofthe preliminary experiments. However, the estimation accuracydid not improve even when increasingk.

(a) k=1

��

��

��

��

� ��

��

��

��

��

��

��

��

(b) k=3

(c) k=5

Fig. 5. Estimated bandwidth, X axis = time

��

��

��

��

��

��

��

��

� ��

Com

munic

ation D

ela

y (

ms)

End to End Distance (km)

Fig. 6. The average, maximum and minimum path delay

��

��

��

��

��

��

� ��

��

��

��

��

��

Fig. 7. Accuracy of Estimated delay based on path length (X axis = distance(KM))

1) Estimation based on link distance:For comparison withthe proposed method, we used a delay estimation methodbased on link distance. Figure 6 shows the relationship be-tween link length and delay. According to this result, theaverage delay of a path increases roughly in proportion tothe distance. However, the maximum and minimum delays donot follow this trend. The delay calculated from the averagedelay is 0.019 ms/km. The results of applying this value tothe delay estimation method are shown in Fig. 7. Obviously,the estimation accuracy is low when the link distance is lessthan 2000 km.

2) The proposed method:In this experiment, we inves-tigated the accuracy of measuring path latency based onthe measured latency results of k different paths six dayspreviously. We investigated the estimation accuracy for anumber of distance functions by varyingk. The results areshown in Figs. 8(a) – 8(c).

According to these figures, accurate estimation was ob-served. The estimation accuracy improved ask increased. Inparticular, we confirmed that the estimation accuracy (0.71–1.41 and 0.5–2.0) is very high for medium and short distances,respectively.

VI. CONCLUSION

We proposed a learning-based overlay link quality estima-tion method that uses the quality observed for other links in thepast. With respect to latency, by defining geographical similar-ity between overlay links, the proposed method achieves goodestimation accuracy. With respect to bandwidth, we foundthat there is no correlation between overlay links with closegeographical similarity. In the future, we intend devising a newsimilarity metric to accurately estimate overlay link bandwidthtaking into account domain type, connecting ISP, and so on.

REFERENCES

[1] Jain, M. and Dovrolis, C. : “End-to-end available bandwidth: measure-ment methodology, dynamics, and relation with TCP throughput,” inIEEE/ACM Transactions on Networking, vol. 11, Issue 4, pp. 537–549,2003.

[2] Jiri Navratil and R. Les Cottrell : “ABwE:A Practical Approach to Avail-able Bandwidth Estimation,” in Proc. of Passive and Active MeasurementWorkshop (PAM’03), 2003.

[3] Weihua Sun, Naoki Shibata, Keiichi Yasumoto and Masaaki Mori :“Estimation of Overlay Link Quality from Previously Observed LinkQualities,” to appear in The 10th Annual IEEE Consumer Communi-cations Networking Conference (CCNC’13), 2013.

��

��

��

��

� ��

��

��

��

��

��

��

��

(a) k=1

��

��

��

��

��

��

��

��

��

��

��

(b) k=2

��

��

��

��

��

��

��

��

��

��

��

(c) k=3Fig. 8. Estimated latency by Proposed method, X axis = distance(Km)

[4] Alok Shriram and Margaret Murray and Young Hyun and Nevil Brownleeand Andre Broido and Marina Fomenkov and Kc Claffy : “Comparison ofPublic End-to-End Bandwidth Estimation Tools on High-Speed Links,”in Proc. of of Passive and Active Measurement Workshop (PAM’05),pp.306–320, 2005.

[5] Seong-ryong Kang and Xiliang Liu and Min Dai and and Dmitri Loguinovand Dmitri Loguinov : “Packet-Pair Bandwidth Estimation: StochasticAnalysis Of a Single Congested Node,” in Proc. of 20th IEEE Interna-tional Conference on Network Protocols (ICNP’04), pp.316–325, 2004.

[6] Manish Jain and Constantinos Dovrolis : “End-to-end Estimation ofthe Available Bandwidth Variation Range,” in Proc. of the 2005 ACMSIGMETRICS international conference on Measurement and modelingof computer systems, pp.265-276, 2005.

[7] Bo Zhou and Dan He and Zhili Sun and Wee Hock Ng : “NetworkTraffic Modeling and Prediction with ARIMA/GARCH,” In 3rd inter-national working conference: performance modelling and evaluation ofheterogeneous networks (HET-NETs’05), 2005.

[8] Aimin Sang and San-qi Li : “A Predictability Analysis of NetworkTraffic,” Proc. of IEEE INFOCOM 2000, pp.342–351, 2000.

[9] Hariri, N. Hariri, B. and Shirmohammadi, S.: “A Distributed MeasurementScheme for Internet Latency Estimation,” IEEE Trans. on Instrumentationand Measurement, 60 (5): pp.1594–1603, 2011

Date post:	18-Nov-2014
Category:	Technology
Upload:	naoki-shibata-
View:	490 times
Download:	0 times