+ All Categories
Home > Documents > 12 Fischer Masi Gross Shortle Tr2005

12 Fischer Masi Gross Shortle Tr2005

Date post: 05-Apr-2018
Category:
Upload: saeed410
View: 216 times
Download: 0 times
Share this document with a friend

of 14

Transcript
  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    1/14

    The Telecommunications Review 2005 79

    One-Parameter Pareto, Two-Parameter Pareto, Three-

    Parameter Pareto: Is There a Modeling Difference?

    Martin J. Fischer, Ph.D.

    Denise M. Bevilacqua Masi, Ph.D.

    Donald Gross, Ph.D.

    John F. Shortle, Ph.D.

    The Pareto distribution was first formulated in the late 1800s by the Italian economist Vilfredo Pareto. He

    presented the argument that in all countries and times, the distribution of income and wealth could be

    described by the formula log(N) = log(A) + mlog(x), where N is the proportion of income earners who receive

    incomes higher than x, andA andm are constants. Over the years, Paretos Law has held up in empirical

    studies. The Pareto distribution has recently been used as a model for file sizes on the Internet, insurance

    losses, financial behavior of the stock market, and in telecommunications systems. It has various forms; here,

    we consider a one-parameter form and a two-parameter form. Thus, we question if using one form of the

    Pareto gives different results than using another form. In this paper, we numerically address this question bystudying queueing systems with either Pareto arrivals or service times. The two Pareto forms are studied in

    detail: Case 1, both Pareto forms have equal means and variances; and Case 2, both Pareto forms have equal

    mean and shape parameters. For both cases, our numerical results, substantiated by simulation studies, show

    that using the two-parameter Pareto results in lower congestion than the comparable one-parameter Pareto.

    IntroductionPareto distributions play an important role in queueing

    models of Internet traffic and financial insurance

    claims. The Pareto distribution is a power-tailed dis-

    tribution which is a special case of a heavy-tailed dis-tribution. Heavy-tailed distributions have tails that go

    to zero more slowly than exponential. A cumulativedistribution function, F(x), has a power tail if thereexist positive constants c and a such that

    for )(1)( xFxF =

    lim[ ( )] .ax

    x F x c

    =

    That is, the tail decays geometrically in the limit.

    Sometimes these distributions are also said to be fat-tailed, heavy-tailed, or long-tailed. But, the latter

    terms are used to describe the larger class of distribu-

    tions in which the tail probabilities satisfy

    (x)Feax

    x

    =

    lim

    for every a > 0. That is, their survival functions go to0 more slowly than any exponential, but not necessar-

    ily as slowly as a power-tailed function. A power-

    tailed distribution is also a heavy-tailed distribution,

    but not necessarily the reverse.

    Application in Queueing ModelingWith the growth of the Internet and the World Wide

    Web (WWW), heavy-tailed distributions have played

    an important role in characterizing many of the traffic

    invariants. [1, 2, 3] In addition, the application ofthese distributions has also been seen in the financial

    and insurance communities. [4, 5, 6, 7]Figure 1 shows the distribution (complementary

    cumulative distribution function [CCDF]) of file

    transmission times on the Web. The tail of the distri-

    bution is approximately linear on a log-log scale. This

    corresponds to a CCDF which decays as a power law.For contrast, we have also plotted the CCDF of an

    exponential distribution, which is often used to model

    the holding times of voice traffic. While the voice

    holding time distribution drops off quickly to zero, the

    file transmission time distribution decays linearly inthis scale. From a queueing point of view, this says

    that every once in a while there is a request for an ex-

    tremely large size file that occupies the outgoing linkto the Web for an extraordinary length of time.

    Power tails are also observed in the distribution of

    insurance claims. Figure 2 shows the 30 most costly

    insurance losses, worldwide, from 1970 to 1995. [8]We have fitted the distribution with both a power tail

    and an exponential tail. Visually, the power tail is a

    much better fit. Again, the implication is that there is

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    2/14

    The Telecommunications Review 2005 80

    Heavy-Tailed

    Traffic: Files on

    the Web

    Light-Tailed

    Traffic (e.g.,

    Voice)

    0

    -0.5

    -1

    -1.5

    -2-2.5

    -3

    -3.5

    -4

    -4.5

    -5

    -1.5 -1 -0.5 0 0.5 1.5 2 2.5 3 3.5

    Log10(Pr(time>x))

    Log10 (Transmission Time in Seconds)

    Heavy-Tailed

    Traffic: Files on

    the Web

    Light-Tailed

    Traffic (e.g.,

    Voice)

    0

    -0.5

    -1

    -1.5

    -2-2.5

    -3

    -3.5

    -4

    -4.5

    -5

    -1.5 -1 -0.5 0 0.5 1.5 2 2.5 3 3.5

    Log10(Pr(time>x))

    Log10 (Transmission Time in Seconds)

    Figure 1. Log-Log Plot of the Web [1] and Voice Transmission Time

    0

    5

    10

    15

    20

    25

    30

    $0 $4,000 $8,000 $12,000 $16,000

    Rank

    Loss (Millions of $US, 1992 Prices)

    Exponential

    Fit

    Pareto

    Fit Hurricane

    Andrew,

    1992

    Northridge

    Earthquake

    (CA),

    1994

    0

    5

    10

    15

    20

    25

    30

    $0 $4,000 $8,000 $12,000 $16,000

    Rank

    Loss (Millions of $US, 1992 Prices)

    Exponential

    Fit

    Pareto

    Fit Hurricane

    Andrew,

    1992

    Northridge

    Earthquake

    (CA),

    1994

    Figure 2. The 30 Most Costly Insurance Losses, 1970 to 1995 [7]

    a non-trivial probability of an extremely large insur-

    ance loss. From a queueing perspective, one can show

    that the probability of ruin for an insurance company

    with initial cash reserve u is the same as the steady-

    state queue wait probability P(Wq > u) for a G/G/1queue. The service distribution G corresponds to the

    distribution of insurance losses (power-tailed in this

    case) and the arrival distribution G corresponds to thearrival process of claims.

    In this paper, we seek to determine how the

    choice of arrival or service distributionspecifically,

    how different forms of the Pareto distributionaffectsqueueing performance. Suppose one is using the

    M/G/1 queue to model the previous applications. The

    Pollaczek-Khintchine (P-K) formula [9] implies that

    the expected delay is the same for all service distribu-tions with equal mean and variance. But, the delay

    quantiles vary with the form of the service distribu-

    tion. We compared the heavy-tailed LogNormal,

    Pareto (single parameter), and Weibull (with shape

    parameter less than 1) service distributions with equal

    mean and variance. [10] The Pareto yielded thesmallest quantiles and the Weibull the largest for wait

    in queue. Thus, the selection of the service distribu-

    tion can make a difference in modeling the congestionof the systems of interest.

    Paxson and Floyd state that heavy-tailed distribu-

    tions (in particular the Pareto) can serve as models for

    packet interarrival times. [2] We compare the use ofthe Pareto, LogNormal, and Weibull with the same

    mean and variance as arrival distributions for loss and

    delay queueing systems. [11, 12] Based on our nu-

    merical investigations, we found that the blockingprobability for loss systems and the expected queueing

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    3/14

    The Telecommunications Review 2005 81

    delay for delay systems were again ordered with the

    Pareto arrival distribution resulting in smallest meas-

    ures of performance and the Weibull the largest.

    Our analysis to date has shown that modelingresults can vary when one uses a Pareto, LogNormal,

    or Weibull for arrival or service distributions in M/G/1

    or G/M/1 queueing systems. In these analyses, we

    kept the mean and variance of the distribution thesame. Here, we extend our M/G/1 and G/M/1

    comparisons to various forms of the Pareto.

    Forms of the Pareto DistributionThere are several forms of the Pareto distribution and

    we study here how the particular form of the Pareto

    might influence the results of the particular queueingmodel in question. The distribution was named after

    Italian economist Vilfredo Pareto (18481923). In

    Cours DEconomie Politique Professe a lUniversite

    de Lausanne, Volumes I, II, and III, 18961897,

    Pareto presented the argument that in all countries andtimes the distribution of income and wealth could be

    described by the formula (called Paretos Law)

    log(N) = log(A) + mlog(x),

    where N is the proportion of income earners who re-

    ceive incomes higher than x, and A and m are con-stants. Paretos Law is equivalent to the probability

    statement Pr{X>x} = 1- F(x) =Axm . Note that for this

    to be a valid probability distribution, m must be nega-tive since the CDF F(x), must go to 1 as x goes to in-

    finity, hence Axm must go to zero. Over the years,

    Paretos Law has held up in empirical studies.

    There are many forms of the Pareto distribution asshown in Table 1, where F(x) is the CDF and Fc(x)

    denotes the complement. Some have three parameters

    (shape, scale, and location (shift); others have just two

    parameters; and one form of the Pareto has only a sin-gle (scale) parameter. The two and single parameter

    versions can be obtained from the three-parameter

    version (the most general) by setting some parametersto specific values as shown in Table 1.

    All forms above are valid Pareto distributions.

    We note that Form Reference Number 2c seems to be

    the most popular one in use (many texts use this form,as does the popular distribution fitting package, Ex-

    pertFit). This is also the form that directly fitsParetos citation given above, where A = and m =-, and is used in many papers dealing with the appli-cations of heavy-tailed distributions to Internet traffic[1, 2, 13], and to insurance claims processing. [4, 14]

    In our previous work, [3, 15, 16] we have generally

    used the third form, which is easy to work with due toits simplicity, and allows the random variable to start

    at zero, instead of a minimum threshold. The question

    we ponder here is does the form used influence the

    results of a queueing model. We attempt to answer

    this question by comparing waiting times for variousforms of the Pareto in P/M/1 and M/P/1 queueing

    models. Actually, the two forms we compare are the

    popular Form Reference Number 2c with a minimum

    x value of, to the single parameter case Form Refer-ence Number 3 where the minimum x value is 0. We

    look at two cases, one matching the first two moments

    of the two forms of the distribution, and the othermatching the shape parameter and the mean. Forthese situations, matching the other forms shown in

    Table 1 reduce to either Form Reference Number 2c or

    Form Reference Number 3, so it suffices to compare

    only these two forms, i.e., one Form Reference Num-ber 2c with a shift parameter, and the other FormReference Number 3 without. Table 2 shows a nu-

    merical example of the matches.Note that when matching mean and variance (line

    1) in the numerical example above, the shift parameterof Form Reference Number 2c, i.e., the minimumvalue of the random variable, is .256 (more than half

    the mean value), while in Form Reference Number 3,

    the minimum value of the random variable is 0. When

    matching alpha and the mean (line 2), the minimum

    value for Form Reference Number 2c is .323, almostthree-fourths of the mean value, while again, the

    minimum value for Form Reference Number 3 is 0.

    Much of the available analytic results in queueing

    theory rely on the input distributions (interarrival andservice times) having closed form expressions of their

    Laplace transforms. The Laplace transform of the

    Pareto (regardless of the particular form) does not.Thus, we employ a technique which we call the Trans-

    form Approximation Method (TAM), and its associ-

    ated numerical procedure called the TAM Recursion

    Method (TRM), to generate the queue waiting times

    for models with either Pareto arrivals (P/M/1) orPareto service (M/P/1) distributions. [3, 15, 17] We

    briefly summarize the TAM and TRM procedures in

    the Appendix to this paper. The next section discussesthe Pareto distributions compared in this study.

    The Pareto Distributions Compared in

    the StudyIn our investigation, we compare two of the forms ofthe Pareto distribution listed in Table 1; namely, oneof the two parameter cases, Form Reference Number

    2c, and the single parameter case, Form Reference

    Number 3. As mentioned above, Form Reference

    Number 2c is the most used in the literature and unlike

    Form Reference Number 3, has a minimum threshold

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    4/14

    The Telecommunications Review 2005 82

    Number of

    Parameters

    Parameter

    RestrictionsF

    c= 1-F(x)

    Form Reference

    Number

    Three , > 0, 0

    +x

    (x ) 1

    Set = 0

    +x (x 0) 2a

    Set =1

    + 1

    1

    x(x ) 2bTwo

    Set =

    x

    (x > 0) 2c

    One Set = 0, =1

    + 11

    x(x 0) 3

    Table 1. Various Forms of the Pareto Distribution, With Parameters = Shape,

    = Scale, and = Location

    One-Parameter: Form Reference Number 3 Two-Parameter: Form Reference Number 2cCase

    Alpha Mean Variance CV Alpha Gamma Mean Variance CV

    Case 1:Matched

    Mean

    and

    Variance

    3.1 0.47619 0.639043 1.678744 2.1639 0.256 0.47595 0.638715 1.679159

    Case 2:

    MatchedAlpha

    and

    Mean

    3.1 0.47619 0.639043 1.678744 3.1 0.32258 0.47619 0.066497 0.54153

    Table 2. Numerical Example of Matching Two Forms of Pareto

    value of the random variable greater than zero. Ta-

    ble 3 shows the complementary CDF and the means

    and variances of the two cases. We note that for themean to exist, the shape parameter must be greaterthan 1, and for the variance to exist, must be greaterthan 2.

    Based on Table 1 (shown earlier) and the twocases, one matching the first two moments of the two

    forms of the distribution (Case 1) and the other

    matching the shape parameter and the mean (Case

    2), we see that if one matched the first two moments,then the one-parameter Pareto has three moments

    (since = 3.1); but the two-parameter only has twomoments (since is just over 2). Therefore, if onewere considering an M/P/1 queueing system, the ex-

    pected queue waiting time would be equal for both

    forms of the Pareto, but the second moment of the

    two-parameter Pareto would not exist. [18] This is the

    first indication that the choice of the form of thePareto does result in different congestion measures,

    even when the first two moments are equal. In this

    example, the differences are significant in that the

    second moment of the waiting time exists in one caseand does not exist in another.

    Let us look at Case 1, where the means and vari-

    ances are equal. Since we are equating means and

    variances, we must have > 2 and 2 > 2. If is

    known then we have 5.02 ))1

    1(2(1

    += and

    )1(

    1

    2

    2

    =

    . As a function of , 2 is monotoni-cally increasing starting at 2 and bounded above by

    2.414. One can also solve for as a function of2,

    which results in .21

    2222

    +

    = For 2 < 2 2.414, is less than 0. Thus, for the case where we have equal

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    5/14

    The Telecommunications Review 2005 83

    Form Reference

    NumberFc(x) Mean Variance Conditions

    3

    + 1x1 1/(-1) 2/[(-2)(-1)]-[1/(-1)]2 x>0, >2

    2c 2x

    2/(2-1) 2 2/(2-2)-[2 /( 2-1)]2 x> , 2>2Table 3. The Pareto Distributions Compared

    means and variances, we will always have > 2 and2.414 > 2 > 2. This implies that the one-parameterPareto could have more than just the first two

    moments but the two-parameter Pareto will only have

    its first two moments.For Case 2, we are considering equal means and

    shape parameters; we have > 1 and =1/ . So is amonotonically decreasing function of and is alwaysless than 1.

    The P/M/1 ModelFor P/M/1, the usual approach for obtaining the

    stationary delay-time distributions and system-size

    probabilities requires solving a root-finding probleminvolving the Laplace-Stieltjes Transform (LST),

    A*(s), of the interarrival-time distribution function.

    The appropriate form of the problem (often called the

    fundamental equation of the branching processes) is tosolve for z in

    )]z1([*Az =

    where 1/ is the expected service time. [9] The load,, equals /, where is the customer arrival rate, andfor the problem to have a non-trivial solution, one

    must have < 1. The unique root of the fundamentalequation of the branching process, say r0 in (0,1), then

    becomes the parameter of a geometric distribution for

    steady-state system sizes at the embedded arrival

    points. These geometric probabilities are then com-

    bined with convolutions of the exponential servicedistribution to derive the stationary line-delay distri-

    bution. For the case of Pareto arrivals, a closed form

    forA*(s) does not exist. We use TAM forA*(s) and

    then use Newtons method to solve for r0.

    Once the root is found, the complete CDF of thequeue or system waiting time is easily determined. [9]

    It has the same functional form as the M/M/1 queue

    except with r0 replacing . The expected queue wait-ing time, Wq, is given by

    )0

    r1(

    0r

    qW

    =

    .

    In actuality, the equivalent load on the system is

    the root, r0, and not . For the case of bursty arrivals,one can show r0 isgreater than .

    Here we look at using the one-parameter or two-

    parameter Pareto as the customer interarrival distribu-tion. We considered two cases; in Case 1, the two

    forms of the Pareto have equal mean and variances,

    and in Case 2, they have equal means and shape pa-

    rameters. Our analysis focuses on solving for the rootof fundamental equation of the branching process.

    First we look at Case 1; in those comparison was fixed at 0.8. For this case, we have seen that theshape parameter () of the one-parameter Pareto isgreater than 2, and the shape parameter of the two-

    parameter Pareto (2) is contained in the interval (2,2.414). Thus, the two-parameter Pareto does not pos-sess moments higher than its second (discussed ear-

    lier).

    In Figure 3, we compared the expected queuewaiting time for the one-parameter and two-parameter

    Pareto, as well as with Poisson arrivals with the same

    mean (but, here, the standard deviation of the Poisson

    equals the mean so that the variance differs from those

    of the Pareto cases where both mean and varianceswere matched). The most important thing we see is

    that using the two-parameter Pareto results in lower

    expected queue waiting times than does the one-pa-rameter Pareto. However, what is more surprising is

    the fact that it is lower than with Poisson arrivals.

    This implies that the root of the fundamental branch-

    ing equation is less than when using the two-pa-rameter Pareto for the arrival distribution.

    Figure 4 shows the root of the fundamental equa-tion of the branching process. For Poisson arrivals,

    the root is ; but for the one-parameter Pareto, the rootis greater than and for the two-parameter Pareto the

    root is less than . This result is quite significant. Letus look into it a bit more.In Figures 3 and 4, for each , the corresponding

    2 and is determined so that the resulting means andvariance are equal. Table 4 presents the results used in

    Figures 3 and 4, and we see that the coefficient of

    variation (CV = the standard deviation divided by themean) is greater than 1 for both the one-parameter and

    two-parameter Pareto. One would expect that this

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    6/14

    The Telecommunications Review 2005 84

    Expected Queue Waiting Time

    0

    2

    4

    6

    8

    10

    2 2.5 3 3.5 4 4.5 5 5.5 6

    Wq

    1Parm 2Parm Poisson

    Expected Queue Waiting Time

    0

    2

    4

    6

    8

    10

    2 2.5 3 3.5 4 4.5 5 5.5 6

    Wq

    1Parm 2Parm Poisson

    Figure 3. Expected Queue Waiting Time Comparisons (Case 1)

    Root

    0.70

    0.75

    0.80

    0.85

    0.90

    0.95

    2 3 4 5 6

    1Parm 2Parm Poisson

    r

    0

    Root

    0.70

    0.75

    0.80

    0.85

    0.90

    0.95

    2 3 4 5 6

    1Parm 2Parm Poisson

    r

    0r

    0

    Figure 4. Root of the Fundamental Equation of the Branching

    Process (Case 1)

    would be reflected in bursty arrivals; that is, arrivals

    that occur in clusters and so experience longer delays

    than seen by the smoother Poisson arrivals and have a

    root greater than . For the one-parameter Pareto, thiswas true (as shown in Figure 4); but for the two-pa-

    rameter Pareto, this was not true. That is, for the two-

    parameter Pareto, we had a coefficient of variationgreater than 1, but the root and expected queue waiting

    time less than 1 would get with Poisson arrivals

    (where the CV = 1).This would tend to say that there is a significant

    difference in the peakedness factor (PF) of the offered

    load. [19] The PF is defined as the standard deviationof the offered load divided by the mean. The mean

    and variance of the offered load is found from the

    probability distribution of the number of customers

    present (or equivalently the number of busy servers) at

    a random point in time in the P/M// system. [19]

    The mean equals , but the variance depends on theform of the Pareto. More specifically, to find the dis-

    tribution of the number of busy servers in P/M//,

    one first uses the results given in Introduction to the

    Theory of Queues [20] to find the number of custom-

    ers an arrival sees in a P/M/S/S with S = . This is theprobability, Pj, of an arrival seeing j customers present

    in a P/M// system. The probability of j customerspresent at a random point in time, Qj, is given by j Qj =

    Pj-1 for j = 1, 2 and Q0 can be found by normali-zation. [20] The mean and variance of the offered

    load is the mean and variance of the probability distri-bution Qj. This procedure was used to generate the PF

    columns as shown inTable 4. We again use the TAM

    approximation for the Pareto arrivals. The mean andvariance in Table 4 is the actual mean and variance of

    the Pareto.

    Bursty arrivals have a PF greater than 1. We see

    the one-parameter Pareto is bursty, but the two-pa-

    rameter Pareto is not. Bursty arrival processes see

    congestion worse than Poisson (r0 > ) and non-burstysee congestion less than Poisson (r0 < ). In this

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    7/14

    The Telecommunications Review 2005 85

    2 Mean Variance CV PF: One-Parameter ParetoPF: Two-

    Parameter Pareto

    2.1 2.024 0.460 0.909 17.355 4.583 1.298 0.987

    2.5 2.095 0.349 0.667 2.222 2.236 1.264 0.976

    2.9 2.145 0.281 0.526 0.893 1.795 1.236 0.971

    3.1 2.164 0.256 0.476 0.639 1.679 1.226 0.969

    3.5 2.195 0.218 0.400 0.373 1.528 1.209 0.965

    3.9 2.219 0.189 0.345 0.244 1.433 1.198 0.965

    4.1 2.230 0.178 0.323 0.203 1.397 1.192 0.962

    4.5 2.247 0.159 0.286 0.147 1.342 1.185 0.960

    4.9 2.262 0.143 0.256 0.111 1.300 1.177 0.959

    5.1 2.268 0.136 0.244 0.098 1.283 1.174 0.959

    5.5 2.279 0.125 0.222 0.078 1.254 1.171 0.957

    5.9 2.289 0.115 0.204 0.063 1.230 1.165 0.955

    Table 4. The Offered Load and PF Comparison (Case 1)

    example, the CV of both forms of the Pareto is greaterthan 1, but the two-parameter Pareto is not bursty, i.e.,

    its PF is < 1, while the PF of the one-parameter Pareto

    is >1.One of the possible reasons is because the two-

    parameter Pareto arrivals are always greater than ,whereas the one-parameter Pareto can have customers

    arriving before . In the case of the two-parameterPareto, this has a tendency to clear out the queue.Figure 5 plots the CDF of the one-parameter Pareto

    evaluated at for the , pairs shown in Table 4. We

    see there is over a 0.45 probability that the one-parameter Pareto will have an arrival in the interval (0,

    ); whereas that probability is 0 in the case of a two-parameter Pareto.

    As , we have 2 21+ and 0.One can numerically investigate this convergence and

    see that it is slow. As gets large, the one-parameterPareto is converging to an exponential distribution, as

    can be seen by looking at its CDF and taking the limit.

    That is, we have F(x) = 1- (1+x)-, and as gets large,it is straight forward to show that F(x) 1- e-x 1-e-(-1) x. Correspondingly, the two-parameter Pareto is

    converging to a distribution that has only two

    moments and has a concentration closer and closer tozero; but is not deterministic because of lack of

    moments greater than two.

    For Case 2, equal means and shape parameters,

    the story is different. In this case, we have

    = 2and = 1/ 2. able 5 gives the root of thefundamental equation of the branching process for

    Case 2, with > 2 and getting larger and = 0.8. Asthe shape parameter gets large, we see that the coeffi-

    cient of variation for the one-parameter Pareto is ap-proaching one from above, and in the two-parameter

    case it is going to zero. Thus, as the shape parameter

    gets large, the one-parameter Pareto is again converg-ing to an exponential distribution and the two-pa-

    rameter Pareto is converging to a deterministic distri-

    bution. This observation is further illustrated in Figure

    6. In that figure, the CDF is plotted for = 10.5. Theone-parameter Pareto CDF is compared to an expo-

    nential distribution with parameter equal to 9.5; that is,

    one with the same mean. We see the two CDFs are

    very close.For the two-parameter Pareto, as 2 gets large, wehave = 1/2 going to zero and the corresponding CVis going to zero. For = 10.5 we see the two-

    parameter CDF is approaching a deterministicdistribution with mean equal to .105. For Case 2, we

    have the two-parameter CDF, 2a

    2)

    x

    1(1)x(F

    = for

    x >1/2 and = 0 for x

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    8/14

    The Telecommunications Review 2005 86

    Probability One-Parameter Pareto Less Than

    0.4600

    0.4800

    0.5000

    0.5200

    0.5400

    0.5600

    2 2.5 3 3.5 4 4.5 5 5.5 6

    F(

    )

    Probability One-Parameter Pareto Less Than

    0.4600

    0.4800

    0.5000

    0.5200

    0.5400

    0.5600

    2 2.5 3 3.5 4 4.5 5 5.5 6

    F(

    )

    Figure 5. CDF of One-Parameter Pareto Evaluated at (Case 1)One-Parameter Pareto Two-Parameter Pareto

    Mean Variance CV r0 2 Mean Variance CV r02.5 0.667 2.222 2.236 0.8894 2.5 0.400 0.667 0.356 0.894 0.7195

    3.5 0.400 0.373 1.528 0.8582 3.5 0.286 0.400 0.030 0.436 0.6682

    4.5 0.286 0.147 1.342 0.8443 4.5 0.222 0.286 0.007 0.298 0.6549

    5.5 0.222 0.078 1.254 0.8349 5.5 0.182 0.222 0.003 0.228 0.6427

    6.5 0.182 0.048 1.202 0.8258 6.5 0.154 0.182 0.001 0.185 0.6325

    7.5 0.154 0.032 1.168 0.8234 7.5 0.133 0.154 0.001 0.156 0.6380

    8.5 0.133 0.023 1.144 0.8231 8.5 0.118 0.133 0.000 0.135 0.6340

    9.5 0.118 0.018 1.125 0.8175 9.5 0.105 0.118 0.000 0.118 0.6344

    10.5 0.105 0.014 1.111 0.8146 10.5 0.095 0.105 0.000 0.106 0.6319

    Table 5. Root of the Fundamental Equation of the Branching Process (Case 2)

    CDF Comparisons

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    0 0.1 0.2

    x

    F(x)

    F(x):1parm F(x):exp(alpha-1) F(x):2parm F(x):D

    CDF Comparisons

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    0 0.1 0.2

    x

    F(x)

    F(x):1parm F(x):exp(alpha-1) F(x):2parm F(x):D

    CDF Comparisons

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    0 0.1 0.2

    x

    F(x)

    F(x):1parm F(x):exp(alpha-1) F(x):2parm F(x):D

    Figure 6. One-Parameter Pareto and Two-Parameter Pareto

    CDF as Shape Parameter Gets Large (Case 2, = 10.5)

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    9/14

    The Telecommunications Review 2005 87

    have a PF greater than 1? To examine this issue, we

    look at Case 2 with 1 < = 2 < 2; that is, their meansexist, but their second moments do not. Thus, their

    coefficients of variation are not strictly defined;

    however, if we assume the variances are infinite, then

    the associated CV > 1 for both forms of the Pareto.

    In Table 6, we examine Case 2 with set to 0.8and, for all but the last row, both the one-parameterPareto and the two-parameter Pareto do not have a

    variance. For those cases, we see that the expected

    queue waiting time was greater than if the arrivalprocess were Poisson. Thus, the two-parameter Pareto

    has peaked or bursty arrivals when 1 < 2 < 2. Wealso see that for this example, the one-parameter

    Pareto is significantly more peaked than the

    corresponding two-parameter Pareto.

    The last row of Table 6 presents the situation

    where the one- and two-parameter Paretos do havevariances. We see the one-parameter Pareto

    maintaining its PF being greater than 1, but the

    expected queue waiting time for the two-parameterPareto is less than that of Poisson arrivals; indicating

    a PF less than 1.

    An important point shown in Table 6 is when 1 0 and so tended to clear out the queue.When used as the service time distribution in M/P/1, it

    would say that the service time is always greater than > 0 and, hence, one would think the two-parameter

    Pareto would introduce more congestion. Figure 7 and

    Table 8 show this is not true. Since this result iscounter to what we expected, we verified our findings

    using a simulation.For Cases 1a and 1b, the CVs were 4.583 and

    1.679, respectively. We thought it would be

    worthwhile to look at a situation where the CV waslarge, say CV = 10, to see if our numerical conclusions

    still held. Again, = 0.8 and because of the largeCV, the expected queue waiting time was 198.02.Table 9 compares the quantiles obtained using TRM

    and the simulation. Again we see very close

    agreement. In addition, the two-parameter Pareto once

    again results in less quantile congestion than the one-

    parameter Pareto, even in the case of a large expectedqueue waiting time.

    In summary for M/P/1, for the case where eachform of the Pareto has equal means and variance or

    just equal mean and shape parameters, using the two-

    parameter Pareto will result in smaller quantile

    congestion. This result is not intuitive, but has been

    validated against simulation results.

    ConclusionsIn this paper, we numerically investigated whether

    using a one- and two-parameter Pareto makes a differ-

    ence in the results obtained from congestion models.Two cases were numerically investigated. In Case 1,

    each form of the Pareto had equal means and vari-ances; in Case 2, each form of the Pareto had equal

    means and shape parameters. We used the TAM to

    find the root of the fundamental branching process in

    P/M/1 and the TRM to numerically find the CDF of

    the queue waiting time in M/P/1.

    For both queueing systems and each case, we nu-merically found the performance measure given by the

    two-parameter Pareto was less than the performance

    measure using the one-parameter Pareto. For theP/M/1 queue, this result made sense as the two-pa-

    rameter Pareto guaranteed no arrivals during a certain

    period of time, thereby clearing out the queue. For theM/P/1, this result was not intuitive, but was substanti-

    ated using simulation results. For both queues, further

    investigation into the reasons for this ordering are

    planned. One of the possible reasons for the ordering

    is that it appears the two-parameter Pareto has a fattertail than the one-parameter Pareto.

    For P/M/1 and for Case 1, the coefficient of

    variation (standard deviation divided by the mean) wasgreater than one for both forms, but for the two-pa-

    rameter Pareto, the PF was less than 1. The PF factor

    is defined as the standard deviation of the number of

    customers in a P/M// divided by the mean at a ran-

    dom point in time. This result was again counter-in-tuitive, as one would expect that if a distribution had

    the coefficient of variation greater than 1, then its as-

    sociated PF would also be greater than 1. This result

    was also verified using simulation. The net effect ofthis was that using a two-parameter Pareto in P/M/1

    could yield performance measures that were less than

    those obtained with comparable Poisson arrivals.

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    12/14

    The Telecommunications Review 2005 90

    Wq(t) Quantiles

    One-Parameter Pareto Two-Parameter Pareto 2 E[Wq] Method0.5 0.8 0.9 0.95 0.5 0.8 0.9 0.95

    SIM 4.67 24.84 52.03 97.11 1.83 7.82 15.29 27.330.8162 2.02 2.005 0.4913 198.02

    TRM 4.68 24.79 51.66 95.99 1.82 7.81 15.26 27.21

    Table 9. Quantile Comparison of TRM and Simulations for CV = 10

    All of these results highlight the importance ofselecting the distribution most appropriate to the

    application or data being studied, as the queueing

    measures can be quite different.

    AcknowledgementsThis work was partially supported by the NationalScience Foundation Grant DMII-0140232:

    Development of Procedures to Analyze Queueing

    Models with Heavy-Tailed Interarrival and ServiceTimes. Drs. Fischer and Masi would also like to thank

    Mitretek Systems for their support of this work.

    Appendix: Transform Approximation

    Method (TAM) and TAM Recursion

    Method (TRM)One problem with using heavy-tailed distributions is

    that they do not have closed-form Laplace transforms.

    This makes numerical techniques involving heavy-tailed distributions more challenging. Such techniques

    generally require nesting two numerical procedures:

    (a)numerically approximating the Laplace transform

    of the heavy-tailed distribution; and (b)numerically

    inverting the Laplace transform of the distribution ofinterest. There are multiple ways to do both (a) and

    (b). In this paper, we use the TAM [17] to do (a). We

    use this method for its generality and ease ofimplementation. Other methods would also work

    (e.g., methods to approximate Laplace transforms of

    heavy-tailed distributions. [21, 22] The results of this

    paper do not depend significantly on the underlyingnumerical methods. To invert the distribution of

    interest (b), we use a recursion method for the M/G/1

    queue that is based on TAM called TRM. [23] Again,

    other inversion methods would also work, [24] but weuse this one for ease of implementation. For

    completeness, we briefly summarize these twomethods.

    Transform Approximation Method

    Given a CDFFwith Laplace-Stieltjes transform

    =

    0

    sx* )x(dFe)s(B

    and pointsx1, ,xN, the TAM approximation is:

    =

    N

    1i

    sxi

    * .ep)s(B i

    where

    1N,,3,2i,2

    )x(F)x(Fp 1i1ii =

    = + K .

    The idea is to assign to xi half of the probability

    between the points to the left and right ofxi. There are

    exceptions at the boundaries where the leftoverprobability near zero and infinity must be counted so

    all weights add to 1:

    2

    )x(F)x(F1p,

    2

    )x(F)x(Fp N1NN

    211

    +=

    +=

    The approximation points xi are arbitrary. One

    way to choose them is as follows: Choose xi so that

    Fc(xi) = qi, for some constant 0 < q < 1. The idea is to

    choose points that rapidly get far out in the tail of thedistribution. For the one-parameter Pareto (Case 3),

    this becomes

    1qx /ii = .

    For the two-parameter Pareto (Case 2c), this becomes

    xi= qi / .

    TAM Recursion Method

    The recursion method for the M/G/1 queue [21] issummarized as follows: Let T >0 be some small

    number and let Fn = F(nT). Fn can be approximated

    through the following recursion:

    T1

    1F0

    , )T1/(FcTFF1N

    0jjjn1nn

    = ,

    where cn is the sum of the pi such that n = Round(xi /

    T), xi and pi are parameters from the TAM

    approximation, and c0 is assumed to be 0.

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    13/14

    The Telecommunications Review 2005 91

    Notes and References1. Crovella, M. E., M. S. Taqqu, and A. Bestavros,

    Heavy-Tailed Probability Distributions in the

    World Wide Web, A Practical Guide to Heavy

    Tails, R. J. Adler, R. E. Feldman, and M. S.Taqqu, Editors, Birkhauser, Boston, MA, pp. 3

    27, 1998.2. Paxson, V. and S. Floyd, Wide Area Traffic:

    The Failure of Poisson Modeling, IEEE/ACM

    Transactions on Networking, Volume 3, pp. 226

    244, June 1995.

    3. Harris, C. M., P. H. Brill, and M. J. Fischer,Internet-Type Queues with Power-Tailed Interar-

    rival Times and Computational Methods for Their

    Analysis, INFORMS Journal on Computing,

    Volume 12, pp. 261271, 2000.4. Juneja, S., P. Shahabuddin, and A. Chandra,

    Simulating Heavy-Tailed Processes Using De-

    layed Hazard Rate Twisting, Proceedings of the

    1999 Winter Simulation Conference, P. A. Far-rington, H. B. Nembhard, D. T. Sturrock, and

    G. W. Evans, Editors, Phoenix, AZ, pp. 420427,

    1999.

    5. Smith, R. L., Statistics of Extremes, With Appli-cations in Environment, Insurance, and Finance,

    Chapter 1, Extreme Values in Finance, Telecom-

    munications and the Environment, B. Finkenstadt

    and H. Rootzen, Editors, Chapman and Hall/CRCPress, London, pp. 178, 2003.

    6. Mayhew, S., Security Price Dynamics and Simu-lation in Financial Engineering, Proceedings ofthe 2002 Winter Simulation Conference, E. Yce-

    san, C. H. Chen, J. L. Snowdon, and J. M. Char-nes, Editors, San Diego, CA, pp. 15681574,

    2002.7. Embrechts, P., S. Resnick, and G. Samorodnitsky,

    Extreme Value Theory as a Risk Management

    Tool, North American Actuarial Journal, Vol-

    ume 3, pp. 3041, 1999.8. Embrechts, P., C. Kluppelberg, and T. Mikosch,

    Modeling Extremal Events, Springer, New York,

    1997.9. Gross, D. and C. M. Harris, Fundamentals of

    Queueing Theory, Third Edition, John Wiley and

    Sons, New York, 1998.

    10. Masi, D. M. B., M. J. Fischer, D. Gross, and J. F.Shortle, Using Quantile Estimates in SimulatingInternet Queues with Heavy-Tailed Service

    Times, Proceedings of 5th World Multi-Confer-

    ence on Systemics, Cybernetics, and Informatics,Orlando, FL, pp. 414419, July 2225, 2001.

    11. Fischer, M. J., D. M. B. Masi, D. Gross, and J. F.Shortle, Loss Systems With Heavy-Tailed Arri-

    vals, The Telecommunications Review, Volume

    15, Mitretek Systems, 2004.

    12. Fischer, M. J., D. M. B. Masi, P. H. Brill,D. Gross, and J. F. Shortle, Using the CorrectHeavy-Tailed Arrival Distribution in Modeling

    Congestion Systems, The 11th International

    Conference on Telecommunication Systems Man-

    agement, Naval Postgraduate School, Monterey,CA, October 25, 2003.

    13. Fowler, T. B., A Short Tutorial on Fractals andInternet Traffic, The Telecommunications Re-view, Volume 10, Mitretek Systems, 1999.

    14. Asmussen, S. and K. Binswanger, Simulation ofRuin Probabilities for Subexponential Claims,

    ASTIN Bulletin, Volume 27, pp. 297318, 1997.15. Shortle, J. F., P. H. Brill, M. J. Fischer, D. Gross,

    and D. M. B. Masi, An Algorithm to Compute

    the Waiting Time Distribution for the M/G/1

    Queue, INFORMS Journal on Computing, Vol-ume 16(2), pp. 152161, 2004.

    16. Gross, D., J. F. Shortle, M. J. Fischer, and D. M.B. Masi, Difficulties in Simulating Queues withPareto Service, Proceedings of the 2002 Winter

    Simulation Conference, E. Ycesan, C. H. Chen,

    J. L. Snowdon, and J. M. Charnes, Editors, San

    Diego, CA, pp. 407415, 2002.

    17. Fischer, M. J., D. M. B. Masi, P. H. Brill,D. Gross, and J. F. Shortle, Development of

    Procedures to Analyze Queueing Models with

    Heavy-Tailed Interarrival and Service TimeA

    Status Report, 2003 NSF Design, Service, andManufacturing Grantees and Research

    Conference Proceedings, R. G. Reddy, Editor,

    Birmingham, AL, January 69, 2003.18. Cohen, J. W., The Single Server Queue, Revised,

    North-Holland, Amsterdam, 1982.

    19. Cooper, R. B., Introduction to Queueing Theory,Third Edition, CEEPress, Washington, DC, 1990.

    20. Takacs, L.,Introduction to the Theory of Queues,Oxford University Press, New York, 1961.

    21. Feldmann, A. and W. Whitt, Fitting Mixtures ofExponential to Long-Tail Distributions to Ana-lyze Network Performance Models,Performance

    Evaluation, Volume 31, pp. 245279, 1998.

    22. Abate, J. and W. Whitt, Computing LaplaceTransforms for Numerical Inversion Via Contin-

    ued Fractions,INFORMS Journal on Computing,Volume 11, pp. 394405, 1999.

    23. Shortle, J. F., M. J. Fischer, D. Gross, and D. M.B. Masi, Using the Transform ApproximationMethod to Analyze Queues with Heavy-Tailed

    Service, Journal of Probability and Statistical

    Science, Volume 1(1), pp. 1730, 2003.

  • 8/2/2019 12 Fischer Masi Gross Shortle Tr2005

    14/14

    The Telecommunications Review 2005 92

    24. Abate, J., G. L. Choudhury, and W. Whitt, AnIntroduction to Numerical Transform Inversion

    and Its Application to Probability Models,

    Computational Probability, W. Grassman, Editor,Kluwer, Boston, MA, pp. 257323, 1999.

    About the Authors

    Martin J. Fischer, Ph.D., is a senior

    fellow at Mitretek. His experienceincludes network design and perform-ance analysis in telecommunications.

    He has published over 30 articles in

    refereed journals. He received his doc-torate degree in operations research

    from Southern Methodist University.Dr. Fischer may be contacted at

    [email protected].

    Denise M. Bevilacqua Masi, Ph.D., is asenior principal engineer at Mitretek.Her experience and research interestsinclude queueing theory and simulation

    applied to telecommunications net-works. She received her doctorate de-

    gree in information technology and

    engineering at George Mason Univer-sity. Dr. Masi may be contacted at

    [email protected].

    Donald Gross, Ph.D., is a researchprofessor in the Department of Systems

    Engineering and Operations Researchat George Mason University and pro-

    fessor emeritus of Operations Researchat George Washington University. He

    is the co-author of the well-known book,

    Fundamentals of Queueing Theory. Hehas authored numerous publications in the field of queueingtheory, and is past president of INFORMS. He has received

    the INFORMS Kimball Medal for Service to the OperationsResearch Profession. Dr. Gross may be contacted [email protected].

    John F. Shortle, Ph.D., is an assistantprofessor of Systems Engineering at

    George Mason University. His experi-ence includes developing stochastic,

    queueing, and simulation models tooptimize networks and operations.

    His research interests include simula-tion and queueing applications in tele-

    communications and air transporta-tion. He received his doctorate degree in operationsresearch from UC Berkeley. Dr. Shortle may be contactedat [email protected].


Recommended