Download - Probability Plotting for the Weibull Distribution - …cbucc.com/pds_file/2005092163603.pdf · Probability Plotting for the Weibull ... ranking methods because median ranks are at

Probability Plotting for the Weibull Distribution

One method of calculating the parameters of the Weibull distribution is by using probability

plotting. To better illustrate this procedure, consider the following example from Kececioglu

[18].

Example 1

Assume that six identical units are being reliability tested at the same application and

operation stress levels. All of these units fail during the test after operating the following

number of hours, : 93, 34, 16, 120, 53 and 75. Estimate the values of the parameters for a

two-parameter Weibull distribution and determine the reliability of the units at a time of 15

hours.

Solution to Example 1

The steps for determining the parameters of the Weibull pdf representing the data, using

probability plotting, are outlined in the following instructions.

First, rank the times-to-failure in ascending order as shown next.

Obtain their median rank plotting positions. Median rank positions are used instead of other

ranking methods because median ranks are at a specific confidence level (50%). Median

ranks can be found tabulated in many reliability books. They can also be estimated using the

following equation,

where i is the failure order number and N is the total sample size.

The exact median ranks are found in Weibull++ by solving,

for MR, where N is the sample size and i the order number.

The times-to-failure, with their corresponding median ranks, are shown next.

On a Weibull probability paper, plot the times and their corresponding ranks. A sample of a

Weibull probability paper is given in Figure 7, and the plot of the data in the example in Figure

8.

Fig. 7: Example of Weibull probability plotting paper

Draw the best possible straight line through these points, as shown below, then obtain the

slope of this line by drawing a line, parallel to the one just obtained, through the slope indicator.

This value is the estimate of the shape parameter , in this case = 1.4.

Fig. 8: Probability plot of data in Probability Plotting Example

At the Q(t) = 63.2% ordinate point, draw a straight horizontal line until this line intersects the

fitted straight line. Draw a vertical line through this intersection until it crosses the abscissa.

The value at the intersection of the abscissa is the estimate of . For this case = 76 hours.

(This is always at 63.2% since Q(T) = 1 - = 1- = 0.632 = 63.2%.)

Now any reliability value for any mission time t can be obtained. For example the reliability for

a mission of 15 hours, or any other time, can now be obtained either from the plot or

analytically.

To obtain the value from the plot, draw a vertical line from the abscissa, at t = 15 hours, to the

fitted line. Draw a horizontal line from this intersection to the ordinate and read Q(t), in this

case Q(t) = 9.8%. Thus, R(t) = 1 - Q(t) = 90.2%.

This can also be obtained analytically, from the Weibull reliability function, since the estimates

of both of the parameters are known or,

Probability Plotting for the location parameter,

The third parameter of the Weibull distribution is utilized when the data do not fall on straight

line, but fall on either a concave up or down curve. The following statements can be made

regarding the value of :

Case 1: If the curve for MR versus is concave down and the curve for MR versus ( - )

is concave up, then there exists a such that 0 < < , or has a positive value.

Case 2: If the curves for MR versus and MR versus ( - ) are both concave up, then

there exists a negative , which will straighten out the curve of MR versus .

Case 3: If neither one of the previous two cases prevail, then either reject the Weibull pdf as

one capable of representing the data, or proceed with the multiple population (mixed Weibull)

analysis.

To obtain the location parameter, :

?/ΦΟΝΤ> Subtract the same arbitrary value, , from all the times to failure, and replot the

data.

?/ΦΟΝΤ> If the initial curve is concave up subtract a negative from each failure time.

?/ΦΟΝΤ> If the initial curve is concave down subtract a positive from each failure time.

?/ΦΟΝΤ> Repeat until the data plots on an acceptable straight line.

?/ΦΟΝΤ> The value of is the subtracted (positive or negative) value that places the points

in an acceptable straight line.

The other two parameters are then obtained using the techniques previously described. Also,

it is important to note that we used the term subtract a positive or negative gamma, where

subtracting a negative gamma is equivalent to adding it. Note that when adjusting for gamma,

the x-axis scale for the straight line becomes (T - ).

Example 2

Six identical units are reliability tested under the same stresses and conditions. All units are

tested to failure, and the following times-to-failure are recorded: 48, 66, 85, 107, 125 and 152

hours. Find the parameters of the three-parameter Weibull distribution using probability

plotting.


The following figure shows the results. Note that since the original data set was concave down,

17.26 was subtracted from all the times-to-failure and re-plotted, resulting in a straight line,

thus = 17.26. (We used Weibull++ to get the results. To perform this by hand, one would

attempt different values of , using a trial and error methodology, until an acceptable straight

line is found. When performed manually, you do not expect decimal accuracy.)

Fig. 9: Probability Plot of data in Probability Plotting for the Location Parameter

example

Weibull Rank Regression on Y

Performing rank regression on Y requires that a straight line mathematically be fitted to a set

of data points such that the sum of the squares of the vertical deviations from the points to the

line is minimized. This is in essence the same methodology as the probability plotting method,

except that we use the principle of least squares to determine the line through the points, as

opposed to just 밻 yeballing?it.

The first step is to bring our function into a linear form. For the two-parameter Weibull

distribution, the cdf (cumulative density function) is,

(8)

Taking the natural logarithm of both sides of the equation yields,

or,

(9)

Now let,

(10)

(11)

and,

(12)

which results in the linear equation of,

The least squares parameter estimation method (also known as regression analysis) was

discussed in the Statistical Background chapter and the following equations for regression on

Y were derived in the Parameter Estimation chapter.

(13)

and,

(14)

In this case the equations for and are,

and,

The F s are estimated from the median ranks.

Once and are obtained, then and can easily be obtained from Eqns. (11) and (12).

The Correlation Coefficient

The correlation coefficient is defined as follows,

where, = covariance of x and y, = standard deviation of x and = standard deviation

of y.

The estimator of is the sample correlation coefficient, , given by,

(15)

Example 3

Consider the data in Example 1, where six units were tested to failure and the following failure

times were recorded: 16, 34, 53, 75, 93 and 120 hours. Estimate the parameters and the

correlation coefficient using rank regression on Y, assuming that the data follow the two-

parameter Weibull distribution.


Construct a table as shown below.

Table 1- Least Squares Analysis

Utilizing the values from Table 1, calculate and using Eqns. (13) and (14),

or,

and,

or,

Therefore, from Eqn. (12),

and from Eqn (11)

or,

The correlation coefficient can be estimated using Eqn. (15),

The above example can be repeated using Weibull++ 6. The steps for using the application

are as follows:

?/ΦΟΝΤ> Start Weibull++ and create a new Data Folio.

?/ΦΟΝΤ> Select the Times to Failure option.

?/ΦΟΝΤ> Enter the times-to-failure in the spreadsheet (ignore the Subset ID column). The

times-to-failure need not be sorted, Weibull++ will automatically sort the data.

?/ΦΟΝΤ> Select the desired method of analysis. Note that we are assuming that the

underlying distribution is the Weibull, so make sure that the Weibull distribution is selected.

(Just click Weibull. It will turn red when selected.)

Also, so that you get the same results as this example, switch to the Set Analysis page

and make sure you are using the Rank Regression on Y calculation method with this

example.

Note that this can also be done from the Main page by clicking the left bottom box under

the Results area. Each time you click that box you will see the method switch between

MLE, RRX, and RRY.

?/ΦΟΝΤ> Under Parameters/Type on the main page, choose 2. Click the Calculate icon or

select Calculate Parameters from the Data menu. The results should appear in the Data

Folio's Results Area. The next figure shows the results for this example.

You can now plot the results by clicking the Plot icon or by selecting Plot Probability from

the Data menu.

The Weibull probability plot for these data is shown next.

The confidence bounds, as determined from the Fisher matrix, can also be plotted.

If desired, the Weibull pdf representing these data can be written as,

or,

You can also plot the Weibull pdf by selecting Pdf Plot from the Special Plot Type option on

the control panel to the right of the plot area.

From this point on, different results, reports and plots can be obtained.

Weibull Rank Regression on X

Performing a rank regression on X is similar to the process for rank regression on Y, with the

difference being that the horizontal deviations from the points to the line are minimized, rather

than the vertical.

Again the first task is to bring our cdf function, Eqn. (8), into a linear form. This step is exactly

the same as in the regression on Y analysis and Eqns. (9), (10), (11) and (12) apply in this

case too. The derivation from the previous analysis begins on the least squares fit part, where

in this case we treat x as the dependent variable and y as the independent variable.

The best-fitting straight line to the data, for regression on X (see the Statistical Background

chapter), is the straight line,

(16)

The corresponding equations for and are,

and,

where,

and,

and the F( ) values are again obtained from the median ranks.

Once and are obtained, solve Eqn. (16) for y, which corresponds to,

Solving for the parameters from Eqns. (11) and (12) we get,

(17)

and,

(18)

The correlation coefficient is evaluated as before using Eqn. (15).

Example 4

Repeat Example 1 using rank regression on X.


Table 1, constructed in Example 3, can also be applied to this example. Using the values from

this table we get,

or,

and,

or,

Therefore, from Eqn. (18),

and from Eqn. (17),

The correlation coefficient is found using Eqn. (15):

The results and the associated graph using Weibull++ 6 are given next. Note that the slight

variation in the results is due to the number of significant figures used in the estimation of the

median ranks. Weibull++ by default uses double precision accuracy when computing the

median ranks.

Three-Parameter Weibull Regression

When the MR versus points plotted on the Weibull probability paper do not fall on a satisfactory straight line and the points fall on a curve, then a location parameter, , might

exist which may straighten out these points.

The goal in this case is to fit a curve, instead of a line, through the data points, using nonlinear

regression The Gauss-Newton method can be used to solve for the parameters, , and ,

by performing a Taylor series expansion on F( ; , , ). Then the nonlinear model is

approximated with linear terms and ordinary least squares are employed to estimate the

parameters. This procedure is iterated until a satisfactory solution is reached.

Weibull++ 6 calculates the value of by utilizing an optimized Nelder-Mead algorithm, and

adjusts the points by this value of such that they fall on a straight line, and then plots both

the adjusted and the original unadjusted points. To draw a curve through the original

unadjusted points, if so desired, choose the Weibull 3P Line Unadjusted for Gamma option

from the Show Plot Line submenu under the Plot Options menu. The returned estimations of

the parameters are the same when selecting RRX or RRY.

The results and the associated graph for the previous example using the three-parameter

Weibull case are shown next:

Maximum Likelihood Estimation for the Weibull Distribution

As it was outlined in the Statistical Background chapter, maximum likelihood estimation works

by developing a likelihood function based on the available data and finding the values of the

parameter estimates that maximize the likelihood function. This can be achieved by using

iterative methods to determine the parameter estimate values that maximize the likelihood

function, but this can be rather difficult and time-consuming, particularly when dealing with the

three-parameter distribution. Another method of finding the parameter estimates involves

taking the partial derivatives of the likelihood function with respect to the parameters, setting

the resulting equations equal to zero and solving simultaneously to determine the values of the

parameter estimates. The log-likelihood functions and associated partial derivatives used to

determine maximum likelihood estimates for the Weibull distribution are covered in the

Distribution Log-Likelihood Equations chapter.

Example 5

Repeat Example 1 using maximum likelihood estimation.


In this case we have non-grouped data with no suspensions or intervals, i.e. complete data.

The equations for the partial derivatives of the log-likelihood function are derived in the

Distribution Log-Likelihood Equations chapter and given next:

and,

Solving the above equations simultaneously we get,

The variance/covariance matrix is found to be,

The results and the associated graph using Weibull++ (MLE) are shown next.

You can view the variance/covariance matrix directly from the Quick Calculation Pad (QCP) of

Weibull++ by clicking the Fisher Matrix button.

Note that the decimal accuracy displayed and used is based on your individual User Setup.