11 BS201 Prob and Stat - Ch4

78

CHAPTER - 4 Sampling Distributions and Tests of Hypothesis

4.1 Concepts of Population and Samples:

4.1.1 Population

A group of individuals under investigation is called a Population.

In Statistical analysis, the word population refers to general magnitude and the study of

variation with respect to one or more characteristics relating to the individuals in a

group is known as population. Thus, the population is nothing but an aggregate study of

group of objects, living or non-living, at a particular time period of the given

geographical location.

The population is further classified into two types: namely, Finite population, which

constitutes a finite or countable finite number of objects, for example: the no. of

engineering colleges under ANU, the no. of sugar cane industries in AP state, etc., On the

other hand an infinite population which consists of an infinite no. of objects, for example:

the no of independent trials of an unbiased coin, etc.,

4.1.2 Sample and Sample Size

A finite subset of observations chosen from a given population is known as a sample and

the number of observations in the sample is known as sample size.

Classification of Samples: Samples are classified into two ways.

Large Sample: If the size of the sample 30 the sample is said to be large sample.

Small Sample: If the size of the sample < 30 the sample is said to be small

sample.

4.1.3 Methods of drawing samples

There are two different methods of selecting sample from the given population.

a) Lottery Method: Under this method, all the members (observations) of the

given population are numbered serially from 1, 2,.., N on an uniform slips and

79

put into a box. A required number of members (observations) are selected from

these thoroughly shuffled slips, is known as a sample.

This method is so simple and easy to apply and understand, if and only if the

population size is small. When the population size is large or sufficiently large

this method is not applicable. Moreover this method is not reliable as since

different persons may get different samples from same set of population and

sample, which we are selected, may not be a true representation for the

population characteristics.

b) Random Numbers Method: This is most important and widely applied method

to select a sample from the given population (finite or infinite). The method

consists of the following steps:

1 Let us consider a population of size N observations, say 1, 2, 3, ...., N.

2 The population size N may be a single digit (0,1, .., 9) or a two digit

(00,01,, 99) or a three digit number etc.,

3 A required sample of size n is selected from the well defined random

numbers tables given by Fisher and Yates tables, by neglecting the starting

and the ending number in the population, by moving in any direction, i.e.,

horizontally or vertically down words or diagonally.

4 The sample by this method is a true representative of the population

characteristics.

4.1.4 Random Sample (Finite Population)

Let us consider (x1, x2, .. , xn) be a sample of n observations from a finite population

and is said to be a random sample, if it satisfies the following two conditions:

1 All the sample observations are independent

2 All the sample observations are known chance of being selected in the

sample (i.e., N

n)

4.1.5 Random Sample (Infinite Population)

Let us consider (x1, x2, .. , xn) be a sample of n observations from an infinite

population and is said to be a random sample, if it satisfies the following two conditions:

80

1 All the sample observations are independent

2 All the sample observations are satisfies the probability function associated

with the given population.

4.2 Notation and Terminology

Population mean

Population variance 2

Population s.d.

Sample mean x

Sample variance s2

Sample s.d. s

4.3 Sampling

This is one of the methods of studying the population and this is most commonly applied

to the data. Under this method, the population is studied on the basis of the

information available in the sample, which is selected from the given population.

4.4 Parameter and Statistic

Any statistical constant of the Population observations is called the Parameter. In other

words, Parameter is a single constant value, which represents the entire population

characteristics. Whereas any statistical constant of the Sample observations is called a

Statistic. In other words, Statistic is a mathematical constant value, which represents

the sample of observations. Statistic is known as a parameter estimate.

4.5 Sampling Distribution

A series of statistic values are arranged in the form of a frequency distribution is known

as a sampling distribution.

More precisely, a sample of size n is selected from a given population of size N. Then

the number of possible samples will be nNC = k, say. Now calculate a statistic such

as mean, median, mode, standard deviation, etc., on each of the k samples, then we will

81

get a series of k statistic values. These statistic values are arrange in the form of a

frequency table is known as sampling distribution.

Suppose if we calculate the arithmetic mean on k samples then we will get sampling

distribution of sample mean, if we calculate the s. d., on k samples then we will get

sampling distribution of sample s. d., etc.,

4.6 Standard Error

The standard deviation of sampling distribution is known as standard error.

4.7 The Sampling Distribution of the Mean ( Known)

If we want to draw a sample of size n (without replacement) from a finite population of

size N, the total no. of possible samples are C(N,n) =

= k(say)

Example 4.1

A population consists of 4 observations 10, 20, 30, 40 write all the possible samples of

size 2 and then construct the sampling distribution (i) about mean (ii) about variance (iii)

also show that the mean of sample means is equal to the population mean

Solution

Given population observations are 10, 20, 30, 40

A sample of size 2 can be drawn in C(4,2) = 6 ways

Sample distribution about mean and Varience:

S.No. Sample observations Sample mean Sample Variance

1) 10, 20 1 = 15 S1

2 = 25

2) 10, 30 2 = 20 S2

2 = 100

3) 10,40 3 = 25 S3

2 = 225

4) 20, 30 4 = 25 S42 = 25

5) 20, 40 5 = 30 S52 = 100

6) 30, 40 6 = 35 S62 = 35

x

x

x

x

x

x

82

(iii) Mean of Sample means = (15+20+25+25+30+35)/6 = 25= x

Population mean = 10+20+30+40 = 25

x

4.8 The Central Limit Theorem

If repeated samples of size n are drawn from any infinite population with mean and

variance 2, and n is large (n 30), the distribution of , the sample mean, is

approximately normal, with mean (i.e. ( )E x ) and variance 2/n (i.e. 2

( )V xn

),

and this approximation becomes better as n becomes larger.

Note

As in the previous illustrating example, we can see the following modifications:

(i) If the population is finite, 2

( ) (1 )n

V xN n

; where (1-n/N) is known as

the finite population correction factor. When N is very big, the factor is

equal to 1.

(ii) If n is small, say less than 30, the sampling distribution is not so normal. A

t-distribution will be used (discussed later).

In the above example, N=4, n=2, 2

( ) (1 )n

V xN n

=(1-2/4)(1.6667/2) = 0.4167. If the

population is big (or the sample is drawn with replacement), then 2

( )V xn

=1.6667/2=0.8333.

In this course we assume a big population or sampling with replacement.

Example 4.2

An electrical firm manufactures light bulbs that have a length of life that is

approximately normal distributed with mean equal to 800 hours and a standard

deviation of 40 hours. Find the probability that a random sample of 16 bulbs will have

an average life of less than 775 hours.

x

83

Solution

Let X be the average life of the 16 bulbs. X N( x 800, n

x

22

16

402)

16

40

800775775)775( ZP

XPXP

x

x

x

x

00621.0)5.2( ZP

Example 4.3

The mean IQ scores of all students attending a college is 110 with a standard deviation

of 10. (a) If the IQ scores are normally distribution, what is the probability that the

score of any one student is greater than 112? (b) What is the probability that the

mean score in a random sample of 36 students is greater than 112? (c) What is the

probability that the mean score in a random sample of 100 students is greater than 112?

Solution

(a) Let X be the student's IQ score. X N(110, 210 )

4207.0)2.0(10

110112)112(

ZPZPXP

(b) Let 1X be the mean score of a sample of 36 students.

1X N( ,110 x 36

10222 n

x

)

1151.0)2.1(

36

10

110112)112( 1

ZPZPXP

(c) Let 2X be the mean score of a sample of 100 students.

2X N( ,110 100

1022

n

)

84

228.0.0)2(

100

10

110112)112( 2

ZPZPXP

4.9 Theory of Estimation:

4.9.1 Estimation

The process of estimating the unknown values of the population parameters on the

basis of information contained in the sample is known as estimation.

4.9.2 Estimator and Estimate

Let us consider (x1, x2, .. , xn) be a random sample of size n is chosen to from a

population, for which the parameter is an unknown value.

Any statistic or mathematical function that is defined on this sample is known as an

estimator and the particular numerical value, which is assigned to the estimator, is

known as an estimate.

The followings are some examples

Estimator Population parameter

1. x

2. s2 2

There are two important properties for an estimator, namely, unbiasedness and

efficiency.

An estimator, for example, x , is unbiased if and only if ( )E x .

Efficiency: The efficiency of an estimator, for example, x , is given by ( )V x . The smaller

the ( )V x , the more accurate will be the x as an estimator.

4.9.3 Point Estimation and Interval Estimation

Let us consider be an estimator to the parameter of the given population.

A point estimate is a single-value estimate of a population parameter, for example

^ ^

;x P p . In other words, if the estimator is a single value in the entire real number

scale then such estimation is known as point estimation.

85

An interval estimate of a population parameter gives an interval that may contain the

true value of the parameter with a certain probability (i.e. confidence); for example,

Pr( ) 0.99.a b In other words, if the estimator provides an interval of values

between which the true values of the population parameter will exists is known as

Interval Estimation. The interval of values is known as Confidence Interval and the limit

is a confidence limits.

For a point estimate, both the accuracy and reliability of the estimation are unknown.

For an interval estimate, the width of the interval gives the accuracy and the probability

gives the reliability of the estimation.

Example 4.4

(a) The mean and standard deviation for the quality point averages of a random

sample of 36 college seniors are calculated to be 2.6 and 0.3, respectively. Find a

95% confidence interval for the mean of the entire senior class.

(b) How large a sample is required in (a) if we want to be 95% confident of is off by

less than 0.05?

Solution

Let be the mean of the entire senior class.

Given: n = 36, 6.2x , s = 0.3, (1 - ) = 0.95 05.0

(a) A 95% confidence interval estimate for the is

36

3.096.16.2

36

3.096.16.2

025.0025.0

nzx

nzx

698.2502.2

(b) Let 1n be the required sample size.

To be 95% confident that if off by less than 0.05 would imply

05.03.0

96.105.0

11

025.0

nnz

13930.13805.0

)3.0)(96.1(2

1

n

86

Note

The following table gives the summary for constructing (1 )% confidence interval for

mean (single mean and difference of means) for large and small samples

Estimating Conditions Formula

Mean Large samples (n 30)

OR is known nZX

nZX

22

Mean Small samples and

unknown n

stX

n

stX

2,

2,

v=n-1

Difference of means Large sample OR 1 and

2 are known 2

2

1

2

1

221

)(

nnZXX

Difference of means

Small sample & 1 and

2 are unknown, assume

21

212

,21

11)()(

nnstXX p

221 nn ,

pooled estimate of sample standard deviation:

2

)1()1(

21

2

22

2

11

nn

snsns p

Difference of means

Small sample & 1 and

2 are unknown,

assume 1 2

2 2

1 21 2 ,

21 2

( )s s

X X tn n

1

)/(

1

)/(

)//(

2

2

2

2

2

1

2

1

2

1

2

2

2

21

2

1

n

ns

n

ns

nsns

Difference of means Paired observations , / 2d

v

sd t

n ; 1 2d x x

and v=n-1

Example 4.5

The contents of seven similar containers of sulfuric acid are 9.8, 10.2, 10.4, 9.8, 10.0,

10.2 and 9.6 liters. Find a 95% confidence interval for the mean of all such containers,

assuming an approximate normal distribution.

Solution Let be the mean of all such containers.

Given: n = 7 70x 48.7002x

87

107

70

n

xx 08.0

6

77048.700

1

)( 2222

n

n

xx

s

s = 2828.008.0 ; (1 - ) = 0.95 05.0 , 447.2025.0,6 t

A 95% confidence interval estimate for the is

7

2828.0447.210

7

2828.0447.210025.0,6025.0,6

n

stx

n

stx

262.10738.9

Example 4.6

A standardized chemistry test was given to 50 girls and 75 boys. The girls made an

average grade of 76 with a standard deviation of 6, while the boys made an average

grade of 82 with a standard deviation of 8. Find a 96% confidence interval for the

difference 1 and 2, where 1 is the mean score of all boys and 2 is the mean score of

all girls who might take this test.

Solution

Given: 751 n , 502 n , 821 x , 81 s , 762 x , 62 s ,

04.096.)1(

A 96% confidence interval for 21 is:

2

2

2

1

2

102.02121

2

2

2

1

2

102.021

)(

)(

nnzxx

nnzxx

50

6

75

805.2)7682(

50

6

75

805.2)7682(

22

21

22

,

301 n & 302 n , so 11 s & 22 s

57.843.3 21

Example 4.7

In a batch chemical process, two catalysts are being compared for their effect on the

output of the process reaction. A sample of 12 batches is prepared using catalyst 1 and

a sample of 10 batches was obtained using catalyst 2. The 12 batches for which catalyst

1 was used gave an average yield of 85 with a sample standard deviation of 4, while the

88

average for the second sample gave an average of 81 and a sample standard deviation of

5. Find a 90% confidence interval for the difference between the population means,

assuming the populations are approximately normally distributed with equal variances.

Solution

Let 1 and 2 be the mean population yield using catalyst 1 and catalyst 2, respectively.

Given: 121 n , 102 n , 851 x , 41 s , 812 x , 52 s ,

10.090.)1( , 2021012221 nn , 725.105.0,20 t

pooled estimate of sample standard deviation 2

)1()1(

21

2

22

2

11

nn

snsns p

478.421012

5)110(4)112( 22

A 90% confidence interval for 21 is:

21

05.0,202121

21

05.0,2021

11)()(

11)()(

nnstxx

nnstxx pp

10

1

12

1)478.4)(725.1()8185(

10

1

12

1)478.4)(725.1()8185( 21

31.769.0 21

Example 4.8

The weight of 10 adults selected randomly before and after a certain new diet was

introduced was recorded as follows:

Adult Before ( 1x ) After ( 2x ) Difference

1 76 81 -5

2 60 52 8

3 85 87 -2

4 58 70 -12

5 91 86 5

6 75 77 -2

7 82 90 -8

8 64 63 1

9 79 85 -6

10 88 83 5

Find a 98% confidence interval for the mean difference in weight.

89

Solution

idd

n

= -1.6 2

2( ( 1.6))

40.71

i

d

ds

n

For v = n-1 = 9; 0.01 2.821t .

A 98% confidence interval is 6.38

1.6 (2,821)10

That is 7.29 4.09d

Exercise 4.1

4.1.1 What is the value of the finite population correction factor for when (i) n = 10

and N = 1000; (ii) n = 5 and N = 200?

4.1.2 How many different samples of size n=2 can be chosen from a finite population of

size (i) N=25; (ii) N=6?

4.1.3 When we sample from an infinite population, what happens to the standard error

of the mean if the sample size is (a) increased from 50 to 200 (b) decreased from

225 to 25?

4.1.4 The mean of a random sample size n=25 is used to estimate the mean of an infinite

population with the s.d. = 2.4. What can we assert about the probability that the

error will be less than 1.2, if we use the central limit theorem?

4.1.5 A random sample of size 100 is taken from an infinite population having the mean

= 76 and the variance of 2 = 256. What is the probability that x will be between

75 and 78?

4.1.6 A random sample of size n = 36 is taken from an infinite population with mean =

63 and the variance of 2 = 81. What can we assert about the probability of getting

a sample mean greater than 66.75, if we use the central limit theorem?

4.1.7 A research worker wants to determine the average time it takes a mechanic to

rotate the tires of a car and she wants to be able to assert with 95% confidence

that the mean of her sample is off by at most 0.50 minute. If she can presume

from past experience that = 1.6 minutes, how large a sample will she have to

take?

90

4.1.8 If we want to determine the average mechanical aptitude of a large group of

workers, how large a random sample will we need to be able to assert with

probability 0.95 that the sample mean will not differ from the true mean by more

than 3.0 points? Assume that it is known from past experience that = 20.0.

4.1.9 What is the maximum error one can expect to make with probability 0.90 when

using the mean of a random sample of size n = 64 to estimate the mean of a

population with 2 = 2.56?

4.1.10 A random sample of 40 drums of a chemical, drawn from among 200 such drums

whose weights can be expected to have the = 12.2, has a mean weight of 240.8

pounds. If we estimate the mean weight of all 200 drums as 240.8 pounds, what

can we assert with 99% confidence about the maximum error?

4.1.11 To estimate the average time it takes to assemble a certain computer

component, the industrial engineer at an electronics firm timed 40 technicians in

the performance of this task, getting a mean of 12.73 minutes with a standard

deviation of 2.06 minutes. Construct a 98% confidence interval for the true

average time it takes to assemble the computer component.

4.1.12 Ten bearings made by certain process have a mean diameter of 0.5060 cm with a

standard deviation of 0.0040 cm. Assuming that the data may be looked upon as a

random sample from a normal population, construct 95% and 99% confidence

interval for the actual average diameter of bearings made by this process.

4.10 Statistical Hypothesis

Any statement regarding the characteristics of the population or the parameter of the

population or the nature of the population probability distribution is known as a

Hypothesis.

For example: i) the ratio of women for every 1000 men is 937 in India, ii) the literacy rate

in India is 56.7%, iii) there is substantial growth in prices of the infrastructure goods in

the past five years in India., and so on.

Further the hypothesis is classified into two types namely: Simple hypothesis any

hypothesis which specifies the population completely or absolutely is known as simple

hypothesis, otherwise the hypothesis is said to be Composite or Comparative

91

hypothesis. In the above example (i) & (ii) are the simple hypothesis and (iii) is the

composite hypothesis.

4.11 Test of Hypothesis

The Test of Hypothesis is nothing but a two-action decision problem, either to accept

the hypothesis or to reject the hypothesis.

4.12 Test of Significance

The test of significance is to test the deviation between

i) the parameter and statistic, or

ii) the difference between two independent statistics, is just because of

chance variation or assignable variation.

4.13 Null Hypothesis

Any statement or hypothesis, which gives no difference between the parameter and a

statistic or no differences in two independent statistics, is known as Null Hypothesis.

According to Prof. Fisher, Null hypothesis is a hypothesis, which is tested for possible

rejection under the assumption that it is true. The null hypothesis is always denoted by H0:

4.14 Alternative Hypothesis

Any statement or hypothesis, which gives a difference between the parameter and a

statistic or difference in between two independent statistics, is known as an alternative

hypothesis. The alternative hypothesis is always denoted by H1

4.15 Errors in Sampling

There are four possible situations in any testing of hypothesis procedure and are given in the following table:

0H is correct 0H is wrong

Accept 0H Correct decision Type II Error

Reject 0H Type II error Correct decision

92

In the above table, there are two types of errors occurred in sampling, while testing an

hypothesis problem, which are given by

Type I Error: Rejecting the Null Hypothesis H0, when it is true.

Type II Error: Accepting the Null Hypothesis H0, when it is false.

4.16 Sizes of Errors

The probabilities of occurring the errors are known as size of the errors and are

respectively given by:

Size of Type-I Error = Prob. {Type I Error} = Prob. {Reject H0 | H0} =

Size of Type-II Error = Prob. {Type II Error} = Prob. {Accept H0 | H1} =

Here , is known as Level of significance and , is known as power of a test.

4.17 Critical Region and Critical Value

The region of rejecting the null hypothesis is known as critical region and is denoted by

. Generally, the critical region is also known as the level of significance, and is fixed in

advance and is either 5% or 1%.

A value, which separates out the acceptance region and rejection region, is known as

critical value.

4.17.1 Significant or Critical Values

The following table represents the significant values at different levels of significance.

Z Test Level of Significance (%)

1% 5% 10%

Two-tailed Test 2.58 1.96 1.645

Right-tailed Test 2.33 1.645 1.25

Left-tailed Test -2.33 -1.645 -1.25

Note:

The significant values for small sampling techniques ( i.e. using t-Test, F-Test and Chi-

Square (2) Test) depends upon the size of the sample.

93

4.18 Hypothesis Testing Terminology

1. Null hypothesis, H0: A hypothesis that is held to be true until very strong

evidence to the contrary is obtained. H0 : 0

2. Alternative hypothesis, H1: It is a hypothesis that is complement to the null

hypothesis. Hence it will be accepted if the null hypothesis is rejected.

1 0:H (two-tail test)

1 0:H (One-tail test)

1 0:H (One-tail test)

In the one-tail test we have some expectation about the direction of the error

when the null hypothesis is wrong, while in the two-tail test we dont have such

expectation.

3. Test statistics is the value, based on the sample, used to determine whether the

null hypothesis should be rejected or accepted.

4. Critical region is a region in which if the test statistic falls the null hypothesis will

be rejected.

5. Types of error

(a) Type I error: Reject H0 when H0 is true

(b) Type II error: Accept H0 when Ha is true

6. The significance level, is the probability of committing a Type I error, i.e.,

P(Type I error) = .

The probability of committing a Type II error is ; i.e., P(Type II error) = .

4.19 Basic Steps in Testing Hypothesis

1. Formulate the null hypothesis.

2. Formulate the alternative hypothesis.

3. Specify the level of significance to be used.

4. Select the appropriate test statistic and establish the critical region.

5. Compute the value of the test statistic.

6. Conclusion: Reject H0 if the statistic has a value in the critical region, otherwise

accept H0.

94

4.20 Tests Concerning Means

The tests concerning means (for large and small samples) are summarized in the

following table.

H0 Conditions Test statistic

0 Large samples (n 30) OR is known n

xz

0

Small samples and unknown ns

xt 0

with 1 n

021 d Large samples OR 1 and

2 are known

2

2

2

1

2

1

021 )(

nn

dxxz

Small sample & 1 and 2

are unknown, assume

21

21

021

11

)(

nns

dxxt

p

with 221 nn and

2

)1()1(

21

2

22

2

112

nn

snsnsp

if 1 = 2 but unknown

Small sample & 1 and 2

are unknown,

assume 1 2

2

2

2

1

2

1

021 )(

n

s

n

s

dxxt

with

1

)/(

1

)/(

)//(

2

2

2

2

2

1

2

1

2

1

2

2

2

21

2

1

n

ns

n

ns

nsns

021 d Paired observations

ns

ddt

d

0 with 1 n

Example 4.9

A manufacturer of sports equipment has developed a new synthetic fishing line that he

claims has a mean breaking strength of 8 kilograms with a standard deviation of 0.5

kilogram. Test the hypothesis that = 8 kilograms against the alternative that 8

kilograms if a random sample of 50 lines is tested and found to have a mean breaking

strength of 7.8 kilograms. Use a 0.01 level of significance.

95

Solution

Null hypothesis: 8 kilograms

Alternative hypothesis: 8 kilograms (Two tailed test alternative)

Level of significance: 0.01

Critical region: Z > 58.2005.0 z or Z < 58.2005.0 z

Computation: n = 50 8.7x 5.0

828.2

505.0

88.7

n

xz

Conclusion: As the sample z (= -2.828) falls inside the critical region, so reject the null

hypothesis at 0.01 level of significance and conclude that is significantly smaller than

8 kilograms.

Example 4.10

The average length of time for students to register for fall classes at a certain college has

been 50 minutes with a standard deviation of 10 minutes. A new registration procedure

using modern computing machines is being tried. If a random sample of 12 students had

an average registration time of 42 minutes with a standard deviation of 11.9 minutes

under the new system, test the hypothesis that the population mean is now less than

50, using a level of significance of (1) 0.05, and (2) 0.01. Assume the population of times

to be normal.

Solution

Let be the population mean time for students to register in the new registration

procedure.

Null hypothesis: 50 minutes

Alternative hypothesis: 50 minutes (left tailed test alternative)


Critical region: (n = 12 < 30; and the new is unknown, so t-test should be used)

degree of freedom ( ) = n -1 = 12 -1 =11

96

t < 796.105.0,11 t

Computation:

n = 12 42x s = 11.9

329.2

129.11

5042

ns

xt

Conclusion:

(1) As sample t (= -2.329) falls inside the critical region, so reject the null hypothesis

at 0.05 level of significance and conclude that is significantly smaller than 50

minutes.

(2) Identical with those of (1) except the critical region would be replaced by:

718.201.0,11 tt

and the corresponding conclusion would be changed as follows:

As sample t (= -2.329) falls outside the critical region, so reject the alternative

hypothesis at 0.01 level of significance and conclude that is not highly

significantly smaller than 50 minutes.

Example 4.11

An experiment was performed to compare the abrasive wear of two different laminated

materials. Twelve pieces of material 1 were tested, by exposing each piece to a machine

measuring wear. Ten pieces of material 2 were similarly tested. In each case, the depth

of wear was observed. The samples of material 1 gave an average (coded) wear of 85

units with a standard deviation of 4, while the samples of material 2 gave an average of

81 and a standard deviation of 5. Test the hypothesis that the two types of material

exhibit the same mean abrasive wear at the 0.10 level of significance. Assume the

populations to be approximately normal with equal variances.

Solution:

Let 1 and 2 be the mean abrasive wear of material 1 and 2 respectively.

Null hypothesis: 21 , i.e. 021

Alternative hypothesis: 21 , i.e. 021


97

Critical region: (As both 1n and 2n are smaller than 30 and their standard deviations are

unknown, so t-test has to be used.)

2021012221 nn , 725.105.0,20 tt or 725.105.0,20 tt

Computation:

121 n 851 x 41 s

102 n 812 x 52 s

05.2021012

5)9(4)11(

2

)1()1( 22

21

2

22

2

112

nn

snsns p

086.2

10

1

12

105.20

0)8185(

11

)()(

21

2121

nns

xxt

p

Conclusion: As the sample t (=2.086) falls inside the critical region, so reject the null

hypothesis at 0.10 level of significance and conclude that the mean abrasive wear of

material 1 is significantly higher than that of the material 2.

Example 4.12

Five samples of a ferrous-type substance are to be used to determine if there is a

difference between a laboratory chemical analysis and an X-ray fluorescence analysis of

the iron content. Each sample was split into two sub-samples and the two types of

analysis were applied. Following are the coded data showing the iron content analysis:

Sample

Analysis 1 2 3 4 5

x-ray 2.0 2.0 2.3 2.1 2.4

Chemical 2.2 1.9 2.5 2.3 2.4

Solution:

Assuming the populations normal, test at the 0.05 level of significance whether the two

methods of analysis give, on the average, the same result.

Let 1 and 2 be the mean iron content determined by the laboratory chemical

analysis and X-ray fluorescence analysis respectively; and

D be the mean of the population of differences of paired measurements.

98

Null hypothesis: 21 or 0D

Alternative hypothesis: 21 or 0D


Critical region: (As n = 5 < 30, so t-test should be used.)

776.2025.0,4 tt or 776.2025.0,4 tt

Computation:

Sample

Analysis 1 2 3 4 5

x-ray 2.0 2.0 2.3 2.1 2.4

Chemical 2.2 1.9 2.5 2.3 2.4

id -0.2 0.1 -0.2 -0.2 0

2

id 0.04 0.01 0.04 0.04 0

5.05

1

i

id 13.05

1

2 i

id

1.05

5.0

5

dd

02.0)4)(5(

)5.0()13.0)(5(

)1(

222

2

nn

ddnsd

5811.1

502.0

0)1.0(

n

s

dt

d

D

Conclusion: As the sample t (=-1.5811) falls outside the critical region, so reject the

alternative hypothesis at 0.05 level of significance and conclude that there is no

significant difference in the mean iron content determined by the above two analyses.

4.21 Chi-square Tests

The various tests of significance studied earlier such that as Z-test, t-test, F-test was

based on the assumption that the samples were drawn from normal population. Under

this assumption the various statistics were normally distributed. Since the procedure of

testing the significance requires the knowledge about the type of population or

parameters of population from which random samples have been drawn, these tests are

99

known as parametric tests.

But there are many practical situations in which the assumption of any kind about the

distribution of population or its parameter is not possible to make. The alternative

technique where no assumption about the distribution or about parameters of

population is made is known as non-parametric tests. Chi-square test is an example of

the non parametric test. Chi-square distribution is a distribution free test.

Chi-square distribution was first discovered by Helmert in 1876 and later independently

by Karl Pearson in 1900. The range of chi-square distribution is 0 to .

The square of a standard normal variate is known as chi-square variate with one degree

of freedom. Symbolically we can write it as

test is used for enumeration data (the data obtained by enumeration or counting is

called enumeration data. For example, number of blue flowers, number of intelligent

boys, number of curled leaves, etc.,) which generally relate to discrete variable where as

t-test and standard normal deviate tests are used for measure mental data which

generally relate to continuous variable.

test can be used to know whether the given objects are segregating in a theoretical

ratio or whether the two attributes are independent in a contingency table.

a) Conditions for the validity of test:

The validity of -test of goodness of fit between theoretical and observed, the

following conditions must be satisfied.

i) The sample observations should be independent

ii) Constraints on the cell frequencies, if any, should be linear oi =ei

iii) N, the total frequency should be reasonably large, say greater than 50

100

iv) If any theoretical (expected) cell frequency is < 5, then for the application of chi-

square test it is pooled with the preceding or succeeding frequency so that the pooled

frequency is more than 5 and finally adjust for the d.f. lost in pooling.

b) Applications of Chi-square Test:

1. testing the independence of attributes

2. to test the goodness of fit

3. testing of linkage in genetic problems

4. comparison of sample variance with population variance

5. testing the homogeneity of variances

6. testing the homogeneity of correlation coefficient

4.21.1 Goodness-of-fit Test

A test to determine if a population has a specified theoretical distribution. The test is

based on how good a fit we have between the frequency of occurrence of observations

in an observed sample and the expected frequencies obtained from the hypothesized

distribution.

A goodness-of-fit test between observed and expected frequencies is based on the

quantity test2

2

( )O E

E

i i

i

where test2 is a value of the random variable whose sampling distribution is

approximated very closed by the Chi-square distribution,

Oi is the observed frequency of cell i,

Ei is the expected frequency of cell i.

The number of degrees of freedom in a Chi-square goodness-of-fit test is equal to the

number of cells minus the number of quantities obtained from the observed data that

are used in the calculations of the expected frequencies.

For a level of significance equal to , test2 2 constitutes the critical region. The

decision criterion described here should not be used unless each of the expected

frequencies is at least equal to 5.

101

Example 4.13 Consider the tossing of a die 120 times.

Faces

1 2 3 4 5 6

Observed 20 22 17 18 19 24 Expected

By comparing the observed frequencies with the expected frequencies, one has to

decide whether the die is fair die or not.

Null hypothesis: the die is a fair die, i.e. 6

1)( iXP for i = 1, 2, 3, 4, 5, and 6

Alternative hypothesis: the die is not a fair die


Critical region: 5161 n ; 07.112 05.0,52

Computation:

Expected value = 20)6

1(120)( iXnP

I 1 2 3 4 5 6

Observed )( iO 20 22 17 18 19 24

Expected )( iE 20 20 20 20 20 20

ii EO 0 2 -3 -2 -1 4

6

1

2

2 )(

i i

ii

E

EO

7.120

4

20

)1(

20

)2(

20

)3(

20

2

20

0 222222

Conclusion: As the sample 2 (= 1.7) falls outside the critical region, so reject the

alternative hypothesis and conclude that the die is a fair die.

4.21.2 Test for Independence

The Chi-square test procedure can also be used to test the hypothesis of independence

of two variables/attributes. The observed frequencies of two variables are entered in a

two-way classification table, or contingency table.

The expected frequency of the cell in the ith row and jth column in the contingency table.

102

The degrees of freedom for the contingency table is equal to (r 1) (c 1) where r is the

number of rows and c is the number of columns in the table.

Example 4.14

Suppose that we wish to study the relationship between grade point average and

appearance of 150 students of a college, whose analysis are given below. Use the level

of significance is 0.05

Grade Point Average

Appearance 1 2 3 4 Totals

Attractive 14 ( ) 11 ( ) 10 ( ) 5 ( ) 40 Ordinary 10 ( ) 16 ( ) 16 ( ) 14 ( ) 56 Unattractive 3 ( ) 4 ( ) 7 ( ) 10 ( ) 24

Totals 27 31 33 29 120

Null hypothesis: There is no relationship between grade point average and appearance.

That is, the two characteristics are independent.

Alternative hypothesis: There is a relationship between grade point average and

appearance. That is, the two characteristics are not independent.


Critical region: 2 05.0,2

, where = (r -1)(c - 1)

Computation:

Grade Point Average

Appearance 1 2 3 4 Totals

Attractive 14

(9)

11

(10.33)

10

(11)

5

(9.67)

40

Ordinary 10 ( 12.6)

16 ( 14.47)

16 (15.4)

14 (13.53)

56

Unattractive 3

(5.4)

4

(6.2)

7

(6.6)

10

(5.8)

24

Totals 27 31 33 29 120

67.9

)67.95(

11

)1110(

33.10

)33.1011(

9

)914()( 222222

E

EO

53.13

)53.1314(

4.15

)4.1516(

47.14

)47.1416(

6.12

)6.1210( 2222

103

818.108.5

)8.510(

6.6

)6.67(

2.6

)2.64(

4.5

)4.53( 2222

Conclusion: Since 596.122 05.0,6 , so sample 2 (=10.818) falls outside the critical

region. So reject the alternative hypothesis and conclude that there is no evidence to

support there is relationship between grade point average and appearance.

4.21.3 Test for Homogeneity

To test the hypothesis that several population proportions are equal.

The approach for the test of homogeneity is the same as for the test of independence of

variables/attributes.

Example 4.15

A study of the purchase decisions for 3 stock portfolio managers, A, B, and C was

conducted to compare the rates of stock purchases that resulted in profits over a time

period that was less than or equal to one year. One hundred randomly selected

purchases obtained for each of the managers showed the results given in the table. Do

these data provide evidence of differences among the rates of successful purchases for

the three portfolio managers? Test with .05.0

Result

Manager

A B C

Purchases show profit 63 71 55

Purchase do no show profit 37 29 45

Total 100 100 100

Null hypothesis: the rates of stock purchases that resulted in profit were the same for

the three stock portfolio managers

Alternative: their rates were not all the same


Critical region: 2)13)(12( ; 991.52 05.0,22

Computation:

104

Result

Manager

A B C Total

Purchases show profit 63

(63) 71

(63) 55

(63) 189

Purchase do no show profit 37

(37) 29

(37) 45

(37) 111

Total 100 100 100 300

49.537

)3745(

37

)3729(

37

)3737(

63

)6355(

63

)6371(

63

)6363( 2222222

Conclusion: As the sample 2 (= 5.49) falls outside the critical region so reject the

alternative hypothesis and conclude that there is no sufficient evidence to support the

rates of purchases resulted in profit of the three portfolio managers were different.

Exercise 4.2

4.2.1 According to the norms established for a mechanical aptitude test, persons who

are 18 years old should average 73.2 with standard deviation of 8.6. If 45

randomly selected persons of that age averaged 76.7, test the null hypothesis =

73.2 against the alternative hypothesis > 73.2 at the 0.01 level of significance.

4.2.2 In 64 randomly selected hours of production, the mean and standard deviation of

the number of acceptable pieces produced by an automatic stamping machine is

1,038 and 146 respectively. At the 0.05 level of significance does this enable us to

reject the null hypothesis = 1000 against the alternative hypothesis >1000?

4.2.3 A random sample of 6 steel beams has a mean compressive strength of 58,392psi,

with a standard deviation of 648psi. Use this information and the level of

significance of = 0.05 to test whether the true average compressive strength of

the steel beam from which this sample came is 58,000psi. Assume normality.

4.2.4 Five measurements of tar content of a certain kind of cigarette yielded 14.5, 14.2,

14.4, 14.3 and 14.6 mg per cigarette. Show that the difference between the mean

of this sample, = 14.4 and the average tar claimed by the manufacturer = 14.0,

is significant at = 0.05. Assume normality.

4.2.5 A manufacturer of gunpowder has developed a new powder which is designed to

produce a muzzle velocity equal to 3000 ft./sec. Eight shells are loaded with the

charge and the muzzle velocities measured. The resulting velocities are as: 3005,

105

2935, 2965, 2995, 3905, 2935 and 2905. Do these data present sufficient evidence

to indicate that the average velocity differs from 3000ft/sec?

4.2.6 An investigation of two kinds of photocopying equipment showed that 71 failures

of the first kind of equipment took on the average 83.2 minutes to repair with a

standard deviation of 19.3 minutes, while 75 failures of the second kind of

equipment took on the average 90.8 minutes to repair with a standard deviation of

21.4 minutes. Test the null hypothesis (the hypothesis that on the

average it takes an equal amount of time o repair either kind of equipment)

against the alternative hypothesis at the level of significance of =

0.05

4.2.7 Suppose that we want to investigate whether on the average men earn more than

$20 per week more than women in a certain industry. If sample data show that 60

men earn on the average per week with a standard deviation of s1 =

$15.60, while 60 women earn on the average per week with a

standard deviation of s2 = $18.20, what can we conclude at the 0.01 level of

significance?

4.2.8 Measuring specimens of nylon yarn taken from two spinning machines, it was

found that 8 specimens from the first machine had a mean denier of 9.67 with a

standard deviation of 1.81, while 10 specimens from the second machine has a

mean denier of 7.43 with a standard deviation of 1.48. Assuming that the

populations sampled are normal and have the same variance, test the null

hypothesis against the alternative hypothesis at the

level of significance of = 0.05.

4.2.9 The following are the Brinell hardness values obtained for samples of two

magnesium alloys:

Alloy 1: 66.3 63.5 64.9 61.8 64.3 64.7 65.1 64.5 68.4 63.2

Alloy 2: 71.3 60.4 62.6 63.9 68.8 70.1 64.8 68.9 65.8 66.2

Use the 0.05 level of significance to test null hypothesis against the

alternative hypothesis .

4.2.10 A genetical law says that children having one parent of blood group M and other

parent of blood group N will always be one of the three groups M, MN and N;

and that the average number of children in these three groups will be in the ratio

106

1:2:1. The report on an experiment states as follows: Of 162 children having

one M Parent, and one N parent, 28% were found to be of group M, 42% of

group MN and the rest of them are group N. Do the data in the report conform

to the expected genetic ratio 1:2:1? (Given value of Chi-square at 2, 0.05 is 5.991)

4.2.11 A chemical extraction plant processes sea water to collect sodium chloride and

magnesium. It is known that sea water contains sodium chloride, magnesium

and other elements in the ratio of 62:4:34. A sample of 200 tones of sea water

has resulted in 130 tones of sodium chloride, 6 tones of magnesium and 64 tones

of other elements. Are these data consistent with the known composition of sea

water at 5% level?

4.2.12 The following table shows the number of aircraft accidents that occurred during

the six days of a week. Find whether the accidents are uniformly distributed over

the week? Given value of 2 = 11.070 with 2, 0.05

Day Mon Tue Wed Thu Fri Sat Total

No. of Accidents

14 18 12 11 15 14 84

4.2.13 200 digits were chosen at random from a set of tables. The frequencies of the

digits were as follows:

Digit 0 1 2 3 4 5 6 7 8 9 Total

F 18 19 23 21 16 25 22 20 21 15 200

Use Chi-square test; to assess the correctness of the hypothesis that the digits

were distributed in equal number in the table, given the value of chi-square at 9,

0.05 is 16.9

4.2.14 In an experiment on immunization of cattle from tuberculosis, the following

results were obtained:

Affected Unaffected

Inoculated 12 28

Not Inoculated 13 7

Examine the effect of vaccine in controlling the incidence of the disease.

4.2.15 Can vaccination be regarded as a preventive measure of small-pox as evidenced

by the following data: Of 1,482 persons exposed to small pox in a locality, 368 in

107

all were attacked. Of these 1,482 persons, 343 were vaccinated and of these,

only 35 were attacked.

4.2.16 Out of 8000 graduates in a city 800 are females; out of 1600 graduate employees

120 are females. Use 2 test to determine if any distinction is made in

appointment on the basis of gender. (Value of 2 is 3.84 for 1d.f., at 0.05 level)

Date post:	25-Nov-2015
Category:	Documents
Upload:	knmounika2395
View:	242 times
Download:	9 times

11 BS201 Prob and Stat - Ch4

Documents