Two Proportion z-Interval and z-Tests

transcript

Small differences.

Recall from Random VariablesIf X and Y are two independent Random

Variables from Normal distributions, we can combine them to get a new Random Variable that is also from a Normal distribution.

Standard Deviation:2 2( ) ( ) ( )SD X Y SD X SD Y

( ) ( ) ( )E X Y E X E Y

Differences between two proportionsWe can create a confidence interval for the

difference between the two proportions.Conditions:

1.Randomization

2. Independence: Independence between the two groups

3.10% Condition: check for both groups

4.Success/Failure: check for both groups separately, use and

All conditions have been met to use a Normal model for the differences in two proportions.

1p̂ 2p̂

Two-proportion z-intervalSince the Normal model is being used we need a

mean and standard deviation(error).Mean: Standard Error:

Be very careful with the conclusion.

1 2ˆ ˆp p

1 1 2 2

ˆ ˆ ˆ ˆp q p q

1 1 2 21 2

ˆ ˆ ˆ ˆˆ ˆ( ) *

p q p qp p z

Example: Gender and Binge DrinkingCollege students were randomly selected and asked

about how much alcohol they consumed on a weekly basis. Over a certain amount of alcohol consumed was considered binge drinking and those individuals were labeled frequent binge drinkers. The following table shows the number of men and women who were frequent binge drinkers. What is the difference in the proportions of male and female binge drinkers?

Frequent Binge Drinker (x)

Total Questioned(n)

Male 1392 5348Female 1748 8471

Conditions:

1.Randomization: Stated as random samples

2. Independence: It is safe to assume that men and women are independent of each other.

3.10% Condition: 5348 male college students is less than 10% of all male college students

8471 female college students is less than 10% of all female college students

4.Success/Failure:

All conditions have been met to use a Normal model for the differences of two proportions.

ˆ 1392 10

ˆ 3956 10

ˆ 1748 10

ˆ 6723 10

Two-proportion z-interval at 95% confidence

=(0.039, 0.069)

We are 95% confident that the true proportion of men who are frequent binge drinkers is between 3.9% to 6.9% higher than the proportion of women who are frequent binge drinkers.

1 1 2 21 2

ˆ ˆ ˆ ˆˆ ˆ( ) *

p q p qp p z

1392 3956 1748 67231392 1748 5348 5348 8471 8471

1.965348 8471 5348 8471

Comparing 2 Group ProportionsThree possibilities:

1.The two proportions are the same.

2.The two proportions are different.

3.One proportion is larger than the other.

The null hypothesis would be that they are the same.

We’ll then run a test to see if there are different(two-tail) or one is larger than the other(one-tail)

HypothesesThe null should be that there is no difference

between the two proportions or that they are equal.

H0: p1 = p2

But we have no probability model for this situation.Since both proportions come from independent(we

hope) Normal distributions, the difference of the two would be a new Normal distribution.

H0: p1 – p2 = 0We can test against this null hypothesis now.

Alternative HypothesisThe alternative hypotheses are then:

p1 – p2 < 0

The proportion of group 1 is lower than group 2 p1 – p2 > 0

The proportion of group 1 is higher than group 2 p1 – p2 ≠ 0

The proportion of group 1 is not the same as group 2

Conditions1. Randomization

2. Independence: Independence between the two groups

3. 10% Condition: check for both groups

4. Success/Failure: check for both groups separately

All conditions have been met to use a Normal model for the difference of two proportions

MechanicsWe need to calculate a pooled proportion from

the two we are given. The assumption is that they are equal, so given:

The pooled proportion is:

This will is used to calculate a standard deviation for the two proprotions.

1 21 2

ˆ ˆ;x x

p pn n

ˆpooled

Pooled Standard Deviation:

Test Statistic:

P-Value = P(z <,> ___)

ˆ ˆ ˆ ˆˆ( ) pooled pooled pooled pooled

pooled

p q p qSE p

1 2 1 2

ˆ ˆ 0 ˆ ˆ

ˆ( ) ˆ ˆ ˆ ˆpooled pooled pooled pooled pooled

p p p pz

SE p p q p q

ConclusionCompare P-Value to stated alpha. Talks about difference between the two

proportions only, not the values of the individual proportions.

Example: Binge DrinkersIs there evidence that male college students have a

higher proportion of frequent binge drinking than female college students?

Hypothesis:

H0: p1 – p2 = 0 The proportion of male and female college students that are frequent binge drinkers is the same.

HA: p1 – p2 > 0 The proportion of male college students that are frequent binge drinkers is more than female college students.

Conditions:

1. Randomization: Stated as random samples

2. Independence: It is safe to assume that men and women are independent of each other.

3. 10% Condition: 5348 male college students is less than 10% of all male college students

8471 female college students is less than 10% of all female college students

4. Success/Failure:

All conditions have been met to use a Normal model for the differences of two proportions.

ˆ 1392 10

ˆ 3956 10

ˆ 1748 10

ˆ 6723 10

Mechanics:1

1392ˆ5348

1748ˆ8471

n 0.05

1392 1748 3140ˆ5348 8471 13819pooled

1392 17485348 8471 7.369

3140 10679 3140 1067913819 13819 13819 13819

5348 8471

1 2ˆ ˆ 0

ˆ( )pooled

14( 7.369) 8.66 10 0P Value P z

Conclusion:

Since the P-Value is so much smaller than alpha (8.66 x 10-14 < 0.05), we reject the null hypothesis. There is statistically significant evidence that the proportion of male college students who are frequent binge drinkers is more than female college students.

Two Proportion z-Interval and z-Tests

Documents