+ All Categories
Home > Documents > Chapter 12

Chapter 12

Date post: 03-Jan-2016
Category:
Upload: ira-levy
View: 23 times
Download: 0 times
Share this document with a friend
Description:
Chapter 12. Analysis of Variance. Goals. List the characteristics of the F distribution Conduct a test of hypothesis to determine whether the variances of two populations are equal Discuss the general idea of analysis of variance Organize data into a ANOVA table - PowerPoint PPT Presentation
Popular Tags:
36
1 Chapter 12 Analysis of Variance
Transcript

1

Chapter 12

Analysis of Variance

2

Goals1. List the characteristics of the F distribution

2. Conduct a test of hypothesis to determine whether the variances of two populations are equal

3. Discuss the general idea of analysis of variance

4. Organize data into a ANOVA table

5. Conduct a test of hypothesis among three or more treatment means

3

F Distribution1. Used to test whether two

samples are from populations having equal variances

2. Applied when we want to compare several population means simultaneously to determine if they came from equal population ANOVA

Analysis of variance In both situations:

Populations must be Normally distributed

Data must be Interval-scale or higher

4

Characteristics Of The F Distribution1. Family of F Distributions

1. Each is determined by:1. df in numerator

comes from pop. 1 which has larger sample variation

2. df in denominator comes from pop. 2 which has smaller sample variation

2. F distribution is continuous F Value can assume an infinite number of values from 0 to ∞

3. Value for F Distribution cannot be negative Smallest value = 0

4. Positively skewed1. Long tail is always to right2. As # of df increases in both the numerator and the denominator,

the distribution approaches normal

5. Asymptotic1. As X increases the F curve approaches the X-axis

5

Why Do We Want To Compare To See If Two Population Have Equal Variances?

What if two machines are making the same part for an airplane?

Do we want the parts to be identical or nearly identical? Yes!

We would test to see if the means are the same: Chapter 10 & 11

We would test to see if the variation is the same for the two machines: Chapter 12

What if two stocks have similar mean returns? Would we like to test and see if one stock has more

variation than the other?

6

Why Do We Want To Compare To See If Two Population Have Equal Variances?

Remember Chapter 11: Assumptions for small sample tests of means:

1. Sample populations must follow the normal distribution

2. Two samples must be from independent (unrelated) populations

3. The variances & standard deviations of the two populations are equal

7

Conduct A Test Of Hypothesis To Determine Whether The Variances Of Two Populations Are Equal

To conduct a test: Conduct two random samples List population 1 as the sample with the largest

variance: n1 = # of observations s1^2 = sample variance n1 – 1 = df1 = degree of freedom (numerator for

critical value lookup) List population 2 as the sample with the smaller

variance: n2 = # of observations s2^2 = sample variance n2 – 1 = df2 = degree of freedom (denominator for

critical value lookup)

Always list the sample

with thelarger

sample variance

as population

1 (allows us

touse fewer

tables)

8

Step 1: State null and alternate hypotheses

• List the population with the suspected largest variance as population 1• Because we want to limit the number of F tables

we need to use to look up values, we always put the larger variance in the numerator and the smaller variance in the denominator

• This will force the F value to be at least 1• We will only use the right tail of the F distribution

• Examples of Step 1:

22

211

22

210

:

:

H

H22

211

22

210

:

:

H

H

9

22

211

22

210

:

:

H

H22

211

22

210

:

:

H

H

Step 2: Select a level of significance:

• Appendix G only lists significance levels: .05 and .01

Significance level = .10.10/2 = .05

Use .05 table inAppendix G

Significance level = .05

Use .05 table inAppendix G

10

Step 3: Identify the test statistic (F), find critical value and draw picture

• Look up Critical value in Appendix G and draw your pictureLevel of Significance 0.05

Degrees of Freedom for Numerator (From Pop 1)

1 2 3 4 5 6 71 161 200 216 225 230 234 237

2 18.5 19.0 19.2 19.3 19.3 19.3 19.4

3 10.1 9.55 9.28 9.12 9.01 8.94 8.89

4 7.7 6.94 6.59 6.39 6.26 6.16 6.09

5 6.61 5.79 5.41 5.19 5.05 4.95 4.88

6 5.99 5.14 4.76 4.53 4.39 4.28 4.21

7 5.59 4.74 4.35 4.12 3.97 3.87 3.79

8 5.32 4.46 4.07 3.84 3.69 3.58 3.5

9 5.12 4.26 3.86 3.63 3.48 3.37 3.29

10 4.96 4.1 3.71 3.48 3.33 3.22 3.14

11 4.84 3.98 3.59 3.36 3.2 3.09 3.0112 4.75 3.89 3.49 3.26 3.11 3 2.9113 4.67 3.81 3.41 3.18 3.03 2.92 2.8314 4.6 3.74 3.34 3.11 2.96 2.85 2.7615 4.54 3.68 3.29 3.06 2.9 2.79 2.7116 4.49 3.63 3.24 3.01 2.85 2.74 2.6617 4.45 3.59 3.2 2.96 2.81 2.7 2.61

Deg

rees

of

Fre

edo

m f

or

Den

om

inat

or

(Fro

m P

op

2)

If you have a dfthat is not listedin the border,

calculate your Fby estimating

a valuebetween

two values.HW #5:df = 11,

use valueBetween10 & 12

Book says:(3.14+3.07)/2 =3.105 3.10

11

Step 4

• Step 4: Formulate a decision rule:• Example:• If our calculated test statistic is greater than

3.87, reject Ho and accept H1, otherwise fail to reject Ho

12

Step 5

• Step 5: Take a random sample, compute the test statistic, compare it to critical value, and make decision to reject or not reject null and hypotheses

Test Statistic F:

• Example Conclusion for a two tail test:• Fail to reject null

• “The evidence suggests that there is not a difference in variation”

• Reject null and accept alternate• “The evidence suggests that there is a difference

in variation”

22

21

s

sF

Larger variancein numerator,

always!!

Let’s Look atHandout

Example 1

The mean rate of return on a sample of 8 utility stocks was 10.9 percent with a standard deviation of 3.5 percent. At the .05 significance level, can Colin conclude that there is more variation in the software stocks?

Colin, a stockbroker at Critical Securities, reported that the mean rate of return on a sample of 10 software stocks was 12.6 percent with a standard deviation of 3.9 percent.

Example 1 continued

221

220

:

:

UI

UI

H

H

Step 3: The test statistic is the F distribution.

Step 1: The hypotheses are

Step 2: The significance level is .05.

Example 1 continued

2416.1)5.3(

)9.3(2

2

F

Step 5: The value of F is computed as follows.

H0 is not rejected. There is insufficient evidence to show more variation in the software stocks.

Step 4: H0 is rejected if F>3.68 or if p < .05. The degrees of freedom are n1-1 or 9 in the numerator and n1-1 or 7 in the denominator.

16

ANOVAAnalysis Of Variance

Technique in which we compare three or more population means to determine whether they could be equal Assumptions necessary:

Populations follow the normal distribution Populations have equal standard deviations () Populations are independent

Why ANOVA? Using t-distribution leads to build up of type 1 error

“Treatment” = different populations being examined

17

Case Where Treatment Means Are Different

18

Case Where Treatment Means Are The Same

19

Example Of ANOVA Test To See If Four Treatment Means Are The Equal

22 students earned the following grades in Professor Rad’s class. The grades are listed under the classification the student gave to the instructor

Is there a difference in the mean score of the students in each of the four categories?

Use significance level α = .01# of "Treatments" 1 2 3 4Rating of Instructor Excellent Good Fair Poor

Course Grades94 75 70 6878 90 80 8281 77 76 7280 83 89 73

88 80 7475 6565

20

Conduct A Test Of Hypothesis Among Four Treatment Means

Step 1: State H0 and H1

H0 : µ1 = µ2 = µ3 = µ4

H1 : The Mean scores are not all equal (at least one treatment mean is different)

Step 2: Significance Level?

α = .01

21

Step 3: Determine Test Statistic And Select Critical Value

# of "Treatments" 1 2 3 4Rating of Instructor Excellent Good Fair Poor

Course Grades94 75 70 6878 90 80 8281 77 76 7280 83 89 73

88 80 7475 6565

k = Number of treatments = 4n = Total number of observations from all the treatments = 22Degrees of Freedom in the numerator = k - 1 3Degrees of Freedom in the denominator = n - k 18α = Level of significance = 0.01F from appendix G ((df = 3, 18), α = 0.01) 5.09

22

Step 4: State Decision Rule

If our calculated test statistic is greater than we reject H0 and accept H1, otherwise we fail to reject H0

Now we move on to Step 5: Select the sample, perform calculations, and make a decision…

Are you ready for a lot of procedures?!!

5.09

23

ANOVA TableSources of Variations Sum of Squares

Degrees of Freedoms

Mean Square (Estimate of Variation) F

Treatments SST (k - 1) SST/(k - 1) = MSTError SSE (n - k) SSE/(n - k) = MSE

TotalSS Total(Total Variation) (n - 1)

MST/MSE

The idea is: If we estimate variation in two ways and use one estimate in the numerator and the other estimate in the denominator: If we divide and get 1 or close to 1, the sample means

are assumed to be the same If we get a number far from 1, we say that the means

are assumed to be different The F critical value will determined whether we

are close to 1 or not

24

ANOVA Table So Far

ANOVA Table

Sources of Variations Sum of Squares

Degrees of Freedoms

Mean Square (Estimate of Variation) F

Treatments SST 3 SST/3 = MSTError SSE 18 SSE/18 = MSE

TotalSS Total(Total Variation) 21

MST/MSE

Let’s go calculate this!

25

Calculation 1: Treatment Means and Overall Mean

Mean Grand Mean Overall X G

# of "Treatments" 1 2 3 4Rating of Instructor Excellent Good Fair Poor

Course Grades94 75 70 6878 90 80 8281 77 76 7280 83 89 73

88 80 7475 6565

Treatment Mean 83.25 82.60 76.43 72.33Grand Mean = Overall Mean for all the data = 77.95

26

Calculation 2: Total Variation

Mean OverallX

nObservatio ParticularA X

Variation Total Total Squares of Sum

Total SS )X - (X 2

G

G

27

Calculation 2: Total VariationExample of 1st Example of 2nd94-77.95 = 16.05 16.05 2̂ = 257.6025

Excellent (X - 77.95) (X - 77.95) 2̂ Good (X - 77.95) (X - 77.95) 2̂94 16.05 257.6025 75 -2.95 8.702578 0.05 0.0025 90 12.05 145.202581 3.05 9.3025 77 -0.95 0.902580 2.05 4.2025 83 5.05 25.5025

88 10.05 101.0025

Totals 271.11 281.3125

Fair (X - 77.95) (X - 77.95) 2̂ Poor (X - 77.95) (X - 77.95) 2̂70 -7.95 63.2025 68 -9.95 99.002580 2.05 4.2025 82 4.05 16.402576 -1.95 3.8025 72 -5.95 35.402589 11.05 122.1025 73 -4.95 24.502580 2.05 4.2025 74 -3.95 15.602575 -2.95 8.7025 65 -12.95 167.702565 -12.95 167.7025

Totals 373.9175 358.615

Total Variation = SS Total = 1284.96

28

ANOVA Table So FarANOVA Table

Sources of Variations Sum of Squares

Degrees of Freedoms

Mean Square (Estimate of Variation) F

Treatments SST 3 SST/3 = MSTError SSE 18 SSE/18 = MSE

Total 1284.96 21

MST/MSE

Let’s go calculate this!

29

Calculation 3: Random Variation

Treatment Particular C

CTreatment for Mean SampleX

nObservatio ParticularA X

Variation Random Error Squares of Sum

SSE )X - (X 2

C

C

30

Calculation 3: Random VariationExcellent (X - 83.25) (X - 83.25) 2̂ Good (X - 82.6) (X - 82.6) 2̂

94 10.75 115.5625 75 -7.6 57.7678 -5.25 27.5625 90 7.4 54.7681 -2.25 5.0625 77 -5.6 31.3680 -3.25 10.5625 83 0.4 0.16

88 5.4 29.16

Xbar 83.25 82.6

Totals 158.75 173.2

Fair (X - 76.43) (X - 76.43) 2̂ Poor (X - 72.33) (X - 72.33) 2̂70 -6.43 41.3449 68 -4.33 18.748980 3.57 12.7449 82 9.67 93.508976 -0.43 0.1849 72 -0.33 0.108989 12.57 158.0049 73 0.67 0.448980 3.57 12.7449 74 1.67 2.788975 -1.43 2.0449 65 -7.33 53.728965 -11.43 130.6449

Xbar 76.43 72.33

Totals 357.7143 169.3334

Sum of Squares Error = SSE = 859

31

ANOVA TableSources of Variations Sum of Squares

Degrees of Freedoms

Mean Square (Estimate of Variation) F

Treatments SST 3 SST/3 = MSTError 859 18 859/18 = 47.72

Total 1284.96 21

MST/47.72

ANOVA Table So Far

Let’s go calculate this!

32

Calculation 4: Treatment Variation

ANOVA TableSources of Variations Sum of Squares

Degrees of Freedoms

Mean Square (Estimate of Variation) F

Treatments 425.96 3 425.96/3 = 141.99Error 859 18 859/18 = 47.72

Total 1284.96 21

141.99/47.72 = 2.98

mean overall grand the

andmean ment each treatbetween sdifference

Sqaure theof Sum The Variation Treatment

Treatment Squares of Sum SSE - Total SS SST

Simple Subtraction!

33

Calculation 5: Mean Square (Estimate of Variation)

ANOVA TableSources of Variations Sum of Squares

Degrees of Freedoms

Mean Square (Estimate of Variation) F

Treatments 425.96 3 425.96/3 = 141.99Error 859 18 859/18 = 47.72

Total 1284.96 21

141.99/47.72 = 2.98

34

Calculation 6: F

ANOVA TableSources of Variations Sum of Squares

Degrees of Freedoms

Mean Square (Estimate of Variation) F

Treatments 425.96 3 425.96/3 = 141.99Error 859 18 859/18 = 47.72

Total 1284.96 21

141.99/47.72 = 2.98

35

Step 5: Make A Decision Because is less than 5.09, we fail to

reject H0

The evidence suggests that the mean score of the students in each of the four categories are equal (no difference)

2.98

36

Summarize Chapter 121. List the characteristics of the F distribution

2. Conduct a test of hypothesis to determine whether the variances of two populations are equal

3. Discuss the general idea of analysis of variance

4. Organize data into a ANOVA table

5. Conduct a test of hypothesis among three or more treatment means


Recommended