1
10/28/2006 P331 Analysis of Variance Introduction
1
Analysis of Variance(ANOVA)
10/28/2006 P331 Analysis of Variance Introduction
2
What We Will Cover in This Section
• Introduction.• Overview.• Simple ANOVA.
– Theory– Practice
10/28/2006 P331 Analysis of Variance Introduction
3
Situation
The management of Saul T. Dogg’srestaurant wanted to see if the saltiness of appetizers would influence the number of drinks people purchased. Three sections of the club are targeted to receive appetizers that have either low, medium, or high saltiness. The dependent variable is the number of drinks ordered.
2
10/28/2006 P331 Analysis of Variance Introduction
4
Appetizer saltiness and number or drinks ordered.
M = 1.80M = 3.90M = 2.00244142222131142362251241143332
Group 3 High SaltGroup 2 Medium SaltGroup 1 Low Salt
10/28/2006 P331 Analysis of Variance Introduction
5
Issue
How to determine if one mean is significantly different from the other means while minimizing the probability of committing a Type I error.
10/28/2006 P331 Analysis of Variance Introduction
6
Analysis of Variance:Background and Theory
3
10/28/2006 P331 Analysis of Variance Introduction
7
t-test Logic
1X 2X
Distance Between Means
Treatment + Random Error
10/28/2006 P331 Analysis of Variance Introduction
8
ANOVA Logic
10/28/2006 P331 Analysis of Variance Introduction
9
Treatment Effects
4
10/28/2006 P331 Analysis of Variance Introduction
10
Total Variability
10/28/2006 P331 Analysis of Variance Introduction
11
Between Groups Variability
:t1 :t2 :t3
10/28/2006 P331 Analysis of Variance Introduction
12
Within Groups Variability
:t1 :t2 :t3
5
10/28/2006 P331 Analysis of Variance Introduction
13
Partitioning the Variance
TotalVariability
Between GroupsVariability
Within GroupsVariability
Treatment Effect Random Error Random Error
10/28/2006 P331 Analysis of Variance Introduction
14
Where We Are Going
( )Random Erro) rR
Treatmandom
ent + (Error
10/28/2006 P331 Analysis of Variance Introduction
15
ANOVA Model
:t1 :t2 :t3
:G
+= RandomErrorTreatment EffectTotal
Effect
+= Within GroupsVariability
Between Groups
Variability
TotalVariability
6
10/28/2006 P331 Analysis of Variance Introduction
16
Partitioning the Variance
+= Within GroupsVariability
Between Groups
Variability
TotalVariability
BetweenGroups WithinGroupsTotal
Total Between Within
SS SSSSdf df df
= +
(N - 1) (k – 1) (N – k)
10/28/2006 P331 Analysis of Variance Introduction
17
The F ratio
:t1 :t2 :t3
:G
F(k-1,N-k) MSW
MSB=
10/28/2006 P331 Analysis of Variance Introduction
18
F-test Compared to t-test
F(k-1,N-k)MSW
MSB=t(df)
σX
X1 - X2=
7
10/28/2006 P331 Analysis of Variance Introduction
19
What Does the F Mean?
Random Error
Treatment Variability+
Random Error=F
10/28/2006 P331 Analysis of Variance Introduction
20
The Critical Value of F
• See page 485 in text book.• Notice
– Need df between (numerator) for columns.– Need df within (denominator) for rows.– As df increases the critical values get
smaller.
10/28/2006 P331 Analysis of Variance Introduction
21
The ProblemReturns
8
10/28/2006 P331 Analysis of Variance Introduction
22
The Situation (in case you forgot)
The management of Sal T. Dogg’s restaurant wanted to see if the saltiness of appetizers would influence the number of drinks people purchased. Three sections of the club are targeted to receive appetizers that have either low, medium, or high saltiness. The dependent variable is the number of drinks ordered.
10/28/2006 P331 Analysis of Variance Introduction
23
Hypotheses
Research Hypothesis.Saltiness of the appetizers will influence the number of drinks that people buy.
Null Hypothesis. Saltiness will not influence the number of drinks that people buy.
Statistical Hypothesis.µ1 ≠ µ2 ≠ µ3
10/28/2006 P331 Analysis of Variance Introduction
24
Appetizer saltiness and number or drinks ordered.
X = 1.80X = 3.90X = 2.00244142222131142362251241143332
Group 3 High SaltGroup 2 Medium SaltGroup 1 Low Salt
9
10/28/2006 P331 Analysis of Variance Introduction
25
Graph of Saltiness Ratings
GROUP
High SaltinessMedium SaltinessLow saltiness
Mea
n of
SAL
TINE
S
4.5
4.0
3.5
3.0
2.5
2.0
1.5
1.0
.5
0.0
10/28/2006 P331 Analysis of Variance Introduction
26
ANOVA Summary Table
2951.37Total
.912724.50Within Groups
14.7713.435226.87Between Groups
F(crit=3.35)MSdfSSSource
10/28/2006 P331 Analysis of Variance Introduction
27
How to Express F
F (2,27) = 14.77, p<.05
Calculated FDegrees of freedom(between, within) Alpha
10
10/28/2006 P331 Analysis of Variance Introduction
28
Post Hoc Tests
When F is significant, how do you determine which of the means differs from the others?
10/28/2006 P331 Analysis of Variance Introduction
29
Tukey Honestly Significant Difference Test (HSD)
( , , )within
withindf k
MSHSD qnα=
q = Value from table, page 488α = desired significance leveldfwithin = within groups dfk = Number of groups.
(.05,27,3).9110
HSD q=
3.53 .091HSD x=
1.06HSD =
10/28/2006 P331 Analysis of Variance Introduction
30
2.00
3.90
1.80
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
How To Use Tukey (HSD = 1.06)
1.902.10
.20
11
10/28/2006 P331 Analysis of Variance Introduction
31
Effect Size: Eta Squared (η2)
2 Between
Total
SSSS
η =2 26.87
51.37η =
2 .523η =
10/28/2006 P331 Analysis of Variance Introduction
32
Assumptions
1. The observations within each sample are independent.
2. The population from which the samples are selected is normally distributed.
3. The population from which the samples are selected have equal variances (homogeneity of variance)
10/28/2006 P331 Analysis of Variance Introduction
33
Another ANOVA Example
Sal O. Gysm felt that the perceived difficulty of logic problems would influence performance on these problems. Sal developed a set of problems and gave them to three groups. One group was told that the problems was easy, another was told that they were moderately difficult, and the third was told that they were difficult. The dependent variable was the number of problems solved.
12
10/28/2006 P331 Analysis of Variance Introduction
34
ANOVA: Example 2
M = 3.0M = 6.0M = 8.0
13452
463210
912487
DifficultModerateEasy
10/28/2006 P331 Analysis of Variance Introduction
35
Logic Problem Results
012345678
No of problems solved
Easy Moderate Difficult
10/28/2006 P331 Analysis of Variance Introduction
36
ANOVA: Summary Table
4.5231.677.00
21214
63.3384.00
147.33
BetweenWithinTotal
FMSdfSSSource
Is this value significant?
13
10/28/2006 P331 Analysis of Variance Introduction
37
Post hoc Analysis: Tukey HSD4.46HSD =
8
6
3
0123456789
Easy Moderate Difficult
10/28/2006 P331 Analysis of Variance Introduction
38
Effect Size: Eta2 2( )η
2 between
total
SSSS
η =2 63.33
147.33η =
2 .428η =
10/28/2006 P331 Analysis of Variance Introduction
39
Key Learning Points
1. The F test is used for comparing three or more means.
2. The Total Variance is broken down into two components
- Between Group Variance (Treatment plus error)- Within Group Variance (Error)
3. The F ratio is computed by dividing Mean Square Between by the Mean Square Within.
14
10/28/2006 P331 Analysis of Variance Introduction
40
Key Learning Points
4. The F ratio tells you ONLY if all of the means together are different.
5. The post hoc tests tell you which pairs of means differ.
6. Eta2 tells you the amount of variability accounted for in the treatment.
10/28/2006 P331 Analysis of Variance Introduction
41
Tess Tosterone is studying aggression among adolescent girls. She believes that there is a relationship between the level of interaction a girl has with her mother and
the girl’s level of aggression. She has identified fifteen girls who fall into one of
three maternal interaction levels (low, medium, and high) and has measured their
aggression scores.
The scores are shown on the next slide.
Another Practice Problem
10/28/2006 P331 Analysis of Variance Introduction
42
Data Summary Table
M = 1.00M = 5.00M = 6.00
026
144
059
485
066
High InteractionModerate InteractionLow interaction
15
10/28/2006 P331 Analysis of Variance Introduction
43
ANOVA Summary Table
FMean SquaredfSum of
Squares
116.00
70.00
14
2Between
Within
Total
10/28/2006 P331 Analysis of Variance Introduction
44
Tukey HSD
( , , )within
withindf k
MSHSD qnα=
3.8333.77
5
HSD = 3.30
10/28/2006 P331 Analysis of Variance Introduction
45
Which Means are Different?
6
5
1
0
1
2
3
4
5
6
7
Low Moderate High
16
10/28/2006 P331 Analysis of Variance Introduction
46
Eta2
2 between
total
SSSS
η = 70.00116.00
.60
10/28/2006 P331 Analysis of Variance Introduction
47