+ All Categories
Home > Documents > STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of...

STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of...

Date post: 28-Dec-2015
Category:
Upload: carmella-waters
View: 217 times
Download: 0 times
Share this document with a friend
Popular Tags:
22
STA291 Statistical Methods Lecture 31
Transcript
Page 1: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

STA291Statistical Methods

Lecture 31

Page 2: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Consider an experiment with a single factor of k levels.Question of Primary Interest: Is there evidence for differences in effectiveness for the treatments?Let be the mean response for treatment group i. Then, to answer the question, we must test the hypothesis:

i

0 1 2:

: at least one mean is differentk

A

H

H

(no difference in treatments)

(at least one treatment has a different result)

Page 3: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

What criterion might we use to test the hypothesis?

The test statistic compares the variance of the means to what we’d expect that variance to be based on the variance of the individual responses.

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 4: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

The F-statistic compares two measures of variation, called mean squares. The numerator measures the variation between the groups (treatments) and is called the Mean Square due to treatments (MST). The denominator measures the variation within the groups, and is called the Mean Square due to Error (MSE). The F-statistic is their ratio:

1,k N k

MSTF

MSE

Every F-distribution has two degrees of freedom, corresponding to the degrees of freedom for the mean square in the numerator and for the mean square (usually the MSE) in the denominator.

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 5: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

1) The Mean Square due to Treatments (between-group variation measure)

2

1

1

mean for group

grand mean (mean of all the data)

observations in group

k

i ii

i

i

SSTMST

k

SST n y y

y i

y

n i

To quantify these two classes of variation, we introduce two new measures of variability for one-factor experiments with k levels:

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 6: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

2) The Mean Square due to Error (within-group variation measure)

2

1

2

1

variance for group

observations in group

total number of observations

k

i ii

i

i

SSEMSE

N k

SSE n s

s i

n i

N

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 7: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

This analysis is called an Analysis of Variance (ANOVA)The null hypothesis is that the means are all equalUsually presented in a table, called the ANOVA table, like this one:

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 8: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Example: Tom’s Tom-Toms tries to boost catalog sales by offering one of four incentives with each purchase:

1) Free drum sticks

2) Free practice pad

3) Fifty dollars off any purchase

4) No incentive (control group)

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 9: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Here is a summary of the spending for the month after the start of the experiment. A total of 4000 offers were sent, 1000 per treatment.

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 10: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Use the summary data to construct an ANOVA table.

(This table is most often created using technology.)•Since P is so small, we reject the null hypothesis and conclude that the treatment means differ. •The incentives appear to alter the spending patterns.

Analyzing a Design in One Factor – The One-Way Analysis of Variance

Page 11: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Assumptions and Conditionsfor ANOVA

Independence Assumption

The groups must be independent of each other.

No test can verify this assumption. You have to think about how the data were collected and check that the Randomization Condition is satisfied.

Page 12: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Equal Variance AssumptionANOVA assumes that the true variances of the treatment groups are equal. We can check the corresponding Similar Variance Condition in various ways:• Look at side-by-side boxplots of the groups. Look for differences in spreads.• Examine the boxplots for a relationship between the mean values and the spreads. A common pattern is increasing spread with increasing mean.• Look at the group residuals plotted against the predicted values (group means). See if larger predicted values lead to larger-magnitude residuals.

Assumptions and Conditionsfor ANOVA

Page 13: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Normal Population AssumptionLike Student’s t-tests, the F-test requires that the underlying errors follow a Normal model. As before when we faced this assumption, we’ll check a corresponding Nearly Normal Condition.

• Examine the boxplots for skewness patterns.• Examine a histogram of all the residuals.• Example a Normal probability plot.

Assumptions and Conditionsfor ANOVA

Page 14: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Normal Population Assumption

For the Tom’s Tom-Toms experiment, the residuals are not Normal. In fact, the distribution exhibits bimodality.

Assumptions and Conditionsfor ANOVA

Page 15: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Normal Population Assumption

The bimodality shows up in every treatment!

This bimodality came as no surprise to the manager. He responded, “…customers …either order a complete new drum set, or…accessories… or choose not to purchase anything.”

Assumptions and Conditionsfor ANOVA

Page 16: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Normal Population Assumption

These data (and the residuals) clearly violate the Nearly Normal Condition. Does that mean that we can’t say anything about the null hypothesis? No. Fortunately, the sample sizes are large, and there are no individual outliers that have undue influence on the means. With sample sizes this large, we can appeal to the Central Limit Theorem and still make inferences about the means. In particular, we are safe in rejecting the null hypothesis.

Assumptions and Conditionsfor ANOVA

Page 17: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Example: A test prep course

A tutoring organization says of the 20 students it worked with gained an average of 25 points on a given IQ test when they retook the test after the course.

Explain why this does not necessarily prove that the special course caused scores to go up.

The students were not randomly assigned. Those who signed up for the course may be a special group whose scores would have improved anyway. Design an experiment to test their claimGive the IQ test to a group of volunteers and then randomly assign them to take the review course or to not take the review course. After a period of time re-administer the test.

Assumptions and Conditionsfor ANOVA

Page 18: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Example: A test prep course

A tutoring organization says of the 20 students it worked with gained an average of 25 points on a given IQ test when they retook the test after the course.

It is suspected that the students with particularly low grades would benefit more from the course, how would you change your design to account for this suspicion. After the initial test, group the volunteers based on their scores. Randomly assign half of each group to either take the review course or not. Compare the results from the two groups in the block design.

Assumptions and Conditionsfor ANOVA

Page 19: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Multiple Comparisons

Knowing that the means differ leads to the question of which ones are different and by how much.

Methods that test these issues are called methods for multiple comparisons.

Question: Why don’t we simply use a t-test for differences between means to test each pair of group means?

Answer: Each t-test is subject to a Type I error, and the chances of committing such an error increase as the number of tested pairs increases.

Page 20: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

• Let J represent the number of pairs of means.

• Then, find the confidence interval for each difference

using the confidence level instead of 1J

1 .

If a confidence interval does not contain 0, then a significant difference is indicated.

The Bonferroni Method

Use the t-test methodology to test differences between the means, but use an inflated value for t* that lessens the accumulated chance of committing a Type I error by keeping the overall Type I error rate at or below This wider margin of error is called the minimum significant difference or MSD.

.

Multiple Comparisons

Page 21: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

• Watch out for outliers.

• Watch out for changing variances.

• Watch for multiple comparisons.

• Be sure to fit an interaction term when it exists.

• When the interaction effect is significant, don’t interpret the main effects.

Page 22: STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

Looking back

Understand how to use Analysis of Variance (ANOVA) to analyze designed experiments. oANOVA tables follow a standard layout; be acquainted with it.oThe F-statistic is the test statistic used in ANOVA. F-statistics test hypotheses about the equality of the means of two groups.oThe Bonferroni Method is for identifying which treatments are different


Recommended