+ All Categories
Home > Documents > Section 9.4

Section 9.4

Date post: 30-Dec-2015
Category:
Upload: talon-chan
View: 48 times
Download: 0 times
Share this document with a friend
Description:
Section 9.4. How Can We Analyze Dependent Samples?. Dependent Samples. Each observation in one sample has a matched observation in the other sample The observations are called matched pairs. Example: Matched Pairs Design for Cell Phones and Driving Study. - PowerPoint PPT Presentation
26
Agresti/Franklin Statistics, 1 of 106 Section 9.4 How Can We Analyze Dependent Samples?
Transcript

Agresti/Franklin Statistics, 1 of 106

Section 9.4

How Can We Analyze Dependent Samples?

Agresti/Franklin Statistics, 2 of 106

Dependent Samples

Each observation in one sample has a matched observation in the other sample

The observations are called matched pairs

Agresti/Franklin Statistics, 3 of 106

Example: Matched Pairs Design for Cell Phones and Driving Study

The cell phone analysis presented earlier in this text used independent samples:

• One group used cell phones

• A separate control group did not use cell phones

Agresti/Franklin Statistics, 4 of 106

Example: Matched Pairs Design for Cell Phones and Driving Study

An alternative design used the same subjects for both groups

• Reaction times are measured when subjects performed the driving task without using cell phones and then again while using cell phones

Agresti/Franklin Statistics, 5 of 106

Example: Matched Pairs Design for Cell Phones and Driving Study

Data:

Agresti/Franklin Statistics, 6 of 106

Example: Matched Pairs Design for Cell Phones and Driving Study

Benefits of using dependent samples (matched pairs):• Many sources of potential bias are

controlled so we can make a more accurate comparison

• Using matched pairs keeps many other factors fixed that could affect the analysis

• Often this results in the benefit of smaller standard errors

Agresti/Franklin Statistics, 7 of 106

Example: Matched Pairs Design for Cell Phones and Driving Study

To Compare Means with Matched Pairs, Use Paired Differences:

• For each matched pair, construct a difference score

• d = (reaction time using cell phone) – (reaction time without cell phone)

• Calculate the sample mean of these differences: xd

Agresti/Franklin Statistics, 8 of 106

For Dependent Samples (Matched Pairs)

Mean of Differences

=

Difference of Means

Agresti/Franklin Statistics, 9 of 106

For Dependent Samples (Matched Pairs)

The difference (x1 – x2) between the means of the two samples equals the mean xd of the difference scores for the matched pairs

The difference (µ1 – µ2) between the population means is identical to the parameter µd that is the population mean of the difference scores

Agresti/Franklin Statistics, 10 of 106

For Dependent Samples (Matched Pairs)

Let n denote the number of observations in each sample

This equals the number of difference scores The 95 % CI for the population mean difference

is:

deviation standard their is s

sdifference theofmean sample theis

d

025.

d

dd

xn

stx

Agresti/Franklin Statistics, 11 of 106

For Dependent Samples (Matched Pairs)

To test the hypothesis H0: µ1 = µ2 of equal means, we can conduct the single-sample test of H0: µd = 0 with the difference scores

The test statistic is:

1 with 0 ndf

nsx

td

d

Agresti/Franklin Statistics, 12 of 106

For Dependent Samples (Matched Pairs)

These paired-difference inferences are special cases of single-sample inferences about a population mean so they make the same assumptions

Agresti/Franklin Statistics, 13 of 106

Paired-difference Inferences

Assumptions:• The sample of difference scores is a

random sample from a population of such difference scores

• The difference scores have a population distribution that is approximately normal• This is mainly important for small samples

(less than about 30) and for one-sided inferences

Agresti/Franklin Statistics, 14 of 106

Paired-difference Inferences

Confidence intervals and two-sided tests are robust: They work quite well even if the normality assumption is violated

One-sided tests do not work well when the sample size is small and the distribution of differences is highly skewed

Agresti/Franklin Statistics, 15 of 106

Example: Matched Pairs Analysis for Cell Phones and Driving Study

Boxplot of the 32 difference scores

Agresti/Franklin Statistics, 16 of 106

Example: Matched Pairs Analysis for Cell Phones and Driving Study

The box plot shows skew to the right for the difference scores• Two-sided inference is robust to violations

of the assumption of normality

The box plot does not show any severe outliers

Agresti/Franklin Statistics, 17 of 106

Example: Matched Pairs Analysis for Cell Phones and Driving Study

Agresti/Franklin Statistics, 18 of 106

Example: Matched Pairs Analysis for Cell Phones and Driving Study

Significance test:• H0: µd = 0 (and hence equal population means for

the two conditions)

• Ha: µd ≠ 0

• Test statistic:

46.5

325.52

6.50 t

Agresti/Franklin Statistics, 19 of 106

Example: Matched Pairs Analysis for Cell Phones and Driving Study

The P-value displayed in the output is 0.000

There is extremely strong evidence that the population mean reaction times are different

Agresti/Franklin Statistics, 20 of 106

Example: Matched Pairs Analysis for Cell Phones and Driving Study

95% CI for µd =(µ1 - µ2):

69.5) (31.7,or

18.950.6 )32

5.52(040.26.50

Agresti/Franklin Statistics, 21 of 106

Example: Matched Pairs Analysis for Cell Phones and Driving Study

We infer that the population mean when using cell phones is between about 32 and 70 milliseconds higher than when not using cell phones

The confidence interval is more informative than the significance test, since it predicts just how large the difference must be

Agresti/Franklin Statistics, 22 of 106

Section 9.5

How Can We Adjust for Effects of Other Variables?

Agresti/Franklin Statistics, 23 of 106

A Practically Significant Difference

When we find a practically significant difference between two groups, can we identify a reason for the difference?

Warning: An association may be due to a lurking variable not measured in the study

Agresti/Franklin Statistics, 24 of 106

Example: Is TV Watching Associated with Aggressive Behavior?

In a previous example, we saw that teenagers who watch more TV have a tendency later in life to commit more aggressive acts

Could there be a lurking variable that influences this association?

Agresti/Franklin Statistics, 25 of 106

Control Variable

A control variable is a variable that is held constant in a multivariate analysis (more than two variables)

Agresti/Franklin Statistics, 26 of 106

Can An Association Be Explained by a Third Variable?

Treat the third variable as a control variable

Conduct the ordinary bivariate analysis while holding that control variable constant at fixed values

Whatever association occurs cannot be due to effect of the control variable


Recommended