Date post: | 30-Mar-2015 |
Category: |
Documents |
Upload: | leonard-flowe |
View: | 212 times |
Download: | 0 times |
• Homogenous group (the subjects are very similar on the variables)
• Unreliable measurement instrument (our measurements can't be trusted and bounce all over the place)
• Nonlinear relationship (Pearson's r is based on linear relationships...other formulas can be used in this case)
• Ceiling or Floor with measurement (lots of scores clumped at the top or bottom...therefore no spread which creates a problem similar to the homogeneous group)
Created by Del Siegle ([email protected] – www.delsiegle.info) for students in EPSY 341
.
Weight
Hei
ght
..........
. ....
.
...
Imagine that we created a scatterplot of first graders’ weight and height. Notice how the correlation is around r=.60.
.
Weight
Hei
ght
..........
. ....
.
...
Imagine that we created a scatterplot of first graders’ weight and height. Notice how the correlation is around r=.60.
..........
. ....
.
...
Now let’s add data from second graders (assuming second graders are generally heavier and taller than first graders but the relationship between their weight and height is similar to first graders).
.
Weight
Hei
ght
..........
. ....
.
...
Imagine that we created a scatterplot of first graders’ weight and height. Notice how the correlation is around r=.60.
..........
. ....
.
...
Now let’s add data from second graders (assuming second graders are generally heavier and taller than first graders but the relationship between their weight and height is similar to first graders).
..........
. ....
.
...
We now have added third graders. Notice how the total scatterplot for first through third graders resembles r=.80 while each grade resembled r =.60.
.
Weight
Hei
ght
..........
. ....
.
...
Imagine that we created a scatterplot of first graders’ weight and height. Notice how the correlation is around r=.60.
..........
. ....
.
...
Now let’s add data from second graders (assuming second graders are generally heavier and taller than first graders but the relationship between their weight and height is similar to first graders).
..........
. ....
.
...
We now have added third graders. Notice how the total scatterplot for first through third graders resembles r=.80 while each grade resembled r =.60.
..........
. ....
.
...
Extending the scatterplot to fourth graders increases the value of r even more.
.
Weight
Hei
ght
..........
. ....
.
...
Imagine that we created a scatterplot of first graders’ weight and height. Notice how the correlation is around r=.60.
..........
. ....
.
...
Now let’s add data from second graders (assuming second graders are generally heavier and taller than first graders but the relationship between their weight and height is similar to first graders).
..........
. ....
.
...
We now have added third graders. Notice how the total scatterplot for first through third graders resembles r=.80 while each grade resembled r =.60.
..........
. ....
.
...
Extending the scatterplot to fourth graders increases the value of r even more.
..........
. ....
.
...
As we add fifth graders, we can see that the correlation coefficient is approaching r=.95 for first through fifth graders.
.
Weight
Hei
ght
..........
. ....
.
...
Imagine that we created a scatterplot of first graders’ weight and height. Notice how the correlation is around r=.60.
..........
. ....
.
...
Now let’s add data from second graders (assuming second graders are generally heavier and taller than first graders but the relationship between their weight and height is similar to first graders).
..........
. ....
.
...
We now have added third graders. Notice how the total scatterplot for first through third graders resembles r=.80 while each grade resembled r =.60.
..........
. ....
.
...
Extending the scatterplot to fourth graders increases the value of r even more.
..........
. ....
.
...
The purpose of this demonstration is to illustrate that homogeneous groups
.
Weight
Hei
ght
..........
. ....
.
...
Imagine that we created a scatterplot of first graders’ weight and height. Notice how the correlation is around r=.60.
..........
. ....
.
...
Now let’s add data from second graders (assuming second graders are generally heavier and taller than first graders but the relationship between their weight and height is similar to first graders).
..........
. ....
.
...
We now have added third graders. Notice how the total scatterplot for first through third graders resembles r=.80 while each grade resembled r =.60.
..........
. ....
.
...
Extending the scatterplot to fourth graders increases the value of r even more.
..........
. ....
.
...
The purpose of this demonstration is to illustrate that homogeneous groups produce smaller correlations than heterogeneous groups.
.. .. ..
. .. . ... .
. .. ...
... ..... .... .
. . ..... ..
. ... ...
. ... ...
Assume that the relationship between Variable 1 and Variable 2 is r = - 0.90.
Variable 1
Var
iabl
e 2
.. .. ..
. .. .. .. .
. .. .
..... ..... .... .
. . ..... ..
. ... ...
. ... ...
If the instrument to measure Variable 1 were unreliable, the values for Variable 1 could randomly be smaller or larger.
Variable 1
Var
iabl
e 2
.. .. . .
. .. .. .. .
. .. .
..... ..... . ..
. .. . ..
... .... .. ..
.. ... ...
This would occur for all of the scores.
Variable 1
Var
iabl
e 2
.. .. . .
. .. .. .. .
. .. .
..... .. ... . ..
. .. . ..
... .... .. ..
.. ... ...
Unreliable instruments limit our ability to see relationships.
Variable 1
Var
iabl
e 2
.. .. ..
. .. . ... .
. .. ... ... ..... ..
.
. ... .. ... ..
. ... ...
. ... ...
Image that each year couples were married they became slightly less happy.
Years’ Married
Hap
pine
ss
.. .. ..
. .. . ... .
. .. ... ... ..... ..
.
. ... .. ... ..
. ... ...
. ... ........
...... .
.......
... .. ..... ..
...
..... ..
...... ..
. .....
Image that after they are married for 7 years, they slowly become more happy each year.
Years’ Married
Hap
pine
ss
.. .. ..
. .. . ... .
. .. ... ... ..... ..
.
. ... .. ... ..
. ... ...
. ... ........
...... .
.......
... .. ..... ..
...
..... ..
...... ..
. .....
The negative correlation for the first 7 years…
Years’ Married
Hap
pine
ss
.. .. ..
. .. . ... .
. .. ... ... ..... ..
.
. ... .. ... ..
. ... ...
. ... ........
...... .
.......
... .. ..... ..
...
..... ..
...... ..
. .....
…cancels the positive relationship for the next 7 years.
Years’ Married
Hap
pine
ss
.. .. ..
. .. . ... .
. .. ... ... ..... ..
.
. ... .. ... ..
. ... ...
. ... ........
...... .
.......
... .. ..... ..
...
..... ..
...... ..
. .....
Pearson’s r would show no relationship (r=0.00) between year’s married and happiness even though the scatterplot clearly shows a relationship.
Years’ Married
Hap
pine
ss
.. .. ..
. .. . ... .
. .. ... ... ..... ..
.
. ... .. ... ..
. ... ...
. ... ........
...... .
.......
... .. ..... ..
...
..... ..
...... ..
. .....
This is an example of a curvilinear relationship. Pearson’s r is not an appropriate statistic for curvilinear relationships.
Years’ Married
Hap
pine
ss
.. .. ..
. .. . ... .
. .. ...
... ..... .... .
. . ..... ..
. ... ...
. ... ...
One of the assumptions for using Pearson’s r is that the relationship is linear. That is why the first step in correlation data analysis is to create a scatterplot.
Years’ Married
Hap
pine
ss
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Imagine that we are plotting the relationship between Variable 1 and Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
As values on Variable 1 increase, values on Variable 2 also increase.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
As values on Variable 1 increase, values on Variable 2 also increase.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
As values on Variable 1 increase, values on Variable 2 also increase.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
As values on Variable 1 increase, values on Variable 2 also increase.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
As values on Variable 1 increase, values on Variable 2 also increase.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Suppose that the top score on the instrument used to measure Variable 2 is 9 (in other words there is a “ceiling” on Variable 2’s measurement instrument).
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Our subjects can continue to have higher scores on Variable 1, but they are restricted on Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Our subjects can continue to have higher scores on Variable 1, but they are restricted on Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Our subjects can continue to have higher scores on Variable 1, but they are restricted on Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Our subjects can continue to have higher scores on Variable 1, but they are restricted on Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Our subjects can continue to have higher scores on Variable 1, but they are restricted on Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
We can see that the ceiling on Variable 2 is causing us to have a lower correlation than if our subjects were able to continue to score higher on Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
..........
. ....
.
........
...... .
.......
..........
. ....
.
... ..........
. ....
.
... ..
..... ...
... ..
... .
Our subjects can continue to have higher scores on Variable 1, but they are restricted on Variable 2.
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16Variable 1
Var
iabl
e 2
1
2
3
4
5
6
7
8
9
10
..........
. ....
.
...
When a variable is measured with an instrument that has a ceilings (or floor), we obtain a lower correlation coefficients than if the variable were measured with an instrument that did not have a ceiling (or floor).
..........
. ....
.
........
...... .
....... .....
...... .
....... .
...
... ....
.. ..... .
• Homogenous group (the subjects are very similar on the variables)
• Unreliable measurement instrument (our measurements can't be trusted and bounce all over the place)
• Nonlinear relationship (Pearson's r is based on linear relationships...other formulas can be used in this case)
• Ceiling or Floor with measurement (lots of scores clumped at the top or bottom...therefore no spread which creates a problem similar to the homogeneous group)
Created by Del Siegle ([email protected] – www.delsiegle.info) for students in EPSY 341