Measurement Statistics

Basic Concepts in Measurement

*

*

Can Psychological Properties Be Measured?A common complaint: Psychological variables cant be measured.

We regularly make judgments about who is shy and who isnt; who is attractive and who isnt; who is smart and who is not.

*

*

QuantitativeImplicit in these statements is the notion that some people are more shy, for example, than othersThis kind of statement is inherently quantitative.Quantitative: It is subject to numerical qualification.If it can be numerically qualified, it can be measured.

*

*

Measurement

The process of assigning numbers to objects in such a way that specific properties of the objects are faithfully represented by specific properties of the numbers.

Psychological tests do not attempt to measure the total person, but only a specific set of attributes.

*

*

Measurement (cont.)

Measurement is used to capture some construct For example, if research is needed on the construct of depression, it is likely that some systematic measurement tool will be needed to assess depression.

*

*

MeasurementMeasurement--defined as application of rules to assign numbers to objects (or attributes).

Measurement rules--the procedures used to transform the qualities of attributes into numbers (e.g., type of scale used).

*

Why bother assigning numbers?quantifying something that is expected to vary.

individual differences -- premise that people will vary (get different scores) on the attribute

*

Individual Differences The cornerstone of psychological measurement - that there are real, relatively stable differences between people.

This means that people differ in measurable ways in their behavior and that the differences persist over a sufficiently long time.

Researchers are interested in assigning individuals numbers that will reflect their differences.

Psychological tests are designed to measure specific attributes, not the whole person.

These differences may be large or small.

*

*

Scales of measurementThree important properties:

Magnitude--property of moreness. Higher score refers to more of something.Equal intervals--is the difference between any two adjacent numbers referring to the same amount of difference on the attribute?Absolute zero--does the scale have a zero point that refers to having none of that attribute?

*

Types of Measurement Scales

1. Nominal

2. Ordinal

3. Interval

4. Ratio*

*


Nominal Scales - there must be distinct classes but these classes have no quantitative properties. Therefore, no comparison can be made in terms of one category being higher than the other.

For example - there are two classes for the variable gender -- males and females. There are no quantitative properties for this variable or these classes and, therefore, gender is a nominal variable.

Other Examples:country of originbiological sex (male or female)animal or non-animalmarried vs. single

*

*

Nominal ScaleSometimes numbers are used to designate category membership

Example:

Country of Origin1 = United States3 = Canada2 = Mexico4 = Other

However, in this case, it is important to keep in mind that the numbers do not have intrinsic meaning

*

*


Ordinal Scales - there are distinct classes but these classes have a natural ordering or ranking. The differences can be ordered on the basis of magnitude.

For example - final position of horses in a thoroughbred race is an ordinal variable. The horses finish first, second, third, fourth, and so on. The difference between first and second is not necessarily equivalent to the difference between second and third, or between third and fourth.

*

*

Ordinal ScalesDoes not assume that the intervals between numbers are equal

Example:finishing place in a race (first place, second place)1 hour2 hours3 hours4 hours5 hours6 hours7 hours8 hours1st place2nd place3rd place4th place*

*

Types of Measurement Scales (cont.)

Interval Scales - it is possible to compare differences in magnitude, but importantly the zero point does not have a natural meaning. It captures the properties of nominal and ordinal scales -- used by most psychological tests.

Designates an equal-interval ordering - The distance between, for example, a 1 and a 2 is the same as the distance between a 4 and a 5

Example - Celsius temperature is an interval variable. It is meaningful to say that 25 degrees Celsius is 3 degrees hotter than 22 degrees Celsius, and that 17 degrees Celsius is the same amount hotter (3 degrees) than 14 degrees Celsius. Notice, however, that 0 degrees Celsius does not have a natural meaning. That is, 0 degrees Celsius does not mean the absence of heat!

*

*


Ratio Scales - captures the properties of the other types of scales, but also contains a true zero, which represents the absence of the quality being measured.

For example - heart beats per minute has a very natural zero point. Zero means no heart beats. Weight (in grams) is also a ratio variable. Again, the zero value is meaningful, zero grams means the absence of weight.

Example: the number of intimate relationships a person has had0 quite literally means nonea person who has had 4 relationships has had twice as many as someone who has had 2*

*


Each of these scales have different properties (i.e., difference, magnitude, equal intervals, or a true zero point) and allows for different interpretations.

The scales are listed in hierarchical order. Nominal scales have the fewest measurement properties and ratio having the most properties including the properties of all the scales beneath it on the hierarchy.

The goal is to be able to identify the type of measurement scale, and to understand proper use and interpretation of the scale.

*

*

Types of scalesNominal scales--qualitative, not quantitative distinction (no absolute zero, not equal intervals, not magnitude)Ordinal scales--ranking individuals (magnitude, but not equal intervals or absolute zero)Interval scales--scales that have magnitude and equal intervals but not absolute zeroRatio scales--have magnitude, equal intervals, and absolute zero (so can compute ratios)

*

Test Your Knowledge:

A professor is interested in the relationship between the number of times students are absent from class and the letter grade that students receive on the final exam. He records the number of absences for each student, as well as the letter grade (A,B,C,D,F) each student earns on the final exam. In this example, what is the measurement scale for number of absences?

Nominalb) Ordinalc) Intervald) Ratio

*

*

In the previous example, what is the measurement scale of letter grade on the final exam?

a) Nominalb) Ordinalc) Intervald) Ratio*

*

A researcher is interested in studying the effect of room temperature in degrees Fahrenheit on productivity of automobile assembly workers. She controls the temperature of the three manufacturing facilities, such that employees in one facility work in a room temperature of 60 degrees, employees in another facility work in a room temperature of 65 degrees, and the last group works in a room temperature of 70 degrees. The productivity of each group is indicated by the number of automobiles produced each day. In this example, what is the measurement scale of room temperature?

a) Nominalb) Ordinalc) Intervald)Ratio*

*

In the previous example, what is the measurement scale of productivity?


*

Select the highest appropriate level of measurement:

Bicycle models:

1= Road2 = Touring3 = Mountain4 = Hybrid5 = Comfort6 = Cruiser


*


Educational Level:

1 = Some High school2 =High school Diploma3 = Undergraduate Degree4 = Masters Degree5 = Doctorate Degree


*


Number of questions asked during a class lecture


*

Select the highest level of measurement:

Categories on a Likert-type scale measuring attitudes:

1 = Strongly Disagree2 = Disagree3 = Neutral4 = Agree5 = Strongly Agree


*

Evaluating Psychological TestsThe evaluation of psychological tests centers on the tests:Reliability - has to do with the consistency of the instrument. A reliable test is one that yields consistent scores when a person takes two alternate forms of the test or when an individual takes the same test on two or more different occasions.

Validity - has to do with the ability to measure what it is supposed to measure and the extent to which it predicts outcomes.*

*

Why Statistics?Correlation - relationship between scores. Variability - measure of the extent to which test scores differ.Statistics are important because they give us a method for answering questions about meaning of those numbers.Three statistical concepts are central to psychological measurement:Prediction - forecast relationships .*

*

Why we need statistics

Statistics for the purposes of description--numbers as summaries.

Statistics for making inferences--logical deductions about events that cant be observed directly (e.g., opinion polls).

*

Three basic statistical conceptsVariability--extent to which individuals differ on the attribute

not simply the range of scoresdetermine how far from the mean each individuals score is--square each value--then sum these values and divide by the number of people

*

Variability

There are four major measures of variability:

1. Range - difference between the highest and lowest scoresFor Example: If the highest score was 60 & lowest was 40 = range of 20

2. Interquartile Range - difference between the 75th and 25th percentile.

3. Variance - the degree of spread within the distribution (the larger the spread, the larger the variance). It is the sum of the squared differences from the mean of each score, divided by the number of scores

4. Standard Deviation - a measure of how the average score deviates or spreads away from the mean.

*

*

Variability (continued)square root of that value is the standard deviation

standard scores (z-scores) -- calculated using the mean (average score) and the standard deviation

positive values are above the mean, negative values are below the mean

*

Standard DeviationStandard deviation is a measure of spread affected by the size of each data value a commonly calculated and used statistic equal to square root of variance

typically about 2/3 of data values lie within one standard deviation of the mean.

*

*

Question: Six masses were weighed as 4, 6, 6, 7, 9 and 10 kgFind the mean, variance and standard deviation of these weights.Example using individual data valuesAnswer: mean Variance is the average square distance from the mean= 7 kg*

*

Question: Six masses were weighed as 4, 6, 6, 7, 9 and 10 kgFind the mean, variance and standard deviation of these weights.Example using individual data valuesAnswer: mean = 7 kgMethod 1 Variance Variance is the average square distance from the mean*

*

Answer: mean Method 1 Variance Variance is the average square distance from the mean= 4 kg2 = 7 kgQuestion: Six masses were weighed as 4, 6, 6, 7, 9 and 10 kgFind the mean, variance and standard deviation of these weights.*

*

Answer: mean Method 1 Variance = 4 kg2 standard deviation = = 7 kgQuestion: Six masses were weighed as 4, 6, 6, 7, 9 and 10 kgFind the mean, variance and standard deviation of these weights.= 2 kg *

*

Variability

Variability is the foundation of psychological testing.

Variability refers to the spread of the scores within a distribution.

Tests depends on variability across individuals --- if there was no variability then we could not make decisions about people.

The greater the amount of variability there is among individuals, the more accurately we can make the distinctions among them.

*

*

Normal Distribution Curve

Many human variables fall on a normal or close to normal curve including IQ, height, weight, lifespan, and shoe size.

Theoretically, the normal curve is bell shaped with the highest point at its center. The curve is perfectly symmetrical, with no skewness (i.e., where symmetry is absent). If you fold it in half at the mean, both sides are exactly the same.

From the center, the curve tapers on both sides approaching the X axis. However, it never touches the X axis. In theory, the distribution of the normal curve ranges from negative infinity to positive infinity.

Because of this, we can estimate how many people will compare on specific variables. This is done by knowing the mean and standard deviation.

*

*

Relational/Correlational Research

Relational Research

Attempts to determine how two or more variables are related to each other.

Is used in situations where a researcher is interested in determining whether the values of one variable increase (or decrease) as values of another variable increase. Correlation does NOT imply causation!

For example, a researcher might be wondering whether there is a relationship between number of hours studied and exam grades. The interest is in whether exam grades increase as number of study hours increase.

*

*

Use and Meaning of Correlation Coefficients

Value can range from -1.00 to +1.00

An r = 0.00 indicates the absence of a linear relationship.

An r = +1.00 or an r = - 1.00 indicates a perfect relationship between the variables.

A positive correlation means that high scores on one variable tend to go with high scores on the other variable, and that low scores on one variable tend to go with low scores on the other variable.

A negative correlation means that high scores on one variable tend to go with low scores on the other variable.

The further the value of r is away from 0 and the closer to +1 or -1, the stronger the relationship between the variables.

*

*

Correlationused to determine the relationship between two variables

scatterplots involve plotting the scores on each of two variables (one along the x-axis and one along the y-axis)

*

Scatter Plots

An easy way to examine the data given is by scatter plot. When we plot the points from the given set of data onto a rectangular coordinate system, we have a scatter plot.

Is often employed to identify potential associations between two variables, where one may be considered to be an explanatory variable (such as years of education) and another may be considered a response variable

*

*

PredictionCorrelations used in prediction

Relation between test score (predictor) and the thing to be predicted (criterion)

E.g., GREs used to predict likely success in graduate school

*

Prediction/Linear Regression Linear regression attempts to model the relationship between two variables by fitting a linear equation to observed data. One variable is considered to be an explanatory variable, and the other is considered to be a dependent variable.

Formula : Y = a + bX ---------- Where X is the independent variable, Y is the dependent variable, a is the intercept and b is the slope of the line. Before attempting to fit a linear model to observed data, a modeler should first determine whether or not there is a relationship between the variables of interest

*

*

Coefficients of Determination

By squaring the correlation coefficient, you get the amount of variance accounted for between the two data sets. This is called the coefficient of determination.

A correlation of .90 would represent 81% of the variance between the two sets of data (.90 X .90 = .81)

A perfect correlation of 1.00 represents 100% of the variance. If you know one variable, you can predict the other variable 100% of the time (1.00 X 1.00 = 1.00)

A correlation of .30 represents only 9% of the variance, strongly suggesting that other factors are involved (.30 X .30 = .09)

*

The coefficient of determination, r 2, is useful because it gives the proportion of the variance (fluctuation) of one variable that is predictable from the other Variable. It is a measure that allows us to determine how certain one can be in making predictions from a certain model/graphThe coefficient of determination represents the percent of the data that is the closest to the line of best fit. For example, if r = 0.922, then r 2 = 0.850, which means that 85% of the total variation in y can be explained by the linear relationship between x and y (as described by the regression equation). The other 15% of the total variation in y remains unexplained. The coefficient of determination is a measure of how well the regression line represents the data. If the regression line passes exactly through every point on the scatter plot, it would be able to explain all of the variation. The further the line is away from the points, the less it is able to explain. *

Factor AnalysisIs a statistical technique used to analyze patterns of correlations among different measures. The principal goal of factor analysis is to reduce the numbers of dimensions needed to describe data derived from a large number of data. It is accomplished by a series of mathematical calculations, designed to extract patterns of intercorrelations among a set of variables. *

e.g. many of the mental disorders specified in the DSM share qualities and Attributes. Could imagine correlating all of the disorders and determining theRelationships (correlation) between the disorders. Would result in far too many Correlations can reduce this through factor analysis which will describe The pattern of relationships among the disorders by extracting the primary Factors associated with the correlations. Doing so may find that only two Underlying factors are needed to account for the intercorrelations in case of DSM disorders two broad factors identified as internalizing vs. externalizingHave been identified as accounting for interrelationships between many of the Disorders.

FA identifies basic underlying factors that account for the correlations Between test scores*

**

*

*

*

*

*

**

**

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*

*The coefficient of determination, r 2, is useful because it gives the proportion of the variance (fluctuation) of one variable that is predictable from the other Variable. It is a measure that allows us to determine how certain one can be in making predictions from a certain model/graphThe coefficient of determination represents the percent of the data that is the closest to the line of best fit. For example, if r = 0.922, then r 2 = 0.850, which means that 85% of the total variation in y can be explained by the linear relationship between x and y (as described by the regression equation). The other 15% of the total variation in y remains unexplained. The coefficient of determination is a measure of how well the regression line represents the data. If the regression line passes exactly through every point on the scatter plot, it would be able to explain all of the variation. The further the line is away from the points, the less it is able to explain. *e.g. many of the mental disorders specified in the DSM share qualities and Attributes. Could imagine correlating all of the disorders and determining theRelationships (correlation) between the disorders. Would result in far too many Correlations can reduce this through factor analysis which will describe The pattern of relationships among the disorders by extracting the primary Factors associated with the correlations. Doing so may find that only two Underlying factors are needed to account for the intercorrelations in case of DSM disorders two broad factors identified as internalizing vs. externalizingHave been identified as accounting for interrelationships between many of the Disorders.

FA identifies basic underlying factors that account for the correlations Between test scores*

Date post:	19-Nov-2015
Category:	Documents
Upload:	prakash-narendran
View:	9 times
Download:	1 times

Measurement Statistics

Documents