2.4 Units of Measurement and Functional Form -Two important econometric issues are: 1) Changing...

2.4 Units of Measurement and Functional Form-Two important econometric issues are:1) Changing measurement

-When does scaling variables have an effect on OLS estimates?-When does scaling variables have no effect on OLS estimates?

2) Functional Forms

-How do natural logs affect linear regressions?-How do functional forms impact elasticity?

2.4 Units of Measurement and Functional Form

• Consider the following model:

-where the number of squirrels and trees are in single units-if trees=2000 then the predicted number of squirrels becomes 1500-if there are no trees, there are 500 squirrels-how does this change if squirrels are measured in hundreds (ie: divided by 100)?

(ie) 5.0500îi treesrrelsisqu


-If squirrels are measured in hundreds, then zero trees would produce 500 squirrels, so B0hat=5 (divided by 100)

-If there are 2,000 trees, then B1hat*2000 must equal 10 (15-5, or 1,000 squirrels)-Therefore B1hat=0.005 (divided by 100)

(ie) 005.05îi treesrrelsisqu

2.4 Acorns are for the squirrelsTherefore

When the dependent variable is multiplied (divided) by a constant…

Multiply (divide) the OLS intercept and slope by the same constant.

IE: Change the y – change OLS the same way.


• Consider the following model:

-where the number of treehuggers and trees are in single units-if trees=800 then the predicted number of treehuggers becomes 900-if there are no trees, there are 500 treehuggers-how does this change if trees are measured in hundreds (ie: divided by 100)?

(ie) 5.0500ˆ ii treesggersutreeh


-If trees are measured in hundreds, then zero trees would produce 500 treehuggers, so B0hat=500 (nothing has changed)-If there are 800 trees, then trees=8 and 8(B1hat) must equal 400 (900-500)

-Therefore B1hat=50 (multiplied by 100)(ie) 50500ˆ

ii treesrrelsisqu

2.4 Acorns are for the squirrelsTherefore

When the independent variable is multiplied (divided) by a constant…

Divide (multiply) the OLS slope (not intercept) by the same constant.

IE: Change the x – change OLS slope the opposite way.

2.4 Units of Measurement and Functional FormHow does R2 (goodness of fit) change when

a variable is scaled?

-It doesn’t-R2 calculates how much of the variation in y is explained by x-this doesn’t depend on scaling-a similar “best fit line” is drawn through data points, regardless of scaling

2.4 Functional FormThus far, we have focused on LINEAR

relationships

-Linear relationships don’t capture all of the possible interaction between variables-Linear relationships assume that the first x has the same impact on y as the last x-changing impacts can be captured through the use of NATURAL LOGARITHMS

2.4 Log-Lin ModelWhen a variable has an increasing (percentage)

impact on y, the log-lin model is appropriate:

(2.42) )log( 10 uxy -note that log(y) indicates the natural log of y-if we assume that u doesn’t change,

(2.43)x )100(% 1 y

-note as x increases, y increases, therefore this equation expresses INCREASING return

2.4 Log-Lin ModelAssume that absence does make

the heart grow fonder:(i.e) 5.03)log( uabsencefondness

-assuming (for simplicity) that u=0, 2 days absence causes fondness of e4 (54.6) while 10 day’s absence causes fondness of e8 (2,981)

-therefore, given another day of absence:

(2.43) 50%x)100(% 1 fondness

2.4 Lin-Log Model

Therefore:-the 3rd day of absence increases fondness by 27.3-the 11th day of absence increases fondness by 1,490.5-we have INCREASING RETURNS

-Note: a Log-lin model can also be expressed: )( 10e uxy

2.4 Log-Log ModelRecall from Econ 299 that

elasticity is calculated as:

)ln(

)ln(

x

y

-if constant elasticity is theoretically important to a model, a log-log functional form ensures that elasticity is constant as B1:

uxy )log()log( 10

2.4 Scaling and Dependent Logs• Consider a Log-Lin model where the y value is

multiplied by c:

-scaling a dependent variable in log form changes the intercept but does not affect the slope

iii

iii

iii

iii

uxcy

uxccy

cuxcy

uxy

10

10

10

10

)log(

)]log([)log(

)log()log()log(

)log(

2.4 Units of Measurement and Functional FormDifferent Functional Forms are Summarized as

Follows:Model Function Interpretation of B1

Lin-Lin y=f(x) ∆y= B1∆x

Lin-Log y=f(log(x)) ∆y= (B1/100)%∆x

Log-Lin Log(y)=f(x) %∆y= 100B1∆x

*also call semi-elasticity

Log-Log Log(y)=f(log(x)) %∆y= B1%∆x

2.4 Units of Measurement and Functional FormNotes:1)Even though non-linear variables are

included in models (ie: log(y) or y2), the models are still considered “Linear Regressions” as they are linear in the parameters B1 and B2

2)Non-linear variables make interpreting B1 and B2 more complicated

3)Some estimated models are NOT linear regression models

2.5 Expected Values and Variances of the OLS Estimators

• This section will, using classical Gauss-Markov Assumptions, find 3 OLS properties:

1) OLS is unbiased2) Sample Variance of OLS Estimators3) Estimated Error Variance

This will be done viewing B0hat and B1hat as estimators of the population model:

ux10 y

Gauss-Markov Assumption SLR.1(Linear in Parameters)

In the population model, the dependent variable, y, is related to the independent variable, x, and the error (or disturbance), u, as

(2.47) 10 uxy

Where B0 and B1 are the population intercept and slope parameters, respectively.

Gauss-Markov Assumption SLR.1(Linear in Parameters)

Notes:1) In reality, x, y and u are all viewed

as random variables 2)Since OLS needs only be linear in B1

and B2, SLR.1 is far from restrictive

Given an equation, an assumption must now be made concerning data

Gauss-Markov Assumption SLR.2(Random Sampling)

We have a random sample of size n, {(x,y): i=1,2,…..n}, following the population model in equation (2.47).

Gauss-Markov Assumption SLR.2(Random Sample)

We will see in later chapters that random sampling can fail, especially in time series data but also in cross-sectional data

-Now that we have a population equation and an assumption about data, (2.47) can be re-written as:

(2.47) n.2,...., 1,i ,10 iii uxy Where ui captures all unobservables for observation I

and differs from uihat

-to estimate B0 and B1 we need a 3rd assumption:

Gauss-Markov Assumption SLR.3(Sample Variation in the

Explanatory Variable)

Sample outcomes of x, namely, {xi, u=1,….,n}, are not all the same value.

Gauss-Markov Assumption SLR.3(Sample Variation in the

Explanatory Variable)-This assumption ensures that the

denominator of B1hat is not zero-This assumption is violated if:

-The variance of x is zero-The standard deviation of x is zero-The minimum value of x is equal to the maximum value

-Although we can now obtain OLS estimates, we need one more assumption to ensure unbiasedness

Gauss-Markov Assumption SLR.4(Zero Conditional Mean)

The error u has an expected value of zero given any value of the explanatory variable. In other words,

0x)|(u E

Gauss-Markov Assumption SLR.4(Zero Conditional Mean)

Given our assumption about random sampling, we can further conclude:

. 0)x|(u 2,...n 1,iii E-this is read “for all 1=1, 2,….n”-given SLR.2 and SLR.4, we can derive the properties of OLS estimators

as conditional on xi’s values-given these 2 assumptions, nothing is lost in derivation by assuming xi

is nonrandom

2.4 OLS is UnbiasedIn order to prove OLS’s unbiasedness,

B1hat must first be algebraically manipulated:

(2.49) )(x

y)(xˆ2

i

ii1

x

x

-By a familiar mathematical property.-Substituting out yi and restating the

denominator gives us:(note that SSTx is not the same as SST)

(2.50) )ux()(xˆ ii10i

1xSST

x

2.4 OLS is UnbiasedUsing summation properties, the

numerator becomes:

iiii1i0

iii1i0i

)(xx)(x)(x

(2.51) )(xx)(x)(x

uxxx

uxxx

-Which is simplified using the properties:

xiii

i

SSTxxxxx

and

xx

2)()(

0)(

2.4 OLS is UnbiasedReturning to our B1hat estimate, we now

have:(2.52) ud

1

u)(xîi1

ii11

xx SSTSST

x

-Which indicates that the estimate of B1 equals B1 plus a term that is a linear combination of errors

-Conditional on values of x, B1hat’s randomness is due solely to the errors

-Note: xx ii d

Theorem 2.1(Unbiasedness of OLS)

Using assumptions SLR.1 through SLR.4,

(2.53) )ˆ( and )ˆ( 110o EE

for any values of B0 and B1. In other words, B0hat is unbiased for B0 and B1hat is unbiased for B1.

Theorem 2.1 ProofSince expected values are conditional on samples

of x, and SSTx and di are functions only of xi, they are nonrandom in conditioning. Therefore:

1i1

i1

i1

i11

0d(1/SST)

)(d(1/SST)

)E(d(1/SST)

]dE[(1/SST))ˆ(

i

i

i

uE

u

uE

Theorem 2.1 Proof-this is proved from the fact that each ui (conditional on sample x’s) is zero from SLR.2 and SLR.4

-”since unbiasedness holds for any outcome on {x1, x2,…,xn}, unbiasedness also holds without conditioning on {x1, x2,…,xn}

-unbiasness of B1hat is now straightforward:

Theorem 2.1 Proof

Since we already proved that0

110

1100

110

11010

])ˆ[(

][])ˆ[()ˆ(

)ˆ(

ˆˆˆ

xE

uExEE

ux

xuxxy

0)-Ê( e therefor)ˆ( 1111 E

Theorem 2.1 Notes-Remember that unbiasedness is a feature of the sample distributions of B1hat and

B2hat-if we have a poor sample, our OLS estimates would be far from the true values

-if any of our 4 initial Gauss-Markov assumptions are not true, OLS’s unbiasedness fails

Assumption Failure-If SLR.1 fails (y and x are not linearly related), very advanced estimation methods

are needed-Failure of SLR.2 (random sampling) is discussed in Chapters 9 and 17

-common in time series and possible in cross-sectional data-If SLR.3 fails (x’s are all the same), we cannot obtain OLS estimates -If SLR.4 fails, OLS estimators are biased, which can be corrected

SLR.4 Failure-If x is correlated with u, we have spurious correlation

-the relationship between x and y is influenced by other factors connected with x

-note that some vague connection is always possible but not statistically significant

SLR.4 Failure Example-Saskatchewan instituted a hypothetical drunk driving

awareness (DDA) campaign as an alternative to jail time for DUI

-It was found that the relationship between DUI’s and enrolment in the program is as follows:

ii DDAIUD 78.023ˆ -Even though the program looks to have failed, it is

due to a spurious correlation:-the existence of drunk drivers both increases the

number of DUI’s and the enrolment in the program

Date post:	04-Jan-2016
Category:	Documents
Upload:	agnes-stevens
View:	216 times
Download:	0 times

2.4 Units of Measurement and Functional Form -Two important econometric issues are: 1) Changing...

Documents