+ All Categories
Home > Documents > Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito...

Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito...

Date post: 26-Apr-2018
Category:
Upload: dangque
View: 229 times
Download: 4 times
Share this document with a friend
21
Transcript
Page 1: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Introduction to Linear Regression Analysis

Samuel Nocito

Lecture 1

March 2nd, 2018

Page 2: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Econometrics: What is it?

I Interaction of economic theory, observed data and

statistical methods.

I The science of testing economic theory.

I The application of statistical techniques for solving

empirical problems.

I The set of tools used either for predicting future variables

(prices, demographic trends, etc.) or for phenomenon

estimation.

I The science of using data to make quantitative inference for

policy recommendations.

Page 3: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Econometrics: Why do we need it?

I Is there gender discrimination in the labor market (wage

gender gap)?

I How much can "carbon tax" reduce the use of fossil fuels?

I Is there racial discrimination in the market for home loans?

I What is the economic return of education?

I What will the life expectancy at birth be in the next 20

years?

Page 4: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Migration Topics Addressed by Econometrics

Broad questions:

(A) Who chooses to migrate?I Impact of personal characteristics.

(B) Why do people migrate to di�erent countries?I Push and pull factors.

(C) What is the impact of emigration?I E�ect on the country of origin.

(D) What is the impact of immigration?I E�ect on the host country.

Page 5: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Migration Topics Addressed by Econometrics

Speci�c questions (examples):

I Does foreign language pro�ciency foster migration of young

individual within the European Union? (Aparicio Fenoll

and Kuehn, 2016)

⇒ Point (A) "broad questions".

I Do immigrants cause crime? (Bianchi et al., 2012)

⇒ Point (D) "broad questions".

Page 6: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Principal Econometrics Methods

I Linear Regression model: Ordinary Least Squares (OLS)

I Non Linear Regression Models:I Maximum Likelihood Estimation (MLE)I Probit, Logit, Tobit

I Di�erences-in-Di�erences

I Instrumental Variable Estimation (IV)

Page 7: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Principal Econometrics Methods in the Literature

1995-1999 2000-2004 2005-2009

Number of papers 31 40 51By empirical technique

OLS 14 11 20MLE, Probit, Logit, Tobit 3 9 9Di�erences-in-Di�erences 1 2 0Instrumental Variable 4 12 8Others 9 6 14By topic

Assimilation 14 17 14Immigrants selection 6 7 8Native outcome 8 9 12Others 3 7 12

American Economic Review, Quarterly Journal of Economics, Journal of PoliticalEconomy, Journal of Labour Economics, and others top journals.Source: Sona Kalataryan, Methodological Workshop, MPC (EUI) 2016.

Page 8: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Principal Econometrics Methods: We focus on

I Ordinary Least Squares (OLS)I Simple mathematical and graphical explanationI Practical examplesI Interpretation of results

I Instrumental Variable (IV)I Very short introduction on the topicI Correlation vs causalityI Interpretation of results (OLS vs IV)I Tackled in lecture 2

Page 9: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

Suppose we have a sample of N observations on individual

wages and personal characteristics:

y X

i Wage Age Gender

1 6 18 M2 5 18 F3 5.8 20 F...

......

...N 6.9 22 M

US National Longitudinal Survey (NLS) of 1987 (Example).

N=3294 young working individuals, 1569 females.

Hourly wage rates. Males average 6.31, females 5.15.

We want to answer:

how in this sample wages are related to other

observables?

Page 10: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

OLS general equation:

yi = β0 + β1Xi + εi

In our empirical case:

Wagei = β0 + β1Genderi + εi

Where:

I yi (individual wage): dependent variable (explained)

I xi (gender): independent variable (explanatory)

I εi: is the error term

Page 11: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

yi = β0 + β1Xi is a linear equation model where

I β0 is the intercept of the curve

I β1 is the slope of the curve

Page 12: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

In the empirical case:

Figure: Fitted line and observation points (Verbeek, Fig. 2.1)

Page 13: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

Figure: Linear Regression Example: Height and Age (months)

I blue dots: observed data (combinations of height and age).

I blue line: OLS linear equation.

I red arrow: error term εi.

Page 14: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

I We observe x and y.

I We want to estimate β0 and β1 to understand the relation

between x and y.

I The distance between the dot and the line is the error

term εi of the OLS.

I We want to minimize the error term.

Page 15: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

Formally:

yi = β0 + β1Xi + εi ⇔ εi = yi − β0 − β1Xi

where εi is the error term.

In particular we want to minimize:∑Ni=1 ε

2i =

∑Ni=1(yi − β0 − β1Xi)

2

Remark: we use the quadratic transformation to avoid issues

with the sign of the error term.

Page 16: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Ordinary Least Squares (OLS)

In the case with one regressor (i.e., gender) and a constant., the

solutions of β0 and β1 that minimize the error are:

β0 = y − β1x

β1 =Cov(x, y)

V ar(x)

Where:

I y is the sample average of the yi.

I x is the sample average of the xi.

I Cov(x, y) is the sample covariance between x and y.

I V ar(x) is the sample variance of x.

The intercept (β0) is determined to make the average error

equal to zero.

Page 17: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

OLS: Application to the Wage Example

We create the variable Male using the information of gender

(dummy variable).

y X

i Wage Age Gender Male

1 6 18 M 1

2 5 18 F 0

3 5.8 20 F 0...

......

......

N 6.9 22 M 1

We use OLS to estimate:

Wagei = β0 + β1Malei + εi

Page 18: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

OLS: Application to the Wage Example

Table: OLS results wage equation (Verbeek, tab. 2.1)

Dependent variable: wageVariable Estimate Standard ErrorConstant 5.1469 0.0812Male 1.1661 0.1122

R2 = 0.0317 F=107.93

Wagei = 5.15 + 1.17Malei

β0 = 5.15 and β1 = 1.17

I β1 = 1.17 means that males receive 1.17 dollar per hour morethan females.

I Standard errors show the error in the estimate of the coe�cient(the smaller the better!).

I R2 = 0.0317 means that approximately 3.2% of the variation inindividual wages is given to gender di�erences.

Page 19: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

OLS: Application to the Wage Example

Figure: Graphical Representation of the Standard Errors (example)

Suppose each dot is a coe�cient estimate:

I The standard error shows the interval in which the

coe�cient lies.

I The smaller is the interval the higher is the precision of the

estimate.

Page 20: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

Lecture 2 in Sketches

I Dependent Variable and Explanatory variablesI How to interpret coe�cient estimates with di�erent variablede�nitions.

I Analysis of an empirical paper results.I OLS issues.

I Correlation vs causalityI Short introduction to IV estimates (conceptual).I Comparison of results (OLS vs IV) of an empirical paper.

Page 21: Introduction to Linear Regression Analysis - unito.it to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018

References

I Marno Verbeek, A Guide to Modern Econometrics, 3rd Ed.,Wiley, 2008, Chapter 2, pp. 6-31.

I Suggested (not used in class):I Stock, James H., and Mark W. Watson, Introduction toEconometrics, Global Edition, MA: Pearson Education,2012.


Recommended