How Much Does Immigration Boost Innovation?ftp.iza.org/dp3921.pdf · and the individual...

IZA DP No. 3921

How Much Does Immigration Boost Innovation?

Jennifer HuntMarjolaine Gauthier-Loiselle

DI

SC

US

SI

ON

PA

PE

R S

ER

IE

S

Forschungsinstitutzur Zukunft der ArbeitInstitute for the Studyof Labor

January 2009

How Much Does Immigration Boost

Innovation?

Jennifer Hunt McGill University,

NBER and IZA

Marjolaine Gauthier-Loiselle Princeton University

Discussion Paper No. 3921 January 2009

IZA

P.O. Box 7240 53072 Bonn

Germany

Phone: +49-228-3894-0 Fax: +49-228-3894-180

E-mail: [email protected]

Any opinions expressed here are those of the author(s) and not those of IZA. Research published in this series may include views on policy, but the institute itself takes no institutional policy positions. The Institute for the Study of Labor (IZA) in Bonn is a local and virtual international research center and a place of communication between science, politics and business. IZA is an independent nonprofit organization supported by Deutsche Post World Net. The center is associated with the University of Bonn and offers a stimulating research environment through its international network, workshops and conferences, data service, project support, research visits and doctoral program. IZA engages in (i) original and internationally competitive research in all fields of labor economics, (ii) development of policy concepts, and (iii) dissemination of research results and concepts to the interested public. IZA Discussion Papers often represent preliminary work and are circulated to encourage discussion. Citation of such a paper should account for its provisional character. A revised version may be available directly from the author.

mailto:[email protected]

IZA Discussion Paper No. 3921 January 2009

ABSTRACT

How Much Does Immigration Boost Innovation?*

We measure the extent to which skilled immigrants increase innovation in the United States by exploring individual patenting behavior as well as state-level determinants of patenting. The 2003 National Survey of College Graduates shows that immigrants patent at double the native rate, and that this is entirely accounted for by their disproportionately holding degrees in science and engineering. These data imply that a one percentage point rise in the share of immigrant college graduates in the population increases patents per capita by 6%. This could be an overestimate of immigration's benefit if immigrant inventors crowd out native inventors, or an underestimate if immigrants have positive spill-overs on inventors. Using a 1940-2000 state panel, we show that immigrants do have positive spill-overs, resulting in an increase in patents per capita of 9-18% in response to a one percentage point increase in immigrant college graduates. We isolate the causal effect by instrumenting the change in the share of skilled immigrants in a state with the state's predicted increase in the share of skilled immigrants. We base the latter on the 1940 distribution across states of immigrants from various source regions and the subsequent national increase in skilled immigrants from these regions. JEL Classification: J61, D24, O32 Keywords: immigration, innovation Corresponding author: Jennifer Hunt Department of Economics University of British Columbia 997-1873 East Mall Vancouver B.C. V6T 1Z1 Canada E-mail: [email protected] * We are grateful to David Munroe for excellent research assistance, and for helpful comments to Francisco Alvarez-Cuadrado, Leah Brooks, David Card, Lee Fleming, Rachel Friedberg, David Green, Francisco Gonzales, Judy Hellerstein, Chad Jones, Bill Kerr, Daniel Parent, Giovanni Peri, Steve Pischke, Regina Riphahn, Eric Stuen and Dee Suttiphisal, seminar participants at the London City University, London School of Economics, NBER (Productivity and Labor Studies), Nürnberg, Simon Fraser, University College London and the SoLE/EALE Transatlantic Conference, and several friends holding patents. We thank Bill Kerr, Nicole Fortin, and Jim Hirabayashi of the USPTO for data and Deven Parmar for obtaining and formatting the USPTO data. Hunt is also affiliated with the CReAM, CEPR, IZA and DIW-Berlin, and acknowledges the Social Science and Humanities Research Council of Canada for financial support.

mailto:[email protected]

Economists have studied the impact of immigration on a variety of host country out-

comes. For example, Card (2007) considers U.S. immigration’s impact on population

growth, skill composition, internal migration, wages, rents, taxes and the ethnic and

income composition of neighborhoods and schools. In contrast, the impact of immigra-

tion on innovation has received less attention. In addition to the direct contributions of

immigrants to research, immigration could boost innovation indirectly through positive

spill–overs on fellow researchers, the achievement of critical mass in specialized research

areas, and the provision of complementary skills such as management and entrepreneur-

ship. Some tantalizing facts hint at the possible importance of these effects for the United

States. Compared to a foreign–born population of 12% in 2000, 26% of U.S.–based Nobel

Prize recipients from 1990–2000 were immigrants (Peri 2007), as were 25% of founders of

public venture–backed U.S. companies in 1990–2005 (Anderson and Platzer 2006), and

founders of 25% of new high–tech companies with more than one million dollars in sales

in 2006 (Wadhwa et al. 2007). Immigrants are over–represented among members of the

National Academy of Sciences and the National Academy of Engineering, among authors

of highly–cited science and engineering journal articles, and among founders of bio–tech

companies undergoing IPOs (Stephan and Levin 2001). Kerr (2007) documents the surge

in the share of U.S. patents awarded to U.S.–based inventors with Chinese and Indian

names to 12% of the total by 2004, and Wadhwa et al. (2007) find that non–U.S. citizens

account for 24% of international patent applications from the United States.

The goal of our paper is to assess the impact of skilled immigration on innovation

as measured by U.S. patents per capita. The purpose of studying patents is to gain in-

sight into technological progress, a driver of productivity growth and ultimately economic

growth. If immigrants increase patents per capita, they may increase output per capita

and make natives better off. This is an important consideration for the debate concerning

how many and what type of immigrants should be admitted to the United States, and

particularly for the discussion of the appropriate number of employer–sponsored H–1B

visas for skilled (especially science and engineering) workers. The context of the dis-

cussion is the shift from European to low and middle–income source countries since the

1

Immigration Act of 1965, and the concomitant faster increase in unskilled immigration

than skilled immigration.

One way skilled immigrants could increase patenting per capita is through a greater

concentration than natives in science and engineering occupations. Immigrants are likely

to be over–represented in such occupations, since scientific and engineering knowledge

transfers easily across countries: it does not rely on institutional or cultural knowledge, is

not associated with occupations with strict licensing requirements like medicine, and does

not require the sophisticated language skills of a field like law.1 Skilled immigrants could

also increase patenting per capita if a combination of immigration policies and immigrant

self–selection leads them to be more educated or of higher unobserved inventive ability.

Even immigrants who do not patent themselves may increase patenting by providing

complementary skills to inventors, such as entrepreneurship. Immigrant inventors may in

turn make natives more inventive. Conversely, immigrant inventors’ contributions could

be offset by negative spill–overs, for example, if their presence discourages natives from

working in science and engineering.2

After establishing theoretical propositions concerning the decision of scientists and

engineers to emigrate, we begin our empirical analysis by examining how much immigrants

patent using the 2003 National Survey of College Graduates (NSCG), which contains

information on both patenting activity and birth place. The individual–level data allow

us to gauge the impact of immigrants on patents per capita under the assumption that

immigrants do not influence the behavior of natives or other immigrants, and allow us to

examine whether immigrants patent more than natives because they have higher inventive

ability or merely different education or occupations.

In order to account for immigrants’ possible influence on natives or other immigrants,

we turn to a panel of U.S. states from 1940–2000, based on data from the U.S. Patent

and Trademark Office, the decennial censuses and other sources. We provide estimates

1See Chiswick and Taengnoi (2007) and Peri and Sparber (2008) for evidence.2In the most relevant paper, Borjas (2006) finds that immigrants do not crowd out natives as a whole

from graduate school.

2

of skilled immigrants’ impact on patents per capita that encompass both immigrants’

own patenting and any positive or negative spill–overs immigrants might have. To obtain

the causal effect of immigrants despite their endogenous choice of destination state, we

difference the data across census years, and instrument the change in the share of skilled

immigrants in a state with the state’s predicted increase in the share of skilled immigrants.

We base the latter on the 1940 distribution across states of immigrants from various source

regions and the subsequent national increase in skilled immigrants from these regions.

We contribute to two understudied areas, the impact of immigration on innovation

and the individual determinants of innovation, as well as to the study of the regional

determinants of innovation. Our work is also relevant for the macroeconomic growth

literature, where the link between innovation and the number of researchers is the key to

growth.3

We go beyond the most closely related paper linking immigration and innovation, Peri

(2007), by adding individual–level analysis, extending the state panel, using instrumen-

tal variables to correct for endogeneity, and defining skilled immigration more broadly

and consistently across time. These considerations also distinguish our paper from the

time–series analysis of Chellaraj, Maskus and Mattoo (2008). Both of these papers find

skilled immigration increases U.S. patenting. Our analysis is more general than that of

Stuen, Mobarak and Maskus (2007) and Kerr and Lincoln (2008). The former authors find

that immigrant students increase U.S. university patenting and science and engineering

publishing. The latter authors find that when the national population of H–1B visa–

holders increases, patenting by inventors with Indian and Chinese names rises in states

that have many H–1B applications. A related paper by Niebuhr (2006) concludes that

German regions with more diverse worker nationalities (as measured by the Herfindahl

index) patent more. The result is not robust to region fixed effects, however, no doubt

in part because she has only two years of data close in time (1997 and 1999). Paserman

(2008) finds no effect of skilled immigration on Israeli manufacturing productivity. We

are not aware of previous papers with regression analysis of the individual determinants of

3Aghion and Howitt (1992), Grossman and Helpman (1991a,b), Jones (1995), Romer (1990).

3

patenting, though Morgan, Kruytbosch and Kannankutty (2003) note in passing the im-

migrant advantage in patenting in the 1995 NSCG, and economic historians have studied

the characteristics of nineteenth century inventors (e.g. Khan and Sokoloff 1993).

There is a large literature on the regional determinants of patenting, but the analysis

relies primarily on cross–section variation or qualitative analysis. The literature considers

the effects of private and public R&D spending, the presence of a university, the presence of

small firms, the competitiveness of product markets, the presence of an airport, geographic

centrality, population density and size and the presence of skilled workers, especially

scientists and engineers.4 The most closely related paper is by Zucker and Darby (2006):

they pool data on Bureau of Economic Analysis regions for 1981–2004, and find that

non–university patenting is affected by neither the presence of star scientists, a high wage

(proxying for education) nor a high stock of relevant journal publications (representing

the stock of knowledge).5

Our theoretical analysis shows that workers with science and engineering education are

more likely to emigrate than other (“professional”) skilled workers if the expected wage

premium commanded by professional over unskilled jobs in the destination is smaller than

the cost of adapting professional skills to the destination institutions. We demonstrate

this in Appendix A. Our empirical analysis of the NSCG data shows that immigrants

account for 24% of patents, twice their share in the population, and that the skilled im-

migrant patenting advantage over skilled natives is entirely accounted for by immigrants’

disproportionately holding degrees in science and engineering fields. The data imply that

a one percentage point increase in college–graduate immigrants’ share of the population

increases patents per capita by 6%.

4See, for example, Acs (2002), Bottazzi and Peri (2003), Hicks et al. (2001), and the papers in Acs et

al. (2002); Jaffe, Trajtenberg and Henderson (1993) and successor papers study geographic patterns of

patent citations.5Other relevant papers include Agrawal, Kapur and McHale (2002), who find that emigration from

India reduces access to knowledge in India, Zucker et al. (2006), who examine the determinants of

a region’s publications in nanotechnology, and Marx, Strumsky and Fleming (2007) and Stuart and

Sorenson (2003), who examine the effect of a state’s enforcing non–compete laws on inventor inter–firm

mobility and biotech IPOs respectively.

4

This could overestimate the contribution of immigrants, if immigrants crowd out na-

tives from science and engineering, or could underestimate the contribution, if immigrants

have positive spill–overs. The state panel analysis, which estimates the contribution in-

cluding any spill–overs, shows evidence of positive spill–overs of immigrants, since the

estimates of their impact on patents per capita are higher than in the NSCG: a one per-

centage point rise in the share of immigrant college graduates in the population increases

patents per capita by 9–18%. The state–level results mean that the 1990–2000 increase

in the population share of this group from 2.2% to 3.5% increased patents per capita by

12–21% in a period when patents per capita rose 63%. We find that immigrants who are

scientists and engineers or who have post–college education boost patents per capita more

than immigrant college graduates.

1 Empirical methodology

We use individual–level data to measure and explain differences in patenting behavior

between immigrants and natives, and to gauge the contribution of immigrants to patenting

per capita under the assumption that immigrants do not affect the behavior of natives or

other immigrants. We then use state–level data to estimate the effect of immigrants on

patenting per capita, including any positive or negative spill–overs.

1.1 Individual–level data

A measure of the increase in patenting per capita owing to skilled immigrants can be

calculated as follows.6 Let the skilled immigrant share of patents be α0 (we obtain this

value from the NSCG) and the skilled immigrant share of the population be α1 (we

obtain this value from the census). Let MS be the number of skilled immigrants and

PMS their patents. If the skilled immigrant share of the population increases by one

percentage point, the percent increase in skilled immigrants is ∆MS

MS = 1α1

0.010.99−α1

, the

percent increase in the population is ∆MS

POP= 0.01

0.99−α1and the percent increase in patents

6The full algebra is presented in Appendix B.

5

is ∆PMS

P= 1

PPMS

MS ∆MS = α0∆MS

MS . The percent increase in patents per capita is therefore

1 + ∆PMS

P

1 + ∆MS

POP

− 1 = (0.01)α0 − α1

α1(1− α1). (1)

We shall establish below that skilled immigrants patent more than skilled natives,

and that this difference is driven by the difference in patenting at all. For policy–makers

contemplating reducing skilled immigration and inducing more natives to study science

and engineering, it may be interesting to understand the reasons for the immigrant ad-

vantage. To explore these reasons, we estimate a probit for the probability of having a

patent granted, or the probability of commercializing or licensing a patent, weighted by

the survey weights:

P (patentj) = β0 + β1IMj +Xjβ2 + εj, (2)

where j indexes individuals and IM is a dummy for the foreign–born. The coefficient

of interest is β1. We are interested in how much of the raw patenting gap between

immigrants and natives (the value of β1 with no X covariates) can be explained by adding

the covariates X: field of study of the highest degree, the highest degree, and demographic

variables. We perform the regressions for three samples: college graduates, post–college

degree holders, and scientists and engineers.

1.2 State–level data

We supplement the analysis using a panel of U.S. states with decennial data from 1940–

2000. By extending the period of observation back to 1940, we are able to distinguish

long run and short run effects by differencing the data in lengths varying from ten to 50

years.7 We estimate

∆logPi,t+1

POPi,t+1

= γ0 + γ1∆ISit + γ2∆NSit + ∆Xitγ3 + Zi,1940γ4 + µt + ∆ηit, (3)

where i indexes states, P is the number of patents, POP is state population, IS is the share

of the population or workforce (18–65) composed of skilled immigrants, NS represents the

7Strictly speaking, we should refer to low–frequency and high–frequency effects.

6

corresponding share for skilled natives, Zi,1940 are characteristics of the state in 1940, X

are contemporaneous state characteristics and µt are year dummies. The coefficient of

interest is γ1, though its size relative to γ2 is also of interest. We also present results from

specifications where the dependent variable is not in logs.8

We define a skilled person variously as one with a college degree or more, one with

post–college education, or one working in a science, engineering or computer science oc-

cupation. We include characteristics of the state in 1940 (including land area), to capture

the convergence in patents per capita occurring over the time period. The X covariates

comprise the log of defense procurement spending and the log of the average age of state

residents (18–65). We deliberately do not include R&D spending, as we believe this to

instead be a potential outcome variable. We lead the dependent variable by one year to

allow for a year of research time between the change in the inputs and the patent appli-

cation, as anecdotal evidence suggests the lag can vary between a few months and two

years.

There were several major changes to the patent system between 1980 and 1998 (see

Hall 2005). One change led to a large increase in patenting in electrical engineering

relative to other sectors. To capture potentially differential effects of this by state, we

include among the X’s the share of employment in electrical engineering–related sectors

in 1980, interacted with year dummies.9 Alternatively, we capture this by controlling for

region–specific dummies interacted with a dummy for differences involving years beyond

1980.

We use state populations to weight the regressions,10 since in some small states one

company drives the time series of patenting,11 and we cluster standard errors by state

to allow for serial correlation. Because we account for state fixed effects by estimating

8All patents are filed in Washington D.C.; they are attributed to states based on the home address of

the first inventor.9We use 1980 values as electrical engineering employment was still tiny in most states in 1940–1970.

10Specifically, we weight by 1/(1/popi,t+1 + 1/popi,t−k+1), where k is the length of the difference.11Idaho’s emergence as the state with most patents per capita has been driven by one semi–conductor

company, Micron Technology Inc., founded in 1978, which was granted 1643 patents in 2001 and was the

fourth–ranked company in this regard.

7

equations differenced across time, we elect not to include the change in the patent stock

among the regressors as would be suggested by patent models. Furthermore, because we

analyze long–run changes, we have chosen not to use a partial adjustment model.12

Equation (3) suffers from an endogeneity problem. Skilled workers are likely to migrate

to states which are growing or innovating, causing γ̂1 and γ̂2 to be biased upward in least

squares estimation. On the other hand, γ̂1 in particular could be biased towards zero

owing to measurement error.13 We devise an instrument to address these problems for

skilled immigrants, inspired by a shift–share type analysis of the change in popularity of

a state stemming from changes in the origin regions of skilled immigrants at the national

level.14 To illustrate, if immigrants from Europe prefer the northeastern United States

because it is closer to home and because other Europeans are already there because of

geography, and Asian immigrants prefer the west coast for the same reasons, the large

national increase in the share of skilled immigrants that are Asian will lead to an increase

in skilled immigration to the west coast relative to the northeast. If the national increase

in skilled Asian immigrants is caused by the change in U.S. immigration policy in 1965,

the opening of China to the world in 1979, along with increases in tertiary education in

China and India, it is orthogonal to shocks to west coast patenting.

For a state i, the predicted change in the number of skilled immigrants, caused by

changing origin regions k, can be written as

∆M̂Si =

∑k

Mik

Mk

∆MSk =

∑k

λik∆MSk ,

where λik is state i’s share in 1940 of the national total of immigrants who originate

12We have estimated these models. The coefficient on the change in the stock of patents is close to

one, rendering all other coefficients insignificant, while the coefficient on the partial adjustment term is

insignificant.13There is considerable measurement error for small states in the 1950 census, which was a smaller

sample than later years and which asked certain key questions of only one quarter of the sample. There

is also measurement error for the share of immigrant post–college and immigrant scientists and engineers

in small states in the 1940–70 censuses.14This instrument is similar to the instrument developed by Card (2001). For ∆NS , the change in the

share of native skilled workers, we have experimented unsuccessfully with lagged college enrollments as

an instrument. The enrollment data only begin in the 1970s.

8

from region k, and ∆MSk is the national change in the number of skilled immigrants from

that region. We use 18 source regions or countries, listed in Appendix Table 1. Because

the variable to be instrumented, ∆ISit , is a percentage point change, we convert ∆M̂Si to

percentage points by dividing by the population level at the start of the period to which

∆ refers, to define our final instrument as:

∆M̂Si

POPi.

We deliberately base the λik on immigrants of all educations (and ages) to emphasize

the role of geography and taste and minimize the role of economic factors that might

attract skilled workers specifically. We can control for the λik in the regressions, to ensure

that the instrument is not correlated with the error term due to their omission, and

the year dummies control for all national trends. If controlling for λik and the other

covariates does not account for state–specific patenting shocks which are very persistent

and influence national inflows of particular immigrant groups (e.g. California has serially

correlated positive patenting shocks which caused low–skill Chinese to settle there before

1940 and which incited skilled Chinese to move to the United States in more recent years),

the instrument could be correlated with the error term. We present evidence below that

suggests this is not the case.

Information on Alaska and Hawaii is not available until 1960. If the instrument is con-

structed using the 1960 shares for Alaska and Hawaii instead of the missing 1940 shares,

Hawaii is such an outlier (due to its high share of Asian immigrants in 1960) that the in-

strument does not statistically significantly predict immigration patterns even in weighted

regressions. We therefore drop Hawaii from the analysis entirely, and for simplicity drop

Alaska as well. This makes an imperceptible difference to weighted least squares regres-

sions, and only a small difference to the less preferred unweighted regressions.

9

2 Data and Descriptive Statistics

2.1 Individual–level data

We use the individual–level data from the 2003 National Survey of College Graduates

(NSCG). These data are a stratified random sample of people reporting having a bachelor’s

degree or higher on the long form of the 2000 census. In 2003, all respondents who

had ever worked were asked whether they had applied for a U.S. patent since October

1998, whether they had been granted any U.S. patent since October 1998, and if so, how

many, and how many had been commercialized or licensed.15 The survey will not capture

patents by those with less than a college degree, but we assume that most patents are

captured. Appendix C provides more information on the NSCG. We include in our sample

respondents 65 or younger (the youngest respondent is 23, but few are younger than 26).

Immigrants are those born outside the United States.

We define three (not mutually exclusive) skill categories, motivated in part by con-

sistency with categories that can be distinguished in the censuses: college graduates (i.e.

the full sample); holders of a post–college degree; and those working as scientists and

engineers in the survey week. Only 51% of respondents who had been granted a patent

reported working in a science or engineering occupation. Another 18% reported a man-

agement occupation: a research team’s manager is sometimes listed as a co–inventor on

a patent, and all inventors listed are captured in the data; also, many inventors will have

been promoted to management since obtaining a patent. Science and engineering tech-

nicians represent 2.5% of patent holders, and respondents in health–related occupations

represent another 3.0%.

Table 1 shows details of how patenting varies by immigrant status for the three skill

groups. For college graduates (the whole sample, columns 1–2), 1.9% of immigrants were

granted patents compared to 0.9% of natives, a ratio of 2.1, and patents per capita were

15Questions on patents were also asked in the 1995 NSCG, but only of respondents who said they

worked in research and development in the survey week, which will cause the patents of job changers to

be missed.

10

0.057 for immigrants and 0.028 for natives, a ratio of 2.0. Immigrants therefore patent

at about twice the native rate, with the difference being principally in the probability of

patenting at all. Immigrants held a slightly smaller advantage in patents commercialized

or licensed, patents likely to benefit society more than others: 1.2% immigrants had

commercialized a patent compared to 0.6% for natives, but commercialized patents per

capita were 0.029 for immigrants and 0.017 for natives. The immigrant–native gap is

larger for the sample with post–college education (columns 3–4), but much smaller for

the sample working in science and engineering occupations (columns 5–6). For example,

6.2% immigrants in the latter sample had been granted a patent, compared to 4.9%

natives, and immigrants hold 1.35 times the patents per capita of natives. Appendix

Table 2 contains the means of variables used in the regression analysis below.

2.2 State–level data

The patent data used in the state–level analysis come from the U.S. Patent and Trademark

Office (USPTO). Patents are attributed to states based on the home address of the first

inventor on the patent. We merge a series based on electronic data from 1963 onwards with

a series from paper records for 1883–1976 (see Appendix C for the merging procedure).

Patents are classified according to application (filing) date. Figure 1 shows the evolution

of total patents and patents per 100,000 residents from 1941-2001, our study period.

In Figure 2 we use patent data from 1929 to 2001 to display the long–run convergence

across states in patenting, as measured by changes in the (unweighted) standard deviation

of log patents. The convergence in patents, shown by the downward slope of the top line, is

not merely a function of convergence in population, as is demonstrated by the convergence

in patents per capita (bottom line).16 However, there is divergence in patents per capita

from 1990–2001, and there have historically been other periods of divergence. California

is a force for divergence, as may be seen by the growing gap between the inequality of

state patent counts with California (top line) and without California (middle line).

16Papers such as Co et al. (2006) have previously noted cross–state convergence in patents per capita.

11

We have also used an extract from the Harvard Business School patent data file,

which contains information on patents granted from 1975 to 2007, arranged by year of

application and patent class.17 We have aggregated the patent classes to six categories

using the classification of Hall, Jaffe and Trajtenberg (2001) and our own classification

of patent classes created since 1999. The extract contains the number of citations made

to patents in each patent class, state and application year. These may be viewed as a

proxy for the quality of the patent. We analyze 1971–2001 data using this extract (see

Appendix C for how we approximate 1971 values).

To compute the shares of the population in various education and occupation classes,

to divide these into immigrant and native, and to calculate the average age of the state’s

population, we use the IPUMS microdata of the decennial censuses. Post–college educa-

tion is the highest education level that can be measured consistently throughout 1940–

2000. We define immigrants to be the foreign born. Information for Alaska and Hawaii

is not available in 1940 and 1950, and we elect to drop these states entirely for reasons

explained in the methodology section.

The variable means for the full 1940–2000 sample, weighted by population, are re-

ported in Table 2. Between 1940 and 2000, the share of the population 18–65 composed

of immigrants with college education or more increased twelvefold to 3.5%, while the

equivalent share for post–college increased twelvefold to 1.6%. The population shares

comprising natives with at least college and with post–college increased from 4.1% to

20.0% and from 1.1% to 7.7% respectively. The share of workers composed of immigrant

scientists and engineers multiplied elevenfold to 0.9%, while the native share rose from

0.6% to 3.5%. The Appendix Table 2 contains information about the variables used in

the construction of the instruments.

17We are very grateful to Bill Kerr for making this extract for us.

12

3 Results

3.1 Individual determinants of patenting

The NSCG data may be used to estimate the direct effect of immigration on patenting,

ignoring possible crowd–out or spill–over effects, using equation (1). Immigrants hold

24.2% of patents in the (weighted) data (α0 = 0.242), and in the 2000 census (the basis of

the NSCG sampling frame), college–graduate immigrants were 3.5% of the U.S. population

(α1 = 0.035). A one percentage point rise in the share of college immigrants in the

population therefore implies an increase in patents per capita of 0.061, or 6.1%. The same

exercise may be performed for natives, with the result that a one percentage point rise in

the share of college natives increases patents per capita by 3.5%. As immigrants with post–

college education have 2.0 (=0.112/0.057) times as many patents per capita as immigrants

with only a college degree (see Table 1), the direct impact of an extra percentage point

of post–college immigrants in the population is likely to be 2.0 times higher, or an extra

2.0×6.1 = 12.2%. Similarly, the contribution of an additional percentage point immigrant

scientists and engineers is likely to be 3.1× 6.1 = 18.9%.

To assess the reasons for the immigrant patenting advantage, we first observe that

in Table 1 immigrants’ patenting advantage over natives is much smaller in the scientist

and engineer sample (columns 5 and 6) than in the overall sample (columns 1 and 2).

This suggests that immigrants’ advantage is due in large part to a greater science and

engineering orientation. Table 3 lends further support to this. Column 1 shows that, for

the whole sample, 6.6% of those with a highest degree in physical science and 6.1% of those

with a highest degree in engineering had patented, far ahead of other fields. Column 2

shows a qualitatively similar picture for commercialized or licensed patents. Immigrants’

education is therefore well–suited to patenting, since columns 3 and 4 show that the share

of immigrants with physical science and engineering degrees is more than twice as high

as for natives.

In Table 4, we pursue this explanation using the probit of equation (2) for the prob-

ability of patenting. Column 1 shows that immigrants are 1.0 percentage points more

13

likely to have been granted a patent in the sample of college graduates (top panel), 2.3

percentage points more likely in the sample of post–college educated (second panel) and

1.3 percentage points more likely in the sample of scientists and engineers (third panel).

In the second column, we control for the field of study of the highest degree obtained

by the respondent by adding 30 dummies. For all three samples, the gap becomes small:

6–9% of the original size for college and post–college graduates, and 24% for scientists

and engineers. In the third column, we control for the highest degree obtained by the

respondent. For college graduates and scientists and engineers, the direction of the gap is

reversed: immigrant scientists and engineers are a statistically significant 0.95 percentage

points less likely to patent than natives. Controlling for age, age squared, sex and current

employment status in column 4 changes little. Skilled immigrants’ advantage is therefore

entirely due to the nature of their education, and not to any selection on unobservables

such as ability.18 In columns 5 and 6 we show that the same conclusions may be drawn

for the probability of commercializing or licensing a patent.

3.2 State determinants of patenting

In Table 5, we estimate the state determinants of log patenting per capita using differences

of varying lengths, with a college degree as the measure of skill and least squares estimation

of equation (3). The regression in column 1 is unweighted, while those in the other columns

are weighted. A ten–year difference is taken in columns 1 and 2, a thirty–year difference

in column 3 and a fifty year difference in column 4. The coefficients on the changes in the

immigrant college shares are positive and significant. A one percentage point increase in

the share of the population composed of immigrant college graduates is associated with

a 12–15% increase in patenting per capita. These effects are larger than the 6% impact

calculated based on the NSCG data, implying positive spill–over effects of immigrants.

The coefficients on the change in the share of native college graduates are smaller

than the coefficients for immigrants. The point estimate increases as the difference length

18It is possible that unobservable effects cancel out e.g. immigrants may have higher ability but lower

quality education.

14

increases, and for 50–year differences the coefficient is a significant 5.8 in column 4. The

immigrant/native ratio is 2.6, somewhat larger than the 1.9 ratio in the NSCG. The

coefficient suggests that skilled natives too have positive spill–overs, as the effect of a one

percentage point increase in their population share based on the NSCG data was 3.5%.

The absence of significance at short differences probably reflects the emphasis of short

differences on high–frequency events (Baker, Benjamin and Stanger 1999), since the share

of native college graduates changes only gradually.

Older populations appear to be more innovative, as indicated by the positive coeffi-

cients on the average age of the state. This may reflect the importance of management or

other skills complementary to innovation. As suggested by time series work in Griliches

(1990), Department of Defense procurement spending lowers patenting, presumably in

part because military invention is primarily protected by secrecy rather than patents. Fi-

nally, the importance of the 1940 conditions (and land area) increases with the difference

length, and the coefficients indicate that patent growth was lower for initially richer and

more densely populated states.

We reproduce key Table 5 results in Table 6 Panel A columns 1–2, to facilitate their

comparison with the equivalent coefficients for post-college education (Panel B) and sci-

entists and engineers (Panel C). The coefficients for immigrant post–college are 20.7 and

29.8 in columns 1 and 2, 1.6–2.0 times as high as for immigrant college graduates, com-

pared to a ratio of 2.0 the NSCG data. The coefficients indicate positive spill–overs, as

the effect of a one percentage point increase in the immigrant post–college share in the

individual data was 12%. The coefficients for the share of native post–college educated

are not statistically significant, though the point estimates are higher for the longer dif-

ferences, yielding an immigrant/native ratio of 3.5 at 50–year differences compared to 3.1

in the NSCG. For scientists and engineers in panel C columns 1–2, a one percentage point

increase raises patents by 52 log points. This is high compared with the direct NSCG

effect of about 19% and compared with the effect of natives at 50–year differences (25

log points), given that in the NSCG the immigrant patenting advantage over natives was

only 35% amongst scientists and engineers. We return to this discrepancy below.

15

In columns 3 and 4 we repeat the regressions of columns 1 and 2, using the unlogged

dependent variable and reporting coefficients multiplied by 100. For immigrants, the

50–year difference yields smaller coefficients than the ten–year difference. For immigrant

college graduates in Panel A, a one percentage point increase in their share is associated

with a 0.000037 (column 3) or 0.000027 (column 4) increase in patents per capita, which

represent respectively 16.1% and 11.7% increases compared to the mean, close to the

estimates in columns 1 and 2. The coefficients for post–college immigrants (Panel B) are

1.9–2.0 times the magnitude of the college immigrant coefficients, as we would expect,

but are statistically insignificant. The disparity between ten and 50–year differences is

large for the regressions using scientists and engineers (Panel C). The 50–year coefficient

is similar in magnitude to the results from the log specification: the coefficient of 1.35

(column 4) means that a one percentage point increase in the skilled immigrant share is

associated with a 58.7% increase compared to the mean. The skilled immigrant coefficients

in columns 3–4 are not very sensitive to the covariates included, while the results in

columns 1–2 are much smaller if the 1940 covariates (including land area) are not included.

We have repeated all the least squares regressions of Table 6 splitting the skilled

natives according to whether they lived in the state of their birth or not (these results

are not reported). For long differences, it is the coefficient on the change in the share of

skilled natives born in another state that is larger and statistically significantly positive.

For short and medium differences, which coefficient is more larger depends on the skill

group.

Before presenting instrumental variables results, we display in Figure 3 the correla-

tion between the change in the immigrant college share and its instrument (the predicted

change) for each decade, and plot the weighted regression line. If the instrument were

picking up shocks to patenting from around 1940, we would expect the instrument to grad-

ually weaken over time. Instead, the instrument is weak in the low immigration decades

of 1940–1950 and 1950–1960, strong thereafter. The unreported figures for immigrant

post–college and scientist and engineer shares indicate similar correlations, though they

are less significant for scientists and engineers.

16

We now proceed to present the results of instrumental variables estimation and other

specification checks, focusing on the log specification and ten–year differences, and re-

porting only the coefficient on the change in the skilled immigrant share. For conciseness

we present the full results only for the case of college proxying for skill, in Table 7. Row 1

shows that in the base specification, instrumental variables increases the coefficient to

30, more than twice the least squares coefficient of 13. The instrument is strong in the

first stage, as indicated by the value in brackets of 27 for the F–statistic for the excluded

instrument’s significance in the first stage (the first stage itself, along with all other first

stages for the table, is shown in Appendix Table 3). The higher magnitude of the instru-

mental variables estimate may indicate that in least squares, measurement error’s bias

towards zero is more important than upward bias due to the endogenous location choice

of immigrants, a possibility mooted by Card and DiNardo (2000) in a similar context.

It seems less likely that skilled immigrants whose behavior is affected by the instrument

(skilled immigrants whose location decision is affected by historical geographic or taste

considerations) are more inventive than other immigrants.

In rows 2 and 3 we check the robustness of the results to different samples. Without

California (row 2), the least squares estimate falls from 13.2 to 9.2 and the instrumental

variables estimate from 30 to 26. Without differences involving the year 2000, the point

estimates fall more. In row 4 we return to the full sample and add seven dummies for

BEA regions, which pick up region–specific trends in per capita patenting. This decreases

the point estimates compared to row 1. The region coefficients are jointly significant,

as are the coefficients for all sets of covariates we add in this table. In row 5, we use

state instead of region dummies, which mainly serves to increase the standard errors. In

row 6, we revert to using BEA region dummies, and add controls for the share of workers

in electrical engineering in 1980 interacted with year dummies, which decreases the least

squares estimate from 11.5 in row 4 to 9.6, but leaves the instrumental variables estimate

little changed at 23.

Until this point, the regressions have not controlled for the state’s 1940 shares of the

eighteen immigrant groups that enter the calculation of the instruments (λik). We add

17

these shares to the controls in row 7, which increases the coefficients slightly as well as the

standard errors, rendering the least squares estimate insignificant. As the point estimates

change little, we do not include the 1940 shares in the remaining rows.

Since the change in the native skilled share is endogenous, the instrumental variable

estimate of the effect of the change in the immigrant skilled share is only unbiased if the

two variables are independent. Since this is obviously not the case (the correlation in the

first stage is significantly negative), we experiment in row 8 with dropping the control

for the change in the share of college natives. Compared to row 6, the least squares

coefficient falls slightly, but the instrumental variables coefficient falls considerably, from

23.1 to 17.6.

In row 9 we propose a more parsimonious specification aimed at capturing post–1980

patenting changes, adding to the base specification only BEA region dummies interacted

with a dummy for post–1980 (i.e. for differences including 1990 or 2000). These results

are very close to those of row 8, although the addition of these covariates has weakened

the instrument considerably in the first stage, as evidenced by the fall in the F–statistic

in brackets. In row 10 we drop the change in the native college share from the covariates

of the row 9 specification, and the estimates fall to the lowest values in the table: 7.0 for

least squares and 12.3 for instrumental variables.

Our preferred specifications for least squares are rows 6 and 9 (which include the native

skilled share), while for instrumental variables they are the counterparts excluding the

native skilled share in rows 8 and 10. We present their results in Table 8 for all three

skill groups (repeating the college results). For immigrant college graduates (columns 1

and 2), a one percentage point increase in share increases patenting per capita by 8–10%

in least squares and 12–18% in instrumental variables, more than the 6% based on the

individual–level data (statistically significantly so in the case of the highest coefficient),

and therefore implying positive spill–overs. For post–college immigrants (columns 3 and

4), the upper specifications lead to insignificant coefficients similar in size to the college

coefficients: 11.3 and 18.9 for least squares and instrumental variables respectively. It

seems implausible that the effects would not be larger than for college immigrants (and this

18

specification is unusual in this regard), so we put more weight on the lower specifications

which yields coefficients of 15.9 and 27.0, about double the college coefficients, as would

be expected from the individual–level analysis. The individual–level effect calculated for

the immigrant post–college share was 12%, so overall the coefficients imply considerable

positive spill–overs.

We do not present instrumental variables results for immigrant scientists and engineers,

as the instrument is too weak in the first stage, but the least squares coefficients in

column 5 are 38.2 and 30.7 (significant only at the 10% level), compared to a 19% effect

calculated with the individual–level data. The scientist and engineer coefficients are 3.7–

4.0 times their college counterparts in column 1, which still seems slightly high compared

to the ratio of 3.1 in the individual–level data. For immigrant scientist and engineers in

the specifications of this table, contrary to the cases of immigrant college and post–college,

the coefficient on the share generally falls as the difference length increases. This means

that at longer differences, the immigrant and native shares of scientists and engineers

have similar coefficients, as would be expected based on the individual–level results. For

example, in the 50–year counterpart to the upper specification of Table 8, the coefficient

on the immigrant scientist and engineer share is 27.0 (standard error 16.2) while the native

coefficient is 22.9 (standard error 5.8). These values imply ratios of immigrant to native

and of immigrant scientists/engineers to immigrant college very similar to in the NSCG.

The instrument is also more powerful at this long difference (an F–statistic of 11 in the

first stage), and the instrumental variables coefficient is 61.4 (standard error 28.7). These

are our preferred coefficients for scientists and engineers.

In Table 9 we investigate further using the Harvard Business School patent data for

1971–2001. In row 1 we repeat the least squares base specification for log patents per

capita for this smaller sample, and obtain slightly smaller point estimates for immigrant

college (10.7), post–college (15.2) and scientists and engineers (45.7) compared to those

in Table 6. In row 2 we change the dependent variable to be the log of patent citations

per capita, and in row 3 we add region dummies to this specification. The results are

not very different from the results for patent counts, suggesting immigrants are not gen-

19

erating patents of lower quality than native patents. Unreported instrumental variables

coefficients are larger (21 for college graduates in the base specification) but insignificant.

In rows 4–9 we present the results for the six categories of patent. The standard errors

are high when the data are split in this way, and we elect simply to present the least squares

base specification rather than various specifications with insignificant coefficients. From

the reported results as well as the unreported results, the only firm conclusions are that

there is no effect for mechanical patents, and a negative effect for “other” patents. The

beneficial effects of immigration are therefore spread across computer and communications

patents, electrical and electronic patents, drug and medical patents, and chemical patents.

4 Conclusions

In this paper we have combined individual and aggregate data to demonstrate the im-

portant boost to innovation provided by skilled immigration to the United States in

1940–2000. A calculation for 1990–2000, when patenting per capita rose 63%, puts the

magnitudes of the effects in context. The 1.3 percentage point increase in the share of

the population composed of immigrant college graduates and the 0.7 percentage point

increase in the share of post–college immigrants both increased patenting per capita by

about 12% based on least squares19 and 21% based on instrumental variables. The 0.45

percentage point increase in immigrant scientists and engineers increased patenting per

capita by about 13% based on least squares20 and 32% based on instrumental variables.

These impacts include the positive spill–overs of skilled immigrants, which are a substan-

tial share of the total impact: calculations based on individual–level data of the impacts

without spill–overs suggest impacts of about 8–9% for all three skill groups.21

We find that a college graduate immigrant contributes at least twice as much to

patenting as his or her native counterpart. The difference is fully explained by the greater

19College: Table 8 column 1 average coefficient 9.0, 9.0 × 1.3 = 11.7 log points= 12%; Post–college:

Table 8 column 3 coefficient 15.9, 15.9× 0.7 = 11.1 log points = 12%.20Coefficient 27.0 from text, 27.0× 0.45 = 12.1 log points= 13%.216.1%× 1.3 = 7.9%; 12.2%× 0.7 = 8.5%; 18.9%× 0.45 = 8.5%

20

share of immigrants with science and engineering education, implying immigrants are

not innately more able than natives. Indeed, immigrants are less likely to have patented

recently than observably similar native scientists and engineers. Despite this, the fact that

immigrants increase patenting per capita shows that their presence in the United States

provides a previously uncharacterized benefit to natives, assuming the immigrants would

have been less innovative or less able to commercialize their innovation elsewhere or that

U.S. natives benefit more from innovation and commercialization in the United States

than abroad. We can make a crude calculation of the benefit using the results of Furman,

Porter and Stern (2002), who find that the elasticity of a country’s GDP with respect

to its patent stock is 0.113, controlling for capital and labor. This elasticity implies that

the influx of immigrant college graduates in the 1990s increased U.S. GDP per capita by

1.4–2.4%.

One should be cautious in drawing the conclusion that innovation could be sustained

by simultaneously subsidizing natives to study science and engineering and cutting im-

migration of scientists and engineers. The additional natives drawn into science and

engineering might have lower inventive ability than the excluded immigrants, and such

natives might have contributed more to the U.S. economy outside science and engineering.

While evidence in the paper of positive spill–overs from scientists and engineers appears to

support the dual policies of subsidizing native science and engineering study and increas-

ing immigration of scientists and engineers, it is possible that members of other skilled

professions provide equally large spill–overs that are simply more difficult to measure.

Finally, the paper leaves for future research the question of which immigration pathways

lead to the most successful immigrants: for example, admitting science and engineering

graduate students and facilitating their remaining in the United States after graduation

compared with admitting more scientists and engineers on temporary work visas.

21

References

Acs, Zoltan J., 2002. Innovation and the Growth of Cities, Northampton, M.A.: EdwardElgar.

Acs, Zoltan J., Henri L.F. de Groot and Peter Nijkamp eds. 2002. The Emergence of theKnowledge Economy, Berlin: Springer Verlag.

Aghion, Philippe and Peter Howitt. 1992. “A Model of Growth through Creative Destruc-tion”. Econometrica, 60 pp. 323–351.

Agrawal, Ajay, Devesh Kapur and John McHale. 2007. “Brain Drain or Brain Bank?The Impact of Skilled Emigration on Poor–Country Innovation”. University of Torontoworking paper.

Anderson, Stuart and Michaela Platzer. 2006. “American Made: The Impact of ImmigrantEntrepreneurs and Professionals on U.S. Competitiveness”. National Venture CapitalAssociation.

Baker, Michael, Dwayne Benjamin and Shuchita Stanger. 1999. “The Highs and Lows ofthe Minimum Wage Effect: A Time–Series Cross–Section Study of the Canadian Law”.Journal of Labor Economics, 17 (2) pp. 318–350.

Borjas, George J. 2006. “Do Foreign Students Crowd Out Native Students from Gradu-ate Programs?”. In Ronald G. Ehrenberg and Paula E. Stephan eds. Science and theUniversity, Madison: University of Wisconsin Press.

Bottazzi, Laura and Giovanni Peri. 2003. “Innovation and spillovers in regions: Evidencefrom European patent data”. European Economic Review, 47 pp. 687–710.

Card, David. 2007. “How Immigration Affects U.S. Cities”. CReAM Discussion Paper No.11/07.

Card, David. 2005. “Is the New Immigration Really So Bad?” Economic Journal, 115(507) pp. F300–323.

Card, David. 2001. “Immigrant Inflows, Native Outflows and the Local Labor MarketImpacts of Higher Immigration”. Journal of Labor Economics 19 (1) pp. 22–64.

Card, David and John DiNardo. 2000. “Do Immigrant Inflows Lead to Native Outflows?”American Economic Review Papers and Proceedings, 90 (2) pp. 360–367.

Chellaraj, G. Keith E. Maskus and A. Mattoo. 2008. “The Contribution of Skilled Immi-gration and International Graduate Students to U.S. Innovation”. Review of Interna-tional Economics, 16 (3) pp. 444–462.

Chiswick, Barry and Sarinda Taengnoi. 2007. “Occupational Choice of High Skilled Im-migrants in the United States”. International Migration, 45 (5) pp. 3–34.

22

Co, Catherine Y., John S. Landon–Lane and Myeong–Su Yung. 2006. “Inter–State Dy-namics of Invention Activities, 1930–2000”. Journal of Applied Econometrics, 21 pp.1111–1134.

Furman, Jeffrey L., Michael E. Porter and Scott Stern. 2002. “The determinants of na-tional innovative capacity”. Research Policy, 31 pp. 899–933.

Griliches, Zvi. 1990. “Patent Statistics as Economic Indicators: A Survey”. Journal ofEconomic Literature, 28 (4) pp. 1661–1707.

Grossman, Gene and Elhanan Helpman. 1991a. Innovation and Growth in the GlobalEconomy, Cambridge: MIT Press.

Grossman, Gene and Elhanan Helpman. 1991b. “Quality Ladders in the Theory ofGrowth”. Review of Economic Studies, 58 pp. 43–61.

Hall, Bronwyn H. 2005. “Exploring the Patent Explosion”. Journal of Technology Trans-fer, 30 (1/2) pp. 35–48.

Hall, Bronwyn H., Adam Jaffe and Manuel Trajtenberg. 2001. “The NBER Patent Ci-tation Data File: Lessons, Insights and Methodological Tools”. NBER Working Paper8498.

Hicks, Diana, Tony Breitzman, Dominic Livastro and Kimberly Hamilton. 2001. “Thechanging composition of innovative activity in the US – a portrait based on patentanalysis”. Research Policy, 30 pp. 681–703.

Jaffe, Adam B., Manuel Trajtenberg and Rebecca Henderson. 1993. “Geographic Local-ization of Knowledge Spillovers as Evidence by Patent Citations”. Quarterly Journal ofEconomics, 108 (3) pp. 577–598.

Jones, Charles I. 1995. “Time Series Tests of Endogenous Growth Models”. QuarterlyJournal of Economics, 110 (2) pp. 495–526.

Kerr, William R. 2007. “The Ethnic Composition of US Inventors”. Harvard BusinessSchool Working Paper No. 08-006.

Kerr, William R and William Lincoln. 2008. “The Supply Side of Innovation: H–1B VisaReforms and U.S. Ethnic Invention”. Harvard Business School working paper.

Khan, B. Zorina and Kenneth L. Sokoloff. 1993. “Schemes of Practical Utility: En-trepreneurship and Innovation Among Great Inventors in the United States, 1790–1865”. Journal of Economic History, 53 (2) pp. 289–307.

Marx, Matt, Deborah Strumsky and Lee Fleming. 2007. “Noncompetes and InventorMobility: Specialists, Stars, and the Michigan Experiment”. Harvard Business SchoolWorking Paper 07-042.

23

Morgan, Robert P., Carlos Kruytbosch and Nirmala Kannankutty. 2001. “Patenting andInvention Activity of U.S. Scientists and Engineers in the Academic Sector: Compar-isons with Industry”. Journal of Technology Transfer, 26 pp. 173–183.

Niebuhr, Annekatrin. 2006. “Migration and Innovation: Does Cultural Diversity Matterfor Regional R&D Activity?” IAB Discussion Paper No. 14/2006.

Paserman, Daniele M. 2008. “Do High–Skill Immigrants Raise Productivity? Evidencefrom Israeli Manufacturing Firms, 1990–1999”. IZA Discussion Paper 3572.

Peri, Giovanni. 2007. “Higher Education, Innovation and Growth”. In Giorgio Brunello,Pietro Garibaldi and Etienne Wasmer eds. Education and Training in Europe, Oxford:Oxford University Press.

Peri, Giovanni and Chad Sparber. 2008. “Highly Educated Immigrants and OccupationalChoice” U.C. Davis working paper.

Romer, Paul M. “Endogenous Technological Change”. Journal of Political Economy, 98pp. S71–103.

Stephan Paula E. and Sharon G. Levin. 2001. “Exceptional contributions to US scienceby the foreign–born and foreign–educated”. Population Research and Policy Review, 20pp. 59–79.

Stuart, Toby E. and Olav Sorenson. 2003. “Liquidity Events and the Geographic Dis-tribution of Entrepreneurial Activity”. Administrative Science Quarterly, 48 (2) pp.175–201.

Stuen, Eric T., Ahmed Mushfiq Mobarak and Keith E. Maskus. 2007. “Foreign PhD Stu-dents and Knowledge Creation at U.S. Universities: Evidence from Enrollment Fluctu-ations”. University of Colorado working paper.

U.S. Department of Commerce, Patent and Trademark Office. 1977. Technology Assess-ment and Forecast, Seventh Report, Washington, D.C.

Wadhwa, Vivek, AnnaLee Saxenian, Ben Rissing and Gary Gereffi. 2007. “America’s NewImmigrant Entrepreneurs”. Kauffman Foundation report.

Zucker, Lynne G. and Michael R. Darby. 2006. “Movement of Star Scientists and Engineersand High–Tech Firm Entry”. NBER Working Paper 12172.

Zucker, Lynne G., Michael R. Darby, Jonathan Furner, Robert C. Liu and Hongyan Ma.2006. “Minerva Unbound: Knowledge Stocks, Knowledge Flows and New KnowledgeProduction”. NBER Working Paper 12669.

24

Appendix A: Theory

A.1 Model

The occupations of skilled immigrants to the United States depend in part upon thechoices of potential immigrants, even if other factors are also at work. U.S. employersand universities, for example, influence the allocation of visas. Also, scientists and en-gineers might be common among immigrants because market conditions in the sendingcountries lead a larger share of foreigners than Americans to study science and engineer-ing. Nevertheless, it is likely that self–selection is in part responsible for the fact thatskilled immigrants of all visa types are more likely than skilled natives to have studiedscience and engineering (as shown by the National Survey of College Graduates).

We consider a world with two countries, the origin o and the potential destinationd, and three types of labor Lk: scientific labor Ls, professional labor Lp and unskilledlabor Lu. We assume that wages for each type of labor are higher in country d, so thatimmigration goes in one direction only: wdk > wok for all k. The migration cost is Md

with a distribution g(Md) on [MdL,M

dH ]. The cost may vary for an individual for many

reasons such as relatives in the destination country, number of children, language skills,adaptation capacity, etc.

Consider the decision of an origin worker to emigrate. If she chooses to stay in theorigin country, a worker of skill category k will receive a real net wage wok with certainty.Workers of all three skill categories can find an unskilled job with certainty if they moveto country d, but professional migrants can only find a professional job in country d withprobability P d

pp, and scientific migrants can only find a scientific job with probability P dss

(and scientific workers cannot work as professionals and vice–versa). Moreover, whilescientific knowledge is equivalent in the two countries, a professional migrant needs toadapt her skills at cost Cd

p > 0 in order to get a professional job in country d. Thus,P dpp = 0 unless the worker adapts her skills.

We assume workers are risk neutral, have perfect access to credit, care only aboutconsumption, and therefore maximize the expected present value of lifetime income. Theexpected wage of an worker in the origin country is

E[w|k] = P (emigrate to d)[E(wd|k)−Md|emigrate to d]+(1−P (emigrate to d))wok, (4)

where P (emigrate to d) is the probability of migration. Assuming that the worker prefersthe status quo in case of indifference, she will migrate if

E(wd|k)−Md > wok ⇐⇒ Γdk ≡ E(wd|k)− wok > Md. (5)

Thus, the worker will emigrate to country d with probability G(Γdk). The expected gainfrom emigration to country d for unskilled workers is

Γdu = wdu − wou, (6)

25

for professional workers is

Γdp =

E[wd|k=p]︷︸︸︷max{wdu, P d

ppwdp + (1− P d

pp)wdu − Cd

p}−wop= wdu + max{0, P d

pp(wdp − wdu)− Cd

p} − wop (7)

= wdu − wou + max{0, P dpp(w

dp − wdu)− Cd

p} − wop + wou

= Γdu +max{0, P dpp(w

dp − wdu)︸︷︷︸

Expected productive skill premium in country d

−Cdp}

︸︷︷︸Net expected productive skill premium in country d

− (wop − wou)︸︷︷︸Productive skill premium at home

,

and for scientific workers is

Γds =

E[wd|k=s]︷︸︸︷P dssw

ds + (1− P d

ss)wdu−wos

= wdu + P dss(w

ds − wdu)− wos (8)

= wdu − wou + P dss(w

ds − wdu)− wos + wou

= Γdu + P dss(w

ds − wdu)︸︷︷︸

Net expected innovative skill premium in country d

− (wos − wou)︸︷︷︸Innovative skill premium at home

.

We assume that the cost of acquiring professional and scientific skills is the same, andthat the expected benefits must therefore be the same. If this were not the case, workersin the origin country would all choose the more profitable skill category. This woulddecrease the marginal value of labor of this category, and thus its wage, until equality wasattained again. Therefore, we have

G(Γds)E(Γds −Md|Md < Γds) + wos = G(Γdp)E(Γdp −Md|Md < Γdp) + wop, (9)

which can be rearranged as

G(Γds)E(Γds −Md|Md < Γds)−G(Γdp)E(Γdp −Md|Md < Γdp) = wop − wos . (10)

Let ϕ(Γdk) ≡ G(Γdk)E(Γdk−Md|Md < Γdk). Noting22 that∂ϕ(Γd

k)

∂Γdk> 0 (strict inequality if

Γdk > IdL), we have that Γds −Γdp is of opposite sign from wos −wop. We can use this to showthat, under some conditions, we must have Γds > Γdp i.e. a greater return to migration forscientific than professional workers.

Proposition 1. If wdu < wos, then a necessary condition that Γds > Γdp is

Ppp(wdp −wdu)−Cd

p <1

G(Γdp)[(wos −wop)[1−G(Γdp)] + φ(Γdp)− φ(Γds) +G(Γds)Pss(w

ds −wdu)],

and a sufficient condition isCdp > Ppp(w

dp − wdu).

22For the proof, see below.

26

Proof. See below.

Focusing on the simpler sufficient condition, we see that for an origin country whosescientific workers earn more at home than unskilled workers in the destination, scientificworkers have a larger expected gain from migration than professional workers if the skilladaptation costs for professional migrants are larger than the expected professional skillpremium in the destination. Therefore, scientific workers are more likely to migrate thanprofessional workers. No correspondingly simple condition exists for the case of wdu > wos ,where unskilled workers in the destination earn more than scientific workers in the origincountry.23

A.2 Proof of Proposition 1

Proposition 1. If wdu < wos, then a necessary condition that Γds > Γdp is


p <1


ds −wdu)],


dp − wdu).

Proof. This is a proof by contradiction. First, replace (7) and (8) in (9). This gives

G(Γds)[Γdu + Pss(w

ds − wdu)− (wos − wou)− E(Md|Md < Γds)] + wos =

G(Γdp)[Γdu + max{0, Ppp(wdp − wdu)− Cd

p} − (wop − wou)− E(Md|Md < Γdp)] + wop.

Using (6) and rearranging, we get

G(Γds)[wdu + Pss(w

ds − wdu] + [1−G(Γds)]w

os − φ(Γds) =

G(Γdp)[wdu + max{0, Ppp(wdp − wdu)− Cd

p}] + [1−G(Γdp)]wop − φ(Γdp),

whereφ(Γdk) ≡ G(Γdk)E(Md|Md < Γdk).

Also, using the Fundamental Theorem of Calculus, we have that∂φ(Γd

k)

∂Γdk> 0.

Adding and subtracting G(Γdp)wos on the right side and rearranging gives

(wdu − wos)[G(Γds)−G(Γdp)] = (wop − wos)[1−G(Γdp)] + φ(Γds)− φ(Γdp)

+G(Γdp) max{0, Ppp(wdp − wdu)− Cdp} −G(Γds)Pss(w

ds − wdu).

23In principle, we can maximize discounted lifetime income of workers in the origin country, takingthe migration option into account and assuming a distribution of the cost of skilled education relative tounskilled education, and calculate the share of workers at each skill level, for migrants and non–migrants.In practice, the resulting non–linear equations cannot be solved analytically.

27

Now, suppose that wdu < wos and Γdp > Γds. We have that the left side of the equation ispositive, since G(�) is a density function, so it is increasing. Now, let us look at the rightside of the equation. We have:

(wop − wos)︸︷︷︸60, since Γd

p>Γds

[1−G(Γdp)]︸︷︷︸>0, since G(�) is a density︸︷︷︸60

+φ(Γds)− φ(Γdp)︸︷︷︸60, since Γd

p>Γds

+G(Γdp) max{0, Ppp(wdp − wdu)− Cdp} −G(Γds)Pss(w

ds − wdu)︸︷︷︸

<0, else, no one wants skilled job

.

Therefore, the right side is negative if

max{0, Ppp(wdp − wdu)− Cdp} <

1

G(Γdp)[(wos − wop)[1−G(Γdp)] + φ(Γdp)− φ(Γds) +G(Γds)Pss(w

ds − wdu)]︸︷︷︸

>0

.

Thus, a necessary condition to have the right side negative is


p <1


ds −wdu)],


dp − wdu).

Under these conditions, we get a contradiction, since the left side is strictly positive,but the right side is negative. As long as either of these conditions holds, we must haveΓdp < Γds if wdu < wos .

A.3 Proof that ∂ϕ(Γdk)/∂Γd

k ≥ 0

Proof. We will show that ∂ϕ(Γdk)/∂Γdk ≥ 0 with strict inequality if Γdk > MdL, where Md

L

is the lower bound on immigration cost. First, note that

ϕ(Γdk) = G(Γdk)E(Γdk −Md|Md < Γdk) =

∫ Γdk

MdL

(Γdk −Md)g(Md)dMd

= Γdk

∫ Γdk

MdL

g(Md)dMd

︸︷︷︸G(Γd

k)

−∫ Γd

k

MdL

Mdg(Md)dMd

︸︷︷︸φ

∂ϕ(Γdk)/∂Γk = G(Γdk) + Γkg(Γdk)− ∂φ/∂Γdk︸︷︷︸=Γkg(Γ

dk) using Fundamental Theorem of Calculus

= G(Γdk) ≥ 0, since G(.) is a density function

28

Appendix B: Algebra for Equation (1)

If immigrants’ share of the population grows by one percentage point,

MS + ∆MS

POP + ∆MS− MS

POP= 0.01,

which implies after some rearranging that

∆MS =0.01POP

0.99POP −MS.

Therefore, the percent increase in the population can be calculated as

∆MS

MS=POP

MS

0.01

0.99− MS

POP

=1

α1

0.01

0.99− α1

.

The percent increase in the population is

∆MS

POP=

0.01

0.99− MS

POP

=0.01

0.99− α1

. (11)

We calculate the percent increase in patents assuming that the additional immigrantscontinue to patent at rate PMS

MS :

∆PMS

P=

1

P

PMS

MS∆MS = α0

∆MS

MS. (12)

The expressions (11) and (12) can then be substituted into the expression for the percentincrease in patents per capita on the left hand side of equation (1) to obtain the expressionon the right hand side.

29

Appendix C: Data

C.1 National Survey of College Graduates

The data were collected between October 2003 and August 2004 by the U.S. Bureauof the Census, on behalf of the National Science Foundation. The data consist of astratified random sample of people reporting having a bachelor’s degree or higher on thelong form of the (April) 2000 census, who were under age 76 and living in the UnitedStates or its territories including Puerto Rico in the reference week of October 1, 2003.We drop respondents living outside the United States and define as immigrants thoseborn outside the United States. Missing information is imputed with a hot deck pro-cedure, and imputed values are not flagged. More information on the data is providedat www.nsf.gov/statistics/showsrvy.cfm?srvy CatID=3&srvy Seri=7#fn1. The data areavailable at www.nsf.gov/statistics/sestat/.

C.2 Patents

We combine two patent series from the U.S. Patent and Trademark Office. The firstseries was compiled for me by the USPTO based on their electronic records which beginin 1963. This series is utility patents by state and year of application. Year of applicationis preferred to year of grant as it is a more accurate match to the time of invention. Thesecond series (U.S. Department of Commerce 1977) is from paper–based USPTO recordsof patents by state and grant year 1883–1976 (application year is not available pre–1963).Grants lag applications by a median of three years between 1950 and 1963 (accordingto our US–wide calculations based on Lexis–Nexis), so we lead this series three years.Patents grants are also more volatile than patent applications (Hall 2005), so we smooththe series with a three year moving average. Finally, because for 1930–1960 plants anddesigns cannot be separated from utility patents, we leave them in for the whole series,calculate by state the average percent gap in the overlap years of the two series (18% onaverage), and reduce the old series by this percent. We then merge the series, using theadjusted paper series values only for pre–1963. The USPTO attributes a patent to a stateaccording to the home address of the first–listed inventor.

We have also used an extract from the Harvard Business School patent data file, whichcontains information on utility patents granted from 1975 to 2007, arranged by year ofapplication and patent class. We have aggregated the patent classes to six categoriesusing the classification in Hall, Jaffe and Trajtenberg (2001) and our own classificationof patent classes created since 1999. In particular, we attribute classes 506 and 977 tochemical patents; classes 398, 701–720, 725 and 726 to computers and communicationpatents; and classes 901 and 903 to mechanical patents. We have not been able to finddefinitions for some patent classes created in 2006 or later (which affects some patentsapplied for in earlier years), and a small number of patents have a missing patent class.For the application years we used, 0.04% of patents are not allocated to one of our sixcategories. To examine patents by category, we have simply attributed 1974 values (mostpatents granted in 1975 were applied for in 1974 or earlier) to 1971, then used 1971, 1981,

30

1991 and 2001 patent values, and 1970, 1980, 1990 and 2000 values for the dependentvariables. Some small states do not have patents in every category in every year, and inthe analysis of log patents these observations are missing.

The extract also contains the number of citations made to patents in each patentclass, state and application year. These may be viewed as a proxy for the quality of thepatent. We calculate citations per patent from 1974 onwards for each state. We then runa regression of this ratio on a trend for each state from 1974–1980, and use the resultingcoefficient to predict the 1971 value of citations per patent for each state. We then returnto our original, longer patent series obtained directly from the USPTO, and multiply thepatents by the ratio for 1971 onwards. We can then study citations, or quality–adjustedpatents, for 1971, 1981, 1991 and 2001.

C.3 Immigration, education, age, occupation, labor force status

We use extracts from the Integrated Public Use Microdata Series for the United StatesCensus, available at usa.ipums.org/usa/, and aggregate to the state level using the weightsprovided. Variables computed as shares (other than the excluded instruments) are com-puted as shares of the population or workers aged 18–65, and average population age isthe average age of people aged 18–65. Immigrants are people born outside the UnitedStates. We use the census–provided edurec variable to identify college graduates (16 yearsof education or more in the 1940–1980 censuses, and a college or higher degree in the 1990and 2000 censuses) and high–school dropouts (11 or fewer years of education). Peoplewith post–college education are people with 17 or more years of education in the 1940–1980 censuses, and a post–college degree in 1990 and 2000. This is the highest level ofeducation that can be distinguished for the whole 1940–2000 period. Alaska and Hawaiiare not in the 1940 and 1950 IPUMS and we drop them from the analysis. The SIC codeswe count as electrical engineering are 321, 322, 342, 350, 371, 372.

C.4 Other data

We use Bureau of Economic Analysis data for total state population (used to weight theregressions) and for state personal income per capita (available from 1929 onwards, unlikegross state product which is not available for our whole period). The data are availableat www.bea.gov/regional/spi/.

Department of Defense procurement contracts by state are available on paper for theearly years in Prime Contract Awards by State, Fiscal Years 1951–1978, published by theDepartment of Defense, OASD (Comptroller), Directorate for Information Operationsand Control. The later years are available online at www.fpds.gov. Some measurementerror in the attribution to states is involved, as recipient firms may subcontract the workto firms in other states. Also, in the electronic records for 1978–1983, 1986 and 1989 (ofwhich only 1980 is relevant for the paper), the California numbers seem to be too small bya factor of 1000, so we have multiplied them by 1000. (We have obtained scanned versionsof the paper documents for these years: the values for the non–problematic states and

31

years are only approximately the same as those online, but the problematic Californiayears are indeed about 1000 times higher than the online version.) We attribute the1951 value to 1950, and set changes in the values involving 1940, for which data are notavailable, to zero.

We obtain the land area of each state from the US. Census Bureau atwww.census.gov/population/censusdata/90den stco.txt.

32

Table 1: Patenting by immigrant status (1) (2) (3) (4) (5) (6) College graduates Post-college graduates Scientists and

engineers Immigrant Native Immigrant Native Immigrant Native Any patent granted 0.019 0.009 0.036 0.013 0.062 0.049 Number patents granted

0.057 0.028 0.112 0.036 0.178 0.132

Any patent commercialized

0.012 0.006 0.021 0.008 0.037 0.030

Number patents commercialized

0.029 0.017 0.054 0.019 0.084 0.074

Share immigrant 0.136 0.161 0.240 Observations 20,294 71,186 11,729 30,410 6724 15,502 Notes: Shares weighted with survey weights. Patents questions only asked of respondents who had ever worked. Whether a patent has been granted refers to period from October 1998 to the survey in 2003, and whether a patent has been commercialized or licensed refers to those patents granted in the same period. Source: 2003 National Survey of College Graduates.

Table 2: Means of aggregate variables (1) (2) (3) 1940-2000 1940 2000 Patents/population, x100 0.023

(0.015) 0.018

(0.013) 0.035

(0.020) Share of population 18-65 that is: Immigrant, college education and above 0.015 0.003 0.035 Native, college education and above 0.127 0.041 0.200 Immigrant, post-college education 0.007 0.001 0.016 Native, post-college education 0.050 0.011 0.077 Share of workers 18-65 that are: Immigrant, scientists and engineers 0.003 0.001 0.009 Native, scientists and engineers 0.022 0.006 0.035 Age of population 18-65 38.7

(1.0) 37.7 (1.0)

39.5 (0.6)

DoD prime military procurement contracts (millions of nominal $)

3236 (4386)

1500 (1679)

5528 (5809)

State personal income per capita (nominal $) 11976 (11098)

594 (204)

29,851 (4094)

Land area (millions of square kilometers) 0.190 (0.161)

0.166 (0.145)

0.207 (0.173)

Observations 343 49 49 Notes: Means of state-level variables for population 18-65, weighted by state population the year after the census. Standard deviations in parentheses. Patents and population are led by one year. Alaska and Hawaii are excluded. Patents are classified by year filed. The predicted increases in immigrant college share (instruments) are based on states’ shares of 1940 immigrant high school graduates from various countries and national growth in college graduates from those countries (see text). The 1940 value of DoD procurement spending is not available, and the 1950 value is given instead of 1940, and the 1950-2000 average instead of 1940-2000. Sources: Education, age, occupation, nativity: U.S. Census Bureau, IPUMS decennial census microdata usa.ipums.org/usa/ Patents: U.S. Patent and Trademark Office, electronic and paper data. State income, population: Bureau of Economic Analysis www.bea.gov/regional/spi/ Land Area: U.S. Census Bureau www.census.gov/population/censusdata/90den_stco.txt

Table 3: Patenting by field of study and field of study by immigrant status, college graduates (1) (2) (3) (4) Field of highest degree

Any patent granted

Any patent commercialized

Share immigrants

Share natives

Computer science, math 0.017 0.012 0.079 0.036 Biological, agricultural and environment sciences

0.023 0.011 0.055 0.040

Physical sciences 0.066 0.039 0.036 0.017 Social and related sciences 0.004 0.002 0.093 0.108 Engineering 0.061 0.042 0.137 0.053 Other S&E (mainly health) 0.007 0.004 0.166 0.121 Non-S&E 0.004 0.003 0.434 0.624 All fields 0.011 0.007 1.00 1.00 Notes: Shares weighted by survey weights. “S&E” means science and engineering. Full sample (i.e. college graduates), 91,480 observations. Whether a patent has been granted refers to period from October 1998 to the survey in 2003, and whether a patent has been commercialized or licensed refers to those patents granted in the same period. Source: 2003 National Survey of College Graduates.

Table 4: Effect of immigrant status on patenting (1) (2) (3) (4) (5) (6) Any patent granted? Any patent

commercialized? College graduates 0.0100**

(0.0010) 0.0009** (0.0005)

-0.0007* (0.0004)

-0.0005 (0.0003)

0.0062** (0.0008)

-0.0004 (0.0003)

Pseudo-R2 0.01 0.15 0.19 0.21 0.01 0.18 Post-college graduates

0.0226** (0.0018)

0.0014** (0.0008)

0.0004 (0.0006)

0.0005 (0.0006)

0.0135** (0.0014)

0.0002 (0.0004)

Pseudo-R2 0.02 0.21 0.24 0.26 0.02 0.21 Scientists and engineers

0.0131** (0.0039)

0.0031 (0.0031)

-0.0095** (0.0027)

-0.0074** (0.0026)

0.0063** (0.0030)

-0.0052** (0.0020)

Pseudo-R2 0.00 0.08 0.12 0.13 0.00 0.09 Major field of highest degree

-- Y Y Y -- Y

Highest degree -- -- Y Y -- Y Age, age2, sex, employed

-- -- -- Y -- --

Notes: Marginal effect on dummy for foreign-born from weighted probits. There are 91,480 observations in the college graduate sample, 42,139 in the post-college sample and 22,226 in the scientist and engineer sample. All scientists and engineers are employed in the reference week. Post-college degrees include master’s (including MBA), PhD and professional. There are 30 major field of study dummies (we combine the two S&E teacher training categories into one). Standard errors are in parentheses. ** indicates coefficients significant at the 5% level, * indicates coefficients significant at the 10% level.

Table 5: Effect of share of immigrant college graduates on log patents per capita (1) (2) (3) (4)

Difference: 10 year 10 year 30 year 50 year Δ Immigrant college as share of population

14.7** (5.3)

13.2** (4.3)

12.1** (3.2)

14.9** (4.3)

Δ Native college as share of population

4.1** (1.9)

1.9 (2.3)

4.8** (1.8)

5.8** (2.1)

Δ Age (average) 0.064** (0.028)

0.104** (0.035)

0.163** (0.041)

0.137** (0.066)

Δ DoD procurement (log) -0.033 (0.025)

-0.039** (0.017)

-0.081** (0.035)

-0.063 (0.041)

Land area (log) 0.046** (0.007)

0.063** (0.013)

0.169** (0.032)

0.281** (0.055)

Population 1940 (log) -0.056** (0.015)

-0.062** (0.015)

-0.161** (0.032)

-0.252** (0.060)

State personal income per capita 1940 (log)

-0.223** (0.037)

-0.162** (0.048)

-0.544** (0.103)

-1.006** (0.189)

Weighted No Yes Yes Yes R2 0.47 0.64 0.63 0.70 Observations 294 294 196 98 Notes: The dependent variable is the difference in log patents per capita across periods ranging from ten to 50 years, with a lead of one year compared to the independent variables. Weighted least squares regressions have weights 1/(1/popt+1+1/popt-k+1), where k the length of the difference. Regressions also include year dummies. Standard errors clustered by state are in parentheses. ** indicates coefficients significant at the 5% level, * indicates coefficients significant at the 10% level.

Table 6: Effect of skilled immigrant shares on patents per capita (1) (2) (3) (4) Δ Log patents per capita Δ Patents per capita

Difference: 10 year 50 year 10 year 50 year Panel A. Skilled: college graduates as share of population Δ Skilled immigrant 13.2**

(4.3) 14.9** (4.3)

0.368** (0.173)

0.271* (0.158)

Δ Skilled native 1.9 (2.3)

5.8** (2.1)

-0.010 (0.108)

0.134* (0.081)

R2 0.64 0.70 0.49 0.42 Panel B. Skilled: post-college educated as share of population Δ Skilled immigrant 20.7*

(10.7) 29.8** (10.9)

0.686 (0.501)

0.538 (0.403)

Δ Skilled native -1.3 (3.3)

8.4 (5.8)

-0.086 (0.138)

0.127 (0.224)

R2 0.63 0.66 0.48 0.38 Panel C. Skilled: scientists and engineers as share of workers Δ Skilled immigrant 52.4**

(20.7) 52.6** (16.1)

2.318** (0.990)

1.354** (0.661)

Δ Skilled native 9.4* (5.4)

24.7** (8.0)

0.212 (0.239)

0.665** (0.278)

R2 0.67 0.74 0.56 0.52 Observations 294 98 294 98 Notes: The dependent variable is the difference in (log) patents per capita across periods ranging from ten to 50 years, with a lead of one year compared to the independent variables. Weighted least squares with weights 1/(1/popt+1+1/popt-k+1), where k is equal to the difference length. All regressions include the covariates of Table 5. Standard errors clustered by state are in parentheses. Coefficients in columns 3-4 are multiplied by 100. ** indicates coefficients significant at the 5% level, * indicates coefficients significant at the 10% level.

Table 7: Effect of college immigrant shares on log patents per capita, ten-year differences, further specifications

(1) (2) WLS IV 1. Base specifications

13.2** (4.3)

30.3** (7.2) [27]

2. Base specifications without California (288 observations) 9.2** (4.3)

26.3** (7.1) [24]

3. Base specifications without year 2000 (245 observations) 8.8** (3.5)

18.9** (7.1) [51]

4. Include BEA region dummies

11.5** (4.5)

23.4** (5.4) [30]

5. Include state dummies 11.6** (5.6)

24.5** (6.3) [31]

6. Include BEA region dummies and % electrical workers 1980*year dummies

9.6** (4.7)

23.1** (7.0) [28]

7. Include BEA region dummies and % electrical workers1980*year dummies and 1940 immigrant shares (λ)

10.1* (5.8)

24.5** (7.8) [39]

8. Include BEA region dummies and % electrical workers1980*year dummies; exclude share college natives

8.6* (4.3)

17.6** (5.6) [27]

9. Include BEA region dummies*post-1980 8.4** (3.4)

18.6** (7.6) [9]

10. Include BEA region dummies*post-1980; exclude share college natives

7.0** (3.4)

12.3** (6.1) [10]

Notes: Each coefficient reported is the effect of a change in college immigrant share of the population from a different regression. The dependent variable is the difference in log patents across ten years, with a lead of one year compared to the independent variables. 294 observations unless otherwise noted. Weighted least squares (column 1) or instrumental variables (column 2) with weights 1/(1/popt+1+1/popt). The instruments are the predicted increase in college immigrant shares, based on states’ shares of 1940 immigrants from various countries and subsequent national growth in college graduates from those countries (see text). F-statistic for test of joint significance of excluded instrument in the first stage in brackets. All regressions also include the covariates of Table 5 including the appropriate differenced share of skilled natives. Standard errors clustered by state are in parentheses. ** indicates coefficients significant at the 5% level, * indicates coefficients significant at the 10% level.

Table 8: Effect of skilled immigrant shares on log patents per capita, ten-year differences, preferred specifications

(1) (2) (3) (4) (5)

Skill represented by: College graduates Post-college Scientists/ engineers

WLS IV WLS IV WLS 1. Include BEA region dummies, % electrical workers1980*year dummies; IV excludes share college natives

9.6** (4.7)

17.6** (5.6) [27]

11.3 (8.3)

18.9 (14.3) [18]

38.2* (20.3)

2. Include BEA region dummies*post-1980; IV excludes share college natives

8.4** (3.4)

12.3** (6.1) [10]

15.9** (6.2)

27.0** (13.4) [11]

30.7* (16.3)

Notes: Each coefficient reported is the effect of a change in skilled immigrant share from a different regression. The dependent variable is the difference in log patents across ten years, with a lead of one year compared to the independent variables. Weighted least squares (odd columns) or instrumental variables (even columns) with weights 1/(1/popt+1+1/popt). The instruments are the predicted increase in skilled immigrant shares, based on states’ shares of 1940 immigrants from various countries and subsequent national growth in college graduates from those countries (see text). F-statistic for test of joint significance of excluded instrument in the first stage in brackets. All regressions also include the covariates of Table 5 including the appropriate differenced share of skilled natives. Standard errors clustered by state are in parentheses. 294 observations. ** indicates coefficients significant at the 5% level, * indicates coefficients significant at the 10% level.

Table 9: Effect of skilled immigrant shares on log patent citations per capita and log patents per capita by type, ten-year differences, 1970-2000

(1) (2) (3)

Skill represented by: College graduates Post-college Scientists/engineers 1. Patents 10.7**

(4.6) 15.2

(11.6) 45.7** (22.0)

2. Patent citations

9.9** (4.8)

18.5 (12.4)

54.4** (24.2)

3. Patent citations, include BEA region dummies

11.3** (5.6)

24.2* (12.2)

56.2* (29.6)

4. Computer and communications patents (137 observations)

10.2 (7.4)

1.7 (19.5)

35.0 (23.5)

5. Electrical and electronic patents (145 observations)

0.1 (6.1)

-2.2 (15.4)

6.9 (23.9)

6. Drug and medical patents (146 observations)

12.5* (6.7)

9.2 (14.8)

50.6 (30.7)

7. Chemical patents (145 observations)

7.5 (6.4)

8.7 (14.9)

10.3 (26.5)

8. Mechanical patents -3.0 (3.9)

-15.7 (9.0)

3.8 (16.1)

9. Other patents -7.7** (2.6)

-19.2** (4.9)

-15.5 (9.7)

Notes: Each coefficient reported is the effect of a ten-year change in skilled immigrant share from a different regression. The dependent variable is the difference in log patents per capita or log of patent citations per capita across ten years, with a lead of one year compared to the independent variables. Weighted least squares for 1970-2000 with weights 1/(1/popt+1+1/popt). All regressions include the covariates included in Table 5. Standard errors clustered by state are in parentheses. 147 observations, unless otherwise specified. ** indicates coefficients significant at the 5% level, * indicates coefficients significant at the 10% level.

Appendix Table 1: States’ 1940 shares of national-level immigrants from various origins (1) (2) (3) (4) State’s 1940 share of national total, all immigrants College grad immigrants Origin Mean Maximum State of maximum National total 2000 -

national total 1940 United Kingdom 0.050 0.19 New York 192,750 Ireland 0.059 0.32 New York 22,667 Italy 0.062 0.35 New York 45,368 Germany 0.052 0.23 New York 216,513 Poland 0.060 0.28 New York 66,284 Russia 0.064 0.37 New York 236,052 Other Europe 0.050 0.19 New York 348,471 Canada 0.038 0.21 Massachusetts 208,124 Mexico 0.043 0.39 California 263,006 Puerto Rico 0.096 0.90 New York 121,761 Cuba 0.047 0.41 Florida 125,089 Other Caribbean 0.073 0.61 New York 234,065 Central America 0.052 0.26 California 130,725 South America 0.069 0.47 New York 368,186 China 0.058 0.39 California 617,482 India 0.047 0.26 California 773,840 Other Asia 0.045 0.62 California 1,426,943 Rest of world 0.054 0.26 New York 549,302 Notes: State-level variables, means weighted by state population. Standard deviations in parentheses. 49 observations. Alaska and Hawaii are excluded. The minimum share is zero for all origins except for the United Kingdom, Germany, other Europe and Canada. Columns 1-3 are based on the full population, while column 4 is based on the population 18-65.

Appendix Table 2: Means of individual-level variables College graduates Post-college Scientists/engineers Immigrant Native Immigrant Native Immigrant Native Highest degree: Bachelor’s 0.58 0.65 -- -- 0.44 0.68 Master’s 0.28 0.26 0.66 0.74 0.39 0.26 Doctorate 0.07 0.03 0.17 0.08 0.16 0.06 Professional 0.07 0.06 0.17 0.17 0.01 0.01 Field of highest degree Computer science, mathematics

0.076 0.036 0.091 0.027 0.219 0.168

Biological, agricultural, environment science

0.056 0.040 0.061 0.030 0.092 0.093

Physical science 0.035 0.017 0.044 0.017 0.077 0.072 Social science 0.091 0.108 0.069 0.078 0.026 0.046 Engineering 0.132 0.053 0.131 0.037 0.397 0.321 Other S&E 0.164 0.121 0.199 0.157 0.069 0.058 Non-S&E 0.446 0.624 0.406 0.653 0.120 0.243 Sex (female) 0.48 0.50 0.43 0.49 0.24 0.23 Age 43.4

(9.9) 44.7

(10.3) 43.8 (9.9)

46.6 (10.3)

40.4 (9.0)

42.4 (9.5)

Employed 0.86 0.85 0.89 0.87 1.00 1.00 Observations 21,248 71,304 12,042 30,460 6840 15,519 Notes: Means weighted with survey weights. S&E means science and engineering. “Other S&E” includes the social sciences. Native movers are natives living in a different census region from their census region of birth. Source: National Survey of College Graduates

Appendix Table 3: First stage of instrumental variables regressions for ten-year differences

(1) (2) Skill represented by: College graduates Post-college

1. Base specifications

0.31** (0.06)

0.29** (0.06)

R2 [R2 without excluded instrument] 0.78 [0.69] 0.75 [0.67] 2. Sample without California (288 observations)

0.25** (0.05)

0.24** (0.06)

R2 [R2 without excluded instrument] 0.70 [0.63] 0.69 [0.63] 3. Sample without year 2000 (245 observations)

0.45** (0.06)

0.40** (0.06)

R2 [R2 without excluded instrument] 0.79 [0.59] 0.69 [0.55] 4. Include BEA region dummies 0.29**

(0.05) 0.27** (0.06)

R2 [R2 without excluded instrument] 0.83 [0.76] 0.80 [0.74] 5. Include state dummies 0.34**

(0.06) 0.33** (0.07)

R2 [R2 without excluded instrument] 0.89 [0.82] 0.86 [0.79] 6. Include BEA region dummies and % electrical workers 1980*year dummies

0.25** (0.05)

0.25** (0.06)

R2 [R2 without excluded instrument] 0.83 [0.79] 0.80 [0.76] 7. Include BEA region dummies and % electrical workers 1980*year dummies and 1940 immigrant shares

0.29** (0.05)

0.30** (0.06)

R2 [R2 without excluded instrument] 0.89 [0.85] 0.85 [0.81] 8. Include BEA region dummies and % electrical workers 1980*year dummies; exclude native skilled share

0.27** (0.05)

0.29** (0.06)

R2 [R2 without excluded instrument] 0.83 [0.78] 0.84 [0.80] 9. Include BEA region dummies*post-1980 0.25**

(0.09) 0.26** (0.08)

R2 [R2 without excluded instrument] 0.81 [0.77] 0.78 [0.73] 10. Include BEA region dummies*post-1980; exclude native skilled share

0.27** (0.09)

0.26** (0.08)

R2 [R2 without excluded instrument] 0.81 [0.76] 0.77 [0.73] Notes: The dependent variable is the ten-year difference in the share of the population that is a skilled immigrant, with skill proxied by college in column 1, post-college in column 2. The coefficient reported is that on the excluded instrument, which is the predicted increase in immigrant college shares, based on states’ shares of 1940 immigrant graduates from various countries and subsequent national growth in college graduates from those countries (see text). Weighted least squares with weights 1/(1/popt+1+1/popt). All regressions also include the covariates of Table 5 (except the difference in the skilled immigrant share). Standard errors clustered by state are in parentheses. 294 observations unless otherwise specified. ** indicates coefficients significant at the 5% level, * indicates coefficients significant at the 10% level.

Patents

Patents per 100,000

0

10

20

30

40

50

60

70

8085

Pate

nts

per 1

00,0

00 re

siden

ts

0

20000

40000

60000

80000

100000Nu

mbe

r of p

aten

ts

1941 1951 1961 1971 1981 1991 2001Year (Application Date)

Source: USPTO, BEA and authors’ calculations

Figure 1: U.S. Origin U.S. Patents 1941−2001

Patents, all states

Patents, without California

Patents per capita

.5

.75

1

1.25

1.5

1.75St

anda

rd d

evia

tion

of lo

g pa

tent

s

1930 1940 1950 1960 1970 1980 1990 2000Year (Application Date)

Source: USPTO, BEA and author’s calculations

Figure 2: Convergence in Patenting Across States 1929−2001

AL

AZ

AR CA

COCT

DEDC

FLGA

ID

IL

IN IAKSKY

LA

ME

MDMAMI

MNMS

MOMTNENV

NHNJNM

NYNC

ND

OHOK OR

PA

RI

SCSDTN

TXUT

VT

VA

WA

WVWI

WY

−.005

0

.005

.01

0 .001 .002 .003 .004

1940−50: slope 0.24 (0.19)

AL

AZ

AR CACO

CT

DE

DC

FLGA

ID

ILINIAKS

KY

LA

ME

MDMAMI

MN

MSMOMT

NENV

NH

NJNMNYNCND

OHOKOR

PA

RI

SCSDTNTXUT

VT

VA WAWV

WIWY

−.01

−.005

0

.005

−.0015 −.0005 0 .0005 .0015

1950−60: slope 0.22 (0.42)

AL AZAR

CA

COCT

DE

DC

FL

GA ID

IL

IN

IAKSKY LA ME

MD

MAMIMN

MSMO

MTNE

NV NHNJ

NM

NY

NC ND

OHOK

ORPA

RI

SCSD

TNTX UT

VTVA

WAWVWI

WY0

.002

.004

.006

.008

0 .002 .004 .006 .008 .01

1960−70: slope 0.37 (0.05)

ALAZ

AR

CA

CO

CT

DE

DC

FL

GA

ID

IL

INIA

KSKY

LAME

MD

MA

MIMNMSMOMTNE

NVNH

NJ

NM

NY

NC NDOHOK ORPA

RISC

SDTN

TX

UTVTVA

WA

WVWI WY

−.005

0

.005

.01

.015

0 .01 .02 .03

1970−80: slope 0.50 (0.05)

AL

AZ

AR

CA

CO

CT

DE

DC

FL

GA

ID

IL

INIAKSKY

LAME

MD MA

MIMNMSMO MTNE

NVNH

NJ

NM

NY

NC

NDOHOK

ORPA

RI

SCSD

TN

TX

UTVT

VA

WA

WVWI WY0

.005

.01

.015

.02

0 .01 .02 .03 .04

1980−90: slope 0.46 (0.05)

AL

AZ

AR

CA

COCT

DE

DC

FL

GA

ID

IL

IN IAKSKY LAME

MDMA

MIMN

MS

MOMT

NE

NV

NH

NJ

NM

NY

NC

NDOHOK

OR PARI

SCSD

TN

TX

UTVT

VA WA

WVWI

WY0

.01

.02

.03

0 .02 .04 .06

1990−2000: slope 0.33 (0.05)

Actu

al c

hang

e in

imm

igra

nt c

olle

ge s

hare

Predicted change in immigrant college share of population (instrument)

Fig 3: Actual, Predicted Change in Immigrant College Share

Date post:	28-May-2020
Category:	Documents
Upload:	others
View:	5 times
Download:	0 times

How Much Does Immigration Boost Innovation?ftp.iza.org/dp3921.pdf · and the individual...

Documents