Internal Migration and Firm Growth: Evidence
from China.∗
Clement Imbert Marlon Seror Yifan Zhang
Yanos Zylberberg
Preliminary and incomplete – do not circulate
Abstract
This paper provides some of the first empirical evidence on the role of
internal migration in manufacturing growth, using Chinese data. We first
identify shocks to rural livelihoods caused by variation in international agri-
cultural prices and local climatic conditions. We then combine these shocks
with a gravity model to predict yearly migrant inflow into each urban center.
Finally, we use household survey data and a census of large firms to estimate
the causal impact of migrant inflows on the urban economy. Preliminary re-
sults suggest that by increasing labor supply, migration lowers labor costs and
increases the profitability of manufacturing firms.
JEL codes: D24; J23; J61; O15.
1 Introduction
As countries develop, labour shifts from the traditional—agriculture—to the modern
sector—manufacturing,—which implies migration from rural to urban areas (Lewis,
1954; Kuznets, 1964; Harris and Todaro, 1970).1 Despite the fact that the movement
∗Imbert: Warwick University, [email protected]; Seror: PSE, [email protected]; Zhang:CUHK, [email protected]; Zylberberg: Bristol University, [email protected] are grateful to Gharad Bryan, Jon Temple, Christine Valente, Thomas Vendryes, ChrisWoodruff for useful discussions and comments. We also thank participants in Bristol, CUHKand Warwick for helpful comments. The usual disclaimer applies.
1In a recent study of the determinants of structural change in today’s developed economies,Alvarez-Cuadrado and Poschke (2011) propose an empirical exercise to distinguish “push” from“pull” models. In “labour push” models, rising productivity in agriculture releases labour, which in
1
of labour is central in structural transformation, there is little empirical evidence on
its short- and medium-run impact on the urban manufacturing sector.
The objective of this paper is to estimate the causal impact of migration inflows
on urban labor markets and manufacturing firms in China. We use variation in
rainfall and world prices for agricultural commodities, combined with information on
cropping patterns and potential yields to construct exogenous shocks to agricultural
labour returns in each rural prefecture.2 We then combine these shocks with a
gravity model, which includes distance between rural origin and urban destination
and population at destination to predict migration inflows into each urban area.
Finally, we use these origin-based fluctuations to instrument immigrant inflows and
estimate their effect on the urban economy through the observation of workers and
firms.
China arguably offers the best context to study the role of migration in economic
development. The Chinese economy has experienced a remarkably rapid structural
transformation, with a sharp fall in the share of agriculture and a symmetric rise
in manufacturing and services for the last three decades. China’s agricultural em-
ployment share was about 70% in 1980 and is predicted to taper off at 24% in 2020
(ADB, 2014).3 At the same time, China has seen massive migration flows from rural
to urban areas. The stock of rural-to-urban migrants, i.e. the urban population with
a rural household registration or residence permit (hukou), rose from 46.5 million in
1982 to 205.6 million or 30.9% of the total urban population in 2010 (Chan, 2012).4
This rapid evolution allows us to study migration and manufacturing growth with
coherent data sources spanning a significant part of the structural transformation
period.
Our empirical strategy proceeds in three steps. In a first step, we construct shocks
to agricultural incomes. For this we collect geocoded grids (1km×1km) provided by
the Food and Agricultural Organisation (FAO). We multiply the 1990 harvested
area with a model-based measure of potential yield combining crop requirements
and soil characteristics to create a measure of expected output for each crop and
turn triggers industrialization (Gollin et al., 2002). In “labour pull” models, technological changeincreases productivity in manufacturing, which attracts workers out of agriculture (Herrendorf etal., 2013). In both narratives, migration—and thus structural change—is prompted by a produc-tivity gap between sectors.
2Prefectures are the second administrative division in China below the province (there wereabout 345 prefectures in 2005).
3In comparison, Alvarez-Cuadrado and Poschke (2011) find that it took 108 years on averagefor agricultural employment share to decline from 60% to 20% of the labour force in 12 of today’sdeveloped economies.
4This figure suggest that internal migration in China alone is of the same order of magnitude asinternational migration worldwide. In 2010 the stock of international migrants was an estimated222 million (United Nations, 2015).
2
prefecture. We then combine the expected output with two crop-specific shocks.
First, we isolate short-term fluctuations in international crop prices and transform
these price variations into variations in expected agricultural income (for a fixed
agricultural portfolio). Second, we interact the crop water requirement during the
growing season with monthly precipitation to create a yearly distance to ideal water
requirement in each prefecture. These origin shocks exhibit a large time-varying
volatility coming from the World demand and supply or rainfall cycles but also
large cross-sectional differences due to the wide variety of harvested crops across
China.
In a second step, we combine rural income shocks with a gravity model which uses
distance between rural origins and urban destinations and population at destination
to predict migration inflows into urban areas. Fluctuations in agricultural income
due to international prices and rainfall generate significant variations in outflows
from rural areas. An origin-specific agricultural portfolio 10% above its long-term
value (about 1 standard deviation) is associated with a 0.25 p.p. lower outmigration
incidence. Similarly, a 1 standard deviation increase in our measure of distance to
ideal water requirement is associated with a 0.18 p.p. lower outmigration incidence.
Both effects are very robust and generate economically significant variations in mi-
gration outflows (the average outmigration incidence is around 1.4 p.p.). We next
use a gravity model based on geographic distance and historical data on destina-
tion populations to transform these rural outflows into immigration inflows to urban
destinations. Our approach is similar in that respect to Boustan et al. (2010).
In a third step, we identify the causal impact of migrant inflows on the urban
economy. We first use an annual survey of urban households (Urban Household
Survey) and estimate the effect of migration on wages and employment for urban
“natives”. We find that migration inflows exert a downward pressure on urban wages
and crowd urban residents out of wage employment. The implied wage elasticity
with respect to migration is 0.15 to 0.28. As expected, the effects are stronger for
less educated workers, who are close substitutes for migrant labour. We next use
a yearly census of large firms from the National Bureau of Statistics (NBS) and
estimate the effect of migration on labour costs and profitability. We show that
migration inflows markedly reduce the wage rate (wage bill divided by employment)
and increase profitability (value added minus labour costs divided by revenues) for
urban firms. The effects are stronger for firms employing mostly unskilled labour.
Our (preliminary) findings contribute to different strands of the literature. First,
this paper contributes to the literature on structural transformation by estimating
the direct impact of rural-to-urban migration on the modern sector, using worker and
3
firm data. Our findings that migration decreases wages and increases profitability in
urban areas relate to “labour push models,” which generally imply that, by releasing
labour, labour-saving rising agricultural productivity may trigger industrialization
(Gollin et al., 2002; Alvarez-Cuadrado and Poschke, 2011; Bustos et al., 2015). Our
results may also complement Marden (2015), who finds that for an earlier period
of Chinese development (the 1990s), the increase in farm profits due to agricultural
reforms provided credit to finance non-agricultural sector growth.
Second, this paper relates to the nascent literature which uses firm-level data to
study how migrants’ labour supply is absorbed by the economy (Peri, 2012; Kerr et
al., 2015; Dustmann and Glitz, 2015).5 The context of our study is however very
different. Urban China has experienced massive flows of internal migrants and its
economy has been expanding at a very high rate, with a constant reallocation of
resources toward small, young and productive firms (Song et al., 2011).
Third, this paper relates to the literature on the effects of immigration on labour
markets (Borjas, 2003), and more specifically to studies that focus on internal mi-
gration. Boustan et al. (2010) study the labour market effects of changes in internal
migration in the US during the Great Depression. El Badaoui et al. (2014), Imbert
and Papp (2014), Kleemans and Magruder (2014) and Feng et al. (2015a) among
others study the labour market effects of migration in Thailand, India, Indonesia
and the United States, respectively.
Fourth, this paper contributes to the literature on the role of migration in shap-
ing economic development in China. Ge and Yang (2014) use wage decomposition
methods and a simple calibration to show that migration depressed unskilled wages
in urban areas by at least 20% throughout the 1990s and 2000s. Based on aggregated
data at the provincial level, De Sousa and Poncet (2011) find that migration helped
alleviate upward pressures on Chinese wages in 1995-2007. In contrast, Meng and
Zhang (2010) provide evidence of a modestly positive or zero effect of rural migrants
on native urban workers’ labour market outcomes, and Combes et al. (2015) put
forward a strong positive externality on local wages. Mayneris et al. (2014) consider
another type of shock to the labour market, an increase in legislated minimum wages.
As we do with migration, they assess the impact of the shock on firm outcomes in
China. To the best of our knowledge, our paper is the first microeconomic paper
to investigate and provide evidence of the effect of migration on firm outcomes, and
thus contributes to linking to empirics “push” models of rural outmigration fuelling
modern sector growth. It complements Facchini et al. (2015), who show that trade
5Giesing and Laurentsyeva (2015), provide evidence on the effect of emigration on firm outcomesin Eastern European countries.
4
shocks increase demand for labor in manufacturing and stimulate internal migration
(which is consistent to a “pull” model).6
Much attention has been given to the mechanisms and patterns of the Chinese
growth, and a large body of literature is devoted to migration in China. However,
while the role of rural-to-urban migration in fuelling economic growth finds a large
echo in the policy debate in China,7 the economic literature has given it much less
attention. For example, Song et al. (2011) focus on three main features of the
Chinese economic take-off—high output growth, with high and sustained returns
to capital, reallocation within the manufacturing sector from large state-owned en-
terprises (SOEs) to smaller private firms, and large savings invested abroad. Their
explanation relies on credit market imperfections, which force small productive firms
to save before growing at the expense of larger, less productive firms. Interestingly,
migration from rural areas may also help explain these stylized facts. Indeed, the
constant increase in labour supply of migrants, by moderating urban wage growth,
may have allowed firms to sustain high profits, accumulate internal savings and
finance profitable investments despite credit constraints.
The remainder of the paper is organized as follows. In Section 2, we describe
our three main data sources allowing us to create migration flows, labour market
outcomes and firm-specific outcomes. We also detail how we isolate exogenous vari-
ations at origin that impact migration flows. In Section 3, we describe our empirical
strategy, in particular how we generate synthetic migration flows thanks to our
agricultural productivity shocks and estimates of migration flows on urban labour
markets and firm outcomes at destination. We present our main results in section 4.
Section 5 concludes.
2 Data
This section presents the data we use and how we construct the main variable of
our analysis.8. We first present our two main sources of exogenous variation in
agricultural returns to labour, i.e. the price and yield shocks. We next present our
measures of migration flows and urban outcomes.
6Macroeconomic discussions of the link between migration and productivity can be found inAu and Henderson (2006), Au and Henderson (2007) and Tombe and Zhu (2015). These papersall focus on mobility restrictions, as do Bosker et al. (2012), an economic geography analysis, andVendryes (2011), a theoretical paper.
7See Meng and Zhang (2010) for a survey.8As we rely on many data sources, we describe them briefly below and provide a more detailed
discussion in the appendix.
5
2.1 Rural income shocks
In order to construct shocks to productivity of labor in agriculture, we combines
three types of information: potential agricultural output, international prices and
rainfall.
Potential Agricultural Output We construct the potential output for each crop
in each prefecture, by combining a measure of harvested area, and a measure of yield
both provided by the Food and Agriculture Organization (FAO).
First, we extract from the 1990 World Census of Agriculture the geo-coded map
of harvested area for each crop (in a 30 arc-second resolution, approximately 1km).
We then overlay this map with a map of prefectures, and we construct total harvested
area hc,o for a given crop c and a given prefecture o.9
Second, we use a measure of potential yield per hectare as computed in the Global
Agro-Ecological Zones (GAEZ) Agricultural Suitability and Potential Yields dataset.
The measure is model-based and uses information on crop requirements (e.g. the
length of yield formation period and the stage-specific crop water requirements), soil
characteristics (i.e. the ability of the soil to retain and supply nutrients) in order
to generate a potential yield for a given crop, and a given soil under 5 scenarios:
rain-fed (high/intermediate/low water input), and irrigated crop (high/intermediate
water input). For each crop c and prefecture o, we use information on whether it
was rain-fed or irrigated in 1990 to construct potential yield yic,o.10
The interaction between harvest area and potential yield hc,oyic,o is our mea-
sure of potential agricultural output for each crop in each prefecture in 1990. Fig-
ure 3 displays potential output hc,oyic,o for rice and cotton, and illustrates the large
geographic variation in agricultural portfolios. By construction, hc,oyic,o is time-
invariant. We next combine potential output at the prefecture level with two time-
varying shocks, international prices and rainfall shocks.
International price shock As a measure of exogenous changes in international
demand for crops, we use the World Bank Commodities Price Data (“The Pink
Sheet”).11. We consider prices in constant 2010 USD and per kg between 1980
and 2009 for the following commodities: banana, cassava, coffee, cotton, an index
9We collapse our analysis at the prefecture level to match migration data but agricultural shockscan be constructed at a 30 arc-second resolution over the whole country.
10The measure is given as a 30 arc-second resolution geo-coded map which we overlay withprefecture maps to generate the prefecture average.
11The data is freely available online at http://data.worldbank.org/data-catalog/commodity-price-data
6
of foddercrops, groundnut, maize, millet, potato, pulses, rapeseed, rice, sorghum,
soybean, sugar beet, sugar cane, sunflower, tea and wheat.12 These crops account
for the lion’s share of China’s agricultural production over the period of interest
(they represented 90% of total agricultural output in 1998 and 79% in 2007).13
We also collected producer prices, exports and production as reported by the FAO
between 1991 and 2013 for China (and other countries) to check that international
price variations translate into producer price variations.
In order to identify shocks in international prices, we use next a deviation from
long-term trend hpc,t by applying a Hodrick-Prescott (HP henceforth) filter on the
logarithm of nominal prices. The Appendix Figure A3 presents the series for three
crops, i.e. rice, bananas and groundnuts, and illustrates the magnitude of fluctua-
tions: The market value of rice production decreases by 40% between 1998 and 2001
and increases by 70% between 2007 and 2008. As shown in Figure A3, fluctuations
in prices are not pure transitory shocks but rather behave as an AR(1) process with
rare and large jumps. Hence our price shocks capture the equivalent of business
cycle fluctuations in international crop prices.
Finally, we transform fluctuations in World prices into an estimate of the value of
crop production for each year in each prefecture. In order to do this, we construct for
each prefecture o the value gap for the agricultural portfolio. We consider the crop-
specific deviations from long-term trend, {hpc,t}c, and weight them by a constant
weight equal to the expected share of agricultural revenue for crop c in prefecture o.
These shares are {hc,oyic,opc}c where hc,oyic,o is potential output in 1990 described
above and pc is a snapshot of international crop prices in 1980.
po,t =
(∑c
hc,oyic,opchpc,t
)/
(∑c
hc,oyic,opc
)(1)
The price shocks po,t exhibit some time-varying volatility coming from World demand
and supply, but there are also large cross-sectional differences. A prefecture is only
exposed to the variations in the prices of crops that it produces. The wide variety
of harvested crops across China guarantees a large cross-sectional variance in prices
po,t that will be exploited in our main empirical strategies. Panel A of Figure A4
shows price shocks po,1999−2000 in 1999 and 2000, just before farmers experienced a
crisis across China due to a strong decrease in the price of rice.
These shocks are likely to have a strong effect on outmigration. Indeed, fluc-
12We exclude from our analysis one crop, i.e. tobacco, for which (i) China has a dominant positionand directly influences the international prices and (ii) China National Tobacco Corporation, astate-owned enterprise, has a monopoly on cigarette production.
13http://data.stats.gov.cn/english/easyquery.htm?cn=C01
7
tuations in prices exhibit some persistence: prices follow a process that looks like
an AR(1). Accordingly, a negative shock does not only affect returns to labour in
the same year but also the following ones. This persistence helps us in triggering
migration outflows but will also introduce some auto-correlation in the resulting
immigration inflows to urban centers.
As the fluctuations in po,t entirely come from fluctuations in the World com-
modity prices, we need to assume that these prices are driven by supply shocks in
other exporting countries, demand fluctuations in importing countries or the World
agricultural market integration, but that these demand and supply fluctuations are
orthogonal to Chinese urban labour demand.14.
Rainfall shocks In our analysis, we use a second type of shocks to agricultural
income based on rainfall deficit during the growing period of each crop.
Our rainfall data is a monthly precipitation measure (0.5 degree latitude x 0.5
degree longitude precision) which covers the period 1901-2011 and mostly relies on
the Global Historical Climatology Network.15 Once collapsed at the prefecture level,
This provides us with a measure rao,m,t of rainfall for prefecture o in month m and
year t.
We refine this rainfall measure to account for the growing cycle of each crop,
i.e. (i) the harvest season and (ii) rainfall requirements. For a given year, there are
several sources of variation across Chinese prefectures in actual yields due to rainfall.
First, different locations receive different levels of rainfall. Second, exposure to
rainfall depends on the growing cycle of the different harvested crops (winter, spring
or summer/autumn crops). In addition, some crops are resistant to large water
deficits while others immediately perish with low rainfall. The large cross-sectional
variations in each year may come from (i) a direct effect of local rainfall, (ii) an
indirect effect coming from the interaction with the crop-specific growing cycle.
We rely on the measure rao,m,t of rainfall for prefecture o in month m and year t
and we construct for each crop a measure wrc of the minimum crop water require-
ment during the growing season Mc as predicted by the yield response to water.16
14One potential issue is that agricultural prices could have a direct effect on firms which useagricultural products as inputs. We test the robustness of our results by excluding these firmsfrom our analysis
15UDel AirT Precip data was provided by the NOAA/OAR/ESRL PSD, Boulder, Colorado,USA, from their Web site at http://www.esrl.noaa.gov/psd/.
16http://www.fao.org/nr/water/cropinfo.html
8
We then generate
ro,t =
(∑c
(max{
∑m∈Mc
wrc − rao,m,t, 0}wrc
)αhc,oyic,opc
)/
(∑c
hc,oyic,opc
). (2)
This measure has a very intuitive interpretation. The quantity max{∑
m∈Mcwrc −
rao,m,t, 0} is the deficit between actual rainfall and the minimum crop water require-
ment wrc during the growing season. We then penalize this deficit with a factor α
capturing potential non-linearities in the impact of rainfall deficit. In our baseline
specification, this penalization parameter α will be set equal to 3.17 A high ratiomax{
∑m∈Mc
wrc−rao,m,t,0}wrc
would be associated with a bad harvest for the specific crop.
We then weight these ratios by potential output for each crop in each prefecture.
Panel B of Figure A4 displays rainfall shocks rao,1999−2000 in 1999 and 2000. As
expected, there is large year-to-year variation in rainfall availability. Also, for a
given year, because of differences in cropping patterns across prefectures, the spatial
auto-correlation of rainfall shocks is much lower than the correlation of rainfall
itself. While the exogeneity of rainfall shocks is not questionable, in order to use
it as instrument for rural to urban migration we need to assume that urban labor
demand is not directly affected by rainfall.18
We view price and rainfall shocks as complement, since they have different
strengths. On the one hand, price shocks reflect business fluctuations and will likely
have a stronger effect on rural to urban migration than rainfall shocks, which are
short-lived. On the other, rainfall shocks are idiosyncratic, which makes it more
likely for us to identify their immediate effect on migration flows.
2.2 Migration and urban outcomes
We now describe our measures of migration flows and workers and firms outcomes
in urban areas.
Migration flows In order to measure migration flows, we use a random 20% ex-
tract of the 1% Population Survey 2005, also called “2005 mini-census”. These data
are representative of the whole of China and contain data on occupation, industry,
income, ethnicity, education level and housing characteristics. Most importantly
for our purpose, the 2005 mini-census is the first to contain comprehensive data
on migration status. This can be determined thanks to information on household
registration type (agricultural or non-agricultural) and on the places of registration
17The results are robust to more conservative values for α, e.g. α = 1 or α = 2.18In the analysis, we test the robustness of our results by controlling for local rainfall shocks.
9
and of residence, which are available down to the prefecture level.19 Migrants are
further asked the main reason for leaving their place of registration and when they
did so. Because of their quality and degree of detail, the census data collected by the
National Bureau of Statistics are widely used in the literature (Combes et al., 2015;
Facchini et al., 2015; Meng and Zhang, 2010; Tombe and Zhu, 2015, inter alia).
Moreover, information on places of origin and residence can be combined with
retrospective data on the year that the respondent first left her place of registration
(censored above six years prior to the interview) in order to create a matrix of yearly
net migration flows across all Chinese prefectures between 1999 and 2005, as well
as to determine the migrant stock in 1999. There were about 345 prefectures (diji
qu/shi) in China over this period, home to 3.7 million people on average. Prefec-
tures are the third tier of government in China, below the central and provincial
governments, and the lowest level of government with accessible data on bilateral
migration flows.
Unlike most studies relying on census data, migration flows are directly observed
rather than computed as a difference of stocks. However, our measure of migration
has two limitations that must be borne in mind in the subsequent analysis. First,
since migration flows are reconstructed ex post, we expect some attrition, i.e. re-
turnees are not counted as former migrants but as agricultural hukou holders living
in their prefectures of registration. Second, the census does not record when the
respondent arrived at her place of residence but only when she left her place of
registration, hence we have to assume that the two happen in the same year. Some
migrants may have however resided in other urban centers in between (step migra-
tion). Return and step migration may dilute the effect of the shocks at origin on
migration flows.20
Figure 4 illustrates the rise in migrant flows between 2000 and 2005 as a share of
the total locally registered urban population, i.e. locally registered (at the prefecture
level) non-agricultural hukou holders. We consider only inter-prefectural migration
flows. The rising trend and the magnitude of migration flows is striking: In 2005, the
inflow of migrants from other prefectures was in excess of 6%, as against less than 2%
in 2000. Two interesting facts pertain to the composition of the incoming migrants.
First, between 78% in 2000 and 83% in 2005 of the yearly migrant inflow consist of
rural hukou holders, the remainder being accounted for by urban dwellers originating
19Unfortunately, information on the place of residence does not distinguish between rural orurban settings.
20The 2005 mini-census also contains information on the place of residence one and five yearsprior to the interview. In appendix B.1, we use these data to quantify return and step migration.The results suggest that return migration is substantial, but step migration negligible.
10
from other prefectures. Second, on average more than 78% of interprefectural rural-
urban migrations recorded over the period 2000-2005 involved the crossing of a
provincial border. We provide in the Appendix (B.1) a more detailed description of
migration flows and migrants.
Wages and employment In our analysis, we first study labour market outcomes
from the worker point of view, using household survey data. The household data
used to assess the link between migration and destination-specific labour market
outcomes come from the national Urban Household Survey (UHS) collected by the
National Bureau of Statistics.
The UHS is a nationally representative survey of Urban China that covers the
period 2002-2008. It is based on a three-stage stratified random sampling, whose
design is similar to that of the Current Population Survey in the United States (Ge
and Yang, 2014; Feng et al., 2015b). Its sample includes 18 provinces and 207 pre-
fectures.21 The data we use for our analysis are annual cross-sections, with a sample
size that ranges from 68,376 to 94,428 individuals (in 2002 and 2008 respectively).
Before 2002, the population covered by the UHS explicitly excluded the “floating
population” of agricultural hukou holders living in urban areas. Since 2002, all
households living in urban areas are eligible. However, sampling still ignores urban
dwellers living in townships and in the suburban districts of Beijing, Chongqing,
Shanghai, and Tianjin (Park, 2008). Rural-urban migrants, who are more likely to
live in peripheral areas of cities, are therefore under-represented. Our analysis is
thus restricted to the locally registered urban population.22
The UHS is a very rich dataset with detailed information on individual employ-
ment, income —including monthly wages, bonuses, allowances, housing and medi-
cal subsidies, overtime, and other income from the work unit—and household-level
characteristics. It also includes detailed data on household expenditures collected
using diaries—see Feng et al. (2015b) for more detail—. As our main income mea-
sure, we use monthly wages divided by a prefecture- and year-specific consumer
price index which we constructed ourselves using consumption data.23 We also
construct three employment outcomes: wage employment, unemployment and self-
employment (which also includes firm owners).24 Table A1 in the Appendix provides
21Although the 18 provinces capture much of China’s regional disparities, it must be noted thatthey may not constitute a faithful picture of China as a whole. The provinces are Beijing, Shanxi,Liaoning, Heilongjiang, Shanghai, Jiangsu, Zhejiang, Anhui, Jiangxi, Shandong, Henan, Hubei,Guangdong, Chongqing, Sichuan, Yunnan, Shaanxi and Gansu.
22The UHS data also tend to oversample employees from state and collective enterprises, whereresponse rates are higher (Ge and Yang, 2014; Feng et al., 2015b).
23Statistical Yearbooks in China do not publish CPIs below the province level.24Working hours in the month preceding the survey were also recorded in UHS 2002-2006. How-
11
average summary statistics of key variables over the period 2002-2008.
Firms Our second piece of information on the urban economy comes from firm-
level data spanning 1998-2007 from the National Bureau of Statistics (NBS).25 The
NBS implements every year a census of all state-owned enterprises and all non-state
firms with sales exceeding 5 million RMB.26 It covers the industrial sector, which
is defined as mining, manufacturing and utilities In our analysis, we focus on the
manufacturing sector in which large firms are responsible for most of the production:
the NBS sample is responsible for 90% of gross manufacturing output.
The data are based on a standard firm survey and contain information on each
firm’s location, industry, ownership type, number of employees and a wide range
of accounting variables (e.g. output, input, value added, wage bill, fixed assets,
financial assets, etc.). In our preliminary analysis we consider two firm outcomes:
labor costs and profitability. As our measure of labor costs, we construct the wage
rate as the wage bill divided by the total number of employees. As our measure of
profitability, we use revenues divided by sales.
There is a number of caveat with using the NBS census. First, the 5 million
RMB threshold that defines whether a firm belongs or not to the NBS census was
loosely implemented. In effect, it is impossible to know the exact level of sales
before implementing the survey and some firms only entered the database several
years after having reached the sales cut-off.27 This truncation potentially introduces
a selection bias. For that reason, we restrict ourselves to the balanced panel of firms
over the period in most of our analysis, and we only resort to the unbalanced panel
as a robustness check.
Second, matching firms over time in the NBS is difficult because of frequent
changes in firm identifiers. In order to match “identifier-switchers,” we use the
fuzzy algorithm developed by Brandt et al. (2014), which uses slowly-changing firm
characteristics such as its name, address or phone number. While total sample size
ranges between 150,000 and 300,000 per year, we end up with 55,000 firms when we
limit the sample to the fully balanced panel between 1999 and 2005.
Third, although we shall use the terms “firm” and “enterprise” interchangeably
ever, as pointed out by Ge and Yang (2014), they vary within a very narrow range, which meansthat the UHS measure might understate actual variations in working hours. For this reason, wedo not use hours of work in our analysis.
25The following description borrows heavily from a detailed discussion in Brandt et al. (2014).26The average exchange rate over the period of interest was 8.26 RMB to the USD, so 5 million
RMB represent about $605,000.27Conversely, about 5% of private and collectively-owned firms, which are subject to the thresh-
old, continue to participate in the survey even if their annual sales fall short of the threshold.
12
in the remainder of the paper, the NBS data cover “firms” in the narrow sense
of “legal units” (faren danwei). Subsequently, different subsidiaries of the same
enterprise may be surveyed, provided they meet a number of criteria, including
having their own names, being able to sign contracts, possessing and using assets
independently, assuming their liabilities and being financially independent.
3 Empirical strategy
In this section, we first describe how we create exogenous rural-to-urban migration
flows based on our price and rainfall variations at origin. Our strategy closely follows
Boustan et al. (2010) to evaluate labour market effects of internal migration in the
United States.28 We then present the empirical strategy we use to estimate the
causal impact of migrant inflows on the urban sector.
3.1 Predicting rural-urban migration flows
Let Mo,d,t denote the migration flows between origin o (rural areas of a prefecture
o) and destination d (a “city,” i.e. urban areas in a prefecture d) in a given year
t = 2000, . . . , 2005, which we construct using retrospective questions of the 2005
mini-census.29 We construct the outmigration rate in year t, mo,t, by dividing the
sum of migrants who left o in year t by the number of adults who still reside in o,
which we denote with Ro. Formally, we have:
mo,t =
∑dMo,d,t
Ro
.
We also construct the probability that a migrant from o goes to d at time t, which
we denote with po,d,t =Mo,d,t∑dMo,d,t
.
For the sake of exposition, we describe our strategy for a given shock so,t to the
rural origin o in year t, which may either be a price shock or a rainfall shock.
In order to estimate the causal effect of migration inflow on urban destinations,
we need variations in migration flows that are unrelated to potential destination
outcomes. Our empirical strategy follows Boustan et al. (2010), and interacts two
sources of exogenous variation. First, we use price and rainfall variations as ex-
ogenous determinants of migration outflows in each rural prefecture. Second, we
28A similar approach is adopted by El Badaoui et al. (2014), Feng et al. (2015a) and Kleemansand Magruder (2014).
29A ”prefecture” comprises both urban and rural areas. There is some debate on how wellurbanisation is captured in Chinese data—see Chan (2007). Note that our results are not sensitiveto such a measurement issue since we assume, based on the literature and Census data on reasonsfor migration, that rural outmigrants settle in urban areas.
13
combine we use a gravity model which includes geographic distance between prefec-
tures and urban population in 1990 to allocate rural migrants to urban destinations.
This provides us with a prediction of migrant inflow in each urban area that is
exogenous with respect to urban outcomes.
Exogenous variations in migration outflows We first regress migration out-
flow from each rural area on shocks to agricultural income. Formally, we estimate
the following equation:
mo,t = β0 + βsso,t−1 + δt + νo + εo,t, (3)
where o indexes the origin, and t indexes time t = 2000, . . . , 2005. mo,t and so,t
denote is the outmigration rate and the shock at origin o in year t, respectively.
νo denotes for origin fixed effects and captures any time-invariant characteristics of
origins, e.g. barriers to mobility. We use 1990 population at origin as weight to
generate consistent outmigration predictions in the number of migrants.
As our measure of shock so,t, we use the average of rainfall or price shocks in
t − 1 and t − 2. A migration spell at date t = 2005 for instance corresponds to a
migrant worker who moved between October 2004 and October 2005. Hence, given
the timing of the growing cycle for most crops in our sample, migration spells in
period t are most likely to be impacted by rainfall and price variations in t− 1 and
before—especially if there are lags in the decision to migrate.30
Estimating equation 3 yields the predicted migration rate mo,t from origin o in
year t:
mo,t = β0 + β1so,t + νo + δt
where δt is the average of the time effect.31 We then multiply the migration rate by
rural population at origin Ro to compute predicted migration flows from o:
Mo,t = mo,t ×Ro
We present the estimation of equation (3) in Table 1. In the first two columns,
we report the estimates for the price variations with lags (column 1) and with lags
and forwards (column 2). The third and fourth columns display the estimates for
the rain variations, and the last two columns include both price and rainfall shocks.
30Incorporating contemporary price/rainfall shocks in the analysis does not change the results.We also estimate the same specification using forward shocks, i.e. the average of prices in t + 1and t+ 2, to show that shocks are not anticipated.
31We remove time variation from our predictions, in order to avoid correlation between ourmigration flows and destination trends in outcomes.
14
In Table 1 – columns 1 and 2, we see that migration flows are negatively correlated
with (lagged) price deviations from their long-term values. A price 10% above its
long-term value is associated with a 0.25 p.p. lower migration incidence, which is on
average around 1.3 p.p. in our sample. In order to better understand the magnitude
of this effect, let us normalize by the standard deviations of our variables. An
additional standard deviation in the price shocks decreases migration incidence by
0.18 standard deviations. This effect is thus economically large, and quite precisely
estimated. Figure 1 plots the residuals of outmigration (y-axis) against the residual
value of the prefecture-specific agricultural portfolio as predicted by international
prices (x-axis), once cleaned by prefecture and year fixed-effects. The relationship
is globally linear.
As shown in Table 1 – columns 3 and 4, migration flows are positively correlated
with rainfall deficits (see definition in section 2). A standard deviation increase in
rainfall deficits is associated with a 0.18 p.p. higher migration incidence. Figure 2
displays the relationship between outmigration and rainfall deficits, once prefecture
and year fixed effects are partialled out. The relationship seems linear.
As a robustness check, we test whether shocks are anticipated and find that
forward variations in rainfall or prices do not predict migration outflows (Column
2, 4 and 6 of Table 1). Finally, we include both types of shocks in the estimation.
As columns 5 and 6 of Table 1 show, price and rainfall shocks have independent
effects on migration outflows. The estimated coefficients on the lags and forwards
of our constructed shocks in the joint regression are similar to those in the separate
specifications.
Exogenous variations in origin-destination migration flows We next esti-
mate the following equation:
po,d = f(disto,d) + γPopd,1990 + µo + εo,d, (4)
where po,d is the share of migrants from prefecture o who went to prefecture d,
disto,d is the distance between o and d, f is a parametric function of distance and
Popd,1990 is the total urban population of prefecture d in 1990. Equation 4 yields
po,d, the predicted probability for migrants from prefecture o to go to prefecture
d based on distance, a fixed and exogenous characteristic of the pair (o, d), and
the attractiveness of d captured by its lagged population. The specifications are
weighted by Popd,1990.
We report the results of this estimation for three simple parametric specifications
15
in Table 2. In the first column, we use a linear specification in distance.32 In column
2, we add a quadratic term and we use the inverse of distance in column 3. As
apparent in this table, (i) distance is a very strong predictor of migration flows and
(ii) the last specification in column 3 generates a much better fit of the data. 33
Predicted migration flows Finally, we combine predicted migration outflows
(Equation 3) and predicted probabilities to come from each origin to each destination
(Equation 4) to predict migration inflows into each urban destination. Formally, we
compute :
Md,t =∑o 6=d
Mo,t × po,d, (5)
where Md,t are migration inflows in destination d in year t, Mo,t is predicted migration
outflow from origin o in year t and po,d is the predicted probability that a migrant
from o goes to d. In order to avoid that migration inflows are correlated with
destination outcomes, we exclude from Md,t immigration flows attributable to rural
areas of prefecture d.
This two-stage process yields synthetic migration inflows into prefectures of des-
tination that are exogenous with respect to destination outcomes. We first pro-
vide some intuition about the nature of these exogenous variations in Figures A6
(measure Md,t as predicted by price variations) and A7 (measure Md,t as predicted
by rainfall variations). We report these measures cleaned for cross-sectional time-
invariant factors in 2001 (left panels) and 2004 (right panels). As shown in Fig-
ure A6, there is some spatial auto-correlation in these measures arising from the
spatial auto-correlation of crop composition across prefectures and the transforma-
tion of outflows to inflows involving distance between prefectures. There is also
some auto-correlation across periods as international prices exhibit persistence in
their fluctuations. However, there are also large cross-sectional and time-varying
fluctuations that we can use for our analysis. Figure A7 illustrates cross-sectional
and time-varying fluctuations for the immigrant inflows measure as predicted by
rainfall variations.
In order to test whether our migration predictions are accurate, we regress the
actual migrant inflows observed in the mini-census data on the predicted immigrant
inflows. Table 3 reports the correlation between actual and predicted migration
rates. As Columns 1 and 3 show, the relationship is strong, positive and significant
32The distance between two prefectures o and d, po,d, is measured as the distance between thecentroids of o and d.
33This is confirmed by Figure A5 in Appendix which displays the average migration share toeach destination by distance from the origin.
16
with destination-fixed effects. It remains so after adding year fixed effects (Columns
2 and 4). The coefficient in both specifications is close to one. This suggests that,
as expected and by construction, our instrument successfully predict variation in
migration inflow between years for a given prefecture and across prefectures for a
given year, even if they do not explain most of the total variation in migration rates.
This baseline relationship between actual and exogenous variations in immigration
rates will serve as a first stage in our analysis to estimate the impact of migration
on urban labour markets and firm outcomes.
In a robustness check, we only keep migrants from different provinces and run
a similar exercise as in Table 3 (see Table A6). The predictive power of the syn-
thetic migration flows is not affected by the restriction to migration spells between
provinces. This feature is important because it allow us to separate the potential di-
rect effects of price or rainfall shocks on a province (through demand for non-tradable
goods for instance) from the indirect effects through the arrival of workers.
We now turn to the second stage of our analysis, which estimates how rural-to-
urban migration is absorbed by the urban modern sector.
3.2 Migration flows and labour market outcomes
In order to estimate the effect of migration on urban labour market outcomes, we
use employment and wage data from the Urban Household Survey.34
We estimate the impact of migration on labour market outcomes of individual i
in destination d in year t by regressing each outcome, which we denote with yi,d,t,
on predicted migration that year, Md,t, and a vector of individual characteristics Xi.
The vector Xi includes dummy variables for individual i’s marital status, gender,
education level (primary, lower secondary, upper secondary and tertiary), and age
(24-35, 35-44, 45-54 and 54-64). We also include seven occupation dummies in order
to better control for workers’ skills.35 In order to control for labour market conditions
at destination and aggregate fluctuations in labour market outcomes, we also include
destination and year fixed effects. The effect of Md,t on yi,d,t is estimated through
34Since UHS does not cover all prefectures, but only a representative sample of 18 provincesand 207 prefectures, we checked that our predictions and actual migration rates are indeed wellcorrelated within the UHS sample (Results available upon request).
35UHS occupation categories are “Head of organization,” “Professional skill worker,” “Staff,”“Commercial and service worker,” “Agriculture,” “Production operator,” “Soldier” and “Otheroccupations”. Since occupation itself may be an outcome of migration, we check that our resultsare robust to excluding it from the vector of controls.
17
Two-Stage Least Squares (2SLS) with Md,t as an instrument:{Md,t = b0 + bmMd,t + bxXi + ed + nt + ed,t
yi,d,t = β0 + βmMd,t + βxXi + ηd + νt + εi, (6)
and standard errors are clustered at the level of the prefecture of destination×year.36
3.3 Migration flows and firm outcomes
We next turn to the estimation of the effect of migrant inflows on firm outcomes.
One challenge with firm data is that some variables, e.g. size, are not station-
ary and these differential trends would not be captured by firm fixed effects. We
describe below our strategy when dealing with stationary variables: wage rate and
profitability (profits normalized by sales) and we describe in the appendix the em-
pirical strategy to deal with non-stationary variables.
We take advantage of the panel structure of the data and implement a 2SLS-FE
specification in which we regress the outcome of firm j in year t in urban prefecture
d on migration inflow in d, which we denote Md,t, using predicted migration Md,t as
instrument and including firm fixed effects ηj.{Md,t = b0 + bmMd,t + ej + nt + ed,t
yj,d,t = β0 + βmMd,t + ηj + νt + εj,t, (7)
with standard errors clustered at the level of the prefecture of destination×year.37
4 Results and discussion
In this section, we discuss our preliminary findings on the absorption of labour supply
in urban centers. We first analyze the impact on urban labour markets, which helps
identify the nature of the shock induced by immigant inflows at destination. We
then discuss preliminary findings on firm outcomes.
36Because the regressor of interest, the migration rate, is itself predicted, correct inference re-quires to bootstrap the first stage. The standard errors in the second stage are however correctlyestimated through 2SLS.
37Because the regressor of interest, the migration rate, is itself predicted, correct inference re-quires to bootstrap the first stage. The standard errors in the second stage are however correctlyestimated through 2SLS.
18
4.1 Effects of migration inflow on urban workers
In order to identify the shift in urban labour supply, we use repeated cross sections
from the National Urban Household Survey and consider the effect of migration
inflows on labour market outcomes of locally registered urban residents aged 15
to 64. In this exercise, we ignore the existence of heterogeneity between migrants
and “natives,” i.e. assume that they are perfectly substitutable. As Table A3 in
the Appendix shows, however, migrants are significantly less skilled than urban
workers.38 For this reason, we also estimate the change in labour market outcomes
for “natives” with primary education and lower secondary education only.
Table 4 presents our estimates of the effect of migration inflows on four outcomes:
wages, wage employment, unemployment and self-employment of urban residents.
The first column presents results from a simple OLS regression of each outcome
on the actual immigration rate. The second and third columns present 2SLS es-
timations, using rainfall and price shocks, respectively, as instruments for migrant
inflows.
We first consider the impact on urban wages. The OLS estimate is negative but
small: a 1 p.p. increase in the immigration rate is associated with a 0.09% decrease
in wages. The IV estimates are negative and larger in magnitude: If migrants are
attracted to cities that offer higher wages, OLS estimates should indeed be biased
upwards. Using rainfall and price shocks to predict migration, we find that a 1
p.p. higher immigration rate is associated with 0.17% - 0.22% lower wages. The
effects become larger when we focus our attention on urban residents with lower
secondary education or less, who are more likely to compete for jobs with migrants.
A 1 p.p. higher immigrant rate is associated with a 0.17% - 0.31% decrease in wages.
Overall, these estimates suggest that, once cleaned for the potential demand-driven
fluctuations, an influx of rural migrants depresses urban wages. Following Borjas
(2003) we can recover the elasticity of urban wages with respect to migration by
multiplying the coefficient by 1(1+m)2
, where m is the ratio of migrants to native.
In our context, the migration rate is about 5%, hence 1(1+m)2
≈ 0.90. The implied
wage elasticities from our estimates are between 0.15 and 0.28, which is lower than
Borjas’s (2003) own estimates (0.3− 0.4).
We next consider the effect of rural to urban migration on the status of active
urban residents (wage employment, self-employment or unemployment). The OLS
estimates are close to zero and mostly insignificant, as are the IV estimates using
price shocks as instrument. The IV estimates using rainfall shocks, however dis-
38See section B.1 in the Appendix for a systematic comparison between rural migrants and urbanresidents.
19
play significant decrease in wage employment: a one percentage point increase in
migration decreases wage employment of urban residents by 9 percentage point (the
average participation to wage employment is above 90%). Correspondingly, unem-
ployment and self-employment seem to increase (the effect on self-employment is not
significant). These results provide some evidence that employers substitute urban
workers with rural migrants, leaving urban residents unemployed or leading them to
become self-employed. However, these effects are not consistent across instrumen-
tation strategies.
Overall, our results confirm that the arrival of migrants shifts labour supply
downward. The estimated effect of migration on wages is relatively small, as com-
pared to those from the literature on international migration into developed coun-
tries (Borjas, 2003) and to other studies on internal migration in developing countries
which use a similar strategy (Boustan et al., 2010; El Badaoui et al., 2014; Imbert
and Papp, 2014; Kleemans and Magruder, 2014). One reason behind such pattern
could be that the labour market for urban residents is regulated with the existence
of minimum wages, while the labour markets for migrants are unregulated. The
marginal labour cost may thus drastically respond to the arrival of migrants when
the average labour cost (mostly driven by residents) remains quite high.
4.2 Effects of migration inflow on manufacturing firms
We now turn to the firm side and analyze the impact of exogenous changes in
migration inflows on labor costs and profitability.
In Table 5, we analyze specification 7 on the subsample of firms present from
1999 to 2005. We look at two “stationary” outcome variables: wage rates, which
are defined as total wage bill divided by total labour force, and profitability, which
is equal to total profits (value added minus wage bill) divided by total revenue.39
We take the logarithm of both variables. In column 1, we report the correlation
between these variables (at the end of period t) and migrant inflows during period t.
In column 2 (resp. 3), we use our migration flows as predicted by the rainfall (resp.
price) shocks to instrument actual movements from rural to urban areas. Note that
firms relying on migrants may be selected in terms of unobservable characteristics.
All regressions in Table 5 and the following thus include firm fixed effects to clean
for fixed firm-specific determinants of their reliance on migrant workers.
As shown in the top panels of Table 5 – column 1, the correlation between firm-
level wage rates and total migrant inflows is negative and significant. As expected
given the lower education level of migrants—see Table A3,—the effect is slightly
39We ignore here the ownership structure of the firm.
20
larger in absolute value when one restricts the analysis to firms that rely heavily
on unskilled labour, i.e. food manufacturing, beverage manufacturing, footwear,
wood processing, and textile. To interpret the size of these correlations, the within
standard deviation of migration flows is around .2, which implies that current mi-
gration flows higher by 1 within standard deviation would be associated with a small
0.03% decrease in wage rates. These results are however likely to be biased upwards
(towards zero) as migrants tend to settle in high-wage destinations.
Columns 2 and 3 address this concern thanks to the instrumental variable strat-
egy delineated in Section 3 and show a very different picture. Coefficients become
much larger in absolute value when migration flows are purged of the endogeneity
in migration decisions. A 1% higher immigration rate translates to 1.2% lower wage
rates when using rainfall as a source of exogenous variation and 0.6% lower when we
rely on price shocks. In standardised terms, a 1 within standard deviation increase
in the immigration rate yields a .25% (resp., .13%) drop in wage rates using the
rainfall- (resp., price-) based instrument. The range can be explained by two fac-
tors. First, the estimation based on rainfall tends to be noisier. Second, rainfall and
price shocks identify different local average treatment effects (LATEs). Whereas a
shortage of rainfall is likely to trigger distress migration, price fluctuations exhibit
some serial correlation and might thus lead rural dwellers relying on agriculture for a
living to update their expectations on returns to farming and engage in more planned
migration. We would therefore expect a stronger immediate impact of immigration
on urban labour markets when rainfall is exploited as a source of identification.
Finally, we can note that the effects are larger in magnitude—albeit imprecisely
estimated—when we focus on low-skill industries.
The bottom panels of Table 5 explore the effect of migrant inflows on firms’ prof-
itability, for the whole sample first and then focusing on low-skill sectors. Profitabil-
ity is positively and consistently affected by migration flows. The effect becomes
positive and significant when we implement our instrumental variable strategy.40
The effect of a 1 within standard deviation increase in the immigration rate iden-
tified thanks to international prices (resp., rainfall) on firms’ profitability is a .2%
(resp., 3%) rise in profitability for the whole sample. Firms that hire mostly low-
skilled workers enjoy a larger positive effect of immigration: +.3% (resp., +.4%).
We run a number of robustness checks to verify that our estimates are indeed cap-
40Note that the OLS effect is lower than the coefficients on the instrumented migration flowsand statistically indistinguishable from zero. One explanation is that opposite effects are at work:First, an influx of migrants enables firms to enhance their profits and grow; second, destinationsexperiencing migration flows have already experienced some economic growth with larger and moreestablished firms than in other regions, thereby attracting migrants through higher posted wages.
21
turing labour supply shifts induced by origin-driven fluctuations. All corresponding
tables are in the Appendix.
One concern could be that firms in cities rely on the provision of important crops
as intermediate inputs, and are directly affected by World crop prices as final good
producers (rice vinegar exporters for instance). One possible solution is to exploit
the differential flows between products and migrants with the latter moving much
farther than the former (migration costs are paid once while transportation costs are
paid continuously). Instead, we use the precise indicators of industries and perform
two robustness checks. First, in order to clean for the potential shortages in crop
provisions for some cities close to the fields, we exclude all firms potentially using
one of our crops as an input. These consist of—among others—food exporters and
part of the textile industry. We report the results of this analysis in Table A7. The
results are virtually unchanged compared to Table 5. We also control for the direct
effect of price and rainfall shocks in destination prefectures. These controls are built
based on equations 1 and 2, respectively, and lagged in order to match the way
migration shocks were created. The results, reported in Table A8, are imprecisely
estimated but confirm our findings that immigration depresses wages and boosts
firm productivity at destination.
Second, there may be some delay between the arrival of migrants and the result-
ing increase in firm factor use. Although this does not jeopardise our identification
strategy or the interpretation of the results, we provide in Table A9 the effects of
lagged shocks. Results are consistent with contemporary shocks.
The results of this section give some credit to our constructed migration flows:
migration shifts labour supply to the right, thereby decreasing wages and boosting
labour demand. In the next section, we look more precisely at this effect and better
identify which firms gain from the newly-available resources.
4.3 Reallocation of resources across firms [Work in progress.]
5 Conclusion
A key link in the chain of events between rural to urban migration and manufacturing
growth is the impact of migration on urban labor markets and firms. This paper
provides some of the first causal empirical evidence of this impact using Chinese
data.
We predict migrant inflows into urban areas based on shocks to agricultural in-
comes in rural origins and distance between prefectures of origin and destination.
These predictions are exogenous with respect to urban workers’ and firms’ environ-
22
ments, which allows us to tackle the issue of migrants self-selecting into buoyant
labour markets and provide causal estimates of the effect of migration on urban
outcomes. Using a representative survey of urban households, we find that migrant
inflows from rural areas have a negative effect on urban dwellers’ wages and—to
a lesser extent—employment. We next use a census of large firms and show that
migration decreases labor costs and improves profitability of manufacturing firms
This new piece of evidence brings together two main features of Chinese devel-
opment, massive internal migration and manufacturing growth despite severe credit
constraints (Song et al., 2011). By keeping labor costs low, rural-to-urban migration
may have allowed firms to accumulate larger profits, which were then reinvested to
finance future growth.
23
References
ADB, “The Declining Share of Agricultural Employment in the People’s Republic
of China: How Fast?,” ADB Economics Working Paper Series 419, Asian Devel-
opment Bank (ADB) November 2014.
Alvarez-Cuadrado, Francisco and Markus Poschke, “Structural Change Out
of Agriculture: Labor Push versus Labor Pull,” American Economic Journal:
Macroeconomics, July 2011, 3 (3), 127–58.
Au, Chun-Chung and J. Vernon Henderson, “How migration restrictions limit
agglomeration and productivity in China,” Journal of Development Economics,
2006, 80, 350–88.
and , “A numerical simulation analysis of (Hukou) labour mobility restrictions
in China,” Journal of Development Economics, 2007, 83, 392–410.
Borjas, George J., “The Labor Demand Curve Is Downward Sloping: Reexamin-
ing The Impact Of Immigration On The Labor Market,” The Quarterly Journal
of Economics, November 2003, 118 (4), 1335–1374.
Bosker, Maarten, Steven Brakman, Harry Garretsen, and Marc
Schramm, “Relaxing Hukou: Increased labor mobility and China’s economic
geography,” Journal of Urban Economics, 2012, 72, 252–66.
Boustan, Leah Platt, Price V. Fishback, and Shawn Kantor, “The Effect
of Internal Migration on Local Labor Markets:American Cities during the Great
Depression,” Journal of Labor Economics, October 2010, 28 (4), 719–746.
Brandt, Loren, Johannes Van Biesebroeck, and Yifan Zhang, “Challenges
of working with the Chinese NBS firm-level data,” China Economic Review, 2014,
30 (C), 339–352.
Bustos, Paula, Bruno Caprettini, and Jacopo Ponticelli, “Agricultural pro-
ductivity and structural transformation: evidence from Brazil.,” 2015. Forthcom-
ing American Economic Review.
Chan, Kam Wing, “Misconceptions and Complexities in the Study of China’s
Cities: Definitions, Statistics, and Implications,” Eurasian Geography and Eco-
nomics, 2007, 48 (4), 383–412.
, “Migrant and development in China: trends, geography and current issues,”
Migration and Development, 2012, 1 (2), 187–205.
24
Combes, Pierre-Philippe, Sylvie Demurger, and Shi Li, “Migration exter-
nalities in Chinese cities,” European Economic Review, 2015, 76 (C), 152–167.
Dustmann, Christian and Albrecht Glitz, “How Do Industries and Firms Re-
spond to Changes in Local Labor Supply?,” Journal of Labor Economics, 2015,
33 (3), 711 – 750.
El Badaoui, Eliane, Eric Strobl, and Frank Walsh, “The Impact of Internal
Migration on Local Labour Markets in Thailand,” EconomiX Working Papers
2014-12, University of Paris West - Nanterre la Defense, EconomiX 2014.
Evans, David S, “Tests of alternative theories of firm growth,” The Journal of
Political Economy, 1987, pp. 657–674.
Facchini, Giovanni, Maggie Y. Liu, Anna Maria Mayda, and Minghai
Zhou, “The impact of China’s WTO accession on internal migration,” December
2015.
Fan, Cindy C., China on the Move, Routledge, 2008.
Feng, Shuaizhang, Michael Oppenheimer, and Wolfram Schlenker,
“Weather Anomalies, Crop Yields, and Migration in the US Corn Belt,” March
2015.
, Yingyao Hu, and Robert Moffit, “Long Run Trends in Unemployment and
Labor Force Participation in China,” August 2015.
Ge, Suqin and Dennis Tao Yang, “Changes In China’s Wage Structure,” Journal
of the European Economic Association, 04 2014, 12 (2), 300–336.
Giesing, Yvonne and Nadzeya Laurentsyeva, “Brain Drain and Firm Pro-
ductivity: Evidence from the Sequential Opening of EU Labour Markets,” 2015.
Unpublished.
Gollin, Douglas, Stephen Parente, and Richard Rogerson, “The Role of
Agriculture in Development,” American Economic Review, May 2002, 92 (2),
160–164.
Harris, John R. and Michael P. Todaro, “Migration, Unemployment and De-
velopment: A Two-Sector Analysis,” The American Economic Review, 1970, 60
(1), pp. 126–142.
25
Herrendorf, Berthold, Christopher Herrington, and Akos Valentinyi,
“Sectoral Technology and Structural Transformation,” Technical Report 9386,
C.E.P.R. Discussion Papers March 2013.
Imbert, Clement and John Papp, “Short-term Migration and Rural Workfare
Programs: Evidence from India,” 2014. Manuscript.
Kerr, Sari Pekkala, William R. Kerr, and William F. Lincoln, “Skilled
Immigration and the Employment Structures of US Firms,” Journal of Labor
Economics, 2015, 33 (S1), S147 – S186.
Kleemans, Marieke and Jeremy Magruder, “Labor Market Changes in Re-
sponse to Immigration: Evidence from Internal Migration Driven by Weather
Shocks in Indonesia,” 2014. Manuscript.
Kuznets, Simon, Agriculture in Economic Development, McGraw-Hill Book Com-
pany, 1964.
Lewis, Arthur, “Economic Development with Unlimited Supplies of Labour,” The
Manchester School, 1954, 22 (2), 139–191.
Marden, Sam, “The agricultural roots of industrial development: “forward link-
ages” in reform era China,” 2015. Manuscript.
Mayneris, Florian, Sandra Poncet, and Tao Zhang, “The cleansing effect of
minimum wages. Minimum wages, firm dynamics and aggregate productivity in
China,” Core Discussion Papers 2014/44 October 2014.
Meng, Xin and Dandan Zhang, “Labour Market Impact of Large Scale Inter-
nal Migration on Chinese Urban ’Native’ Workers,” IZA Discussion Papers 5288
October 2010.
Park, Albert, China Urbanizes: Consequences, Strategies, and Policies, The World
Bank, 2008.
Peri, Giovanni, “The Effect Of Immigration On Productivity: Evidence From U.S.
States,” The Review of Economics and Statistics, February 2012, 94 (1), 348–358.
Song, Zheng, Kjetil Storesletten, and Fabrizio Zilibotti, “Growing Like
China,” American Economic Review, 2011, 101 (1), 196–233.
Sousa, Jose De and Sandra Poncet, “How are wages set in Beijing?,” Regional
Science and Urban Economics, 2011, 41 (1), 9–19.
26
Tombe, Trevor and Xiaodong Zhu, “Trade, Migration and Productivity: A
Quantitative Analysis of China,” June 2015.
United Nations, “Trends in International Migrant Stock: The 2015 revision.,”
Technical Report, United Nations, Department of Economics and Social Affairs
2015.
Vendryes, Thomas, “Migration constraints and development: Hukou and capital
accumulation in China,” China Economic Review, 2011, 722, 669–92.
27
A Figures and tables
Figure 1. Value of agricultural portfolio at origin and outmigration rates.
Notes: This Figure illustrates the relationship between the standardized value of the prefecture-specific agriculturalportfolio as predicted by international prices (x-axis) and outmigration (y-axis). We consider the residuals of allmeasures once cleaned by prefecture and year Fixed-Effects. For the sake of exposure, we group prefecture×yearobservations, create 100 bins of observations with similar price shock and represent the average outmigration ratewithin a bin. The lines are locally weighted regressions on all observations.
28
Figure 2. Rainfall deficits relative to water requirements at origin and outmigration rates.
Notes: This Figure illustrates the relationship between the standardized rainfall deficit relative to water requirementsfor the origin-specific agricultural portfolio (x-axis) and outmigration (y-axis). We consider the residuals of allmeasures once cleaned by prefecture and year Fixed-Effects. For the sake of exposure, we group prefecture×yearobservations, create 100 bins of observations with similar rainfall shock and represent the average outmigration ratewithin a bin. The lines are locally weighted regressions on all observations.
Figure 3. Potential output in China for rice and cotton (1990).
(a) Paddy rice. (b) Cotton.
Notes: These two maps represent the potential output constructed with 1990 harvested areas and potential yield(GAEZ model) in 1990 for 2 common crops in China, i.e. paddy rice (left panel), and cotton (right panel).
29
Figure 4. Evolution of migration rates between 1999 and 2005.
Sources: 2005 Mini-Census.
30
Table
1.
Mig
rati
on
flow
san
dpri
ce/ra
infa
llsh
ock
s(2
000-2
005).
Spe
cifi
cati
on
(3)
Mig
rati
onou
tflow
s(1
)(2
)(3
)(4
)(5
)(6
)
Pri
ceL
agsp[t−2,t−1]
-0.0
249*
**-0
.0196***
-0.0
213***
-0.0
143**
(0.0
036)
(0.0
059)
(0.0
037)
(0.0
060)
Pri
ceF
orw
ard
sp[t+1,t+2]
0.0
0889
0.0
130*
(0.0
079)
(0.0
074)
Rai
nfa
llL
agsr [t−
2,t−1]
0.0
617***
0.0
623***
0.0
518***
0.0
552***
(0.0
069)
(0.0
068)
(0.0
067)
(0.0
064)
Rai
nfa
llF
orw
ard
sr [t+
1,t+2]
-0.0
0629
-0.0
177**
(0.0
086)
(0.0
088)
Ob
serv
atio
ns
2,02
22,0
22
2,0
22
2,0
22
2,0
22
2,0
22
R-s
qu
ared
0.80
70.8
08
0.8
07
0.8
07
0.8
11
0.8
12
Ori
gin
FE
Yes
Yes
Yes
Yes
Yes
Yes
Yea
rF
EY
esY
esY
esY
esY
esY
es
Rob
ust
stan
dar
der
rors
are
rep
orte
db
etw
een
par
enth
eses
.T
he
un
itof
ob
serv
ati
on
isan
ori
gin×
aye
ar
an
dth
ere
gre
ssio
nis
wei
ghte
dby
ori
gin
rura
lp
opu
lati
onin
1990
.M
igra
tion
outfl
ows
are
year
lyou
tflow
sn
orm
ali
zed
by
the
pre
fect
ure
’sru
ral
pop
ula
tion
in2005.
Pri
ce(r
esp
.R
ain
fall
)L
ags
are
defi
ned
asth
eav
erag
enor
mal
ized
pri
ced
evia
tion
s(r
esp
.ra
infa
lld
efici
ts)
inp
erio
dt−
1an
dt−
2.
Pri
ce(r
esp
.R
ain
fall
)F
orw
ard
sare
defi
ned
as
the
aver
age
nor
mal
ized
pri
ced
evia
tion
s(r
esp
.ra
infa
lld
efici
ts)
inp
erio
dt
+1
an
dt
+2.
See
sect
ion
2fo
ra
com
ple
ted
escr
ipti
on
of
the
pri
cean
dra
infa
lldefi
cit
con
stru
ctio
n.
31
Table 2. Distance and migration flows between origins and destinations (2000-2005).
Specification (4)
Migration flows (share) (1) (2) (3)
Distance do,d (1,000 km) -0.0116*** -0.0449***(0.000539) (0.00286)
Squared Distance d2o,d 1.04e-08***
(8.50e-10)Inverse Distance 1/do,d 9.424***
(0.757)Destination population (1,000), 1990 Popd,1990 0.943*** 0.956*** 0.949***
(0.0557) (0.0552) (0.0546)
Observations 116,622 116,622 116,622R-squared 0.206 0.231 0.255Origin FE Yes Yes Yes
Robust standard errors are reported between parentheses. The unit of observation is an origin×adestination×a year. Migration flows (share) are the number of migrants going from origin o todestination d normalized by the total number of migrants from origin o. For the sake of exposition,we normalize distance do,d and destination population Popd,1990 by 1, 000.
Table 3. Comparison of actual and predicted immigration rate in urban areas (2000-2005).(1) (2) (3) (4)
Prediction - rainfall 1.328*** 0.914***(0.334) (0.241)
Prediction - price 0.757*** 0.911***(0.246) (0.224)
Observations 2,028 2,028 2,028 2,028R-squared 0.812 0.875 0.813 0.879Year FE No Yes No YesDestination FE Yes Yes Yes Yes
Standard errors are clustered at the destination level and are reported between parentheses. ***p<0.01, ** p<0.05, * p<0.1. An observation is a destination×year. The immigration rate is thenumber of agricultural hukou holders from all origin prefectures who went to a destination prefec-ture d in a given year divided by population at destination. The independent variable correspondto Md,t as defined in equation 5. Regressions are weighted by total urban adult population atdestination.
32
Table 4. Effect of migration flows on wages earned by urban residents and unemployment proba-bility.
OLS 2SLS: rainfall 2SLS: priceEffect of migration inflows on ... (1) (2) (3)Real monthly wages -0.090*** -0.224* -0.170*
(0.023) (0.131) (0.0906)[191,394] [190,989] [191,394]
Real monthly wages (low skill) -0.081*** -0.306* -0.169**(0.016) (0.166) (0.0855)[48,375] [48,375] [48,375]
Wage Employment -0.0063 -0.0991* 0.000309(0.0066) (0.0590) (0.0114)[212,197] [212,197] [212,197]
Wage Employment (low skill) -0.0072 -0.177* 0.0040(0.014) (0.102) (0.018)[58,045] [58,045] [58,045]
Unemployment 0.0081* 0.0408** -0.0081(0.0044) (0.0169) (0.0116)[212,197] [212,197] [212,197]
Unemployment (low skill) 0.0089*** 0.0318** -0.0032(0.0032) (0.0137) (0.0091)[58,045] [58,045] [58,045]
Self-Employment -0.0018 0.0583 0.0078(0.0027) (0.0451) (0.0119)[212,197] [212,197] [212,197]
Self-Employment (low skill) -0.0017 0.146 -0.0008(0.0112) (0.0923) (0.0202)[58,045] [58,045] [58,045]
Prefecture and Year FE Yes Yes Yes
Standard errors are clustered at the prefecture/year level. The unit of observation is an individual.In the first two panels, the dependent variable is the log of wages deflated using a consumer priceindex computed by the authors using the UHS data. In the next six panels, the dependent variablesare dummies that take the value one if the individual works for wage, is unemployed or is self-employed. See section 3 for a complete description of the price- and rainfall -related migrationflows. All specifications include characteristics of the resident population (proportions by maritalstatus, gender, age group, education level, rural registration, and firm ownership for the wagespecifications) and log adult population, as well as year and prefecture fixed effects. The firststages are reported in Table A6.
33
Table 5. Effect of migration flows on wages and profitability using firm data.
OLS 2SLS: rainfall 2SLS: priceEffect of migration inflows on ... (1) (2) (3)Wages -0.168*** -1.221* -0.614**
(0.0518) (0.706) (0.255)[327,070] [327,070] [327,070]
Wages (low skill) -0.185*** -1.452 -0.707(0.0546) (1.515) (0.645)[179,984] [179,984] [179,984]
Profitability -0.134 1.541* 0.890***(0.0978) (0.865) (0.330)[303,957] [303,957] [303,957]
Profitability (low skill) -0.0370 1.990* 1.408***(0.0813) (1.160) (0.470)[167,829] [167,829] [167,829]
Prefecture and Year FE Yes Yes Yes
Standard errors are clustered at the prefecture/year level. The unit of observation is a firm ×a year. In the top two panels, the dependent variable is the log of total wage bill divided bythe number of employees. In the bottom two panels, the dependent variable is the log of profitsdivided by revenues. See section 3 for a complete description of the price- and rainfall -relatedmigration flows. The first stages are reported in Table A6. Low skill indicates firms in sectorsemploying mostly low-skill workers (i.e. food manufacturing, beverage manufacturing, footwear,wood processing, and textile).
34
A Additional tables and figures
Figure A1. Share of return migrants by age.
Sources: 2005 Mini-Census.
Figure A2. Share of step migrants as a function of age and time since departure.
Sources: 2005 Mini-Census.
35
Figure A3. Price deviations from trends on International Commodity Markets 1998-2010 (blue:banana, red: rice, teal: groundnut).
Note: These series represent the Hodrick Prescott residual applied to the logarithm of internationalcommodity prices for three commodities: banana, rice and groundnut. For instance, the price ofrice can be interpreted as being 35% below its long-term value in 2001.
Figure A4. Price and rainfall shocks across Chinese prefectures in 1999/2000.
(a) Price shock. (b) Rainfall shock.
Notes: These two maps represent the standardized price shock po,t in 1999/2000 (left panel), and standardizedrainfall shock ro,t in 1999/2000 (right panel). Note that 1999/2000 corresponds to a pre-crisis period: in 2001, theprice of rice decreases which generates a very negative shock across China concentrated in rice-producing prefectures.
36
Figure A5. Origin-destination migration predictions—the role of distance.
Notes: Migration flows constructed with census data (2000-2005).
Figure A6. Measure Md,t of immigrant inflows to cities as predicted by prices in 2001 and 2004.
(a) 2001 (b) 2004
Notes: These two maps represent the quantities Md,2001 and Md,2004, where Md,t is the measure of immigrantinflows as predicted by price variations and the weighting distance matrix between origins and destinations.
37
Figure A7. Measure Md,t of immigrant inflows to cities as predicted by rainfall in 2001 and 2004.
(a) 2001 (b) 2004
Notes: These two maps represent the quantities Md,2001 and Md,2004, where Md,t is the measure of immigrantinflows as predicted by rainfall variations and the weighting distance matrix between origins and destinations.
Figure A8. Evolution of the share of private firms in the industrial sector.
Sources: 1998-2007 NBS above-scale firm data.
38
Table A1. Descriptive statistics from the UHS data (2002-2008).
Mean St. Dev.
Age 43.17 11.00Female 0.50 0.50Married 0.88 0.33Born in prefecture of residence 0.61 0.49
Education:Primary education 0.05 0.21Lower secondary 0.27 0.45Higher secondary 0.25 0.43Tertiary education 0.42 0.49
Unemployed 0.02 0.14Self-employed/Firm owner 0.05 0.23Employee 0.71 0.45
Public sector 0.63 0.48Private sector 0.37 0.48
Total monthly income (RMB) 1537.52 1416.81Monthly wage income (RMB) 1353.36 1264.84Monthly transfer income (RMB) 56.71 287.76
Industry:Agriculture 0.01 0.10Mining 0.02 0.14Manufacturing 0.22 0.42Utilities 0.03 0.18Construction 0.03 0.17Transportation 0.06 0.24Information transfer, etc. 0.04 0.18Wholesale and retail trade 0.12 0.33Accommodation and catering 0.03 0.16Finance 0.02 0.15Real estate 0.04 0.19Leasing and commercial services 0.02 0.15Scientific research 0.03 0.18Public facilities 0.01 0.11Resident services 0.10 0.30Education 0.06 0.23Health care 0.03 0.18Culture and entertainment 0.01 0.11Public administration 0.10 0.30
Obs.2002 54,5642003 62,1942004 65,8062005 77,9762006 70,8532007 75,5392008 76,874
All variables except Age and Income are dummy-coded. The table displays averages over the period 2002-2008. Thesample is restricted to locally registered urban hukou holders aged 15-64.
39
Table A2. Descriptive statistics from the 2005 mini-census (1/2).
Count Share of total Std. Dev.resident urban
population
Rural migrants from another province 94,326 0.15 0.36Rural migrants from another prefecture 122,756 0.19 0.40
Count Percent CumulativePercent
Reason for moving
Work or business 100,670 82.01 82.01Follow relatives 6,474 5.27 87.28Marriage 5,783 4.71 91.99Support from relatives/friends 4,461 3.63 95.62Education and training 1,367 1.11 96.73Expropriation and relocation 603 0.49 97.22Job transfer 522 0.43 97.65Mission 498 0.41 98.06Recruitment 158 0.13 98.19Deposit household registration demand 142 0.12 98.31Other 1,956 1.59 99.90Missing 122 0.10 100.00
Count Percent CumulativePercent
Starting year of last migration spell
2005 25,968 21.18 21.182004 24,917 20.32 41.502003 17,893 14.59 56.092002 11,110 9.06 65.152001 7,468 6.09 71.242000 7,325 5.97 77.211999 or before 27,954 22.79 100.00“Rural migrants” are defined as inter-prefectural migrants with an agricultural hukou aged 15-64. “Total residenturban population” refers to the population in the prefecture that is either locally registered and holds a non-agricultural hukou or resides in the prefecture but holds an agricultural hukou from another prefecture. The samplein the middle and bottom panels is restricted to inter-prefectural rural migrants.
40
Table A3. Descriptive statistics from the 2005 mini-census (2/2).
Rural-urban Local urban Difference p-valuemigrants hukou
Age 30.22 38.54 -8.32* 0.000Female 0.49 0.49 -0.00* 0.009Married 0.64 0.76 -0.12* 0.000
Education:Literate 0.97 0.99 -0.02* 0.000Primary education 0.20 0.08 0.12* 0.000Lower secondary 0.60 0.33 0.27* 0.000Higher secondary 0.14 0.33 -0.19* 0.000Tertiary education 0.02 0.24 -0.22* 0.000
Unemployed 0.02 0.09 -0.07* 0.000Self-employed/Firm-owner 0.20 0.16 0.04* 0.000Employee 0.77 0.81 -0.04* 0.000
Employee w/o labour contract 0.48 0.29 0.18* 0.000Public sector 0.11 0.72 -0.61* 0.000Private sector 0.89 0.28 0.61* 0.000
Total monthly income (RMB) 961.84 1157.07 -195.24* 0.000
Industry:Agriculture 0.05 0.06 -0.01* 0.000Mining 0.01 0.03 -0.02* 0.000Manufacturing 0.51 0.20 0.31* 0.000Utilities 0.00 0.03 -0.03* 0.000Construction 0.09 0.04 0.05* 0.000Transportation 0.03 0.08 -0.05* 0.000Information transfer, etc. 0.00 0.01 -0.01* 0.000Wholesale and retail trade 0.15 0.14 0.00 0.078Accommodation and catering 0.06 0.04 0.03* 0.000Finance 0.00 0.03 -0.03* 0.000Real estate 0.01 0.01 -0.01* 0.000Leasing and commercial services 0.01 0.02 -0.01* 0.000Scientific research 0.00 0.01 -0.01* 0.000Public facilities 0.00 0.01 -0.01* 0.000Resident services 0.05 0.03 0.02* 0.000Education 0.00 0.10 -0.10* 0.000Health care 0.00 0.04 -0.04* 0.000Culture and entertainment 0.01 0.01 -0.01* 0.000Public administration 0.00 0.11 -0.10* 0.000International organisations 0.00 0.00 0.00 0.200
Obs. 122,756 509,817
All variables except Age and Income are dummy-coded. Only the income of individuals who reported having a jobis considered. The sample is restricted to individuals aged 15-64. * p<0.01
41
Table A4. Descriptive statistics from the NBS firm-level data.Public sector Domestic Foreign
private sector private sector
Real capital stock 37539.69 20346.01 47592.38Sales revenue 63149.08 71267.68 167520.80Value added 18470.79 17106.11 40216.00Total wage bill 3695.91 2938.08 6613.63Total number of employees 340.20 216.93 318.76
All variables except “Total number of employees” are in RMB 1,000. The table displays yearly averages over theperiod 1998-2007.
Table A5. Correlation between crop international prices and local Chinese prices/production.
VARIABLES Prices Output
Price (International) .402*** .201**(.0861) (0.0623)
Price (China) .0824*(.0432)
Observations 210 210R-squared .579 .337Trends Yes Yes
Robust standard errors are reported between parentheses. The unit of observation is a crop×ayear. The two regressions include time trends, and weighted by the average crop production shareover the period 1991-2010. Dependent and the main explaining variables are in logs.
Table A6. Comparison of actual and predicted immigration rate in urban areas (robustness checkwithout intra-province migration spells, 2000-2005).
(1) (2)
Prediction - rainfall 0.889*** 0.735***(0.324) (0.269)
Prediction - price 0.547** 0.988***(0.244) (0.275)
Observations 2,028 2,028 2,028 2,028R-squared 0.807 0.861 0.807 0.863Year FE No Yes No YesDestination FE Yes Yes Yes Yes
Standard errors are clustered at the destination level and are reported between parentheses. ***p<0.01, ** p<0.05, * p<0.1. An observation is a destination×year. The immigration rate is thenumber of agricultural hukou holders from all origin prefectures who went to a destination prefec-ture d in a given year divided by population at destination. The independent variable correspondto Md,t as defined in equation 5. Regressions are weighted by total urban adult population atdestination.
42
Table A7. Effect of migration flows on wages and profitability using firm data – robustness check:industries linked with agriculture.
OLS 2SLS: rainfall 2SLS: priceEffect of migration inflows on ... (1) (2) (3)Wages -0.162*** -1.348* -0.663**
(0.0512) (0.744) (0.262)[293,385] [293,385] [293,385]
Profitability -0.133 1.256 0.718**(0.0979) (0.776) (0.305)[272,361] [272,361] [272,361]
Prefecture and Year FE Yes Yes Yes
Standard errors are reported between parentheses and clustered at the prefecture×year level. Theunit of observation is a firm in a given year. In the top panel, the dependent variable is the log oftotal wage bill divided by the number of employees. In the bottom panel, the dependent variableis the log of profits divided by revenues. See section 3 for a complete description of the price- andrainfall -related migration flows.
Table A8. Effect of migration flows on wages and profitability using firm data – robustness check:controlling for shocks in the prefecture of destination.
OLS 2SLS: rainfall 2SLS: priceEffect of migration inflows on ... (1) (2) (3)Wages -0.139*** -1.264 -0.473
(0.0465) (0.979) (0.296)[326,367] [326,367] [326,367]
Profitability -0.172* 1.890 0.959**(0.101) (1.466) (0.461)
[303,436] [303,436] [303,436]
Prefecture and Year FE Yes Yes Yes
Standard errors are reported between parentheses and clustered at the prefecture×year level. Theunit of observation is a firm in a given year. In the top panel, the dependent variable is the log oftotal wage bill divided by the number of employees. In the bottom panel, the dependent variableis the log of profits divided by revenues. See section 3 for a complete description of the price- andrainfall -related migration flows.
43
Table A9. Effect of migration flows on wages and profitability using firm data – robustness check:lagged shocks.
OLS 2SLS: rainfall 2SLS: priceEffect of lagged migration inflows on ... (1) (2) (3)Wages -0.111*** 0.335 -0.629***
(0.0383) (0.577) (0.243)[323,730] [273,390] [273,390]
Profitability -0.234* 2.058 1.005**(0.120) (1.769) (0.409)
[299,422] [252,694] [252,694]
Prefecture and Year FE Yes Yes Yes
Standard errors are reported between parentheses and clustered at the prefecture×year level. Theunit of observation is a firm in a given year. In the top panel, the dependent variable is the log oftotal wage bill divided by the number of employees. In the bottom panel, the dependent variableis the log of profits divided by revenues. See section 3 for a complete description of the price- andrainfall -related migration flows.
44
B Data description
B.1 Migration flows and census
In this section, we provide some descriptive statistics about migrants and migration
flows.
Patterns of migration in the mini-census Table A2 displays the shares of
rural-to-urban migrants in the total urban population of prefectures. We define
rural-to-urban migrants as agricultural hukou holders who crossed a prefecture
boundary and belong to working-age cohorts (15-64).41
The upper panel of Table A2 distinguishes between inter-prefectural migrants
and those who left their provinces of origin. We see that inter-prefectural migrants
represented 19% of a prefecture’s total number of urban residents on average in 2005,
while inter-provincial migrants accounted for 15% of it, which reveals that a majority
(77%) of inter-prefectural migrations imply the crossing of a provincial boundary.
The middle panel presents the reasons put forward by inter-prefectural agricultural
hukou migrants for leaving their places of registration. A vast majority (82%) moved
away in order to seek work (“Work or business”), mostly as labourers, while all other
rationales attracted much smaller shares.42 When we look at the last migration
spell for these migrants (lower panel), we see that most inter-prefectural migrants
(56.46%) arrived in the three years before the survey, illustrating the acceleration of
migration in the early 2000s and potentially the selection bias generated by return
migration.43 We now investigate the extent to which return migration and step
migration affect our description of migration flows.
Return and step migration in the mini-census In this paper, we construct
annual migration flows between each prefecture of origin and destination by com-
bining information on the current place of residence (the destination), the place of
41Although data are not available, it is clear from the literature that rural-to-rural migration,represents a small share of outmigration from rural areas, not least because most of it is explainedby marriages, which usually give right to local registration (Fan, 2008; Chan, 2012). Only 4.7%of agricultural hukou inter-prefectural migrants in the 2005 mini-census reported having left theirplace of registration to live with their spouses after marriage.
42The only other reasons that display shares in excess of 1% are “Education and training,”“Other,” “Live with/Seek refuge from relatives or friends,” which Fan (2008) based on metadatafrom the Population Census Office dubs “Migration to seek the support of relatives or friends,”“Following relatives,” which should be understood as “Family members following the job transferof cadres and workers” (ibid.), and “Marriage”.
43Data on return migration are scarce. Chan (2012) highlights a “noticeable, though still small,but increasing amount of outmigration” from provinces that have been migration magnets sincethe early 2000s.
45
registration (the origin) and the year in which the migrant left her place of regis-
tration. We implicitly assume that all migrants who left the origin in year Y have
reached the destination that same year and stayed there. As discussed in section 2,
we may underestimate migration flows in year Y if some of the migrants who left in
year Y have gone back to their place of origin before the census (return migration).
We may also be mistakenly assigning the arrival of a migrant to year Y if instead
of directly going to destination she stopped on the way and only arrived some years
later (step migration).
In order to measure return and step migration, we use the information from the
2005 census about the province of residence in 2004 and 2000. Unfortunately, the
census does not report the prefecture of residence in 2004 and 2000. However, as
shown in Figure 4, a majority of rural to urban migrants go beyond province borders.
We first consider the extent of return migration. Among all migrants from rural
areas who lived in their province of registration in 2000 and who lived in another
province in 2004, we compute the fraction that had returned to their province of
registration by 2005. As Figure A1 shows, this share is not negligible: in a given year,
between 4 and 6% of rural migrants who have left their province of registration in the
last six years go back a year later. This fraction is higher for older migrants. Return
migration is hence an important phenomenon, which leads us to underestimate true
migration flows, and the effect of shocks on out-migration.
We next study the importance of step-migration. Among all migrants who lived
in their province of registration in 2000 and are living in another province in 2005,
we compute the fraction that lived in yet another province in 2004. As Figure A2
shows, only a minority of migrants have changed provinces of destination in the last
year. Step-migration is concentrated in the first year of migration and virtually zero
thereafter. One limitation of this approach is that we cannot measure step-migration
if it occurs within a province. With this caveat in mind, these results do suggest
that for most migrants we correctly assign the year of arrival at destination.
Comparison of urban dwellers by hukou status The UHS data are represen-
tative of urban “natives,” not of the urban population as a whole, and urban workers
differ significantly depending on their hukou status. As is usual with internal migra-
tion, we consider in the main specifications that migrants and “natives” are highly
substitutable. However, Chinese rural-to-urban migrants tend to be younger (and
thus less experienced) and less educated, which reduces their ability to compete with
urbanites for the same jobs.
Table A3 provides summary statistics on key characteristics of inter-prefectural
46
migrants and compares them with the locally registered urban population. It appears
that migrants and natives are statistically significantly different on most accounts,
the former being on average younger, less educated, more likely to be illiterate,
and more often single, and employed without a labour contract. Important facts
for the analysis that follows are that rural-to-urban migrants are overrepresented
in privately owned enterprises and in manufacturing and construction industries:
91% of them are employed in the private sector as against 42% of locally registered
non-agricultural hukou holders; and the share of rural-to-urban migrants working
in manufacturing and construction is 51% and 9%, as against 20% and 4% for
urban natives, respectively. Migrants also stand out as earning significantly less.
The simple t test reported in Table A3 shows that migrants’ monthly income is 17%
lower than urban natives’; the difference increases to about 40% when one takes into
account the fact that migrants are attracted to prefectures where they can expect
higher wages.44 As expected, notable differences from urban natives in the 2005
mini-census data can be spotted. This should be kept in mind when extrapolating
results based on the UHS to the rest of China.
B.2 NBS data
We discuss here some issues with NBS data and how we tackle them, and provide
some descriptive statistics.
B.2.1 Issues with the firm panel There are a number of issues with using the
NBS data to study the effect of migration on firm growth. We now discuss these
issues and explain how we take them into account while constructing our variables
of interest.
First, firms may have an incentive to under-report the number of workers as it
serves as the basis for taxation by the local labour department. This should be a par-
ticular concern with migrants, who represent a large share of the workforce and may
be easier to under-report. Along the same lines, workers hired through a “labour
dispatching” (laodong paiqian) company are not included in the employment vari-
able.45 This implies that migrant workers are likely to be severely under-counted in
the firm data. We will estimate the impact of migration inflows on firm performance
without being able to observe the firm-specific increase in employment.46
44Results available upon request.45In manufacturing SOEs, there was also a practice of reclassifying and gradually excluding laid-
off workers—euphemistically, on “furlough” (xiagang)—from their accounts. Although much ofthis process had been completed by the start of our study period, it may still induce some declinein employment in the first couple of waves.
46Wage bill may also be slightly under-estimated as some components of worker compensation
47
Second, some variables are not documented the same way as in standard firm-
level datasets. In particular, fixed assets are reported in each data wave by summing
nominal values for different years. We use the procedure developed in Brandt et al.
(2014) using (i) the change in nominal capital stock as a proxy for nominal fixed
investment, (ii) a fixed depreciation rate at 9% and (iii) the investment deflator
developed by Loren Brandt and Thomas Rawski. Following Brandt et al. (2014),
if the firm’s past investments and depreciation are not available in the data, we
use information on the age of the firm and estimates of the average growth rate of
nominal capital stock at the 2–digit industry level between 1993 and the firm’s year
of entry in the database.
Descriptive statistics from the firm panel Table A4 displays key descrip-
tive statistics across public, domestic private and foreign private firm ownership
over the period 1998-2007.47 Public enterprises, a broad category that encompasses
state-owned and collective enterprises, have a larger capital stock, spend more on
their wage bills and have more employees than domestic private firms. Conversely,
the latter report significantly higher sales revenues and perform better in terms of
value added. Table A4 yields a very different image of state-owned and collective
enterprises when compared to the foreign private sector: Real capital stock, sales
revenues, value added and the total wage bill are all higher in foreign-owned firms;
only the total number of employees is higher in the public sector.
Figure A8 shows the evolution of the share of private firms in the NBS sample
along the same characteristics. Private firms still accounted for a relatively small
share of total real capital stock, value added, sales revenues, wage bill and employ-
ment in 1998 but represented over 80% of the total under all five indicators by 2007.
The evolution in terms of employment is particularly striking: Whereas only 32% of
total employment could be attributed to private firms in the NBS sample in 1998,
they accounted for 89% of it in 2007.
B.2.2 Issues with non-stationary variables In order to estimate the effect of
migration on firm growth, we use a strategy which accounts for the non-stationarity
of firm size and thus most variables characterizing firm output or factor use. In
are not recorded in all years, e.g. pension contributions and housing subsidies, which are reportedonly since 2003 and 2004, respectively but accounted for only 3.5% of total worker compensationin 2007.
47Ownership type is defined based on official registration (qiye dengji zhuce leixing). Out of 23exhaustive categories, Table A4 uses three categories: (i) state-owned, hybrid or collective, (ii)domestic private, and (iii) foreign private firms, including those from Hong Kong, Macau, andTaiwan.
48
order to illustrate our approach, consider a certain firm j located in city d and using
a bundle of input Hj,dt in order to produce a numeraire good. Let W d
t denote the
unit cost of input, and Aj,dt the firm-specific productivity.
The firm maximization problem is:
maxHj,d
t
{Aj,dt (Hj,d
t )α −W dt H
j,dt
},
which generates the following input demand schedule (in which lower case letters
are the logarithm of variables):
hj,dt =ln(α)
1− α+
aj,dt1− α︸ ︷︷ ︸
firm-specific growth process
− wdt1− α︸ ︷︷ ︸
factor shock
.
As a consequence, the stationarity of firm demand (and the subsequent firm out-
comes) depends on the stationarity of the firm-specific technological process (Evans,
1987). For instance, under Gibrat’s law, firm i would grow at a certain given growth
rate νi and aj,dt+1 = aj,dt + νi + εj,dt+1 where εj,dt+1 is the innovation. In such case, it
is important to take the difference in the previous equation in order to have the
firm-specific growth component as a “fixed effect”:
∆t,t−1hj,dt =
νi1− α
+∆t,t−1w
dt
1− α+ εj,dt .
We base our empirical strategy on this assumption of constant firm-specific growth
rate, and consider first-differences so as to keep stationary variables on both sides.
We then estimate the impact of migration on firm growth for firm j in destination
d at time t by regressing each firm outcome, which we denote yj,d,t, on predicted
migration, and time and firm fixed effects.{∆t,t−1Md,t = b0 + bm∆t,t−1Md,t + bzZi + ed + nt + ed,t
∆t,t−1yj,t = β0 + βm∆t,t−1Md,t + δt + πj + εj,t, (8)
where standard errors are clustered at the level of the prefecture of destination×year.
49