PENNSYLVANIA ECONOMIC REVIEWaux.edinboro.edu/pea/pub/PER/PER-V18-2-2011.pdfPennsylvania Economic...

PENNSYLVANIA ECONOMIC

REVIEW

Volume 18, Number 2 Fall 2011

Ralph E. Ancil Questioning the Margin: Using Averages and

an Extended Break-Even Analysis to Find

Optimal Solutions

Muhittin Kaplan Testing Aggregation Bias for the Impact of

Huseyin Kalyoncu Devaluation on the Trade Balance: An

Application to Turkey

Mitchell W. Derubis Access to Formal Credit and Rural Economic

Christian N. Minich Development: An Examination of Prevailing

Assumptions in the Land Rental Market of

Rural India

William K. Bellinger Retail and Population Density in Smaller U.S.

Jue Wang Cities

PENNSYLVANIA ECONOMIC REVIEW


Co-Editors

Thomas Tolin & Orhan Kara, West Chester University

Editorial Board

Gerald Baumgardner, Pennsylvania College of Technology Michael Hannan, Edinboro University of Pennsylvania

Rocky Huang, Pennsylvania State University – Berks

James Jozefowicz, Indiana University of Pennsylvania Yaya Sissoko, Indiana University of Pennsylvania

Kenneth Smith, Millersville University of Pennsylvania

Roger White, Franklin and Marshall College

The Pennsylvania Economic Review is a biannual, refereed journal sponsored by

the Pennsylvania Economic Association. The journal accepts articles and book

reviews on any economic topic. Article submission fee is $25 for nonmembers

and members of the PEA. The subscription price for institutions and nonmembers

is $25 per year. (Members of the Association receive the journal without

additional cost.) Send all subscription orders to the editor. Note: All fees and

subscriptions prices are subject to revision at the discretion of the Editorial Board.

Copyright 2000 by the Pennsylvania Economic Association. Permission is

granted to reproduce articles published in this journal, so long as reproduced

copies are used for non-profit educational or research purposes. For other

purposes, permission must be obtained from the Editor. (ISSN 1531-0949).

PENNSYLVANIA ECONOMIC REVIEW Style and Submission Instructions for Authors

Initial submissions must be made in triplicate with a cover letter indicating that

the manuscript is not currently being considered at another publication. Also include a

complete electronic copy of the paper in MS Word or as a PDF file, or email file to the co-

editors. If a PDF file is sent, remove any author identifying information from the file. The

submission fee is $25. Send submissions to:

Drs. Thomas Tolin & Orhan Kara, Co-Editors

Pennsylvania Economic Review

Department of Economics and Finance

Anderson Hall, 309

West Chester University

West Chester, PA 19383

Email: [email protected] & [email protected]

Papers should have a cover page with the author’s name, address, and affiliation.

The second page should have centered in all caps and bolded the paper title, double space

and in upper and lower case include the authors name followed by affiliation. Triple space

and a fifty to one hundred word abstract should be included. The paper itself should be

typed in times new roman 10 point. Do not number the pages electronically but instead

write them in pencil on the back of each page. All margins should be 1.7 inches.

All papers accepted by the PER for publication must be submitted in Microsoft

Word. Complete guidelines will be sent upon acceptance. Some basic guidelines include:

1. Equations: Default settings should be used. Set point size at 10 points; use Arabic

numbers flush right in parenthesis.

2. References in the text should be parenthetical references with the author’s last name

and date of publication. Alphabetical reference list should be included. See reference

lists in this issue for style format.

3. Tables should be numbered in Arabic numbers, e.g., Table 1. Columns of the tables

should be set using tab stops, NOT SPACES, to align OR the table setting in Word can

be used. Tables should be placed at the end of your article and their appropriate

placement in the text noted.

4. Figures are numbered similarly to tables, e.g., Figure 1. The author will provide the

figures in a Microsoft Word compatible program.

5. If you need to use footnotes, DO NOT use the endnote or footnote setting in Word.

Use superscript numbers to refer to your endnotes, which should appear at the end of

the article.

6. Sections of the paper, such as INTRODUCTION, should be flush left and bolded in all

capital letters with one extra line space between section head and text. For example:

INTRODUCTION

Subheads should be in upper and lower case letters, flush left and italicized, with

one extra line spacing above and below subhead. For subheads below the first level

subhead, indent one tab for second subhead. Please limit subheads to no more than two.

mailto:[email protected]

PENNSYLVANIA ECONOMIC REVIEW


Table of Contents

Ralph E. Ancil

Questioning the Margin: Using Averages and an Extended Break-Even Analysis to Find Optimal

Solutions ……………………………………………………………..………….……………….…1

Muhittin Kaplan and Huseyin Kalyoncu

Testing Aggregation Bias for the Impact of Devaluation on the Trade Balance: An Application to

Turkey …...……………………………………….……………………………...…….…...…… 20

Mitchell W. Derubis and Christian N. Minich

Access to Formal Credit and Rural Economic Development: An Examination of Prevailing

Assumptions in the Land Rental Market of Rural India ………………………………...........35

William K. Bellinger and Jue Wang

Retail and Population Density in Smaller U.S. Cities ……………………………………..….….53

1


Volume 18, Number 2, Fall 2011

QUESTIONING THE MARGIN: USING AVERAGES AND AN EXTENDED BREAK-

EVEN ANALYSIS TO FIND OPTIMAL SOLUTIONS

Ralph E. Ancil

Geneva College

(The author wishes to thank James Jozefowicz, Brian O’Roark, and William O. Pearce for their

constructive comments on an earlier draft of this paper and to the anonymous reviewers for their

effort in this process. Naturally, the author retains full responsibility for any errors.)

ABSTRACT

This paper shows that for some economically significant cases marginal analysis can be replaced

with simple algebra, averages, and an extended version of break-even analysis. Two examples are

used to illustrate the point. The first is historical based on Friedrich von Wieser’s solution to the

imputation problem in Menger’s analysis for explaining factor prices. It is shown that factor

prices can be understood as averages with variations (or break-even points) instead of as marginal

values. In the second case, an extended version of break-even analysis in average revenue/

average cost functions using basic algebra illustrates how the profit-maximizing level of output

can be predicted and how the tangency solution slopes can be derived without the use of the

differential calculus or concepts of marginal revenue and marginal cost. It is concluded that

mainstream theory should recognize such cases where applicable and come to terms with their

implications.

INTRODUCTION

In his Economic Theory in Retrospect, Mark Blaug writes: “…there is a mutual

interaction between past and present economic thinking which illuminates both.” (Blaug, 1997, p.

134) In this spirit, two examples -- one from the past and one from the present -- are used to

illustrate the fact that there are important cases where a simpler mathematical technique can be

used to derive results usually reserved for some form of marginal analysis. The two cases

illustrate that there is often an understandable mixture of technique and ideology which obscures

the possibility of alternative interpretations of results or alternative pathways to those results.

Contrasting models or visions of how the economy works can bring a desirable clarity to present

practices and show that in some instances the method of technique or interpretation of results is a

matter of choice or preference rather than of analytical necessity.

In the first of the following two examples, the Austrian view of determining value

through the process of imputation is presented with special attention to Friedrich von Wieser’s

correction of that process. The second example shows that with an extended form of break-even

analysis and average cost and average revenue functions one can also derive the profit-

2

maximizing level of output, which itself is seen as an average. In both cases, the solution of the

technical problem is the same but either the interpretation of the results or the pathway is different.

EXAMPLE ONE: VON WIESER’S IMPUTATION PROBLEM

Problem Statement and Literature Review

In the Austrian view the problem of value was treated by Carl Menger's method of

"imputation." According to this method, the satisfaction experienced from final consumer goods

is the source of economic value. Starting on the side of final demand, goods of first or lowest

order, entrepreneurs impute value to the upstream, intermediate goods, goods of higher order

(capital and labor). To determine the value of these intermediate goods, one determines the value

of an input variable to the total value by removing it from the product and determining the loss of

satisfaction that results. This approach, his famous “loss principle”, is often described as linear

and causal.

However, Wieser understood that this method in some cases could lead to an over-

valuation of the product where the sum of all the variables' values would be greater than that of the

total product (Natural Value, 1956, pp. 86-89). The problem can be put this way: Suppose a

hunter shoots a dangerous tiger. The success of the shot, 100%, is dependent on both the

cartridge and the rifle, so that x + y = 100. However, we do not know how to value each input

separately. If we follow Menger’s method, we must ask ourselves, what the loss of satisfaction is

if we removed the cartridge. The answer is a total loss or 100%. But if we remove the rifle, that,

too, is a 100% loss. Hence, by this line of reasoning, each input is valued at 100%, an obvious

error.

Wieser proposed to improve upon the matter by a method focused on the factor’s

“productive contribution.” He stressed that the valuation process must include a circular or

cooperative level of interdependence, as different factors are used in different industries, the least

profitable of which establishes their opportunity costs. The solutions for the values of these

interdependent factors were found in the use of the familiar method of simultaneous equations.

The historical literature on Wieser’s contribution gives a mixed evaluation of its merit.

Stigler’s work (1941, pp. 158- 178) is the classic study frequently cited. He gives a number of

criticisms including those attending the assumption of fixed production coefficients, infinite

elasticity of demand, and the limitations of a homogeneous equation of the first degree. He argues

that there is no final difference between Menger’s method (“loss principle”) and Wieser’s

(“productive contribution”) even though he admits that Menger’s method does over-determine the

value of a factor (p. 147 ). Ultimately, he finds the approach “ useless.” Schumpeter (1954, pp.

913-917) describes Wieser as one of the three great Austrian economists but the least technically

competent. He also points out that the difference between Menger’s “loss principle” and Wieser’s

“productive contribution” is fully removed by the method of infinitesimals, i.e., the differential

calculus. Wesley Mitchell (1969, pp. 345-374) gives a lengthy but uncritical exposition of

Wieser’s views including the valuation problem. Maurice Dobb (1973, pp. 193-197) is less

critical of Wieser than Stigler is. Although aware of the limitations of his analysis, Dobb believes

a number of the weaknesses are correctable. He especially emphasizes Wieser’s concern for

absolutely fixed proportions and indivisibilities deserves a respectful hearing. He recognizes that

though infinitesimals resolve the difference between continuous and fixed proportions, it does so

at a loss of realism. He finds most fault with Wieser’s assumption of what is given on the supply

side and its implications for the concept of capital goods. Pribram (1983, pp. 314-318) explains

3

the solution verbally and speaks of it in a positive manner. Mark Blaug (1997, pp. 453-454)

explains the contribution in detail, noting its similarity to linear programming. For Blaug, who is

not exactly keen on Austrian thinking, the Austrian concern for indivisibilities, and with it

Wieser’s solution, is implicitly vindicated by the rise of linear programming techniques among

economists dealing with problems not solvable by the use of the calculus and the assumptions of

continuous differentiability. More recently, Ekelund and Hebert (2007, pp. 307-309) thoroughly

explain Wieser’s contribution and place it in a favorable light, arguing that it not only solves

Menger’s valuation problem, but also emphasizes the Austrian view of demand as the ultimate

source of economic value and the importance of the marginal unit.

In the following, Ekelund and Hebert’s perspective will be used, especially since the

criticisms of various aspects by others of Wieser’s approach are not relevant to the point of

concern here, namely, the mixture of technique and ideology in contrast to an alternative way of

thinking.

Analysis

As indicated above, Wieser deals with the valuation problem by using simultaneous

equations. He poses three industries with three factors, x, y and z, which are used in the

following ways:

x + y = 100 (1)

4y + 5z = 590 (2)

2x + 3z = 290 (3)

Each output value is the price of one unit of the product. The goal is to find the prices of the

factor inputs which exhaust the product.2

To explore the matter further it is convenient to drop the third dimension (z) and work in

the familiar Cartesian plane with two variables (x and y) while retaining three industries. To do

this we can simply unzip the third dimension, z, and place it in the third quadrant. This preserves

Wieser’s main point, interdependency of uses leads to the correct valuation of inputs, while

making a graphical analysis easier. It also allows us to visualize the linear process of imputation

simultaneously with that of circular interdependence.

His equations can be rewritten as:

x + y = 100 (4)

y + 5/4z = 147½ (5)

x + 3/2z = 145 (6)

And rearranging in terms of y (and without z) gives:

y(1) = 100 – x (7)

y(2) = 147 ½ - 5/4y (8)

y(3) = 96 2/3 – 2/3x (9)

Spreading them out over three quadrants with the appropriate sign changes gives:

y(1) = 100 – x (QI) (10)

y(2) = 147 ½ + 5/4x (QII) (11)

y(3) = -96 2/3 + 2/3x. (QIV) (12)

4

Since this problem is being transformed to a two dimensional graph, the z function is

pictured in QIII connecting QII and QIV and this requires appropriate sign changes. In QIII, the z

variable from QI becomes input to both QII and QIV. Or, one can say the function in QIII merely

transforms the x output from QII to the y input to QIV and its slope is accordingly set as -1. It

operates, in other words, like a slack variable. Overall, in this entire process, since the output of

one function is the input of another, the dependent and independent variables alternate for QII and

QIV as explained above. With these points in mind, we have the following equations:

y1 = 100 – x1 (QI) (13)

x2 = -118 + 4/5y1 (QII) (14)

(y2 = -140 – x2) (QIII) (15)

x1 = 145 + 3/2y2 (QIV) (16)

The output value of y1 becomes the input value for the second equation and so on through all four

quadrants. (See Figure 1.)

In Figure 1, we can visualize the causal direction coming from the outlying field in each

quadrant with their product values. Moving through the production functions, entrepreneurs

impute the values to the input factors. At the same time we see the circular or interdependent

movement, say, from QI through QIV coordinating these input values among the industries to

achieve equilibrium. The solutions are: (40, 60) in QI; (-70, 60) in QII; (-70, -70) in QIII; and (-70,

40) in QIV. (See Figure 2.) In terms of the original three variables, the solutions are: x = 40, y =

60, and z = 70.

While, as indicated above, there have been a number of technical criticisms of his

solution from various economists, these points are not the subject matter of this paper.3 Rather, it

is the marginalist understanding given to the solution by Wieser himself and others that is of

concern here. Wieser writes:

In the case of production goods which are available, not individually but in stocks,

imputation of the productive contribution follows the marginal law. To each single item

or quantity is imputed the smallest contribution which, under the circumstances, can be

economically aimed at by the employment of this particular item or quantity…” (pp. 96-

97)

As Ekelund and Hebert (2007) put it:

Given that an input is used in the production of a number of final or consumer goods,

its value will be determined by the least valuable good it produces. This value is

determined at the margin, by the marginal utility of the last unit of the least valuable good

the input is producing. (original emphasis, p. 308)

They conclude that “…the marginal utility of final output is presented as the source of value by

Austrian economists.” (original emphasis, p. 309)

However, we don’t really see this marginality in the graphs or the equations. Instead, it

must be assumed or read into the technique. One could, alternatively, with equal validity, envision

a classical gloss on the matter and recast the solution in those terms. One could, for example, see

market prices and output in terms of averages with fluctuations around a central tendency.

There are at least two different ways of doing so. One is to transpose the solution lines

from Figure 2 into QI and find the simple averages for the variables x and y. Doing so for all four

quadrants, and using all positive numbers, gives a mean value of x* = 55, and standard deviation

of +/-15, and mean of y* = 65, with standard deviation of +/- 5. (See Figure 3, where A = (40,

5

70); B = (70, 70); C = (70, 60); D = (40, 70); and the means are at point E = (55, 65); their ratio

also gives the slope of 1.1818.)

If we take Figure 2 and make some adjustments we can display these results over all four

quadrants. By connecting the solution points in QII and QIV, we have a diagonal line with slope

= -1.1818. Doing the same thing for QI and QIII, we have a diagonal line with slope = +1.1818.

Finding the intersection of these two lines, we have (-15, -5). This forms a solution box and

effectively moves the origin from (0, 0) to (-15, -5). In this case the means and standard

deviations reverse themselves: x* = -15 +/- 55 and -y* = -5 +/- 65. (See Figure 4.)

Another approach is to work the solution backwards, so to speak, by assuming an initial

knowledge of the total values of the input factors, x and y. (This is really a summing up of the

solution coordinates.) The total value of the x’s is 220 and that of the y’s is 260. The ratio of the

y’s to the x’s is:

260/220 = 1.1818. (17)

This ratio of total values is the same as the slope derived from finding the intersection of the two

diagonal lines in the solution box.

With this information one can predict the specific values in each quadrant in this way:

QI: x = -15 + 55 = 40 (18)

y = -5 + 65 = 60 (19)

QII: x = -15 – 55 = -70 (20)

y = - 5 + 65 = 60 (21)

QIII: x = -15 – 55 = -70 (22)

y = -5 – 65 = -70 (23)

QIV: x = -15 +55 = 40 (24)

y = -5 – 65 = -70. (25)

Using the total values, we are able to deduce the specific equilibrium values for each industry even

without knowing the production function of those industries. (See Figure 4.)

With simple averages, there is no assertion about the role of the margin, because all four

solution points are taken to be equally distant from the origin. The value of x = 40 is just as far

from x* = -15 as is x = -70, namely 55. Or, in terms of Figure 2, they are equally distant from

the mean of x* = 55, namely, 15. But there is no one least valuable product which can be argued

to cause or determine the value of the factors. It can be seen as a collective, interdependent,

statistical result where the industries participate in a common pool of factor values.

Weiser himself argues that the cost of production approach is proximately valid, for once

the marginal values are widely dispersed through the economy, the summing of factor prices is a

reasonable approach to their product values (Natural Value, pp. 171ff). From the point of view

of any one individual producer the value of the product must at least be equal to the sum of the

costs that go in to making it up. But, he insists, this is merely a relative assessment since the

ultimate value comes from demand via the theory of imputation.

However, if the results of his imputation analysis in the case discussed above are

reconfigured as a simple average, the sense of the margin is lost. A Smithian cost of production

sees the natural price as an average with no distinctive marginal value determining final value.

The interdependence is not only between industries with respect to factor prices, but also within

6

them with respect to factor and product prices. The marginal is replaced with the central; the

individual addition is replaced by the collective mean.

So much for the historical example. The same principle can be applied to current

mainstream thought.

EXAMPLE TWO: STANDARD REVENUE AND COST CURVES

Problem Statement and Literature Review

Because entrepreneurs may make decisions in a manner different from marginal analysis,

say, by rules of thumb that involve simple averaging, it is useful to see if one can set that heuristic

approach on a sound theoretical basis. Therefore, in the previous paper, the problem of finding a

different pathway to the same marginal result, namely, profit-maximizing level of output, was

looked at more closely in terms of break-even points and averages (Ancil, 2010). In that paper it

was shown that an optimal solution can be found without using marginal analysis, and the

differential calculus, relying instead on simple algebra.1 The success of the examples given there

was due either to the symmetry of the total cost and revenue functions, (i.e., either both were

simple quadratic equations, or they were a combination of linear with quadratic equations), which

allowed one to average either the break-even points or the vertices to arrive at the profit-

maximizing level of output.2 In the present instance, we extend the technique to apply to the case

where this symmetry is not there, i.e., where one of the functions is cubic. It is found that there is

no problem in obtaining an algebraic solution using break-even points and averages and without

the use of marginal analysis.

The familiar break-even analysis, or cost-volume-profit analysis, is usually displayed as

a rough rule of thumb which has some validity in narrowly confined situations, that is, where the

assumptions of constant costs and prices are valid approximations, but which is not capable of

delivering a precise, theoretically sound result, an optimal result, because the concept of

diminishing returns is not considered. Consider, for example, Davies and Lam (2001, p. 190) who

compare the differences between standard economic theory and break-even analysis: “In the first

place, the economic model is an optimising model, which identifies the profit- and contribution-

maximising level of output and price. The break-even model shows no optimum as the levels of

profit and contribution simply increase with the level of output.” (emphasis added). In a similar

vein, the authors Keat and Young (2009, pp. 352-353) summarize the limitations of break-even

analysis, two of which are:

It assumes the existence of linear relationships, constant prices, and constant average

variable costs. However, when the effects of relatively small changes in quantity are

measured, linear revenues and variable costs are certainly good approximations of

reality…The analysis does not result in identification of an optimal point; it focuses on

evaluating the effect of changes in quantity on cost and profits.

What these authors conclude is in fact correct for the usual break-even analysis considered.3 But

if we extend the method, i.e., improve on it to account for the effect of diminishing returns, we

will be able to find optimal solutions after all.

Analysis

Consider the following total revenue and cost functions:

7

TR = 2x – x2 and TC = x

3. See Figure 5. (26)

The relevant break-even points are at (0, 0) and (1, 1). A simple rule of thumb to take the break-

even points and divide by two yields the correct answer, that is, the correct profit-maximizing

level of output (PMO), for the case where TC = x2. However, it yields the same answer here,

where TC = x3. Obviously, this is not right. The solution from marginal analysis

4 is:

MR = 2 – 2x, and (27)

MC = 3x2 . (28)

Solving in the usual manner gives:

(x + 1/3)2 = 7/9, and (29)

x = (√7 – 1)/3

= .5486.

To correct this and arrive at this same answer by an alternative method, we begin with the

average revenue and average cost functions:

AR = 2 – x, and (30)

AC = x2. (31)

To find the break-even points for these, we set them equal to each in the familiar way and

solve:

2 – x = x2 (32)

0 = x2 + x – 2.

After completing the square this becomes:

(x + ½)2 = 9/4. (33)

Solving for x gives the break-even x-values:

x = (√9 – 1)/2 (34)

= 3/2 – ½

= 1 or -2.

The relevant break-even points are (1, 1) and (-2, 4). See Figure 6.

How can the optimal output level which maximizes profit (PMO) be found with the mere

use of break-even points and averages? We make the following observations:

The greatest difference between the AR and AC functions, 1.75, occurs at x = -1/2. This

can be determined by inspection as well as by comparing the slopes of the two functions. The

slope of the AR line is -1 and the slope of the line tangent to the AC curve at x = -1/2 is also -1.

(This is equivalent to shifting the AR line down until it is tangent to the AC curve at this point.)

The axis of symmetry (AS) value is given by the average of the x-values of the break-

even points (BEP):

AS = (BEP1 + BEP2)/2 (35)

8

= - ½.

In the break-even equation, the combined AR/AC function, the slope at x = -1/2 is no

longer -1 but 0, at the vertex.

How do we arrive at the maximum total profit from the maximum average profit?5 Since

the vertex on the original average cost function is at x = 0 (or 0, 0) but the maximum difference

now is at the value x = -1/2 (y = ¼), they are obviously out of alignment with one another. The

maximum total profit can be found by bringing these two into harmony, by making the vertex

equal to 1.75 (7/4) at x = -1/2. This can be accomplished by either shifting the AR curve to the

left by the amount of the AS (x = -1/2) or by shifting the AC curve to the left by that amount.

This is also seen from the quadratic equation combining the AR/AC functions. The

completion of the square technique results in a constant (1/2) whose opposite is the axis of

symmetry (x = -1/2).

Working with the AR curve in Figure 6, we have:

AR = (2 – ½) – x (36)

= 1.5 – x.

This line passes through the point (-1/2, 2) which lies directly above (-1/2, 0). Solving for this

break-even equation:

1.5 – x = x2 (37)

(x + ½)2 = 7/4.

The break-even points at y = 0 are:

BEP1 = (.8229, .67716) (38)

BEP2 = (-1.8229, 3.32296). (39)

Re-graphing the equation, (x + ½)2 = 7/4, gives us Figure 7. The graph shows the break-

even points on the x-axis .8229 and -1.8229. It also shows a line passing with slope = 1/√7 +1,

through BEP2, passing through y = ½, and intersecting the main function at the PMO.

However, we can also achieve the same results from the equation (x + ½)2 = 9/4, by

solving for y = ½. (It is as if these two equations reduce the slope of AR to 0, shift the points to

the left and invert the function so that it opens downward. See Figure 8, where the B1 x-value is

.8229 and B2 is -1.8229.)

The profit-maximizing level of output is found along with the tangency solution and

other results by simple combinations of the break-even points. The formulas for doing so are:

1. PMO = │1/BEP2│ (40)

= 1/1.8229

= .5486.

2. PMO = 1 + BEP1/BEP2 (41)

= 1 + .8229/-1.8229

= 1 - 4514.

= .5486.

3. BEP1 - BEP2 = √7, and (42)

(BEP1 - BEP2)/2 = √vertex

9

= distance between BEP1 and BEP2.

4. The slope, TS, of a line tangent to the TR and TC curves at

the PMO is found by:

TS = (BEP1/BEP2)/AS (43)

= (.8229/-1.8229)/-1/2

= .9028.

5. And finally, the profit-maximizing level of revenue (PMR)

is a line passing through the origin and passing through the

TR function at the PMO point is:

PMR = (1- BEP1/BEP2)x (44)

= (1-.8229/-1.8229)x

= 1.4514x.

= 1.4514(.5486)

= .7962.

The PMR can also be described as a linear revenue growth function which identifies the

maximum revenue at the PMO in terms of the break-even points as:

PMR = │1/BEP2│ + BEP1/BEP22

(45)

= │1/-1.8229│ + .8229/(-1.8229)2

= .7962.

One may reconfigure this information in a stylized graph so that it looks like Figure 9.

Here we modify the AR equation by using BEP2 as the slope:

AR’ = 2 + (BEP2)x (46)

= 2 – [(√7 + 1)/2]x

= 2 – 1.8229x.

This line intersects the x-axis at x = 1.0972. Dividing this in half gives the PMO of .5486. The

difference between this value (1.0972) and 2 is the tangency slope of .9028.

Finally, we can focus on quadrant one and break-even point1 (BEP1) exclusively and

derive all the relevant results. Reasoning in the manner explained above, we adjust the

intersection of the AR and AC from the nominal values to those resulting by shifting the AR line

down by ½. (Recall Figure 6). By drawing a line from the origin through BEP1 (.8229, .6771,

point A in Figure 10), we derive a new break-even line (BEL) which passes through this point and

intersects the original AR line at x = 1.0972 (point B). Dividing this in half gives .5486, the PMO.

(Another adjusted break-even line is the inverse of the value 1.0972 which is .9114, the slope of a

line passing through the PMO at y = ½. This is not on the graph.)

Alternatively, we can continue following the line until it intersects y = 1 at x = 1.21533

which is the inverse of .8229 (point C). Adding one to this figure gives the ratio of BEP2/BEP1 or

2.21533. Subtracting the inverse of this value from 1 gives the PMO, and dividing this into 2 we

get the slope of .9028 at the PMO. We have then the following results, for the adjusted break-

even line (ABEL):

ABEL = .8229x. (47)

10

At ABEL = AR, x = 1.0972 and

y = 2-1.8229x (48)

PMO = 1.0972/2 (49)

= .5486.

Also,

PMO = 1 – BEP2/BEP1 (50)

= 1 – 1/2.21533

= .5486.

And,

TS = 2/2.21533 (51)

= .9028. See Figure 10.

We can also transpose the adjusted break-even line (ABEL) to the upper part of the graph

so that it intersects the AR line at point D in Figure 10 at the PMO.

Note also that the PMO is derived by multiplying the BEP1 by the ratio of the exponents:

PMO = 2/3 (.8229) (52)

= .5486.

For two other applications see the appendix.

CONCLUSION

The above examination adds further evidence that it is possible to consider some

economic models in terms of averages and break-even points and to see economic behavior as

revolving around central tendencies, fixed proportions, and constants rather than as small,

marginal changes. If entrepreneurs make price and output decisions based on such simple rules of

thumb, these decisions may have a more solid analytical foundation than has heretofore been

admitted. Recasting the results of these two cases into a non-marginal mode also emphasizes the

difference between the technical analysis and the verbal (ideological?) gloss and shows that the

marginal analysis or a corresponding marginalist interpretation, is optional, a matter of choice,

not a matter of analytic necessity. The alternative pathways to profit-maximization can be

motivated by a different theory or vision of how the economy works, more in the spirit of a

Smithian, classical view. If so, it may well be that Smith’s description of entrepreneurial

behavior still has a measure of merit. After explaining his view that the “ordinary and average”

rates of wages, profit and rent constitute the “natural price”, he writes:

The natural price, therefore, is, as it were, the central price, to which the prices of

all commodities are continually gravitating. Different accidents may sometimes

keep them suspended a good deal above, and sometimes force them down even

somewhat below it. But whatever may be the obstacles which hinder them from

settling in this center of repose and continuance, they are constantly tending towards it.

(Wealth of Nations, Bk I, vii, 15; emphasis added.)

The full implications of this are beyond the scope of this paper but will certainly depend

on how widely applicable the alternative techniques are. Clearly, the wider the application, the

11

more it impacts research and education in economic theory and the more demands it will make

on the flexibility of the economics discipline to accommodate such a different model.

ENDNOTES

1 It was also shown in that paper that Boehm-Bawerk’s horse-trading example -- in which he

argued that the market clearing price was uniquely determined by a marginal pair -- was also

explicable as a simple average derivable from the total value of reservation prices, without

knowledge of any marginal pairs.

2 It is clear that for some aspects of this discussion one could refer to Rolle’s theorem or the mean

value theorem. However, as indicated in the main text, the point is to see how far one can go just

with simple algebra.

3Nonlinear or curvilinear break-even analysis is, of course, used when the costs and prices are not

constant. But, as Salvatore (2004, p. 300) writes, though this approach is similar to standard

marginal analysis of profit- maximization, the “objective shifts from the determination of the

optimum price and output in optimization analysis to the determination of the output levels at

which the firm breaks even or earns a target profit…” In other words, the goal of nonlinear break-

even analysis is the same as in the linear case, which is non-optimal.

4 Compare 2/(√7 + 1) in the solution of the combined AR/AC functions with the marginal result of

(√7 - 1)/3. Both of these expressions give the same answer for the profit-maximizing level of

output.

5 The familiar algebraic approach of the latus rectum with foci and directrix allows us to identify

the maximum, at the vertex, and the relevant break-even values and the mean as the axis of

symmetry.

REFERENCES

Ancil, Ralph E. (2010) “Using Averages and Break-even Points to Find Optimal

Solutions” in Proceedings of the Pennsylvania Economic Association 2010

Conference, June 3-5, 2010, pp. 26-37.

Blaug, Mark. (1997) Economic Theory in Retrospect. Cambridge University Press.

Cambridge.

Davies, Howard and Pun-Lee Lam. (2001) Managerial Economics (3rd

ed.). Prentice Hall.

Harlow, England.

Dobb, Maurice. (1973) Theories of Value and Distribution Since Adam Smith. Cambridge

University Press. Cambridge.

Ekelund, Jr., Robert B. and Robert Hebert. (2007) A History of Economic Theory and

Method. Waveland Press, Inc. Long Grove.

Keat, Paul G. and Philip K. Young. (2009) Managerial Economics (6th

ed.). Prentice Hall.

Upper Saddle River.

12

Mitchell, Wesley C. (1969) Types of Economic Theory (vol. 2). Augustus M. Kelly

Publishers. New York.

Pribram, Karl. (1983) A History of Economic Reasoning. The Johns Hopkins University

Press. Baltimore.

Salvatore, Dominick. (2004) Managerial Economics in a Global Economy (5th

ed.).

Thomson/South-Western, Australia.

Schumpeter, Joseph. (1954) History of Economic Analysis. Oxford University Press. New

York.

Smith, Adam. (1981) An Inquiry into the Nature and Causes of the Wealth of Nations. Liberty

Fund. Indianapolis.

Stigler, George. (1941) Production and Distribution Theories : The Formative Period. The

Macmillan Company. New York.

Wieser, Friedrich von. (1956) Natural Value. Kelly and Millman, Inc. New York. [German ed.

1889, English ed., 1893]

13

APPENDIX

One can also approach the solution by working with the y–axis as shown in Figure A-1.

One example, if the total cost were simply TC = x2, the y-value corresponding to the PMO is 1.5,

or one half of the distance between y = 2 and y = 1. In the present case where TC = x3 the

corresponding value is 1.4514. We can think of an average value centered at mean y = 1with

variations of +/- .4514, whose total distance equals the slope of the tangency lines of .9028. We

still have y = 1.4514 intersecting AR at point D, and y = .5486, intersecting the original break-

even line (BEL) at y = .5486.

We can also think of the solution being centered at mean y = 1.4514 (point D) with

variations of +/- .4514. The total vertical distance then goes from y = 1 to y = 1.9028. At y = 1,

this intersects AR and TR and TC at x =1.

An alternative way of visualizing the process is seen in Figure A-2. At zero costs the

center is at point (1, 1) where TR and AR are equal. As soon as costs are introduced a cost box

begins to expand around this point. In the present case, we take the ratio of the break-even points,

│.8229/-1.8229│ to derive the value of .4514 and apply it equally from the point (1, 1).

Travelling from this point leftward to the vertical line DF, we have the solution, since that line

intersects the original break-even line at y = x = .5486 (point F). It also insects the TR curve at its

point of maximum profit, at y = .7962.

We further observe that the distance from TR = .7962 to y = 1 is .2038 which is simply

the ratio of the break-even points squared, .45142. Adding this distance to y = 1 allows us to

interpret this portion of the y-axis as TR since 2 - 1.2038 = .7962. Shifting the AR line inward so

that it intersects at the point (.5486, 1.2038), the new slope for this line is the price of 1.4514. We

then have a budget graph showing that consumers’ budget is equal to the TR of .7962 since they

purchase quantity .5486 at a price 1.4514.

Lastly, we observe that 1.02038 is equal to 1.0972 squared.

14

15

16

AR =

2 - x

TR =

2x - x2

TC

=

x3

AC

=

x2

AR

= 2-

x

AR' =

3/2 - x

17

(x + 1/2) = 9/4

Figure 8

Slope = 1/ (7 + 1)

Figure 7

18

19

20



TESTING AGGREGATION BIAS FOR THE IMPACT OF DEVALUATION ON THE

TRADE BALANCE: AN APPLICATION TO TURKEY

Muhittin Kaplan and Huseyin Kalyoncu

Meliksah University, Turkey

ABSTRACT

The empirical literature on the relationship between devaluation and trade balance has mainly

concentrated on aggregate analysis and provides mixed results on the impact of devaluation on

trade balance. However, an analysis undertaken with aggregate data leads to the loss of valuable

information and econometric estimates of the relations at the aggregate level are subject to

aggregation bias. This is because an aggregate analysis of devaluation cannot fully accommodate

the information on the structural change, heterogeneities among trading partners of the economy

and the dynamics of the relationship. To deal with these deficiencies, the current study investigates

the impact of devaluation on trade balance employing data disaggregated by trading partners and

tests whether the parameters obtained from an aggregate trade balance model differ significantly

from the average of those of the corresponding disaggregate trade balance models using

aggregation bias tests of *

1q and *

2q developed and employed in a series of papers (Pesaran et al.

(1989), and Lee et al. (1990a, b), and Lee and Pesaran (1993) and Lee (1997)). This allows us to

find out whether aggregation biases are present in the previous studies of devaluation and to

discover the impact of devaluation on bilateral trade balances through a disaggregate analysis of

the Turkish trade data. The empirical results from our study indicate that devaluation has a

positive impact on both aggregate and bilateral trade balances of the Turkish economy. We also

find that the responsiveness of bilateral trade balances to devaluation differ significantly among

trading partners in terms of their contribution in this process. More importantly, the empirical

evidence in this paper suggests that the existing studies in the literature, carried out at the

aggregate level, underestimate the impact of devaluation on trade balance and, thus suffer from

aggregation bias.

INTRODUCTION

This study evaluates the impact of devaluation on trade balance using both aggregate and

disaggregated data by trading partner countries and to test whether aggregate analysis is subject to

aggregation bias. The relationship between devaluation and trade balance has attracted a lot of

interest in the recent empirical literature which provides mixed results on the effect of devaluation

on trade balance in the long-run. Most of the studies on the subject have mainly concentrated on

aggregate analysis and paid little attention to the links between devaluation and trade balance

operating at the disaggregate trading partner country level. However, this relationship is based on

complex structural relationships and an analysis undertaken with aggregate data is subject to two

deficiencies: The first, economic deficiency, concerns the loss of important information that can

21

be obtained from an investigation of the dynamics of the devaluation-trade balance relationship at

the bilateral level since aggregate analysis disguises a lot of information. A disaggregate analysis

is better able to reflect these important dynamics and economic linkages. The second, statistical

deficiency, is closely related to the fact that there are complex interrelations among trading

partners and that the structure of the economy changes over time. These features mean that

econometric estimates of the relations at the aggregate level are subject to structural breaks and

aggregation bias.

For these reasons, the potential effect of devaluation on the trade balance cannot be fully

evaluated without taking the structural change, heterogeneities among trading partners and the

dynamics of the devaluation-trade balance relationship into account. Because this information

cannot be accommodated into an analysis carried out at the aggregate level, the results from

studies undertaken at aggregate level may be subject to bias. To avoid the potential for aggregation

bias in estimated aggregate relationships, researchers investigate the trade balance-devaluation

relationship at the disaggregate trading partner country level. However, to our best knowledge, no

study has statistically tested the aggregation bias in the empirical literature on the trade balance-

devaluation relationship.

Taking these discussions together, we evaluate the impact of devaluation on the trade

balance econometrically using both aggregate and disaggregate data. In this way, we identify

important aspects of the relationship between devaluation and trade balance which are not possible

in the aggregate analysis and improve the precision of our results by avoiding the aggregation

bias. Furthermore, using recently developed aggregation bias tests, we test for possible biases

which may have occurred in the previous studies on the subject carried out at the aggregate level.

Hence, we provide detailed analysis of the effects of devaluation on trade balance. The rest of the

study is organized as follows. Section 2 provides a brief survey of the empirical literature on the

relationship between devaluation and trade balance. Section 3 introduces the modeling strategy

and the data. Sections 4 presents the results obtained from estimating the trade balance equation at

aggregate and disaggregate level, provide aggregation bias tests results and comments on the

findings. Section 5 concludes.

THE IMPACT OF DEVALUATION ON TRADE BALANCE: A LITERATURE REVIEW

The question of the impact of devaluation on the trade balance has been studied in recent

years in a large literature. Since the publication of Magee’s paper (1973), two strands of the

empirical literature have developed in testing the impact of devaluation on the trade balance. The

first involves estimating the trade balance equation at the aggregate level. The second estimates

the same equation using the data disaggregated by countries. However, the results from these

literatures provide mixed results and hence no consensus view is reached on the impact of

devaluation on trade balance until now.

The aggregate approach considers the trade flow of a country with the rest of the world

using aggregate data. Several studies have examined the impact of devaluation on the trade

balance employing this methodology. Himarios (1985), in his study of 10 countries, and Himarios

(1989), in his study of 15 developing countries, show that devaluations, in general, improve the

trade balance. Recently, Gomes and Paz (2005) found the Marshall-Lerner condition held for the

Brazilian trade balance in the 1990s implying the positive effect of devaluation. In his empirical

analysis of the long-run response of 11 Middle Eastern countries’ trade balances to devaluation,

Bahmani-Oskooee (2001) also concluded that real depreciation has a favorable long-run effect on

the trade balances of most non-oil exporting Middle Eastern countries.

The findings of the studies by Narayan (2004), for New Zealand, and Buluswar et al

(1996), for India, however, indicate that devaluations have had no significant long-run effect on

the trade balance. In a similar study, Upadhyaya and Dhakal (1997) tested the effectiveness of

22

devaluation on the trade balance in eight developing countries and found that devaluation, in

general, did not improve the trade balance in the long run. Bahmani-Oskooee and Alse (1994)

investigated the relationship between trade balance and exchange rate using the quarterly data

from 19 developed and 22 less developed countries, including Turkey, over the years 1971-1990.

Employing the Engle-Granger cointegration technique, the authors found that while devaluation

had no effect on trade balances of Canada, Denmark, Germany, Portugal, Spain, Sri Lanka, UK

and the USA, it had a positive effect for Costa Rica, Brazil, and Turkey, and a negative effect for

Ireland in the long-run. The list also includes Miles (1979) for 14 countries, Bahmani-Oskooee

(1985) for four developing countries, Brissimis and Levantakis (1989) for Greece, Felmingham

(1988) for Australia, Demirden and Pastine (1995) and Brada et al. (1997) for Turkey.

The disaggregate approach considers the bilateral trade flow of a country with the partner

country using the disaggregated trade data. Rose and Yellen (1989) take up bilateral approach on

the US’s bilateral trade with her six trading partners and report the lack of long run effect.

Marwah and Klein (1996) tried to test the phenomenon between Canada and its five largest trading

partners as well as the U.S. and her five trading partners and found mixed results. However,

Bahmani-Oskooee and Brooks (1999), in their study on the effect of devaluation on US’s bilateral

trade with her six trading partners found that real depreciation of the dollar has a favourable long-

run effect on US trade balance. Similarly, Shirvani and Wilbratte (1997) test the relationship

between the real exchange rate and the trade balance using monthly US bilateral trade data for the

flexible exchange rate regime, over the period 1973:1–1990:8. They focus on the USA as the

home country, and Canada, France, Germany, Italy, Japan and the UK as her trading partners.

They find that with the exception of Italy, there is a statistically significant association (in the

expected direction) between the real exchange rate and the trade balance in all cases.

Bahmani-Oskooee and Goswami (2003), investigates the effect of devaluation on Japan’s

bilateral trade with her trading partners. The long-run effect of currency depreciation is found to

be favourable. Ahmad and Yang (2004), investigates the effect of devaluation on Chins’s bilateral

trade with the G-7 countries. They found that real depreciation improves the trade balance with

some countries. Other studies that investigated the effect of devaluation on trade balance at

bilateral basis are: Arora et al (2003) for the India vis-à-vis her trading partners, Wilson (2001) for

Singapore, Malaysia and Korea vis-à-vis their trading partners, Bahmani-Oskooee and Kantipong

(2001) for Thailand vis-à-vis their trading partners, Bahmani-Oskooee et al (2006) for UK vis-à-

vis their trading partners and Halıcıoglu (2007) for Turkey’s bilateral trade with 13 trading

partners.

In summary, the review of the empirical literature shows that there are important

differences among trade balances with trading partners in terms of responsiveness of devaluation.

The level of development also seems to be important in determination of the impact of devaluation

on the trade balance. Taken together, these imply that the aggregate analysis of devaluation may

suffer from aggregation bias. However, the presence of aggregation bias is not tested statistically

in the empirical literature. To close this gap in the literature, we will estimate both aggregate and

disaggregate trade balance equations and test the aggregation bias statistically. The details of the

methodology employed in this study are given in the following section.

METHODOLOGY AND DATA

The modeling framework provides an adequate representation of trade balance model

which allows for structural change, heterogeneities among responsiveness of devaluation to trade

balances with trading partners, and the dynamics of trade balance-devaluation relationship;

namely, disaggregate analysis of devaluation-trade balance relationship. The trade balance

equations developed in this section will be used in the empirical analysis below.

23

Many researchers, for example Bahmani-Oskooee (1991, 2001), Rose (1991) and Gomes

and Paz (2005) used a reduced form equation for the trade balance (TB) in order to study the effect

of devaluation on the trade balance. Kalyoncu and Kaplan (2007) and Kalyoncu et al (2009) stated

that there is an advantage of formulations that used in these studies. Without estimating the

structural parameters of the export and import functions this formulation allows a straightforward

test of the effect of devaluation on trade balance. In its natural logarithm form, the trade balance

(TB) is defined as the difference between the nominal imports )( *MEP in terms of domestic

currency units minus nominal exports )(PX such as )log()log()log( * PXMEPTB . We

can write TB equation using small case letters for the log of variables,

ttttttttt erxmppexmtb )( * (1)

In this equations pandpmx ,,, * represent the natural logarithm of the volume of exports and

imports, foreign import prices and domestic prices respectively. ter is the log of real exchange

rate. Import and exports functions can be defined as:

txtterbyax *

(2)

tmtt erdycm (3)

In these equations a and c are constants, b and d are foreign and domestic income elasticity, x

and m

are real exchange rate elasticity of exports and imports, respectively. After substituting the

equation (2) and (3) into equation (1), we obtain an expression that represents the long run TB,

given by equation 4.

tmxttt erdybyactb )1()( * (4)

By checking the coefficient of t

er variable, which gives the Marshall-Lerner (ML)

condition, one can easily test the effectiveness of devaluation. A negative coefficient on the t

er

variable indicates that the ML condition holds and devaluation improves the trade balance. To take

the dynamics of trade balance into account, the trade balance equation (4) is written as:

ititiitiitiitiitiitiitiiit tbereryyyytb 17165143

*

12

*

10 (5)

We estimate both bilateral trade balance equations (5) for each of the thirteen countries

using once-lagged values of explanatory variables and a lagged dependent variable. In the

aggregate counterpart of the trade balance equation of (5), the aggregate trade balance equation is

estimated. The lagged values variables are introduced into the equation to take short-run dynamics

into account following general to specific modeling approach developed by Davidson et.al. (1978,

p. 680) and Hendry (1979). Then a specification search was carried out on the OLS estimates of

the equations. This involved dropping those variables whose coefficients had t-values, which were

insignificant at a 10 percent level of significance (See Garderen and Lee and Pesaran (2000,

p.316); Pesaran and Pierse and Lee (1994, p.17); Lee and Pesaran (1993, p.39) for a similar

24

specification search methodology). Moreover, since the main aim of the study is to investigate the

trade balance-devaluation relationship, we have always retained the real exchange rate variable

even when they were more insignificant than other variables of the equation. Finally, in our

empirical analysis, we compare the average elasticity from the disaggregate model with the

aggregate model using aggregation bias tests.

Aggregation bias test

As explained in the literature review, most studies on the subject were undertaken at the aggregate

level. However, from an economic and statistical point of view, it seems necessary to study the

relationship between devaluation and trade balance at the disaggregate level. From the economic

point of view, the responsiveness of bilateral trade balances to devaluation and the dynamics of

this relationship may be different across trading partners and aggregate analysis will obscure these

relationships. Furthermore, considering the fact that theories in economics are based on highly

disaggregated units (households, firms behaviour), one will see that any degree of aggregation

might incur a problem of coherence with the underlying theory. In this sense, even more detailed

model of economic process provides more accurate representation of reality.

From a statistical point of view, the empirical studies carried out at the aggregate level

may suffer from aggregation bias in the estimated parameters of the aggregate equation. This issue

is considered in a series of papers (Pesaran et al. (1989); Lee et al. (1990a, b); Lee and Pesaran

(1993); Lee (1997)), where general tests of aggregation bias are developed and employed. An

aggregate model may lead to biased estimates because valuable pieces of information about the

working of the trade balance simply cannot be characterized within an aggregate model. These

involve the change in composition of traded goods over time, heterogeneities among bilateral trade

balances in terms of responsiveness to devaluation, and differences in dynamics of bilateral trade

balance equations. For these reasons, the aggregate parameters will obscure the structural

relationships and it is unlikely that the aggregate model adequately represents the disaggregate

model.

We now turn our attention to comparing the estimates obtained from the disaggregate

model with its aggregate counterpart. For a simple trade balance function, considered to be a

function of only real exchange rate, say, the disaggregate model for m trade partners can be

written as follows:

,:iiiid

uerbH mi ,.....,2,1 (6)

where i

b is the 1nx vector of observations on the dependent variable, trade balance with the

thi trading partner, i

er is the 1nx matrix of observations on the regressor, bilateral real exchange

rate for the thi trading partner, i is the 11x vector of coefficient, real exchange rate elasticity,

associated with i

er , and iu is the 1nx vector of disturbances for the thi trading partner.

The corresponding aggregate equation is given by,

,b :aaaaa

verH (7)

where

m

i

ia

m

i

iaerbb

11

er ,

25

and a

is the 11x aggregate real exchange rate elasticity, and a

v is the 1nx disturbance vector.

The problem of ‘aggregation bias’ is then defined as the deviations of macro parameters from the

averages of the corresponding micro parameters ( mm

i

id /1

). More specifically, a

measure of aggregation bias is defined as dab . A test of aggregation bias then involves

testing the null hypothesis of no aggregation bias ( 0:0 H ) under the assumption that the

disaggregate model holds. The potential for aggregation bias in estimated aggregate relationships

is addressed in Lee, Pesaran and Pierse (1990a,b) and they suggest two statistics, namely the *

1q and *

2q tests, which allow us to test whether the parameters obtained from an aggregate model

differ significantly from the average of those of the corresponding disaggregate models. However,

in practice, it is often the case that we are interested in long-run elasticities and we use a dynamic

model in our estimations. This can be shown in the context of the following simple trade balance

function with lag dependent variable,

ititiitiiituberb

1210 (8)

In this function, we are interested in testing statistically the bias in long-run real exchange rate

elasticity, which is equal to )1/()( 21 iiig . However, the average long-run real

exchange rate elasticity estimated from the disaggregate model can be written in alternative non-

linear and linear forms as shown below. When the average of long-run elasticity is defined in non-

linear form, an average of individual elasticities and the corresponding null hypothesis will follow

as,

m

i i

i

erm 1 2

11

1

1

0

1

1:

1 2

11

0

m

i i

i cm

H

When the average of long-run elasticity is based on average parameters (linear form), the average

elasticity and the corresponding null hypothesis will be,

m

i

i

m

i

i

rer

m

m

1

2

1

1

2

)/1(1

)/1(

011

:1

2

1

1

2

0

cm

cm

Hm

i

i

m

i

i

In a formal setting, the null hypothesis of no aggregation bias when the parameter of interest is

some function of estimated parameters (linear or non-linear) will be,

m

i

ig gm

bg1

)(1

)(

A test of aggregation bias then involves testing the hypothesis 0:0 gH . In testing the

hypothesis, two different aggregation bias statistics, *

1q and *

2q tests, are defined depending on

whether aggregate exchange rate elasticity )(bg is known a priori or not. If the aggregate model

is estimated with certainty (given a priori) and only the disaggregate model is estimated with

uncertainty, then the *

1q statistics will be used in testing the aggregation bias and defined as,

26

m

i

n

m

i

imm

q1

1

1

*

1 )ˆ(1

)(ˆ)ˆ(1

)( iβgbgβgbg (9)

where

m

ji

jjiin Covm1,

2 ˆ)ˆ,ˆ(ˆˆ GββGΩ . On the null hypothesis of 0gη ,

*

1q is distributed

as )1(2 .

If it is assumed that both aggregate and disaggregate parameters are obtained from

estimation, the *

2q statistics is employed in the test of 0H based on the OLS estimate of ,

namely

gngnq ˆˆˆ 11*

2

(10)

where

m

ji

jiijnn

1,

1 ˆ PPΦ and '

iˆmˆ X)X(XGX)X(XGP

1

i

'

ii

1'

a

1

a

'

aai

in which )bG(Ga

ˆˆ and )β(Gii

ˆˆ . On the null hypothesis of 0ˆ , *

2q is distributed as

)1(2 .

In our empirical analysis, we employed both *

1q and *

2q tests because we want to test for

possible biases in real exchange rate elasticity. In this context, the use of aggregation bias tests is

very useful to characterize and summarize the overall properties of the model. If the test results

show evidence of aggregation bias, it is clear that important data are being lost or distorted

through the process of aggregation.

Data

Before undertaking the econometric analysis, this section introduces the data employed in the

applied work and provides a preliminary statistical analysis of it. Quarterly time series data are

used, and the sample period is from 1987: Q1 to 2005: Q4. All data has been taken from the

International Financial Statistics (IFS) and the Central Bank of the Republic of Turkey (CBRT)

electronic data delivery system. The series are seasonally adjusted using X-12 procedure.

Trade balance (bi) is defined as the Turkey’s trade balance with her trading partners. It is

defined as the ratio of the Turkish imports to exports with her trading partners. Logarithmic form

of trade balance is used. Domestic income (y) is defined as the logarithm of the Turkey’s real

GDP, foreign income (yi*) is defined as logarithm of the partner countries real GDP, eri is the

logarithm of the bilateral real exchange rates. Those countries which accounts for two percent or

more of total trade of Turkey are selected as trading partners of Turkey, namely Austria, France,

Germany, Iran, Italy, Japan, South Korea, Netherlands, Spain, Sweden, UK, and USA. The rest of

the trading partners are taken together and named as OTHERS. Hence, total number of trading

partners employed in the empirical analysis is equal to thirteen countries.

The data employed in estimation of the aggregate model is defined as follows: world

income (ya*) is defined as logarithm of Industrial production index, aggregate trade balance (ba) is

the natural logarithm of ratio of total imports over total exports and era is the natural logarithm of

the reel effective exchange rate.

27

Unit root tests

As a first stage in the analysis of the data, it is useful to obtain a broad overview of the properties

of the output series involved. In particular, it is important to establish the level of integration of the

series before carrying out a regression analysis because non-stationary regressors invalidate most

of the standard empirical results. Such an analysis was undertaken for each of the variables of

interest considered at levels using the Augmented Dickey-Fuller (ADF) test. In each case, ADF

statistics were calculated for the bi , y, yi*, eri series and for the ba, ya* and era series in the

‘levels’ case. The selection of the number of lags is carried out using the Akaike Information

Criteria (AIC). The results of the ADF tests computed over the sample period for the levels and

first differences of variables. The 95% critical value for the ADF test statistic including an

intercept in the underlying Dickey-Fuller regression is -2.903 for 76 observations. The results of

ADF test show that there was evidence that the variables considered are I(1) in almost all cases

except trade balance variables, in where most of them are I(0).

It is well known that ADF tests have low power meaning that ADF tests do not reject the

null hypothesis of the series being I(1) often enough (Maddala and Kim, 2000). For this reason,

and because we have a panel of data here, in this work, we also applied the standardized "t bar"

test that proposed by Im, Pesaran and Shin (2003) for the unit root tests. The "t- bar" test increases

the power by exploiting the panel structure of the data, and is based on the average value of the

Augmented Dickey-Fuller statistics estimated for each sector. Using the AIC criterion to

determine the order of ADF regression ip , the test statistics on the data using standardized t-bar

formula were 0.781 for eri series and 0.931 for yi* series, which do not rejects the unit root null at

standard significance levels. For bi series, the null hypothesis of unit root is rejected with t-bar

value was -3.896 at standard significance levels. Taken these test results together, it seems

reasonable to proceed an empirical analysis of trade balance-devaluation relationship under the

assumption that eri and yi* series are I(1) and trade balance variables, bi, are I(0).

When the dependent variable is integrated of I(0) and independent variables are integrated of

I(1), it is still possible for these variables to be cointegrated if the linear combination of

independent variable have I(0) process. In this case error term will be stationary, since both the

dependent variable and the linear combination of independent variables are stationary (Charemza

and Deadman, 1997: 126-127). Considering the discussion given above, the stationarity of error

terms is tested for detecting the existence of long-run relationship.

EMPIRICAL RESULTS

The primary concern of this section is to identify the impact of devaluation on trade

balance based on the results obtained from estimating the bilateral and corresponding aggregate

trade balance equations. We then look at the results from the ‘aggregation bias’ tests. The trade

balance equations are estimated for thirteen bilateral trade balances in Turkey, using quarterly data

over the period 1987Q1-2005Q4 and employing the estimation and specification search procedure

described above. The rest of the section provides the results obtained from the disaggregate model,

the corresponding aggregate model and the aggregation bias tests.

Bilateral trade balance

In this sub-section, we will present and comment on the results obtained in estimating model (5)

for each of the thirteen bilateral trade partners of Turkey in turn. The results obtained in the

28

underlying trade balance equations are presented in Table 1(a). The diagnostic statistics associated

with the trade balance equations are given in Table 1(b). These include the adjusted 2R , the

estimated standard error of the equation, , the value of maximum of log-likelihood (LLF), and

statistics for the tests of residual serial correlation2

.cs , functional form 2

ff , normality 2

n ,

heteroskedasticity 2

H and stationarity.

Table 1(a). The effect of devaluation on Turkish bilateral trade balance, 1987q1-2005q4.

i0 i1 i3 i4 i5 i6 i7

Long-run

er

elasticity

AUST 18.188 -0.387 2.043 -1.391 -0.549 0.370 -0.873 (1.02) (-0.78) (3.05) (-1.96) (-2.45) (3.372)

FR 12.635 -1.604 1.173 -0.319 0.589 -0.779 (0.80) (-4.13) (4.35) (-1.88) (7.64)

GER -14.097 -0.329 0.777 -0.086 0.513 -0.177 (-1.49) (-1.88) (3.51) (-0.98) (5.44)

IR -27.814 -0.075 1.287 -0.272 0.493 -0.538 (-0.55) (-0.17) (0.76) (-1.54) (4.77)

IT 6.218 -0.682 0.285 -1.002 0.746 0.886 -2.245 (0.48) (-1.65) (0.90) (-3.38) (2.37) (14.22)

JAP -84.970 0.884 1.593 -0.063 0.545 -0.139 (-2.06) (0.88) (2.54) (-0.31) (6.09)

KOR -13.895 0.359 0.993 -0.723 0.541 -1.579 (-0.31) (0.76) (0.70) (-1.31) (5.53)

NET -3.296 -0.510 0.421 -0.110 0.764 -0.468 (-0.24) (-2.52) (1.87) (-0.73) (10.51)

SPA 54.709 -1.331 2.391 -1.563 -0.827 0.628 -2.229 (2.76) (-4.35) (4.09) (-2.54) (-3.10) (7.73)

SWE -5.166 -1.163 0.908 -0.190 0.814 -1.026 (-0.19) (-2.39) (2.45) (-0.64) (14.03)

UK -16.915 -1.166 0.900 -0.082 0.673 -0.252 (-0.98) (-4.62) (3.50) (-0.50) (9.50)

USA 50.007 -2.676 1.235 -0.267 0.395 -0.442 (2.92) (-6.22) (3.57) (-1.70) (4.47)

OTHERS 24.783 0.039 0.649 -0.781 -0.325 0.648 -0.925 (1.56) (0.15) (1.89) (-2.26) (-2.03) (7.25)

Notes: Coefficients refer to the following estimated regression

ititiitiitiitiitiitiitiiit bereryyyyb 17165143

*

12

*

10. Values in

parentheses are t-statistics. Variable definitions are provided in the text. *, ** indicate significant

at 5% and 10% level respectively.

The estimated equations presented in Table 1(a) have generally acceptable diagnostics

and explain a considerable amount of variation in bilateral trade balances ranging from 0.46 for

Austria to 0.87 for USA. The diagnostic statistics given in Table 1(b) are generally acceptable,

29

although normality is rejected in only the case of Iran. The last columns in these tables present the

results from the unit root tests applied to the error terms. The results indicate that there is no sign

of a unit root in the errors of the regressions.

Table 1(b). Summary and diagnostic statistics

2R LLF )1(2

sc )1(2

ff )2(2

N )1(2

H ADF

AUST 0.456 0.096 71.905 0.194 1.010 0.418

2.708 -2.763(2)*

FR 0.711 0.064 102.266 1.087 0.164 0.303 0.061 -8.079(0)*

GER 0.706 0.046 126.590 3.250 0.083 0.233 1.474 -5.112(3)*

IR 0.524 0.373 -29.936 1.036 3.957 1260* 0.003 -5.605(3)*

IT 0.823 0.068 97.945 0.080 0.016 1.252 1.055 -7.946(0)*

JAP 0.766 0.129 49.583 0.118

0.963 1.032 6.014 -8.921(0)*

KOR 0.597 0.302 -14.186 0.446 0.317 6.610 6.718 -5.986(3)*

NET 0.674 0.063 102.748 0.0003 0.645 0.385 0.746 -5.657(1)*

SPA 0.754 0.084 82.264 0.023 0.102 18.599* 0.139 -8.588(0)*

SWE 0.813 0.103 66.558 0.473 0.188 1.323 1.968 -4.427(3)*

UK 0.837 0.056 112.368 0.382 0.179 0.792 0.263 -7.667(0)*

USA

0.873 0.068 97.047 1.838 0.524 3.286 2.635 -9.156(0)*

REST 0.647 0.048 123.202 1.734 0.0005 2.993 0.0002 -7.612(0)*

Notes: ADF is the Augmented Dickey-Fuller test statistics carried out on the regression residuals.

The number given in parenthesis next to ADF test statistics are the number of augmentation in the

underlying Dickey-Fuller regression. (*) indicates that the null hypothesis is rejected at

conventional 5% significance level.

Taken together, these tables shed light on some important features of the process of trade

balance determination and the role of devaluation in this process. Firstly, there are considerable

differences among the bilateral trade balances of the economy in terms of their long-run responses

to the change in real exchange rates which range from -0.139 for Japan to -2.245 for Italy. The last

column in Table 1(a) provides the long-run exchange rate elasticity of trade balances. The

inspection of this table shows that the long-run impact of devaluation on the trade balance differs

significantly among bilateral trade balance equations. It seems that devaluation has relatively more

impact on the bilateral trade balances with those countries that are relatively similar level of

development with Turkey, such as Italy, Spain, South Korea than the trade balances with more

advanced countries, such as UK, USA, Germany (the long-run effect of devaluation on trade

balances with these countries are -0.252, -0.442, and -0.177 respectively).

Secondly, it is clear from the results tables that there are important dynamic adjustments

influencing bilateral trade balances. The lag dependent variables have statistically significant

coefficients in all of bilateral trade balance equations. In particular, relatively higher coefficients

on the lagged dependent variables (0.89 for Italy and 0.81 for Sweden) have important

implications for the impact of devaluation on trade balances. They imply that adjustment takes

time and only a small part of the long-run effects of devaluation is observed in the short-term. In

other words, the positive effects of devaluation can only be fully realised over time. In addition, it

can be argued that the differences in the extents of adjustment coefficients across bilateral trade

30

balance equations show the importance of evaluation of effects of devaluation at disaggregate

level.

Thirdly, the size of the economic activity in trading partner countries seems to be closely

related to bilateral trade balances. The foreign income variable, y*, was negative as expected

(except in cases of Japan and Korea) and statistically significant in most of the cases. The results

indicate that the impact of change in income of trading partners on trade balance differ

significantly ranging from -0.075 for Iran to -2.676 for USA. This implies that Turkish exports to

France, Spain, Sweden, UK and USA is very sensitive to the economic activities in these

countries.

Fourthly, the results indicate that increase in domestic economic activity has a positive

and statistically significant impact on bilateral trade balance in most of the cases. While the short-

run coefficient on domestic income variable is higher than one only for France, Japan, Iran and

USA, it is less than one for the rest of the trading partners. Considering the sizes of coefficients on

eri variable, these findings imply that for the first group of countries, domestic economic activity

seems to be more important in determining the bilateral trade balances than the devaluation.

In summary, the estimation results clearly indicate that the estimates of bilateral trade

balance equations differ significantly from each other. While in some cases, devaluation plays an

important role in determining trade balances, in others, domestic income or foreign income seem

to be the main determinant of bilateral trade balances implying the presence of different trading

patters at the bilateral level. As mentioned earlier, the observed heterogeneities among estimates

and differences in dynamics in bilateral trade balance equations may lead to biases at the

aggregate level. In the following section, we first present the results from aggregate trade balance

model and then test statistically for the aggregation bias.

Aggregate trade balance equation

Having found that the long-run real exchange rate elasticities differ significantly among trading

partners of the economy and it plays an important role in determination of the level of trade

balance, we now turn our attention to the application of the tests of aggregation bias, which

compare the estimates of the disaggregated model with the corresponding estimates of the

aggregate trade balance equation for real exchange rate elasticity. To consider these issues, we

estimated the aggregate trade balance equation (5) and the results for the equation (5) are

presented in Table 2. It is worth noting that variables used in the estimation of the aggregate

equations are obtained by aggregating over the bilateral data employed in the disaggregated

analysis above.

Table 2. Aggregate trade balance equation, 1987q1-2005q4.

Dependent

Variable i0 i1 i2 i3 i4 i5 i6 i7

ba 2.072 -0.246 - 0.775 -0.527 -0.470 0.419 0.777 (t-values) (0.18) (-1.18) (3.00) (-1.88) (-2.97) (2.50) (9.50)

Summary and Diagnostic Statistics

2R 0.732 )1(2

sc 0.749

0.453

)1(2

ff 2.106

LLF -43.444 )2(2

N 1.928

31

)0(ADF -8.697* )1(2

H 2.532

Note: see the notes in Table 1(a) and Table 1(b)

The results in Table 2 demonstrate that, while it is possible to obtain an aggregate

equation that is able to explain a very high proportion of the variability in aggregate trade balance,

it is not easy to accommodate the heterogeneities in coefficients and differences in dynamics that

exist in disaggregate models into the aggregate model. The diagnostic statistics related to these

results are acceptable and show that the aggregate equations are well specified. Since the

independent variables in the aggregate equation are I(1), we applied the ADF tests to the residuals

in the aggregate equation. The ADF test result, provided in Table 2, indicates that there are no unit

roots in the error terms.

The coefficient of real exchange rate has a correct sign and statistically significant. It is

equal to -0.231 which is smaller than the average of the disaggregate model. It is also worth noting

that the long-run coefficient of the domestic income and foreign income variables are almost equal

in size with -1.09 and 1.11 respectively. Taking these findings together it seems that domestic and

foreign economic activities seem to be more important in determining trade balance than

devaluation at the aggregate level in Turkey. It is also evident that the coefficient of the lag

dependent variable is relatively higher in the aggregate model with 0.776 than the average of

disaggregate model (0.604) implying that trade balance adjustments take a long time following a

devaluation.

Aggregation bias tests

In this section, we provide formal test results obtained in applying the aggregation bias tests, to

investigate whether the observed differences between aggregate and disaggregate long-run real

exchange rate elasticities are statistically significant.

The aggregation bias test results are presented in Table 3. At the risk of repetition, it

should be reiterated that we considered linear and non-linear cases corresponding to linear and

non-linear formulation of restrictions. Furthermore, each case involves two tests, one of which is

called the *

1q test and the other one the *

2q test. The first test of aggregation bias, the *

1q test,

contrasts average values of the disaggregate estimates with a 'consensus' view of the real exchange

rate elasticity of aggregate trade balance. In our study, we used the coefficient of real exchange

rate obtained from the aggregate model as a 'consensus' view elasticity, because the empirical

literature provides ambiguous results on the impact of devaluation on trade balance. The second

test, *

2q , compares the estimated elasticity based on the disaggregate model with the one obtained

from the aggregate model. Table 3 presents these elasticities. The test results that were given in

Table 3 show that there is a clear evidence to reject the null hypothesis of no aggregation bias in

both linear cases and in one of the non-linear case which is *

1q test. Failure to reject the null

hypothesis for the non-linear case (*

2q test) might be due to the imprecision of the estimated

coefficients in some of the disaggregate trade balance models.

32

Table 3. The aggregation bias test results

Non-linear Case Linear Case

Aggregation bias

test

Q1-CHI-SQ(1) Q2-CHI-SQ(1) Q1-CHI-SQ(1) Q2-CHI-SQ(1)

4.49440** 1.17270 6.73213*

3.12174***

Average Coef. -0.898 (0.315) -0.453 (0.086)

Aggregate Coef. -0.231 (0.107)

Notes: Critical values of CHI-SQ(1) test are (2.706),(3.841) and (6.635) for 10%, 5% and 1%

significance level respectively. *,**,*** indicates the rejection of the null at 1%, 5% and 10%

significance level respectively. The number in parenthesis indicates standard errors of coefficients.

Taken together, the aggregation bias test results given in Table 3 clearly provide strong

evidence that empirical work on the aggregate level is subject to bias. The ‘aggregation bias’ test

results provided above have also important implications for the analysis of the devaluation. It

seems that the studies carried out at the aggregate level underestimate the impact of devaluation on

trade balance in Turkey. In this sense, the use of aggregate data for policy purposes may be

misleadingly in predicting the full impact of devaluation. It is also true that the knowledge on

bilateral responses to devaluation will help to exploit fully the potential positive effects of

devaluation and to achieve macroeconomic stability.

CONCLUSION

In this study, the effects of devaluation on trade balance of Turkey have been investigated

econometrically using the relevant statistical methods. Based on the information provided in the

literature, we identified the three important points that needs to be addressed in any study that

attempts to investigate fully the impact of devaluation on trade balance of Turkey; namely,

compositional change in traded goods, and the heterogeneities and differences in dynamics among

bilateral trade balances of the economy.

The brief overview of the empirical literature on trade balance-devaluation relationship

revealed that the most of the studies on the subject were carried out at aggregate level and

provided ambiguous results on the impact of devaluation on trade balance. We argued that because

the studies in the literature were carried out at the aggregate level they cannot accommodate the

information mentioned above satisfactorily and they may lead to biased results. In order to

accommodate this information into an empirical analysis, we set out our empirical analysis at the

disaggregate level.

The findings of the study indicated that there are significant differences in estimates and

adjustment coefficients among trading partner countries implying that it takes time to benefit from

devaluation and bilateral responsiveness of devaluation need to be taken into account in the

formulation of economic policies. These results together imply that important aspects of the trade

balance-devaluation relationship can only be accommodated within a disaggregate model. This is

confirmed in the statistical comparison of the estimated disaggregate model with its aggregate

counterpart, which indicates that trade balance is best modeled through a disaggregate model. The

aggregation bias test results also show that the aggregate model underestimates the effect of

devaluation in Turkey and may be misleading. Given the significant effects of devaluation on

trade balance determination, these results certainly indicate that the work undertaken at the

bilateral level is necessary for better policy formulation and for evaluation of the impact of

devaluation in Turkey.

33

REFERENCES

Ahmed, J., Yang J. (2004) Estimation of the J-curve in China. East-West Center Working Papers

67: 1-23.

Arora, S., Bahmani-Oskooee, M., Goswami, G. G. (2003) Bilateral J-curve between India and

her trading partners. Applied Economics 35, 1037–41.

Bahmani-Oskooee, M. (1985) Devaluation and the J-curve: some evidence from LDCs. The

Review of Economics and Statistics 67: 500–504.

Bahmani-Oskooee, M. (1991) Is there a long-run relation between the trade balance and the real

effective exchange rate of LDCs? Economic Letters 36: 403-407.

Bahmani-Oskooee, M., (2001) Nominal and real effective exchange rates of Middle Eastern

countries and their trade performance. Applied Economics 33: 103-111.

Bahmani-Oskooee, M., Alse, J. (1994) Short-run versus long run effects of devaluation: error

correction modeling and cointegration, Eastern Economic Journal 20(4): 453–64.

Bahmani-Oskooee, M., Brooks, T. J. (1999) Bilateral J-curve between US and her trading

partners, Weltwirtschaftliches Archiv 135: 156–65.

Bahmani-Oskooee, M., Economidou, C., Goswami, G.G. (2006) Bilateral J-curve between the

UK vis-à-vis her major trading partners. Applied Economics 38: 879-88.

Bahmani-Oskooee, M., Goswami, G. G. (2003) A disaggregated approach to test the J-curve

phenomenon: Japan vs. her major trading partners. Journal of Economics and Finance 27:

102–13.

Bahmani-Oskooee, M., Kantipong, T. (2001) Bilateral J-curve between Thailand and her trading

parteners. Journal of Economic Development 26: 107-17.

Baluswar, M. D., Thomson, H., Upadhyay, K. P. (1996) Devaluation and the trade balance in

India: stationarity and cointegration. Applied Economics 10(2): 263–72.

Brada, J.C., Kutan, A., Zhou, S. (1997) Exchange rate policy and outward orientation in

developing countries: the Turkish experience. Journal of Development Studies 33: 675-92.

Brissimis, S. N., Leventakis, J. A. (1989) The effectiveness of devaluation: a general equilibrium

assessment with reference to Greece. Journal of Policy Modeling 11(2): 247–71.

Charemza, W.W., Deadman, D.F. (1997) New Directions in Econometric Practice, Edward

Elgar Publishing Limited, UK.

Demirden, T., Pastine, I. (1995) Flexible exchange rates and the J curve: an alternative approach,

Economics Letters 48: 373-7.

Felmingham, B.S. (1988) Where is the Australian J-curve? Bulletin of Economic Research 40,

43-56.

Gomes, F.A.R., Paz, S. (2005) Can real exchange rate devaluation improve the trade balance?

The 1990-1998 Brazilian case. Applied Economics Letters 12: 525-528.

Halıcıoglu, F. (2007) The Bilateral J-curve: Turkey versus her 13 Trading Partners. MPRA Paper,

No:3564.

Himarios, D. (1985) The effects of devaluation on the trade balance: a critical view and

reexamination of Miles’s (New Results). Journal of International Money and Finance 4: 553–

63.

Himarios, D. (1989) Do devaluations improve the trade balance? The evidence revisited.

Economic Enquiry 27: 143–68.

Im, K. S., Pesaran, M. H., Shin, Y. (2003) Testing for unit roots in heterogeneous panels. Journal

of Econometrics 115: 53-74.

Kalyoncu, H., Kaplan, M. (2007) Currency devaluation and trade balance in Turkey. The

Empirical Economic Letters, 6(6): 473-479.

34

Kalyoncu, H., Ozturk, I., Artan, S. and Kalyoncu, K. (2009) Devaluation and trade balance in

Latin American countries, Proceedings of Rijeka Faculty of Economics: Journal of

Economics and Business, 27/1: 115-128.

Lee, K. (1997) Modelling economic growth in the UK: An econometric case for disaggregated

sectoral analysis. Economic Modelling 14: 369-394.

Lee, K., Pesaran M. H. (1993) The role of sectoral interaction in wage determination in the UK

economy. Economic Journal 103: 21-55.

Lee, K. C., Pesaran M. H., Pierse R. G. (1990a) Testing for aggregation bias in linear models.

Economic Journal 100: 137-150.

Lee, K. C., Pesaran M. H., Pierse R. G. (1990b) Aggregation bias and labour demand equations

for the UK economy. in T. S. Barker and M. H. Pesaran (eds), Disaggregation in Econometric

Modelling, London, Routledge.

Maddala, G.S., Kim I. M. (2000) Unit roots, cointegration, and structural change, Cambridge

University Press.

Magee, S.P. (1973) Currency contracts, pass through and devaluation. Brooking Papers on

Economic Activity 1: 303-325.

Marwah, K., Klein L.R. (1996) Estimation of J-curve: United States and Canada. Canadian

Journal of Economics 29: 523-539.

Miles, M. A. (1979) The effects of devaluation on the trade balance and the balance of payments:

some new results. Journal of Political Economy 87(3): 600–20

Narayan, P.K. (2004) New Zealand’s trade balance: evidence of the J-curve and granger

causality. Applied Economics Letters 11: 351-354.

Pesaran, M. H., Pierse R. G., Kumar M. (1989) Econometric analysis of aggregation in the

context to linear prediction models. Econometrica 57: 861-88.

Pesaran, M. H. and R. G. Pierse and K. Lee (1994) Choice between disaggregate and aggregate

specifications estimated by Instrumental Variables Methods, Journal of Business and

Economic Statistics, 12: 11-21.

Rose, A.K. (1991) The role of exchange rates in a popular model of international trade: does the

‘Marshall-Lerner’s condition hold? Journal of International Economics 30: 301-16.

Rose, A.K., Yellen J.L. (1989) Is there a J-curve? Journal of Monetary Economics 24: 53-68.

Shirvani, H., Wilbratte, B. (1997) The relation between the real exchange rate and the trade

balance: an empirical reassessment. International Economic Journal 11(1): 39–49.

Upadhyaya, K.P., Dhakal, D. (1997) Devaluation and the trade balance: estimating the long run

effect. Applied Economics Letters 4: 343-345.

Van Garderen, K. J. and K. Lee and M. H. Pesaran (2000) Cross-sectional aggregation of non-

linear models, Journal of Econometrics, 95: 285-331.

Wilson, P. (2001) Exchange rates and the trade balance for dynamic Asian economies: does the J-

curve exist for Singapore, Malaysia and Korea? Open Economies Review 12: 389–413.

35



The following article won the Best Undergraduate Student Paper Award at the 2011

Pennsylvania Economic Association Conference.

ACCESS TO FORMAL CREDIT AND RURAL ECONOMIC DEVELOPMENT: AN

EXAMINATION OF PREVAILING ASSUMPTIONS IN THE LAND RENTAL MARKET

OF RURAL INDIA

Mitchell W. DeRubis and Christian N. Minich*

Indiana University of Pennsylvania

(We would like to acknowledge Dr. James Jozefowicz and Jeff Moon for their help in clarifying

and strengthening this paper. Also, thanks to the International Crops Research Institute for the

Semi-Arid Tropic (ICRISAT) for giving us their Village Level Study data for 2001-2004

ABSTRACT

The effect of access to formal credit on the land rental market, a key input suggested by prior

studies as representative of production decisions overall, is analyzed econometrically. Using

pooled cross-sectional data from six Indian villages in semi-arid provinces of India over the years

2001-2004, a regression equation is estimated using logit. The dependent variable is the decision

to participate in the land rental market. Key types of independent variables are the log of formal

credit, the log of various kinds of informal credit, district, village, and individual characteristics.

Our results indicate that formal credit exerts neither a significant nor a substantial effect on the

decision to participate in the land rental market. The amount of land owned, the amount of rainfall

the year prior, and whether a farm was designated small all exert a negative, significant influence

on dependent variable; whether a farm is designated large and the value of draft animals exert a

positive, significant effect on the dependent variable.

INTRODUCTION

Background

In rural India, land is a scarce resource (Kochar, 1997b). This is particularly true in the

semi-arid climate of the villages in the sample of this paper. In the Indian economy, land is rented

in or rented out depending on each farmer's supply of other inputs, such as draft animals or farm

equipment. The land rental market is known to function well, and it is used to equilibrate

nonmarketable or imperfectly marketable goods, such as draft animals, farm equipment, or family

labor (Skoufias, 1995). Thus, the land rental market in India is an essential component in

examining development in rural Indian economies.

36

As Kochar (1997b) notes, there are multiple reasons to cast doubt on the assumption that

lack of access to formal credit constrains agricultural production decisions in the case of

economies in which land is scarce. There are low levels of fixed capital and infrastructural

development, while fragmented landholdings keep the working capital required to operate a farm

low (Kochar, 1997b). Also, there could be substitutes for formal credit for rural farmers, such as

rental markets (Kochar, 1997b).

If access to formal credit does influence the production decisions of rural farmers, then

the farmers should equilibrate their access to formal credit by using the land rental market

(Kochar, 1997b). In other words, if access to formal credit is significant in the production

decisions of Indian farmers, then there should be a correlation between access to formal credit and

those farmers' use of the land rental market. In this way, analyzing how the land rental market

responds to formal loan amounts should provide a good indication of the impact of access to

formal credit on the land rental market. According to Kochar (1997b), little empirical research has

been conducted on this question, although the assumption has been widely held (see Jaynes

(1984); Eswaran and Kotwal (1989)). This paper is a response to that call for further empirical

investigation.

As Binswanger and Khandker (1992) point out, the benefits from a subsidized

agricultural formal credit program have affected primarily the farmers who currently receive these

subsidized loans. They note that if many of the farmers who receive these loans have larger

resource endowments, then this differential access will result in a selection bias that will

artificially make the program appear more beneficial than it actually is. This selection bias is

crucial, because one of Kochar’s (1997b) primary conclusions was that farmers’ production

decisions are more due to resource endowments, such as the amount of irrigated land, rather than

due to access to formal credit. Farmers who lack access to formal credit but borrow from informal

sources do have a higher rate of participation in the land rental market; however, the impact of the

informal credit sector appears to be less significant than the effect of owning other inputs, such as

irrigated land (Kochar, 1997b).

Kochar (1997b) has already challenged this assumption with an empirical study using the

All-India Debt and Investment Survey of 1981-1982 (1997b), and found that lack of access to

formal credit did not, in fact, exert a significant influence on the land rental market. However, her

study was limited, and there has been a lack of continued literature that surveys more modern data.

The goal of this paper is to provide an updated empirical study using data from 2001-2003,

implementing Kochar's (1997b) methodology in order to ascertain whether her conclusions hold

true across time, as well as to note the impact of economic development in India over the last few

decades.

One aspect of Kochar’s (1997b) original research that this paper should shed additional

light on is that the sample size is in a more arid locale than her sample in Uttar Pradesh. The

sample in this paper comes from six semi-arid villages across multiple provinces. If irrigated land

increases the likelihood to participate in the land rental market, then lack of irrigated land should

have a greater impact on the decision to participate. Kochar (1997b) also notes that her sample did

not include enough specific data on the type of informal credit being lent, whereas the data in this

paper contains more information on the nature of informal credit, enabling a more thorough

examination of the role of informal credit in the land rental market, in addition to the effect of

formal credit on the land rental market. Also, this study will use a pooled cross-sectional dataset,

rather than the cross-sectional analysis conducted by Kochar (1997b). Finally, the inclusion of

variables not present in Kochar’s study but suggested by other literature (e.g. Skoufias (1995))

should help to better examine the relationship between production decisions and their

determinants. These variables include a measure of the education of the members of a household,

dummy variables for the size of the farm, and an improved measure of draft animals.

37

This paper will explore the effects of formal credit on the production decisions of Indian

farmers, which could have key policy implications for developing countries operating under the

assumption that better formal credit access is the highest priority for economic development. The

analysis in this study will corroborate or refute Kochar's (1997b) results, shedding further light on

prevailing assumptions on the role of formal credit in developing economies.

The second section will review the literature on the role of credit in developing

economies. The data and variables for this study are discussed in the third section. Section four

will explicate the methodology and the functional form selected for this analysis. The results will

be discussed in the fifth section. The conclusions and potential extensions of this research will be

discussed in the final section.

LITERATURE REVIEW

Kochar (1997b) examines the assumption that lack of access to formal, cheap credit is a

determining factor in rural farm rental markets in India. Other regressors considered are informal

credit, assets, land, the number of males and females in a household, irrigated land, how many

plots the land is divided into, and the ownership of draft animals. Kochar takes her data from the

All India Debt and Investment Survey from 1981-1982, and she uses the province of Uttar Pradesh.

Similar to the present study, Kochar uses a probit regression to analyze her data, and she divided

her full sample into borrowers and non-borrowers in order to see whether each group had different

determinants of their activity in the land rental market. Kochar concludes that her analysis of the

data suggests that credit constraints, whether formal or informal, do not determine the level of

input use in farming; rather, other inputs, such as the amount of irrigated land, were far more

significant. This study demonstrates the impact of formal credit on land tenancy, which in turn

impacts the overall income of these small rural farmers.

Krause et al. (1990) examine the determining factors of adoption of technology for small

farmers. Their data came from the National Agronomic Research Institute of Niger. They measure

the effect of interest charged for credit, the amount of equity a farmer has, credit repayment

insurance based on the weather, and the sharing of risk by suppliers and laborers on the adoption

of new technology. They find that risk aversion was a key variable in the decision to adopt

technology. The researchers conclude that low-interest credit is not an effective method for

encouraging technology adoption, since it does not address the risk-averse nature of farmers in

Niger. The method they found to be the most effective was having fertilizer-suppliers share in the

risk of the farmers.

Jamajou and Baker (1980) examine the effects of cheap credit on economic development

in Cameroon. They study the impact of cheap credit programs on the farmers' income, using the

credit limit and the rate of interest as the two regressors. Using a model to extrapolate the data

they received from small farmers in Cameroon, they conclude that the credit limit and flexibility

of the use of that credit is more effective for improving the small farmers’ income than a low

interest rate. The data is survey data, and the model is a projection based a series of assumptions

about the way in which small farmers will behave.

Binswanger (1980) conducts an experiment using game theory that subjected about 300

households in areas of India that are characterized as high in climatic volatility. He measures the

impact of income level, age, schooling, assets, net transfers, and “luck” (or experience) on a

farmer's risk aversion. The study concludes that at low levels of payoff, the risk aversion was

widely distributed; however, at higher levels of payoff, risk aversion is concentrated at

intermediate levels. Interestingly, risk aversion is not correlated with the level of wealth. This

article establishes a profile of the risk-averse nature of small rural farmers, particularly in India.

This corroborates the picture of farmers as risk-averse in the other articles (e.g. Krause et al.

(1990), Skoufias (1995)).

38

Skoufias (1995) and Jodha (1981) provide further support for the use of the land rental

market as a proper input to use to measure the impact of formal credit on the production decisions

of rural farmers. Skoufias suggests that rural Indian farmers use the land rental market to

equilibrate nonmarketable and imperfectly-marketable inputs, such as draft power or family size.

Skoufias and Jodha also establish that the land rental market is a market that functions well.

Skoufias’ sample comes from the same villages as the sample for this paper, and a number of the

variables used in this study, such as education, percent of irrigated land, and the numbers of adult

males and females, come from his study. He also provides a precedent for using pooled cross

sectional data.

Binswanger and Khandker (1992) provide a 30-year panel analysis across 85 districts of

India, examining the benefit-cost ratio of India’s aggressive formal credit subsidy program. Their

specific attention is to the agricultural credit targets and their effect on the economic situation of

the rural farmers. They note that the program impacted farmers’ production decisions positively,

in general, but that wages did not increase. They found a selection bias of formal credit to favor

the farmers with larger resource endowments, and used variables for the investment of the

districts, agricultural variables, and banking variables.

Taylor and Adelman (2003) construct a model for agricultural households. While theirs is

not an empirical model, it is significant because it explores the nature of the household-farm

model, in which the demand of a household can also be supplied by itself, if the costs of obtaining

the good elsewhere are prohibitive. Household-farms are both producers and consumers in the

model. As the authors note, an agricultural-household perspective is implicit in models describing

or including access to credit, including Kochar (1997a and 1997b), and this paper. The authors

note that households in these rural economies face “missing markets” when the cost of

participating in the market are higher than the cost to produce the service itself. This is an

important insight for our present analysis, because it implies that our methodology of evaluating

the participation in the land rental market in terms of the costs or availability of the goods

(transaction costs) is a sound one.

Barslund and Tarp (2008) explore how the rural credit market operates in Vietnam by

constructing models of formal and informal credit demand as well as credit rationing. The authors

find, with the use of a probit regression, that land holdings, number of adults in a household, the

value of feed used, and the total value of the household’s assets are both statistically significant

and positively affect the probability of demanding formal loans. Age alone is deemed statistically

significant and negatively related to the probability of demanding formal loans. The value of feed,

poor credit history, and the number of dependents in a household is found to be statistically

significant and positively related to the probability of demanding informal loans. Age, educational

attainment level, and the value of total assets are found to be significant and negatively related to

the probability of demanding informal loans. Our paper explores very similar relationships and

uses many of the variables found to be significant by Barslund and Tarp (2008). The value of feed

variable was used as a proxy for livestock holdings and is improved upon in this paper by using

the value of livestock owned, which we believe will quantify both the number of livestock and

their respective conditions. It appears that the rural credit market in Vietnam operates in a very

similar fashion to the rural credit market in India.

Golait (2007) discusses trends in the agricultural credit market in India. This paper

describes the trends in formal lending and the market share of formal credit, trends in the market

share of co-op loans, and recent policy initiatives. It is important to note that while Golait (2007)

finds the market share of institutional lending to be generally increasing, to 66% in 1991 from 7%

in 1951, our sample shows informal lending to be much more prevalent than formal lending, in

line with the latest National Sample Survey Organization (NSSO) Survey as of 2007 (Golait

2007). Also of interest, Golait discusses the government’s recent finding that suicide among

39

farmers is highly correlated with indebtedness. The paper concludes with recent policy changes in

rural agricultural lending, however, these changes did not take place during our sample years.

DATA

This study uses data provided by the International Crops Research Institute for the Semi-

Arid Tropics (ICRISAT), and the amount of rainfall was obtained from the data website India

Water Portal. Researchers at ICRISAT conducted a yearly study of six villages in the semi-arid

climate of India for the years 2001-2002, 2002-2003, 2003-2004, and 2004-2005. This is the

second generation of the Village Level Studies (VLS) conducted by ICRISAT on these villages.

ICRISAT conducted extensive surveys of households in six villages in rural India: Aurepalle,

Dokur, Shirapur, Kalman, Kanzara, and Kikheda. They took a representative sample of the

number of households in the villages at the beginning of 2000-2001, and then used the same

households throughout the study. These six villages are in three districts: Sholapur,

Mahaboobnagar, and Akola; these districts are in two states: Andhra Pradesh and Maharashtra.

The VLS collects agronomic, socioeconomic, and other data. The total number of cultivator

households surveyed per year is 334 in all six villages. This study uses data from the years 2000-

2001, 2002-2003, and 2003-2004, with 330 households which have all the variables of this study,

yielding 990 total observations. Due to missing values for some of the data, the full sample

analyzed in this study is 904.

This dataset is relevant for several reasons. It includes extensive documentation of a wide

range of sociological, economic, and agronomic data. Also, the survey data was collected by

individuals who know the local customs and speak the language of the village. The dataset

additionally categorizes informal credit by the identity of the lender—whether it is a moneylender,

a co-op, or friends and relatives. This allows for a more careful analysis of the impact of informal

credit than has been possible for other researchers (such as Kochar (1997b), in addition to the

impact of formal credit.

This study uses the dichotomous decision to participate in the land rental market as the

dependent variable (LEASED). The land leased out per household is not included, because

landlords often underreport their leasing-out of land in order to avoid taxes (Kochar, 1997b). Thus

leasing-in land is the dependent variable to measure the household's participation in the land rental

market.

Most of the variables in this paper have been either shown or predicted to have

significance in the literature (e.g. Skoufias (1995) and Kochar (1997b)). There are only a few

exceptions, which represent improvement on prior literature. First, we divided the informal loan

amount into three variables. The reason for this change is that the interest rates charged by co-ops

averaged less than 10% interest, friends and family loans were often lent at 0% interest, and the

money lenders often charged over 30%. There is a clear distinction there, and Kochar (1997b)

points out that further research could be conducted by examining more precisely the role of

informal credit on production decisions. The other deviation is including the education variable,

though it has been included in Binswanger (1980) and Skoufias (1995) for evaluating the risk-

averseness of farmers. We also added a net income variable, and a variable capturing farm size

variables in three groups.

Due to lack of data availability or integrity, we were unable to include a variable

capturing the infrastructural development of each village (or even the district). We were also

unable to obtain the number of land plots per household for a sufficiently large sample.

Variables that measure the impact of other inputs, such as draft power and family labor,

as well as the loan amounts, must be included. Additionally, some measure of the household's

assets at the start of each year is also necessary. In the literature, leasing decisions are assumed to

be an outcome of a given household's resources at the beginning of a crop year prior to the rental

40

decision (Kochar, 1997b). These resources include the household's assets, measured as the values

of a household's farm equipment (EQUIP) and buildings owned by the household (BLDG). Family

labor is measured by the number of males (MALES) and females (FEMALES) over the age of 14.

Skoufias (1995) includes the family labor variables, household equipment and buildings.

The value of the draft animals within each household, measured in real 2001 rupees

(DRAFT), is included as part of the household’s resource endowment. This is an improvement

over Skoufias (1995), Kochar (1997b), and Barslund and Tarp (2008), who use the number of

draft animals or value of feed. Skoufias (1995) mentions that using the value of draft animals

would be preferable, but did not have access to that data. Another resource input is a household's

irrigated land holdings (IRR), measured as the percentage of the total land owned in acres, and the

household's land holdings, measured in acres (LAND). Kochar (1997b) uses the land irrigated in

acres, but this presents a collinearity issue with LAND. Skoufias (1995) uses the percentage of

irrigated land holdings. The formal loan amount (FORMAL) and informal loan amount are also

included as regressors. The informal loan is broken up by source: moneylenders (LENDERS),

friends and relatives (FRFAM), and co-ops (COOP). They are measured as the natural log of real

2001 rupees, as supported by Kochar (1997b), Barslund and Tarp (2008). Dummies are included

for each to measure any bias created by transforming values of “0” to “1” for the natural log to be

zero.

In order to capture the education within a household, which Binswanger (1980) linked as

a measure of the risk-averseness of farmers, we have included the individual in each household’s

maximum years of education (EDUC). We took two measures of this. First, we used a count

variable (EDUC), which took the number of years educated. We also took a dummy variable

(EDUCDUM) which has a value of “1” if there is an individual in the household who has any

education. The use of the dummy is supported by Skoufias (1995). We have also included

intercept dummy variables to capture the relative size of the farms, as defined within ICRISAT’s

measurement system, with one variable for farms designated “large” (LARGE), and farms

designated “small” (SMALL), with medium farms being the omitted condition.

RAIN1 and RAIN2 are the total amount of rainfall, in millimeters, in the province of the

village, in the year prior and two years prior, as supported by Kochar (1997a). WAGE is the

average wage rate in each village for that year, in real 2001 rupees, as used by Kochar (1997b).

INCOME is the net income of a household, in the natural log of 2001 rupees. POP is the total

number of households in each village in 2001.

Expected Signs

LAND is expected to have a negative sign, because as the household’s resource

endowment increases, the likelihood to participate in the market should decrease. The asset

variables (EQUIP, BLDG, and CONS) should have a negative impact, as increased assets will

lessen the need to engage in the leasing-in market. MALES and FEMALES are expected to be

positive factors, because family labor is assumed to be an imperfectly marketable good. DRAFT is

expected to be positive in sign, as an increase in the availability of draft animals should increase

the need to lease-in land. IRR is expected to have a positive sign. This is based on Skoufias

(1995). He reasons that the percent of irrigated land decreases the risk of cultivation. This should

increase the fruitfulness of self-cultivation, which would increase the returns from the land already

owned for less labor, ceteris paribus. The expected signs of the informal and formal loan variables

are ambiguous.

EDUC and EDUCDUM have ambiguous expected signs. Skoufias (1995) found

education to be negatively correlated with leasing-in land, but it was statistically insignificant.

LARGE has a negative expected sign, as the increased resource endowment of the household

should decrease the participation in the land rental market. SMALL should have a positive sign for

41

the opposite reason. POP has a positive expected sign, as the larger the market, the more likely

individuals will be to participate in it. RAIN1 and RAIN2 both have negative expected signs, in

that a decrease in rainfall would indicate more risk for the farmers, who might respond by entering

into a fixed rental contract in order to hedge their risk. WAGE has a negative expected sign,

because as the wages for labor increases, the opportunity cost of working on the farm increases,

thus making renting land in less likely. INCOME has a negative expected sign, as increased

income should decrease the likelihood to participate in the land rental market. Many of the

expected signs come from Kochar’s (1997b) or Skoufias’ (1995) findings, in addition to economic

theory. The definitions of the variables, as well as their expected signs, are reported in Table 1.

Descriptive Statistics

The mean participation of land leased is 0.102 per household. The average amount of

formal credit borrowed is 7,244.56 rupees per household, the maximum being 400,000 and the

minimum being zero (pre-transformation for the logarithm). For money lenders, the maximum

was 10,875.29 rupees, and the minimum was zero. The mean amount of land owned is 5.76 acres,

with a maximum of 44 acres and a minimum of zero, indicating a wide variance. The average for

SMALL is 0.54, indicating that there are more small farmers than large (which has a mean of

0.14). The average number of years educated is 7. The mean population is 535 households, while

the minimum is 170 and the maximum is 649. This indicates that the average village population is

skewed toward the larger populations. Descriptive statistics for the dependent and all the

independent variables are in Table 2.

ECONOMETRIC MODEL

This study utilizes a pooled cross-sectional analysis to evaluate the effect of formal credit

and other factors on the likelihood of participation in the rental market. Skoufias (1995) analyzed a

similar pooled cross-sectional dataset, though he used Maximum Likelihood Estimation of the

amount of land leased, rather than the decision to participate. The initial equation was constructed

using variables that Kochar (1997b) and Skoufias (1995) used in their regression analyses. The

hypothesized logit model is:

(1)

where z= β1 + β2DRAFTi + β3MALESi + β4FEMALESi + β5IRRi +β6LANDi +β7LOGFORMALi +

β8LOGINCOMEi + β9LOGMONEYi + β10LOGFRFAMi + β11LOGCOOPi + β12LARGEi +

β13SMALLi + β14POPi + β15WAGEi + β16EQUIPi + β17BLDGi + β18RAIN1i +

β19RAIN2i + β20(EDUC or EDUCDUM) i + β212001dumi + β222002dumi (2)

RESULTS

In order to examine the relationship between formal credit and the decision to participate

in the land rental market, two sample groups were used. The first group is the full sample1, and the

second treats borrowers and non-borrowers as separate groups, under the assumption that the

determinants of those with access to formal credit may be different than those without access to

formal credit. This approach also allows for comparison with Kochar (1997b), who conducted this

form of analysis and found access to formal credit not to be significant in any of her regressions.

The results of logit regressions on the full sample are reported in Models 1-3 of Table 5. The

results are remarkably robust across the three models. The difference among the three is whether

the education variable used is the count variable EDUC or the dummy variable ECUCDUM, and

42

whether or not SMALL is excluded. In all three models, LAND is negative and significant,

RAIN1 is negative and significant, and DRAFT is positive and significant, as expected. Other

variables had their expected signs but were insignificant: EQUIP and BLDG were robustly

negative, RAIN2 was negative, and IRR was positive. MALES and FEMALES were also always

positive, though insignificant.

In most of the regressions of this study, COOP has a negative sign, and it is almost

always significant. This result is complicated because the dummy measuring the transformation

necessary to convert the “0” values into a logarithm is also always negative and significant as

often as COOP is. The consistent significance of COOP probably has more to do with the large

number of 0’s that had to be transformed than with empirical significance.

LARGE (positive) and SMALL (negative) did not carry their expected signs, but were

significant at the 1% level. It is difficult to interpret these two variables, which are consistently

significant and robust across all the models tested in this paper. It is especially thorny because the

variable LAND is also consistently significant and negative. On the one hand, LAND would

suggest that, as the size of the farm increases, ceteris paribus, the likelihood to participate in the

land rental market decreases. On the other hand, LARGE suggests that larger farms, relative to the

medium-sized farms, are more likely to participate in the land rental market, and small farmers,

conversely, are less likely to participate.

It could be that LARGE and SMALL capture variation that is not simply limited to the

size of the farms. In other words, large farm households may enjoy advantages in markets that

small farmers do not enjoy. This sort of general selection bias of the land rental market for large

farm households over smaller farm households is suggested in the special case of selection bias in

the formal lending sector by Binswanger and Khandker (1992), but there has been little additional

evidence to suggest such a conclusion for the land rental market. Similar to the explanation offered

above, LARGE and SMALL could also capture comparative differences between larger and

smaller farm households relative to the medium size farm households, which the amount of owned

land does not reflect. A correlation matrix was run, and LARGE and SMALL were not collinear

with any other variable, including LAND. No one variable was substantially collinear with any

other variable.

LOGFORMAL is never significant in these models, and it always carries a positive

coefficient. This suggests that the access to formal credit does not significantly constrain a

household’s decision to participate in the land rental market. Kochar (1997b) found this variable to

be insignificant and positive in her probit of the full sample.

The education variable exhibits unusual behavior. When considered as a count variable, it

carries a negative sign and is not significant; however, when measured as a dummy variable, it

changes to a positive sign and is significant in one of the three models on Table 5, Model 3 (which

drops SMALL). When measured as a dummy (the same way Skoufias (1995) measures

education), the variable does not carry the negative sign Skoufias found. WAGE is only

significant in one model. This could be due to averaging male and female wage rates together.

Skoufias (1995) found that there is a difference between male and female wage rates’ impact of

the land rental market. The McFadden Pseudo R2 measures of the first two models are between

0.147 and .0153, with about 83% total correctly predicted. The model is superior to estimates

generated with just the constant term by over 2%.

Another way of evaluating whether or not access to formal credit influences the land

rental market is to treat those who did receive a loan and those who did not as distinct samples and

running separate regressions. Kochar (1997b) utilizes this technique, and it is one of the primary

reasons that she concludes that access to formal credit does not influence the land rental market.

Descriptive statistics for the borrowers and non-borrowers are reported in tables 3 and 4.

The results from these regressions are reported in Table 6. Models 1-3 use the sample of

formal credit borrowers, and Models 4-6 use the sample of non-borrowers. Similar to Kochar’s

43

(1997b) findings, the sign on POP was negative for borrowers, but positive for non-borrowers.

Also, RAIN1, like Kochar’s (1997b) Monsoon80 variable, was negative in all cases, and

significant for non-borrowers, but not for borrowers. Assets for Kochar (1997b), and EQUIP and

BLDG for this study, were negative for borrowers and non-borrowers, though in neither study

were these variables significant. Like Kochar (1997b), MALES and FEMALES are positive across

borrowers and non-borrowers, though interestingly, MALES is significant at the 5% level for

borrowers, while FEMALES is only significant in one model at the 5% level for non-borrowers.

The COOP loan amount continued to be significant and negative for the non-borrowers, while it

was negative but insignificant for the borrowers.

LARGE and SMALL are robust across the models. LARGE is also significant in all but

one model for both borrowers and non-borrowers. In contrast, SMALL is only statistically

significant for non-borrowers, and then it is negative. The coefficients on SMALL are also much

larger in absolute value for the non-borrowers than for the borrowers, indicating that whether a

household is small relative to medium households is a more influential explanatory variable for

non-borrowers than for borrowers, all other things equal. Also, the coefficient on LARGE

decreases from borrowers to non-borrowers, indicating that larger farm households relative to

medium farm households who borrow are more likely to participate in the land rental market than

large farm households who do not borrow.

The number of draft animals also exhibited significant differences across borrowers and

non-borrowers. The signs on the value of draft animals in the borrowers equations are always

negative, although insignificant, but they are always positive and significant at the 1% level for

non-borrowers. The negative sign on DRAFT for borrowers disagrees with Kochar (1997b), who

found DRAFT to be positive. It also makes sense that DRAFT would be positive, since an

increase in the value of draft animals, which are an imperfectly marketable good, would increase

the likelihood to participate in the land rental market. Another key difference is in the education

variable. Both the dummy and the count variable are negative for borrowers, but positive for non-

borrowers. Also, the count variable is significant at the 10% level for borrowers but not the

dummy variable, and, conversely, the count variables are not significant for the non-borrowers but

the dummy variable is significant at the 5% level. This suggests that the level of education within

a household increases the likelihood to participate for non-borrowers, but decreases it for

borrowers, all things equal. The relevant differences between the borrowers and non-borrowers

that would explain this difference are unclear.

Crucially, LOGFORMAL is never significant in the borrower equations, at any level.

Also, variables such as LOGMONEY, which is significant at the 11% level, exhibit greater

influence. LOGMONEY has an approximate magnitude of 0.195 at the mean, while

LOGFORMAL has a magnitude of 0.05415 at the mean. MALES also has a greater magnitude

(and is significant), and IRR also has a greater magnitude, though it is insignificant in all the

models (in contrast with Kochar (1997b), who found similar magnitudes—0.276 at the mean

versus 0.225 at the mean in this study—and significance at the 5% level). The McFadden Pseudo

R2 measure increased for both borrowers and non-borrower equations (around 0.3 and 0.2,

respectively, depending on the regression), and the percent correctly predicted also increased (to

86.5% and 84.5%, respectively). The percentage increase over a prediction using just the constant

also increased, to around 4.5% and around 3.5% for borrowers and non-borrowers (depending on

the regression).

All of the above discussion must be taken with a caution. The small sample size used for

the estimations of borrowers could result in biased results. Additionally, this study was unable to

incorporate the number of plots per household, a variable which Kochar (1997b) found to be

significant, and a measure of infrastructural development. Finally, Kochar (1997b) and others note

that the credit markets (both formal and informal) are possibly endogenous. This study was unable

to correct for any potential endogeneity. Despite this potential endogeneity concern, there is doubt

44

that it is a concern in this case. When Kochar (1997b) used a two-stage probit estimation to correct

for endogeneity of the formal loan amount within her sample, she found the signs and

significances to remain identical, except for the constant and wage variable. Also, the magnitudes

of the coefficients did not alter substantially from the probit to the two-stage probit. The primary

difference was the magnitude of the formal loan amount, which was overestimated in the initial

probit.

CONCLUSIONS

While there is still much room for analysis on this issue, this study has had a

corroborating effect on Kochar’s (1997b) analysis. The formal loan amount was never found to be

significant in any of the regressions, which is consistent with her findings. Second, it appears that

there are differences between borrowers and non-borrowers, differences that are inexplicable by

the formal loan amount itself. This supports Binswanger and Khandker’s (1992) finding that there

is a selection bias that occurs with respect to the formal credit sector, and that there is a bias

toward larger-endowed households. This overestimates the benefits of a low-interest subsidized

credit program, such as the one in which the government of India is aggressively engaged. This

paper is part of a growing body of literature, including Binswanger and Khandker (1992) and

Kochar (1997b), that supports the notion of selection bias in the formal credit sector, and that

differential access to a number of different inputs can be explained, not by access to formal credit,

but potentially by a selection bias favoring those with a larger resource endowment. This selection

bias is a compelling explanation behind the unexpected signs on the large and small farm dummy

variables, and it is perhaps some of that unobserved selection bias that is being reflected within

those two variables.

In summary, this study suggests that access to formal credit does not constrain

agricultural production decisions, insofar as the land rental market is a good production input.

There is also interesting potential evidence for selection bias toward households that are larger

relative to medium houses, and against households which are smaller than a medium house.

Further research should incorporate the number of plots per household, as well as a

measure of infrastructural development of each village. Also, the years of the study could be

increased, from three years to five, at least. A larger dataset would allow for more precise

estimation, and many of the variables, which were insignificant but theoretically relevant, could

become significant. As panel estimation techniques of logit regressions becomes more standard,

this dataset could be analyzed in that fashion. Finally, the endogeneity of the credit markets should

be addressed.

ENDNOTES

*Corresponding author: Department of Economics, Indiana University of Pennsylvania, Indiana,

PA 15705. Email: [email protected].

1Following Studenmund (2011), we became concerned that only having 10% of 1’s and 90% 0’s

could present a potential bias to the coefficients. We took a subsample of the whole, using all of

the 1’s and 50% of the 0’s, leaving us with a sample of 500. This increased the participation rate in

our subsample to 20%, which was similar to Kochar’s (1997b) percent of 1’s. We were compelled

to use the full sample in order to be able to run regressions of borrowers and non-borrowers. Since

our sample only included 201 observations that used formal credit, we had to incorporate the

whole sample in order to be able to evaluate this. Our subsample coefficients are robust with those

in the full sample regressions, but note that the values of the coefficients are somewhat larger in

45

the full sample. See Table 5 for a few examples. Further regressions are available from the authors

upon request.

46

REFERENCES

Bell, C. (1990) Interactions between institutional and informal credit agencies in rural India.

World Bank Economic Review. 4(3): 297-327.

Binswanger, H.P. (1980) Attitudes toward risk: Experimental measurement in rural India.

American Journal of Agricultural Economics. 111(3): 395-407.

Binswanger, H.P. and Khandker, S.R. (1995) The Impact of formal finance on the rural

economy of India. Journal of Development Studies. 32(2): 234-262.

Golait, R. (2007) Current issues in agricultural credit in India: An assessment. Reserve Bank of

India Occupational Papers. 28(1)

Indiawaterportal.org

Kamajou, Francois and Baker, C.B. (1980) Reforming Cameroon’s government credit program:

Effects on liquidity management by small farm borrowers. American Journal of Agricultural

Economics. 62(4): 708-718.

Kochar, Anjini. (1997a) An empirical investigation of rationing constraints in rural credit

markets in India. Journal of Development Economics. 53: 339-371.

Kochar, Anjini. (1997b) Does lack of access to formal credit constrain agricultural production?

Evidence from the land tenancy market in rural India. American Journal of Agricultural

Economics. 79(3): 754-763.

Jodha, N.S. (1981) Agricultural tenancy: Fresh evidence from dryland areas in India.” Economic

and Political Weekly. December, 1981: A118-A128.

Krause, M.A., Deuson, R.R., Baker, T.G., Preckel, P.V., Lowenberg-DeBoer, K.C., Maliki,

R.K. (1990) Risk sharing versus low-cost credit systems for international development.

American Journal of Agricultural Economics. 72(4): 911-922.

Skoufias, E. (1995) Household resources, transaction costs and adjustment through land tenancy.

Land Economics. 37(8): 42-56.

Studenmund, A. (2011) Using econometrics: A practical guide. Boston, MA: Pearson Education

Inc. 6th

Ed.

Taylor, J., and Adelman, I. (2003) Agricultural household models: Genesis, evolution, and

extensions. Review of Economics of the Household. 1.1.

Village Level Studies. ICRISAT. Accessed Online.

47

APPENDIX

Table 2: Descriptive Statistics, Full Sample

Variable Mean St. Dev Max Min

LEASED 0.102794411 0.30384182 1 0

FORMAL 7244.561876 27671.9715 400000 1

COOP 4258.199601 15174.9083 183000 1

FRFAM 1956.775449 10047.3749 200000 1

MONEY 10875.2984 24768.5349 360000 1

INCOME 44407.11341 49375.9278 416799.403 115

EQUIP 16611.41444 50844.9669 608526.108 0

BLDG 66366.07307 75625.5151 887052.8955 0

LAND 5.76501996 6.03798691 44.65 0

IRR (%) 0.373255 0.423989 1 0

IRR (acres) 2.105369261 3.619910077 33.15 0

LARGE 0.146706587 0.353989863 1 0

SMALL 0.54491018 0.49822767 1 0

POP 535.3203593 138.4703439 649 170

RAIN1 629.358007 109.5613781 846.326 449.221

RAIN2 663.4215569 132.4601462 996.938 449.221

WAGE 39.47716561 7.365419277 52.52631579 30.64150943

MALES 1.906906907 1.082907068 7 0

FEMALES 1.748748749 0.8862883 6 0

DRAFT 14186.90876 23375.51715 242059.7015 0

EDUC 7.18482906 5.304324194 18 0

48

Table 4: Descriptive Statistics, Formal Borrowers Only


LEASED 0.099502 0.300083 1 0

FORMAL 36110.7 1000 52772.72 400000

COOP 2175.025 1 9888.029 100000

FRFAM 1831.184 1 6009.746 50000

MONEY 6998.547 1 25221.19 200000

INCOME 53694.04 1 59277.63 295431.6

EQUIP 26628.29 68875.64 578489.9 0

BLDG 73368.41 83694.35 691437.2 5000

LAND 7.142736 6.845727 44 0

IRR (%) 0.483232 0.441193 1 0

IRR (acres) 2.943632 3.461948 20.5 0

LARGE 0.154229 0.36207 1 0

SMALL 0.631841 0.483509 1 0

POP 477.9254 170.2703 649 170

RAIN1 597.3776 89.6332 846.326 449.221

RAIN2 682.0239 161.5571 996.938 449.221

WAGE 41.35359 6.626274 52.52632 30.64151

MALES 2.044776 1.050231 7 0

FEMALES 1.741294 0.918007 5 1

DRAFT 17715.46 29226.81 242059.7 0

EDUC 9.629442 4.230567 18 0

Table 5: Descriptive Statistics, Formal Non-Borrowers Only


LEASED 0.10362 0.304958 1 0

COOP 4780.944 1 16196.54 183000

FRFAM 1988.291 1 10829.55 200000

MONEY 11848.12 1 24573.7 360000

INCOME 41411.42 1 46199.78 416799.4

EQUIP 14097.82 44913.75 608526.1 0

BLDG 64608.93 73413.73 887052.9 0

LAND 5.419301 5.771015 44.65 0

IRR (%) 0.344842 0.415005 1 0

IRR (acres) 1.895019 3.630281 33.15 0

LARGE 0.144819 0.352138 1 0

SMALL 0.523096 0.499778 1 0

POP 549.7228 125.3322 649 170

RAIN1 637.383 112.6483 846.326 449.221

RAIN2 658.7535 123.7744 996.938 449.221

WAGE 39.0063 7.469676 52.52632 30.64151

MALES 1.87218 1.088875 7 0

FEMALES 1.750627 0.8787 6 0

DRAFT 13301.47 21592.51 219252.3 0

EDUC 6.533153 5.373318 18 0

49

Table 3: Logit Results, Full Sample (1-3) and Subsample (4-6)

Model 1 Model 2 Model 3 Model 4 Model 5 Model 6

Variable

Coefficient

(St. Error)

Coefficient

(St. Error)

Coefficient

(Std.

Error)

Coefficient

(St. Error)

Coefficient

(St. Error)

Coefficient

(St. Error)

C 8.176188

b

(4.151907)

7.703657b

(4.244751)

4.45701

(3.89144)

12.00076b

(5.272179)

11.24298b

(5.275964)

10.27612b

(5.073243)

_2001dum 0.241842

(0.449458)

0.044364

(0.432537)

-0.03932

(0.41353)

-0.063413

(0.466175)

-0.178161

(0.456329)

-0.070588

(0.449205)

_2002dum 0.006398

(0.33799)

-0.038505

(0.331137)

-0.06988

(0.320523)

-0.101019

(0.361086)

-0.100599

(0.35849)

-0.194881

(0.349629)

LOGFORMAL .012386

(.0332)

.0010586

(.033088)

.009092

(.031779)

.025978

(.037585)

.021101

(.037340)

0.025599

(0.036218)

LOGCOOP -0.871921a

(0.361052)

-0.899578b

(0.372403)

-0.736442b

(0.342977)

-1.223365a

(0.483417)

-1.210927a

(0.481258)

-1.137295a

(0.466002)

COOPDUM -7.487433b

(3.269941)

-7.72921b

(3.373567)

-6.265732b

(3.106106)

-10.77699b

(4.429216)

-10.66868b

(4.408654)

-10.02301b

(4.264607)

LOGFRFAM 0.031582

(0.043513)

0.033716

(0.043224)

0.030819

(0.042287)

0.022811

(0.047454)

0.025881

(0.047085)

0.016859

(0.045846)

LOGMONEY 0.110434

(0.164944)

0.128637

(0.166565)

0.202349

(0.160103)

0.069078

(0.19035)

0.084579

(0.191808)

0.116

(0.184294)

MONEYDUM 0.941361

(1.618339)

1.015862

(1.626935)

1.526763

(1.559204)

0.416032

(1.841602)

0.491079

(1.849255)

0.707911

(1.775117)

LOGINCOME 0.06107

(0.162598)

0.035087

(0.161084)

0.054987

(0.158103)

0.07026

(0.16805)

0.067637

(0.165962)

EQUIP -7.14E-6

(5.82E-6)

-7.33E-6

(5.89E-6)

-6.24E-6

(5.03E-6)

-5.16E-6

(7.14E-6)

-5.51E-6

(7.22E-6)

-5.8E-6

(6.44E-6)

BLDG -2.79E-7

(2.54E-6)

-2.44E-6

(2.47E-6)

-1.68E-6

(2.2E-6)

-4.38E-6

(2.85E-6)

-4.14E-6

(2.82E-6)

-3.48E-6

(2.63E-6)

LAND -0.124364a

(0.032906)

-0.133985a

(0.033693)

-0.089841a

(0.030561)

-0.103622a

(0.033516)

-0.111106a

(0.034135)

-0.066599b

(0.030012)

IRR (%) 0.154443

(0.317724)

0.232577

(0.32075)

-0.009977

(0.299814)

0.191471

(0.349216)

0.288754

(0.355448)

-0.062812

(0.326269)

LARGE 1.375798a

(0.33017)

1.388423a

(0.332316)

1.710614a

(0.343143)

1.094157a

(0.375732)

1.131591a

(0.376602)

1.438139a

(0.372915)

SMALL -1.604909a

(0.311851)

-1.576405a

(0.315317)

-1.473875a

(0.333631)

-1.45102a

(0.33749)

POP -0.0000727

(0.001124)

-0.000187

(0.001078)

0.000179

(0.001018)

-0.000372

(0.001197)

-0.000433

(0.001179)

0.000427

(0.001131)

RAIN1 -0.004096b

(0.001801)

-0.00319c

(0.001738)

-0.002571

(0.001669)

-0.002832*

(0.001819)

-0.002252

(0.0018)

-0.00262

(0.001762)

RAIN2 -0.001332

(0.001039)

-0.001003

(0.001034)

-0.000633

(0.001017)

-0.001836*

(0.001144)

-0.001587

(0.001139)

-0.001581

(0.001116)

WAGE -0.008345

(0.023561)

-0.01654

(0.022454)

-0.037407c

(0.022256)

-0.001465

(0.026863)

-0.007795

(0.026334)

-0.023095

(0.026003)

MALES 0.110822

(0.120771)

0.106858

(0.120562)

0.083108

(0.11715)

0.170741

(0.130365)

0.174994

(0.129506)

0.129502

(0.126887)

FEMALES 0.170036 0.202482 0.17078 0.146774 0.171953 0.128152

50

a denotes significance at the 1% level, b at the 5%, and c at the 10% level. * both variables were

significant at the 11% level

(0.142881) (0.14493) (0.138053) (0.158886) (0.161194) (0.150817)

DRAFT 9.28E-6b

(4.76E-6)

9.43E-6c

(4.66E-6)

1.06E-5b

(4.51E-6)

7.77E-6

(5.78E-6)

7.93E-6

(5.69E-6)

1.12E-6b

(05.5E-6)

EDUC -0.003377

(0.031316)

-0.001252

(0.034108)

0.009134

(0.033162)

EDUCDUM 0.59987

(0.33026)

0.734296b

(0.327848)

0.484717

(0.368493)

McFadden R2 0.147467 0.1530 0.1106 0.1537 0.157479 0.110972

Log Likelihood -259.04 -257.36 -270.24 -200.39 -199.5208 -210.5343

RLog

Likelihood -303.85 -303.85 -303.85 -236.81

-236.8140 -236.8140

%Corr Pred 83.41 83.5 71.26 70.8

n 904 904 904 454 454 454

51

Table 6: Logit Results, Borrowers and Non-Borrowers

Borrowers Non-Borrowers

Model 1 Model 2 Model 3 Model 4 Model 5 Model 6

Variable

Coefficient

(St. Error)

Coefficient

(St. Error)

Coefficient

(St. Error)

Coefficient

(St. Error)

Coefficient

(St. Error)

Coefficient

(St. Error)

C 8.367749

(15.98079)

7.792326

(16.59159)

7.147672

(16.19137)

10.53778b

(4.420441)

9.686747b

(4.575896)

5.594368

(3.957521)

_2001dum -0.149731

(1.072603)

-0.005707

(1.092464)

-0.005603

(1.089259)

0.517233

(0.593371)

0.197032

(0.544822)

0.29257

(0.544538)

_2002dum -0.380262

(0.820677)

-0.318281

(0.834489)

-0.359838

(0.823511)

-0.105337

(0.453947)

-0.205063

(0.434713)

-0.108085

(0.427223)

LOGFORMAL 0.21658

(0.491731)

0.201745

(0.490095)

0.230667

(0.485552)

LOGCOOP -1.661119

(1.671929)

-1.699287

(1.700598)

-1.683194

(1.663499)

-0.88162b

(0.391305)

-0.941527b

(0.409209)

-0.615853c

(0.354662)

COOPDUM -14.67097

(14.67739)

-15.04046

(14.91145)

-14.92817

(14.59559)

-7.648318b

(3.53754)

-8.165826b

(3.700291)

-5.293439c

(3.205919)

LOGFRFAM 0.112134

(0.084629)

0.084091

(0.086272)

0.085049

(0.085956)

0.018362

(0.061744)

0.023877

(0.061364)

0.012078

(0.058739)

LOGMONEY 0.787652*

(0.493682)

0.794561*

(0.49287)

0.811696

(0.487155)

-0.005975

(0.191544)

0.010801

(0.194638)

0.10548

(0.179901)

MONEYDUM 7.62116*

(4.80717)

7.840714*

(4.831922)

7.939098c

(4.764944)

0.054141

(1.87454)

0.10813

(1.89593)

0.830415

(1.753283)

LOGINCOME -0.180128

(0.352935)

-0.182276

(0.364058)

-0.176794c

(0.36438)

EQUIP -2.63E-5

(2.04E-5)

-2.69E-5

(2.01E-5)

-2.58E-5

(1.96E-5)

-2.82E-6

(5.54E-6)

-3.44E-6

(5.7E-6)

-2.36E-6

(4.89E-6)

BLDG -6.78E-6

(6.89E-6)

-6.21E-6

(6.73E-6)

-5.87E-6

(6.62E-6)

-1.89E-6

(2.7E-6)

-1.42E-6

(2.6E-6)

-9.05E-7

(2.35E-6)

LAND -0.041988

(0.063776)

0.006522

(0.072259)

0.011076

(0.07126)

-0.252443a

(0.048767)

-0.269002a

(0.049566)

-0.17662a

(0.04393)

IRR (%) 0.898322

(0.838088)

1.149483

(0.866703)

1.04372

(0.82626)

0.065702

(0.366644)

0.175108

(0.369542)

-0.182083

(0.339452)

LARGE 2.448738b

(1.221021)

2.552727b

(1.249262)

2.641544b

(1.232353)

1.408668a

(0.375042)

1.441433a

(0.379192)

1.894807

(0.381245)

SMALL -0.49275

(0.816858)

-0.431977

(0.829874)

-2.132163a

(0.370693)

-2.078061a

(0.377113)

POP -0.001325

(0.003022)

-0.002885

(0.00308)

-0.002937

(0.003065)

0.000171

(0.001489)

-0.000428

(0.001388)

0.000479

(0.001366)

RAIN1 -0.00424

(0.003817)

-0.004316

(0.003889)

-0.004589

(0.003853)

-0.005046b

(0.002443)

-0.003435

(0.002272)

-0.003506*

(0.002254)

RAIN2 -0.004454

(0.002937)

-0.004581

(0.003007)

-0.004604

(0.003011)

-0.000236

(0.001301)

0.000223

(0.001288)

0.000363

(0.001242)

WAGE 0.034763

(0.073162)

0.082164

(0.0771)

0.081792

(0.076815)

-0.018902

(0.028167)

-0.024914

(0.027122)

-0.047686c

(0.026955)

MALES 0.839559b

(0.370698)

0.937853b

(0.389293)

0.926801b

(0.385506)

0.020281

(0.136484)

0.015089

(0.135257)

-0.007459

(0.131512)

52

a denotes significance at the 1% level, b at the 5%, and c at the 10% level. *Significant at the 11%

level.

FEMALES 0.23377

(0.35272)

0.212324

(0.365729)

0.247387

(0.357526)

0.260506

(0.172579)

0.306754b

(0.175326)

0.173194

(0.159104)

DRAFT -6.86E-6

(1.47E-5)

-9.91E-6

(1.57E-5)

-8.48E-6

(1.49E-5)

2.06E-5a

(6.11E-6)

2.05E-5a

(6.06E-6)

1.92E-5a

(5.83E-6)

EDUC -0.191496c

(0.109575)

-0.195365c

(0.108752)

0.010788

(0.03622)

0.036025

(0.034575)

EDUCDUM -0.990826

(1.532128)

0.860049b

(0.364238)

McFadden R2 0.282695 0.304207 0.302150 0.201028 0.212807 0.125354

Log Likelihood

-46.25379 -44.86663 -44.99925 -192.1309

-189.2984 -210.3284

RLog

Likelihood -64.48272 -64.48272 -64.48272 -240.4727

-240.4727 -240.4727

%Corr Pred 86.32 86.79 84.61 84.8

n 195 195 195 719 719 719

53



RETAIL AND POPULATION DENSITY IN SMALLER U.S. CITIES

William K. Bellinger1 and Jue Wang

Dickinson College

ABSTRACT

In this article we test for patterns of retail density across three different dimensions of U.S. inner

cities--geographic centrality, low income levels, and ethnic minority status--using a two equation

recursive model with population density as a second dependent variable. The empirical analysis is

based on zip code level census data for 39 small to mid-sized U.S. cities. Our primary policy-

relevant finding is that neighborhoods with high percentages of African-Americans have

significantly lower retail density, all else equal, while the results for Hispanic, low income, or

otherwise centrally located neighborhoods are generally mixed or positive.

INTRODUCTION

The case has been made many times in several contexts that U.S. inner city

neighborhoods are significantly underserved by retail services. This hypothesized retail gap in

inner city neighborhoods has experienced revival as a research topic and a major component of

urban development strategy since the publication of Michael Porter’s path-altering article “The

Competitive Advantage of the Inner City” (1995). The concept of unrealized inner city retail

opportunity has been verified in large cities such as Denver (Weiler, et. al., 2003), Buffalo NY

(Rogers, et. al.), Newark NJ (ICICc), Brooklyn NY (ICICb), and Atlanta (Boston and Ross,

1997b). Most of these studies involved partnerships between research institutions and local

government development agencies, and all found a significant degree of unrealized development

opportunity in the retail sector.

However, this concept has not been explored with much theoretical depth or empirical

breadth in the urban economics or urban development literature. The only broadly based study we

discovered was conducted by the Department of Housing and Urban Development (U.S…., 1999).

It found that 48 of 539 cities had less estimated retail revenue than estimated purchasing power,

less than 9 percent of the total. Also, it did not separately analyze the inner city areas of these

cities. This combination of selected case studies and one predominantly negative large scale study

leaves open the question of whether the claims of a retail gap represent the tip of a systemic urban

market failure, conclusions with more limited applicability, or political arguments in favor of

increased largesse for urban American business.

Another often overlooked but crucial dimension of the issue is the interpretation of the

phrase “inner city”. Porter and the Institute for a Competitive Inner City have consistently defined

inner city neighborhoods in terms of alternative poverty or income and unemployment thresholds,

but the inner city can also be defined in terms of its relatively central location and its ethnic

characteristics (Mills, 1997, pp. 727-29). Separating these often overlapping and related concepts

may be an imperfect process, but these different dimensions of an inner city have different policy

implications, and should be actively explored.

54

Suggested barriers to inner city retail development include a lack of large scale

commercial sites, high barriers to and costs of construction, high local costs for utilities and taxes

(Porter, pp. 62-65), biased or inadequate demand information (Weiler, et. al.; Weissbourd and

Berry) and discrimination by area or ethnicity (Blanchard, 2008). Biased demand estimates may

exist because of an undercounted population in low income areas2, under-estimated or

underreported income, unawareness of the negative relation between the average propensity to

consume and income level, or a lack of understanding of ethnic differences in taste. The very real

possibility of consumer discrimination based on neighborhood ethnic characteristics may also be a

factor.

Our empirical analysis measures patterns in retail density based on a two equation

recursive model. The first equation analyzes retail density using the inverse of a basic Lösch

market area model. This equation offers retail density hypotheses based on demand factors such

as population density, per capita income, and discrimination by consumers. Cost factors include

measures of crime and taxes, while agglomeration economies and retail land use in the central

business district are also included as location factors.

The second equation provides a multivariate estimator for gross population density based

on the monocentric city model. In addition to a relation between population density and radial

distance, hypotheses are tested regarding non-residential land use, housing market factors, and

ethnic characteristics. The final step in the empirical analysis is the testing of the reduced form

equation for this system, which allows us to test the roles of all three dimensions of the inner city

in determining retail density. In combination, these two equations and their reduced form allow

the consideration of retail density in a multi-dimensional format not previously considered in this

context.

This model is tested using zip code level data for a set of 39 small to mid-sized U.S. cities

from the population census (2000) and business census (2002). Retail density is specified in three

forms; stores per square mile, stores with over $1 million in annual revenue per square mile, and

estimated aggregate revenue per square mile. We find that there is evidence of a significant retail

gap for African-American neighborhoods in smaller U.S. cities, particularly for large stores.

However, no such gap is found for Latino or otherwise low income or centrally located areas.

Other results will be discussed in the empirical section to follow.

RETAIL DENSITY

The first step in developing the empirical model is to specify the model for the geographical

density of retail services. This is a surprisingly easy process since retail density (stores per unit of

area) is the inverse of an average store’s market area. Most hypotheses regarding retail density

can therefore be derived from a basic Lösch market area model in long run equilibrium. For a

typical individual consumer, the demand can be expressed in a linear form

x = a(I) – b(p + tr) (1)

where p is price, I is per capita income, and tr represents two way transportation cost for the

consumer at distance r from the store. To further simplify the model assumes that store price p is

fixed and the slope of the individual demand (b) is 1, a common simplification. In a market with

constant population density D, area A and market area radius R, aggregate demand (X) equals

)2( tRpaADX (2)

where the average commuting distance from the household to the firm is ρ ≈ .7071R. The total cost

function for the firm is:

55

FXkC * (3)

where k is the average total operating cost, X is total output, and F is the firm’s fixed cost

associated with its location. Defining the firm’s profit function and profit maximizing price leads

to the following long run equilibrium market area:3

2

)222

(tka

D

FA

(4)

Since retail density per unit of geography is a direct inverse of the geographic area of the average

firm, equilibrium retail business density (B) equals.

F

tkaD

B

2)

222(

(5)

Equation (5) provides the basis for our hypotheses regarding retail density, while the logarithmic

form in (6) offers additional guidance for the empirical specification to follow.

Ftka

DB ln222

ln2lnln

(6)

The model’s predictions are generally straightforward. The relation between aggregate

demand and business density is expectedly positive. Both the individual demand (via endpoint a)

and population density D have positive relations to business density B.

0)222

(

tka

F

D

a

B (7)

0

2)

222(

F

tka

D

B

(8)

Assuming normal demand, per capita income will have a positive relation to the individual

demand endpoint a, and therefore to business density, all else equal. This relationship between

identifies the first dimension of the inner city to be tested in this paper.

The third component of demand, transportation cost, is a somewhat more complicated

variable, in part because of its long run connection to other components of the model. Within the

basic fixed price model, a rise in consumer transportation cost t lowers effective demand and

profit, increases the zero profit market area, and thereby lowers retail density.

0)222

(2

tka

F

DR

t

B (9)

Note that this result depends on the assumptions of perfect competition and fixed store prices

(Villegas, pp. 895-898). Additionally, the long run effect of transportation cost on the market area

also depends on its relation to population density. When one adds the assumption that population

56

density D is a positive function of t, a common result of the monocentric city model, we find the

following relationship between retail density and transportation cost.

F

Rtka

D

F

tka

t

D

t

B)2)(

222(

2)

222(

(10)

With positive and the second addend of (10) negative, retail density B has an ambiguous

relation to transportation cost. The effect of alternative pricing assumptions noted in Villegas

reinforces this result.

The cost side of the model is unambiguous, with both operating cost k and fixed cost F being

negatively related to retail density.

0)222

(

tka

F

D

k

B (11)

0)222

(2

tka

F

D

F

B (12)

Another implication of equation (12) is that the long run marginal effect of F is likely to be

extremely small. Marginally higher location-related costs are therefore likely to be minor factors

in the long run. However, more absolute barriers related to regulation or zoning may represent

significant policy issues for inner city retail development. The same can be said for factors relating

to operating cost and neighborhood aggregate demand.

The Effect of Ethnic Discrimination on Firm Density

In the retail location context, both demand and cost-related sources of discrimination have

been identified in the literature. The most consistent and well-supported connection between

discrimination and firm density comes from consumer prejudice and its effect on demand.

Shoppers, especially white shoppers from outside a mixed or predominantly minority

neighborhood, may feel reluctant to shop there because of their prejudice or fear. Gallagher

(2003), among many others, shows that whites significantly exaggerate the percentage of

minorities living in mixed neighborhoods. Quillian and Pager (2001) show a positive correlation

between African-American population and crime, but also find that Caucasians systematically

overestimate the crime rate in neighborhoods with higher African-American populations. These

two facts imply that whites will significantly overestimate the danger of shopping in

predominantly Black neighborhoods. Given the generally higher average income for Caucasians,

this type of prejudice among white shoppers implies lower individual and aggregate retail demand

in Black neighborhoods. The same logic may apply to Latino neighborhoods.

The role of prejudice, amplified by the biased perceptions noted above, produces a negative

effect on demand. Following Becker, we will incorporate customer prejudice by specifying

individual demand endpoint (a) as a negative function of the minority population percentage (m)

in the neighborhood of the store.4 From equation (5), the effect of consumer discrimination arises

from its effect on individual demand endpoint a.

57

0)222

(

1

m

atka

F

D

m

a

a

B

m

B (13)

where m1 refers to the effect on retail density of consumer-based ethnic discrimination. Therefore

demand side discrimination will lower retail density in minority neighborhoods, all else equal.

The role of racial and ethnic discrimination on operating cost is more mixed and less directly

related to neighborhood characteristics.5 Yet there is substantial evidence that racial discrimination

exists in capital markets, both in terms of neighborhood characteristics and the race of the

potential entrepreneur. There is more and better evidence of discrimination by lending institutions

against minority entrepreneurs (Blanchard and Zhao, 2008). The connection between this finding

and neighborhood characteristics is less well established, however (Bostic, et. al.). Such ethnic

redlining, if and where it exists, implies lower access to or higher cost for initial capital, and

therefore a higher value for fixed cost F. Therefore F can be assumed to be positively related to

the minority status of the neighborhood, m.

0)222

(2

2

m

Ftka

F

D

m

F

F

B

m

B (14)

where m2 refers to the effect of minority status on retail density caused by financial sector

discrimination. Combining the demand and fixed cost components of discrimination against

minority neighborhood retail produces the following relationship, which is also negative.

02

)]([

2)

222

)((

)()()

222

)(()(

mF

tkmaD

m

mF

m

matkmaDmF

m

B

(15)

Therefore racial discrimination lowers store density when these two hypothesized effects are

combined.

Combining equation (6) with the hypotheses noted above produces the following

estimating model for retail density, which in double log form predicts positive coefficients for

population density and per capita income, negative coefficients for minority status and all cost

related variables, and an ambiguous relationship between business density and consumer

transportation cost.

)(ln)222

),(ln(2lnln mF

tkImaDB

(16)

The first equation of our empirical model of retail density will be based on (16).

POPULATION DENSITY

While the roles of income and ethnicity in determining retail density are established in

the retail density analysis above, the role of location within the metropolitan area, and particularly

its inner city core, is only indirectly related to the previous analysis. Population density D

presents a direct connection to the monocentric city model of urban land use developed by Muth

58

(1969), and others. As with the retail density function above, population density is derived from an

inverse of the model of residential land per person. The basic logic of the equilibrium population

density function is that land is more valuable near the CBD under the assumption of central city

employment, producing a lower level of average land per capita.6

The urban geography literature offers a diverse set of functional forms for the population

density gradient (Martori, McDonald). In preliminary tests we specified the population density-

distance function in two forms. The first form is a first degree negative exponential function with a

dummy variable for the zip code containing the central business district (CBD),

lnD = lnD0 + bu + cCBD dummy, (17)

where u is the radial distance to the city’s central point, and the second form is second degree

negative exponential function,

ln D = lnD0 + βu + δu2 (18)

These functions were tested nationally, for each of four regions, and for nine of the larger cities in

the data set. Overall and in most individual cases, the CBD dummy approach produced much

greater significance for distance and an equal or better overall fit. For example, no individual city

had a significant value for β and only one for δ in equation (18), while the first degree exponential

function with CBD dummy tests produced significant negative values for b in seven of the nine

cities and more mixed results for the CBD coefficient.7 Therefore the negative exponential

function with CBD dummy will provide the basis for the population density function tested in this

paper.

Other research has explored multivariate models of population density tested across

multiple cities. The independent variables in these models may be categorized as factors affecting

net population (residential) density and those that affect the relationship between gross and net

density.8 Among the main variables hypothesized to affect net density include income (Mills and

Price), the age of the city or its housing stock (Johnson and Kau), modal choice related

transportation cost variables such as the percent of commuters using public transit or autos per

capita (Muth, 1961), and the total population of the city (Muth 1969, Mills and Price). In this

equation we add ethnicity variables and additional housing market variables such as group housing

percentage and the housing vacancy rate.

The distinction between net and gross density involves measures of alternative land use.

Census data is limited in this context, but we can specify two variables related to this distinction.

The first is a dummy variable identifying the zip code containing the city’s CBD, which is also

included in the retail density specification. The second is the percentage of the zip code

population categorized as urban versus rural.

EMPIRICAL ANALYSIS

In this section we test the two equation recursive model developed above in two steps.

First we test each equation using OLS. In a two equation recursive system OLS is appropriate

when the residuals of the two equations are not correlated. Testing the residuals of the population

density equation against the three specifications of retail density produced insignificant

correlations in all cases at the 5 % level. We also test the reduced form of the model separately in

order to explore simultaneously all of the three dimensions of the hypothesized inner city retail

gap.

59

Data

This study combines data from the 2000 population census with economic census data

from 2002. Given the recursive structure of our two equation model, this modest lag between

population characteristics and retail density seems appropriate. The data is compiled by zip code

for 39 small to mid-sized U.S. cities ranging in population from slightly over than 53,000 (Battle

Creek MI) to over 223,000 (Greensboro NC), with a mean population of just over 142,000.9 The

selected cities have relatively circular or semi-circular boundaries and are separated from other

urban clusters, although some are satellite or hub cities within larger metropolitan statistical areas.

Zip codes that lie partly or entirely within the city boundary are included. The total land area

covered for all zip codes in the 39 cities is about 7,637 square miles. Excluding two western zip

codes with over 200,000 acres each reduces this total to 6,737 square miles. The total land area

within the 39 city boundaries is about 1,704 square miles. Therefore the included zip codes cover

over 4.3 times the land area of the central cities after excluding the two major outliers.10

Therefore this analysis approximates metropolitan rather than central city retail and population

density patterns.

The Economic Census lists numbers of businesses in each zip code by industry subgroup

and revenue category. Our retail data includes two related categories in the North American

Industry Classification System (NAICS). These are Retail Trade (code 44) and General

Merchandise Stores (code 452). Geographical business patterns are tested using the total number

of firms in each zip code, firms reporting over 1 million in revenue, and an estimated total revenue

figure. The revenue estimate is derived by first assigning mid-point values to the all revenue

categories excluding the open ended category of over 1 million, estimating logarithmic

relationships between these size categories and the average rank of each category using a function

variously labeled as Zipf’s Law, the Pareto distribution, or the rank-size rule, converted to natural

logs.11

Total revenue for firms greater than 1 million was then calculated using the estimated

parameters.12

Finally, aggregate revenue for each zip code was estimated by multiplying the

number of firms in each revenue category by the estimated average revenue, then summing across

the categories. The results of this study are for the most part insensitive to the specific measure of

business activity.

Another essential variable required for this analysis is distance from the city’s central

point. Central point latitudes and longitudes for each zip code are reported by the Census Bureau.

The central point in the CBD was chosen through e-mail correspondence with local officials and

satellite images from Google Earth. Distance calculations require a variation of the Pythagorean

Theorem with parameters to convert degrees to miles and to adjust longitude in miles to the

corresponding latitude. This distance formula is seen in equation (19).

distance in miles = [((latitude2-latitude1)•69.1703234283616)2 + (69.1703234283616• (19)

(longitude2-longitude1)•COSINE(latitude 1 •.0174532925199433))2]

.5,

Testing Retail Density

In this section we test the proposed two equation recursive model using ordinary least

squares with robust standard errors. The market area model above predicts that the number of

firms in a given area will rise with population density D due to a smaller minimum market radius

and higher profits at the fixed maximizing radius, and with higher per capita income. Finally,

transportation cost has an ambiguous effect on business density.

While population density and per capita income are directly measurable, transportation

cost requires indirect estimation. Three variables relate to transportation cost. First, an indirect

estimate of transportation speed, a primary component of transportation cost, may be possible

60

despite the limitations of census data. Average transportation time per worker in a given zip code

is easily calculated from census data. In order to approximate the marginal transportation cost per

geographic unit one must divide by a distance measure that might approximate, or at least vary

positively with, the average commuting distance in a given city. In this case we use mean radial

distance in each city as our geographic measure. This constructed average speed estimate produces

slower than expected average speeds, with a mean estimate of 10.2 m.p.h., but more importantly

behaves relatively well regarding common hypotheses. Regressing our estimate of travel time per

person-mile against population density (D), the percent of commuters using public transportation

(Trans), and two infrastructure related variables, median housing age (HAge) and a dummy

indicating the presence of an internal highway in the city (Hwy) produced the following results. t

statistics are in parentheses.

Time/mi = 5.27 - .012 Trans + .039HAge - .991Hwy + .0001D (20)

(12.56) (-0.19) (3.09) (-3.35) (2.67)

This variable for commuting speed will be combined with a variable measuring the percentage of

households that do not own an automobile. An alternative measure of modal choice and

transportation cost, cars per household, will also be tested.

The specification of the cost variables inevitably raises some questions, in part because

no direct measures of operating cost are available in economic census data. Cost-related variables

include the median property tax rate and the city-wide Uniform Crime Report index crime rate for

total reported crimes. The percentage of adults without a high school degree, a measure of local

human capital, and the labor force participation rate were dropped from the final results due to

collinearity problems and highly insignificant coefficients.

In addition to variables related to the market area model, other dimensions of firm

location decisions relate to agglomeration economies. Due to the tendency of retail firms to

cluster based on agglomeration economies, a dummy variable labeled Cluster takes the value of 1

when the total retail stores open all year are over 1 standard deviation above each city’s mean. A

similar dummy variable is included for the zip code containing each city’s central business district.

Of the 39 CBD zip codes only 6 contain retail clusters by this definition. Half of these are in the

western region with one in each of the other 3 regions. On average, zip codes containing retail

clusters are 3.76 miles from the city’s central point, as opposed to 3.29 miles for non-cluster zip

codes. Overall, areas with concentrated retail activity display a decentralized pattern.

The empirical model of retail density may now be specified as follows:

lnRetail = f(ln D, C,CBD, ln T, ln I, t ,ln Crime, %Black, %Latino,) (21)

where C is the retail cluster dummy, T is the median property tax, I is per capita income, t

represents various specifications of transportation cost, crime is total index crime, and D is the

population density.

The population density equation will be specified as follows, based on the discussion in the

previous section.

lnD = f(u, I, H, %Black, %Latino, Pop, V, G,) (22)

where D is population density, u equals radial distance from the city center, I is per capita income,

H is housing age, Pop is the total population of the City, V is the vacancy rate for housing units,

and G is the percentage of the population in group housing. Results are presented in Table 1 for 3

measures of retail density; total stores, estimated total retail revenue, and total stores with over 1

61

million in revenue (Big Store Density), and in Table 2 for the population density equation with

various specifications of transportation cost.

Retail Density Results

Bruesch-Pagan tests for heteroscedasticity were mixed in the retail density specifications,

with insignificant Χ2 values for total stores and significant values for large stores and estimated

aggregate revenue. As a result the total store tests are HC2 robust standard errors while the others

use HC3 errors.13

Mean variance inflation factors, also reported in Table 1, suggest moderate

multicollinearity. Finally, given the existence of a variable that was fixed across each city’s zip

codes (crime rate), tests were also run using standard errors that are clustered by city.14

Results

are very similar to those reported below.

These empirical estimates of retail density offer relatively good overall fits and somewhat

mixed significance for individual variables. Among the demand variables, population density is

extremely significant, while per capita income fails to achieve significance. It is worth noting that

other household income and poverty measures included in preliminary tests were generally

insignificant as well. The retail cluster dummy is highly positive, if not particularly interesting,

while the CBD dummy borders on significance for total stores and is insignificant for large stores

and aggregate revenue. One interpretation of this mixed result is that small shops serving financial

district clients are likely to dominate the modern small city CBD.

The variables related to retail operating costs are of mixed significance. Median real

estate taxes are significantly negative at the .05 or .10 levels in all specifications, while the city

crime rate is a non-factor. It should also be noted that separate tests of violent and property crime

rates produced insignificant results similar to those for total reported crime. The unavailability of

crime data by neighborhood or zip code is clearly a limiting factor in this analysis. Since the cost

variables are relatively indirect measures of true costs, and are in some cases only available on a

city-wide basis, the hypothesis that cost disadvantages beyond taxes exist in the inner city should

not be rejected on the basis of this evidence alone.

The consumer transportation cost variables produce interesting results. The percent of

households without cars has a significant positive relationship to all retail measures, with higher

coefficients and t statistics for the total store count than for estimated revenue or large store

density. Similarly, the number of cars per household is significantly negative with a slightly

higher coefficient for total stores than other specifications. The estimated travel time per mile

variable highway variable has a negative relation to total store density but insignificant

coefficients for the other specifications. The main conclusion to be gathered from these findings is

that increased aggregate access to automobiles lowers retail density.

The final variables of interest are the ethnicity variables. In all specifications the

percentage of African-Americans is negatively correlated to retail density while the percentage of

Latinos is not significant. This result suggests that the possible existence of a retail gap in minority

neighborhoods is confirmed, but only for African-American neighborhoods. This finding will be

considered again in the reduced form tests to follow.

62

Table 1: Retail Density Function

dependent

variable

(per sq. mile)

ln All

Store

Density

ln All

Store

Density

ln Big

Store

Density

ln Big

Store

Density

ln Est.

Revenue

ln Est.

Revenue

Equation 1-1 1-2 1-3 1-4 1-5 1-6

Constant

-9.11

(-3.03)

-4.94

(-1.73) -8.44

(-2.93)

-4.92

(-1.85) 6.25

(2.27)

9.90

(3.50)

Travel Time/

Mile -.0322

(-2.03)

-.0269

(-1.52)

-.025

(-1.76)

Pct. Without

Car .0368

(6.34)

.0303

(4.97)

.031

(4.93)

Cars per

household -1.74

(-6.66)

-1.57

(-5.08)

-1.55

(-5.08)

CBD dummy

.331

(1.89) .334

(2.08)

.2173

(1.33)

.191

(1.26)

.264

(1.48)

.249

(1.56)

Cluster

.929

(10.60)

.884

(9.73)

1.040

(11.50)

1.002

(10.76)

1.05

(9.55)

1.008

(10.51)

% African-

American -.0097

(-3.10)

-.011

(-3.50)

-.0097

(-3.02)

-.011

(-3.49)

-.009

(-3.42)

-.010

(-3.21)

% Latino

-.0045

(-0.94)

-.0003

(-0.08)

-.0038

(-0.74)

-.0004

(-0.08)

-.003

(-0.66)

-.0005

(-0.11)

ln Per Capita

Income (1,000s)

.516

(1.97)

.364

(1.46)

.4157

(1.58)

.319

(1.31)

.434

(1.96)

.319

(1.25)

ln Population

Density .9548

(22.95)

.934

(22.21)

.9221

(19.21)

.898

(19.34)

.944

(21.50)

.922

(19.39)

ln Total

Crime Rate

-.0416

(-0.35)

-.047

(-0.41)

-.0869

(-0.73)

-.099

(-0.85)

-.087

(-0.66)

-.100

(-0.82)

ln Median

Property Tax -.183

(-2.18)

-.241

(-2.90)

-.1586

(-1.79) -.219

(-2.55)

-.185

(-2.62)

-.238

(-2.71)

R2 .778 .786 .743 .755 .735 .744

Mean V.I.F. 1.89 1.76 1.89 1.76 1.89 1.76

t statistics are in parentheses. Boldfaced results are significant at the 5% level

Population Density Results

Results for the population density tests are reported in Table 2. Bruesch-Pagan tests

indicated significant heteroscedasticity, so HC3 robust standard errors are applied to all equations.

The vacancy and group housing rates are dropped in most reported specifications due to

insignificance, while housing age is omitted in some specifications in order to display the well-

known negative relation between housing age and radial distance in U.S. cities. As in the retail

density tests, alternative specifications of transportation cost are included.

There are few surprises in the population density estimates in Table 2, with population

density being positively related to the urban population percentage in the zip code, the total

population in the central city, and the median housing age. Population density is found to be

negatively related to radial distance in keeping with the predictions of the monocentric model.

Finally, the travel time per mile variable is more significant in the context of population density

than retail density (see Table 1), while the alternate variables measuring automobile access are less

significant.

63

The omission of the median housing age variable from equations 2-3 and 2-5 increases

the significance of both the radial distance and CBD dummy variables. This is undoubtedly due to

the well-known tendency of city housing stock to grow outward over time, producing lower

housing age on the city’s periphery. It is also likely that older housing stock correlates negatively

with automobile access, thereby increasing transportation cost. Finally, the well-known filtering

model of housing use suggests a strong negative correlation between housing age and income,

which provides a reasonable explanation for the significantly negative income coefficient in 2-3.

Overall, while collinearity among the income, housing age, and radial distance variables produces

explainable variations in significance, this population density function provides reasonably strong

explanations of population patterns in smaller U.S. cities.

Table 2: Population Density Function

Dependent Variable: ln Population Density

Equation 2-1 2-2 2-3 2-4 2-5

Constant

3.15

(1.47)

2.22

(1.18) 5.51

(2.77)

1.69

(.90)

3.47

(1.73)

% African-American -.0012

(-.45)

-.0013

(-.55)

-.0025

(-1.06)

-.0027

(-1.37)

-.0029

(-1.39)

% Latino

.0102

(3.11)

.0114

(3.72)

.0083

(2.61)

.0084

(2.90)

.0054

(1.77)

Radial Distance

(100 foot units) -.0015

(-2.19)

-.0015

(-2.14)

-.0028

(-5.39)

-.0011

(-1.62) -.0023

(-4.20)

Median Housing Age

.0217

(4.94)

.0216

(4.87)

.0184

(4.28)

Central City

Population (1000s) .0017

(2.71)

.0016

(2.51)

.0019

(2.92)

.0020

(3.05)

.0025

(3.80)

% Urban Population .0613

(7.67)

.0620

(7.63)

.0669

(7.85)

.0618

(7.70)

.0659

(8.03)

ln Per Capita

Income (thousands)

-.236

(-1.22)

-.166

(-1.07) -.4036

(-2.57)

-.1186

(-.74)

-.21

(-1.58)

Travel Time/Mile .0327

(2.55)

.0456

(3.61)

Pct. Without Car .0071

(1.70) .0106

(2.39)

Cars Per Household .075

(.24)

.161

(.79)

-.146

(-.84)

CBD Dummy -.085

(-.62)

-.133

(1.07) -.326

(-2.86)

-.2067

(-1.77) -.3581

(-3.13)

Vacant Housing

Percentage

-.007

(-.54)

Group Population

Percentage

-.0052

(-.65)

R2 .700 .695 .661 .702 .677

Mean VIF 2.07 1.85 1.63 1.90 1.72


64

Reduced Form Estimates

In this section we utilize the reduced form of this two equation system to consider more

broadly the possible causes of any systematic inner city retail gap. By combining the two

equations in this way all three of the dimensions of the inner city, central location, income, and

ethnicity, are tested in a single equation model. Combining the general specifications in equations

(21) and (22) produces the following reduced form equation for retail density:

lnRetail = f(C,CBD, ln T, ln I, t ,ln Crime, r, I, H, %Black, %Latino, Pop, %Black, %Latino,) (23)

where C is the retail cluster dummy, T is the median property tax, I is per capita income, t

represents various specifications of transportation cost, crime is total index crime, r equals radial

distance from the city center, I is per capita income, H is housing age, and Pop is the total

population of the City. Because of their insignificance, the vacancy and group housing variables

were dropped from the analysis.

Results are presented in Table 3 above. HC3 robust standard errors are applied to all

specifications. The substitution of multiple variables for population density increases the

collinearity of the model somewhat, as noted in the higher variance inflation factors. While two

retail density variables (property tax, CBD dummy) that were partially significant in Table 1 lose

their significance in the reduced form estimates, possibly due to increased collinearity, the main

contributors remain highly significant. These include African American percentage, automobile

access (an indicator of transportation cost), central city population, radial distance, the urban

population percentage, and the cluster dummy variable.

The primary purpose of the reduced form tests was to consider directly the roles of all

three dimensions of the inner city, central location, income, and ethnicity. Of these three

dimensions of the inner city discussed in the introduction, geographically central neighborhoods

display evidence of higher retail density, all else equal, primarily due to the higher population

density found in centrally located neighborhoods. The insignificant income coefficients also fail to

show a clear retail gap based on income. The third dimension of the inner city, ethnicity, indicates

a significant retail gap in store density for African-American communities but no such gap for

Latino neighborhoods. This relative lack of retail stores in African-American neighborhoods is the

primary social factor that verifies the existence of an inner city retail gap. Otherwise central, low-

income, or ethnically diverse neighborhoods show no such pattern in smaller U.S. cities.

CONCLUSIONS

Overall, the combination of a retail density equation based on an inverted market area

model and population density equation based on the monocentric city model proves useful in

explaining retail density in America’s smaller cities. For the most part, the hypothesized existence

of an inner city retail gap is not confirmed by these tests. Per capita income proves to have an

insignificant relationship to retail density, while geographically central neighborhoods have

greater retail density by all measures than those in the periphery. The third dimension of the inner

city, ethnicity, indicates a significant retail gap for African-American communities but no such

gap for Latino neighborhoods.

The identification of African American status as the lone source of a systematic retail gap

represents the most important policy issue in this study. Not surprisingly, a wide range of public

and private development efforts are available for addressing this issue. For example, new or

existing enterprise zones could focus their incentive programs on the development of retail

clusters in underserved areas. Mentoring and financial assistance programs for inner city

65

Table 3: Reduced Form Estimates

Dependent

Variable

(per sq. mile)

ln All

Store

Density

ln All

Store

Density

ln Big

Store

Density

ln Big

Store

Density

ln Est.

Revenue

ln Est.

Revenue

Equation 3-1 3-2 3-3 3-4 3-5 3-6

Constant

-6.32

(-2.06)

-.173

(-.60) -6.10

(-2.18)

-1.86

(-.71) 8.73

(2.98)

12.83

(4.65)

Travel Time/

Mile

-.011

(-0.47)

.001

(.05)

-.002

(-.10)

Pct. Without

Car .040

(5.84)

.035

(5.89)

.347

(5.42)

Cars Per

Household -1.67

(-4.15)

-1.67

(-4.15)

-1.47

(-3.38)

CBD dummy

.081

(.43)

.229

(1.33)

-.085

(-.49)

.032

(.20)

-.012

(-.06)

.113

(.65)

Cluster

1.01

(9.81)

.973

(9.45)

1.09

(10.68)

1.05

(10.50)

1.11

(10.51)

1.08

(10.24)

% African-

American -.011

(-3.56)

-011

(-3.31)

-.012

(-4.07)

-.012

(-3.68)

-.011

(-3.45)

-.010

(-3.11)

% Latino

-.0007

(-.14)

.0047

(1.00)

-.0016

(-.33)

.003

(.72)

.0001

(.02)

.005

(1.08)

ln Per Capita

Income (1,000s)

.117

(.38)

-.171

(-.59)

.032

(.12)

-.232

(-.89)

.028

(.09)

-.232

(-.82)

Radial Distance

(100 foot units) -.0024

(-2.60)

-.0029

(-2.03)

-.0028

(-2.77)

-.002

(-2.56)

-.003

(-2.44)

-.002

(-2.14)

Median

Housing Age

.0087

(1.38)

.0093

(1.59)

.002

(.35)

.0028

(.45)

.006

(.91)

.007

(1.06)

Central City

Population (1000s) .0024

(2.54)

.0028

(3.17)

.0027

(2.97)

.0029

(3.50)

.002

(2.12)

.0023

(2.48)

% Urban

Population .064

(7.27)

.0636

(7.27)

.0655

(7.58)

.065

(7.61)

.065

(7.36)

.0644

(7.35)

ln Median

Property Tax

.018

(.17)

.049

(.48)

.008

(.08)

.049

(.53)

.013

(.13)

.050

(.49)

R2 .667 .665 .660 .660 .637 .636

Mean V.I.F. 2.12 2.03 2.12 2.03 2.12 2.03


entrepreneurs (Johnson, et. al.), infrastructure improvements, and effective (and widely advertised)

crime control efforts may also encourage greater retail opportunity in African-American

communities. More flexible access to transportation could provide another method of reducing the

spatial mismatch created by these retail gaps. The wide range of innovative programs particularly

concerned with urban food retail (USDA, pp. 104-111) provides evidence of the variety of policy

options associated with a particular product category.

However, the main point of this paper is not to promote specific policy solutions. Indeed

with such a diversity of opinion and range of proposals we would barely know where to begin.

66

However, this paper adds two bits of information to our understanding of inner city retail

development. First, the combination of a retail density model based on a Löschian model of

market area and an empirical population density model based on the monocentric model of the city

provide a useful basis for analyzing the possible causes of a central city retail gap. Second, and

more importantly, this study provides evidence of a systematic retail gap in smaller inner cities

that weighs significantly on African-American neighborhoods, rather than on other poor or

centrally located areas. This finding offers additional support for the value of urban retail

development efforts in the African-American neighborhoods of our smaller cities.

ENDNOTES

1. William Bellinger is a Professor at Dickinson College, and Jue Wang is an undergraduate

economics and mathematics major. 2. The hypothesized population underestimate is based on at least two sources. First, there is some

evidence that census counts are understated in poor communities (U.S. Housing.., pp. 11-14).

Also, some business location decisions may not fully consider the implications of population

density when estimating potential demand (Porter, pp. 58-59).

3. Given the demand and cost functions in the text the firm’s profit is defined as

FR

tpakpAD )2

)((

Maximizing with regard to price gives us the profit maximizing price, p*

422

* tRakP

Plugging p* into the profit function gives us the maximum profit

FtRka

AD 2

)422

(

Given a zero profit long run competitive equilibrium the firm’s market area A is as presented in

the text.

4. From the earlier individual demand equation, individual demand with discrimination based on

neighborhood ethnicity will equal x = a(m) - P - tu where ∂a/∂m < 0.

5. For firms that hire predominantly local labor, wage discrimination by employers tends to lower

labor cost in minority neighborhoods. While average human capital may be lower in areas with

poorer schools or higher dropout rates, the likelihood of higher unemployment increases the

selectivity of employers’ choices among available workers. Porter (p. 61) includes low wage

moderate skill labor as a potential advantage of an inner city location.

6. In the monocentric model population density can be derived as follows: Assume an individual

maximizes a Cobb-Douglas utility function with two goods, land L and a composite C subject to a

budget that includes total round trip commuting costs of tr at distance r.

(a)Max LαC

1-α subject to y –tr = ρ(r)•L + Pc•C,

In (a) L = Land, C = the composite good, ρ is land rent, r is radial distance, and y = household

income. From (a) a simple Cobb-Douglas demand for land (b) can be derived, combined with the

land rent gradient (c and d) and differentiated twice with regard to distance to display the model’s

predictions for land demanded per person.

67

(b) L*=α(y-tr)/ρ(r)

(c) ρ(d)= Rrty

try

/1

, where r and R are the distance and rent at the city’s boundary.

(d) Combining (2) and (3), L(u)=α(y-t r )1/α

(y-tr)(α-1)/α

/ρ(r)

(e) ∂L/∂r=t(1- α) •ρ(r)-1

>0

(f) ∂2L/∂r

2= -t(1- α) •ρ(r)

-2•∂ρ/∂r >0 (see Anas and Kim)

Equations (e) and (f) state that land per person increases at an increasing rate with distance from

the central point. Conversely, the population per unit of land will decrease at a decreasing rate.

Equation (f) is positive if ∂ρ/∂r is negative, a common finding noted in the text.

7. More specifically, the 9 cities tested individually were Springfield MA, Hartford CT, Rochester

NY, Akron OH, Grand Rapids MI, Greensboro NC, Shreveport LA, Salt Lake City UT and

Tacoma WA. The choices for these preliminary tests were based on the relatively large numbers of

zip codes and degrees of freedom available for these cities. Salt Lake City and Tacoma had

insignificant distance variables in the CBD dummy specification, while Akron had a significantly

negative coefficient for the square of distance in the second degree log specification. In the first

degree exponential with dummy variable specification, the zip code dummy was significantly

negative in Hartford and Akron, and was insignificantly negative for the other cities. The national

and regional tests were also far more significant for the zip code dummy specification.

8. Net population density describes the population density in residential areas only, while gross

density measures population density for an entire area regardless of primary land use.

9. The Northeastern cities are Springfield, Worcester and Lowell MA; Hartford and Waterbury

CT; Schenectady, Syracuse and Rochester NY; and Erie, Allentown and Reading PA. The

Midwestern cities are Rockford IL; Akron, Youngstown, and Canton OH; Ann Arbor, Flint,

Lansing, Battle Creek, Kalamazoo, and Grand Rapids MI; and Racine WI. The Southern cities are

Macon GA; High Point, Greensboro, and Wilmington NC, Greenville SC; Lake Charles and

Shreveport LA. The Western cities are Fort Collins CO; Boise ID; Billings MT; Salt Lake City

UT; Santa Clarita, Salinas, and Santa Rosa CA; Salem and Eugene OR, and Tacoma WA. The

South is somewhat underrepresented because of fewer circular cities, a lack of cooperation from

some local officials regarding the location of the city center, and a larger percent of the original set

of cities with fewer than 4 zip codes.

10. A related point is that zip code land areas are larger in more peripheral locations. A simple

regression indicated that for each mile of radial distance the average land area of a zip code

increased by 6.008 square miles, or 3,845 acres.

11. Zipf’s law is expressed as follows:

BRank

Kv Re , or ln Rev = ln K – β ln Rank.

12. More specifically, the highest and lowest ranks (one and the average number of firms above a

million, respectively), were then averaged using a geometric mean. For example, for the semi-log

specification of retail the estimated equation was Avg. Revenue = 5,712,622 – 1.285716.12 ln

Avg. Rank. With a mean of 28 firms per zip code reporting revenue above one million, the

revenue values were $5,712,622 for the largest firm and $1,035,719 for the 28th

firm. The

geometric mean of these is $2,432,420. These estimates are likely to quite conservative, as major

chains report revenue per store higher than the maximum estimates produced by this method. The

68

double log specification produced a geometric mean of over $7 million. This and a mean of the

double log and semi-log estimates were tested without meaningful effect on the results.

13. Davidson and MacKinnon (pp. 553-554) recommend these choices. HC2 adjusts standard errors

via the formula 1/(1-hii), , while HC3 uses 1/(1-hii)2,

where hii= xi(X’X)-1

x’. According to Long

and Ervin (2000), who recommend the use of HC3 in all cases, there is little distinction between

robust standard error forms with over 250 observations, as is true in this study. The model was

also tested with standard errors clustered by city.

14. This model and data set sit squarely in an area of uncertainty regarding the appropriateness of

clustered standard errors. The natural choice of clustering by city involves somewhat less than the

ideal minimum number of clusters, which according to Kedzi (2004) is about 50. Also, the

resulting clusters are moderately unbalanced. The number of zip codes per city range from 4 to 21,

with three cities (Rochester NY, Salt Lake City UT, and Tacoma WA) having over 15 zip codes

each and two containing slightly over 5 percent of the total sample of 345 zip codes, which

violates the rule of thumb suggested by Rogers (1993). Finally, the explicit fixed effects in the

model are limited to one insignificant variable in the business density equation (crime) and a

somewhat more significant single variable in the population density equation (city population).

For all of these reasons we report unclustered estimates in the text.

REFERENCES

Anas, A., Kim, I., 1992. “Income Distribution and the Residential Density Gradient,” Journal of

Urban Economics 31, 164-180.

Becker, Gary, The Economics of Discrimination, 2nd

Edition, (University of Chicago Press,

1971), Chapters 5 and 6.

Blanchard, L., Zhao, B., Yinger, J., “Do lenders discriminate against minority and woman

entrepreneurs?” Journal of Urban Economics, V. 63 (2008) pp. 467-497.

Bostic, Raphael and K. Patrick Lampani, 1999, “Racial differences in patterns of small

business finance: The impact of local geography,” Business Access to Capital and Credit:

A Federal Reserve System Research Conference: Proceedings of a Conference Held in

Arlington, VA, March 8-9, 1999, editors, Jackson L. Blanton, Alicia Williams, Sherrie

L.W. Rhine, Chicago: Federal Reserve Bank of Chicago.

Boston, T. D., Ross, C. L., (Eds.), 1997a. The Inner City: Urban Poverty and Economic

Development in the Next Century, Transaction Publishers, New Brunswick NJ.

Boston, T. D., Ross, C. L., 1997b. “Location Preferences of Successful African American-Owned

Businesses in Atlanta,” in: Boston, T. D., Ross, C. L. (Eds.), The Inner City: Urban

Poverty and Economic Development in the Next Century, Transaction Publishers, New

Brunswick NJ, pp. 337-357.

Davidson, R. and J.G. MacKinnon, Estimation and inference in econometrics, (New York:

Oxford University Press, 1993).

Gallagher, Charles A., “Miscounting Race: Explaining Whites’ Misperceptions of Racial Group

Size”, Sociological Perspectives, V. 46, No. 3 (Autumn 2003) pp. 381-396.

69

Initiative for a Competitive Inner City and the Brooklyn Economic Development

Corporation, 2005. Initiative for a Competitive Brooklyn: Seizing Our Moment

(March), (http://www.icic.org/atf/cf/%7BC81898B2-76E9-4A18-B838-

A3F65C9F06B9%7D/05-March-InnForACompBrooklyn.pdf )

Initiative for a Competitive Inner City and the Newark Alliance, 2006. Opportunity Newark:

Jobs and Community Development for the 21st Century.

(http://www.icic.org/atf/cf/%7BC81898B2-76E9-4A18-B838-A3F65C9F06B9%7D/06-

April-OppNewark.pdf )

Johnson, J. H., Farrell, W. C., Henderson, G. R. (1997). Mr. Porter’s “Competitive Advantage”

for Inner –City Revitalization: Exploitation or Empowerment?, in: Boston, T. D., Ross,

C. L. (Eds.), The Inner City: Urban Poverty and Economic Development in the Next

Century, Transaction Publishers, New Brunswick NJ, pp. 259-290.

Johnson, S.R., and Kau, James B. “Urban Spatial Structure: An Analysis with a Varying

Coefficient Model,” Journal of Urban Economics, V. 7 (1980), pp. 141-154.

Kézdi, Gábor, “Robust Standard Error Estimation in Fixed Effects Panel Models,’ Hungarian

Statistical Review, Special Volume 9 (2004), pp. 96-116.

Long, J. S., and Ervin, L., “Using heteroscedasticity-consistent standard errors in the linear

regression model,” The American Statistician, V. 54 (2000), pp. 217-224.

Lösch, A., 1954. The Economics of Location, New Haven: Yale University Press.

Martori, Joan Carles, and Jordi Surinach, “Classical Models of Urban Population Density: The

Case of Barcelona Metropolitan Area” 41st Congress of the European Regional Science

Association, 2001 (http://ideas.repec.org/p/wiw/wiwrsa/ersa01p46.html).

McDonald, John F., “Econometric Studies of Urban Population Density: A Survey,” Journal of

Urban Economics, V. 26 (1989), pp. 361-385.

Mills, E.S., and Price, Richard, “Metropolitan Suburbanization and Central City Problems,”

Journal of Urban Economics, V. 15 (1984), pp. 1-17.

Mills, E. S., Sendẻ Lubuele, L., 1997. “Inner Cities,” Journal of Economic Literature, vol.

XXXV (June), pp. 727-756.

Muth, R. F., 1969. Cities and Housing, Wheaton IL: University of Chicago Press, 1969.

Muth, R.F., “The Spatial Structure of the Housing Market,” Papers and Proceedings of the

Regional Science Association, V. 7 (1961), pp. 207-220.

Porter, M., 1995. “The Competitive Advantage of the Inner City”, Harvard Business Review, vol.

73, pp. 55-71.

Quillian, L., Pager, D., “Black Neighbors, Higher Crime? The Role of Racial Stereotypes in

Evaluations of Neighborhood Crime.” The American Journal of Sociology, Vol. 107, No. 3

(Nov., 2001), pp. 717-767.

Rogers, C. D., Gent, M., Palumbo, G., Wall, R., 2001. “Understanding the Growth and Viability

of Inner City Businesses,” Journal of Developmental Entrepreneurship, 6:3 (December),

237-254.

Rogers, William H., “sg17: Regression standard errors in clustered samples,” Stata Technical

Bulletin, V 13 (1993), pp. 19-23.

U.S. Department of Agriculture, “Access to Affordable and Nutritious Food: Measuring and

Understanding Food Deserts and Their Consequences,” June 2009

http://www.ers.usda.gov/Publications/AP/AP036/AP036.pdf

U.S. Department of Housing and Urban Development, 1999. New Markets: The Untapped

Retail Power of America’s Inner Cities. http://www.huduser.org/publications/

Villegas, Daniel, “Comparative Performance of Spatial Models of Economic Markets with Linear

Household Demand“ Southern Economic Journal, April 1982, pp. 893-908.

http://www.icic.org/atf/cf/%7BC81898B2-76E9-4A18-B838-A3F65C9F06B9%7D/05-March-InnForACompBrooklyn.pdf

http://www.icic.org/atf/cf/%7BC81898B2-76E9-4A18-B838-A3F65C9F06B9%7D/05-March-InnForACompBrooklyn.pdf

http://www.icic.org/atf/cf/%7BC81898B2-76E9-4A18-B838-A3F65C9F06B9%7D/06-April-OppNewark.pdf

http://www.icic.org/atf/cf/%7BC81898B2-76E9-4A18-B838-A3F65C9F06B9%7D/06-April-OppNewark.pdf

http://ideas.repec.org/p/wiw/wiwrsa/ersa01p46.html

http://www.ers.usda.gov/Publications/AP/AP036/AP036.pdf

http://www.huduser.org/publications/

70

Weiler, S., Silverstein, J., Chalmers, K., Lacey, E., Rogers, W., Widner, B., 2003.

„Understanding the Retail Business Potential of Inner Cities,” Journal of Economic

Issues, 37:4 (December), pp. 1075-1105.

Weissbourd, R., Berry, C., 1999. “The Market Potential of Inner-City Neighborhoods, Filling the

Information Gap,” Brookings Institution.

http://www.brookings.edu/es/urban/Weissbourd.PDF

http://www.brookings.edu/es/urban/Weissbourd.PDF

Date post:	25-Jul-2020
Category:	Documents
Upload:	others
View:	3 times
Download:	0 times

PENNSYLVANIA ECONOMIC REVIEWaux.edinboro.edu/pea/pub/PER/PER-V18-2-2011.pdfPennsylvania Economic...

Documents