+ All Categories
Home > Documents > Frontier Functions

Frontier Functions

Date post: 14-Apr-2018
Category:
Upload: yaronbaba
View: 217 times
Download: 0 times
Share this document with a friend

of 45

Transcript
  • 7/29/2019 Frontier Functions

    1/45

    Frontier Functions:

    Stochastic Frontier Analysis (SFA) &

    Data Envelopment Analysis (DEA)

    Sponsored by:The Martin School of Public Policy and Administration

    The Department of Economics

    The Research Office

    University of Kentucky

  • 7/29/2019 Frontier Functions

    2/45

    Production and cost functions

    A researcher wishes to estimate a production function ora cost function.

    The object is to estimate not the average production oraverage cost, but the maximum possible productiongiven a set of inputs or the minimum possible cost of aset of outputs.

    OLS regression estimates the mean of the dependentvariable conditional on the explanatory variables;

    Quantile regression is based on a quantile (e.g. 10th,25th,median, 75th, 90th), not the maximum or minimum;

    The max or min cannot be detected directly and used todefine the sample for selection bias analysis;

    Limited dependent variable models truncate thedependent variable into categories or limits but not themaximum or minimum.

  • 7/29/2019 Frontier Functions

    3/45

    Frontier functions: definition

    None of those standard econometric models isthe answer.

    The answer is frontier functions, econometricstochastic frontier analysis (SFA) or linear

    programming data envelopment analysis (DEA). Frontier functions estimate maxima or minima of

    a dependent variable given explanatoryvariables, usually to estimate production or cost

    functions. All frontier functions come from one paper,Aigner and Chu (1968).

  • 7/29/2019 Frontier Functions

    4/45

    Aigner and Chu (1968)

    D.J. Aigner and S.F. Chu (AER 1968), On Estimatingthe Industry Production Function invented this area.

    A viable distinction between the average and frontierfunctions as predictors of capacityderives from aprobability interpretation of alternative forecasts.thefrontier we construct is truly a surface of maximumpoints. This became Stochastic Frontier Analysis,Stochastic = probability interpretation.

    Estimation, for primary metals production in stateaggregates:

    one stage least squares and two stage least squares, quadratic programming (now rarely estimated), and linear programming, developed into Data Envelopment

    Analysis in Charnes, Cooper, and Rhodes (1978) andsubsequent research.

  • 7/29/2019 Frontier Functions

    5/45

    Varian (1984)

    Varian shows how to estimate and test for theWeak Axiom of Cost Minimization (WACM) andother microeconomic assumptions

    Varian suggests using either regression (SFA) orlinear programming (DEA)

    The WACM applies to for-profit, not-for-profit,private, and public producers

    The only requirement is that minimum inputs are

    intended to be used to produce desired output,or maximum output is intended from inputs used

    Profit maximization is not required

  • 7/29/2019 Frontier Functions

    6/45

    SFA and DEA

    Two large differences and another possibledifference

    SFA has a stochastic frontier with a probabilitydistribution

    DEA has a non-stochastic frontier SFA has one output, or an a priori weighted

    average of multiple outputs

    DEA often has more than one output, no a prioriweights, but assumes input-output separability

    Both can have stochastic inefficiency, SFAalways does, DEA sometimes does

  • 7/29/2019 Frontier Functions

    7/45

    One-sided disturbances

    In frontier functions, the disturbance has adistribution all on one side of zero

    the maximum production must be greater than

    or equal to any value in the sample, the minimum cost must be less than or equal toany value in the sample.

    produced quantities are bounded by the

    maximum, with non-positive disturbances costs are bounded by the minimum, with non-

    negative disturbances

  • 7/29/2019 Frontier Functions

    8/45

    MLE with a one-sided disturbance

    does not work well

    MLE and the Cramr-Rao lower bound(minimum variance of an asymptoticallyunbiased estimator, usually the MLE) arequestionable!

    Begin with a likelihood function L which showsthe probability of the data x given theparameters ,

    The parameters might be the mean andstandard deviation or might just be mathematicalparameters.

  • 7/29/2019 Frontier Functions

    9/45

    Setting up MLE

    Limits are a function of parameters in a non-stochastic frontier function: production function

    (max), cost function (min)

    L is Likelihood, L* is log likelihood. L() is always a probability distribution, so it

    follows that it integrates to 1.0 over the range of

    the data, from lower bound A to upper bound Z.

    AZ L(x | )dx = 1.

    Take the derivative wrt :

  • 7/29/2019 Frontier Functions

    10/45

    MLE: problems

    AZ[dL(x | )/d] dx + [dZ/d]L(Z)[dA/d]L(A) = 0 E(dL*/d) + [dZ/d]L(Z) [dA/d]L(A) = 0. The first derivatives of the log likelihood do not

    have mean 0 if those extra terms stay. Second derivatives add more unwanted

    derivatives if the limits are functions of theparameters.

    The negative inverse Hessian is not the varianceof the MLE.

    This is not working at all.

  • 7/29/2019 Frontier Functions

    11/45

    MLE: possible repairs

    Make the frontier stochastic and limits ofproduction or cost not a function of the

    parameters, completely eliminating the problem.

    Make the probability distribution have pdf of 0and derivatives of 0 at the limits, even though

    the limit itself is a function of the parameters:

    [dZ/d]L(Z) = 0 and [dA/d]L(A) = 0

    The Gamma Distribution can do that (Greene(1980)).

  • 7/29/2019 Frontier Functions

    12/45

    The Gamma Distribution

    The Gamma Distribution describes a non-negative random variable with two parameters (shape) and (spread)

    If ~ (, ), E() = /, V() = /2 pdf() = exp(-)-1/(),

    with different shapes for > 0, in ranges: less

    than 1, 1, between 1 and 2, 2, greater than 2 A graph follows; > 2 is required for the pdf and

    its derivatives to be zero at the limits.

  • 7/29/2019 Frontier Functions

    13/45

    Gamma Distribution: shapes

    0

    .2

    .4

    .6

    .8

    1

    0 1 2 3 4 5x

    alpha=1.0 alpha=1.5

    alpha=2.0 alpha=2.5

    Gamma distributions (lambda =1)

  • 7/29/2019 Frontier Functions

    14/45

    Ok, so a Gamma Distribution?

    No, not really. The parameters are restricted

    mathematically. That really annoys

    researchers. Some other distribution? No, no other one-

    sided distribution has the required

    properties at the limits. This is why no one has just onedisturbance .

  • 7/29/2019 Frontier Functions

    15/45

    Composite disturbances

    The disturbance has two parts

    Stochastic frontier (v), unlimited range as usual.The limits of the production or cost function are

    at infinity, not a function of the parameters Inefficiency (u), one sided, non-positive for

    production, non-negative for cost

    Finally, yj = xj + uj + vj , that is, j = uj + vj So there are two disturbance terms to keep theparameters from affecting the limits

  • 7/29/2019 Frontier Functions

    16/45

    Panel data: Fixed effects

    Panel data researchers would like toinclude fixed or random effects ineverything, so why not frontier models?

    Greene (2005) addresses this in detail. Fixed effects have special problems in

    non-linear models, but they can work

    Random effects are offered by Stata. Now there are three disturbance terms! yjt = xjt + j + ujt + vjt

  • 7/29/2019 Frontier Functions

    17/45

    Fixed effects in non-linear models

    Fixed effects have well known advantages inlinear models but in non-linear models they:

    are inconsistent (too small sample for each

    fixed effect), cannot be differenced out (differences ofnon-linear models are still non-linear),

    spread their inconsistency to other

    coefficients (assuming correlation with otherexplanatory variables, which is the motivation forfixed rather than random effects).

  • 7/29/2019 Frontier Functions

    18/45

    Wait, maybe fixed effects are ok

    With few units and many observations, fixed effects workbecause the sample size for each fixed effect might belarge enough. Greene (2005) points this out.

    Stata refuses to enter fixed effects in the model.

    The user can enter fixed effects. Random effects, normally distributed, are offered by

    Stata. As always, they must be assumed to beuncorrelated with explanatory variables.

    The independence assumption cannot be tested byStata, and there is no Hausman test, but Estimate fixed effects by direct inclusion and regress the

    fixed effects on explanatory variables to test theindependence required for consistent random effects.

  • 7/29/2019 Frontier Functions

    19/45

    Stata: all MLE, all the time

    Stata offers MLE with composite disturbances. The one-sided distribution is half-normal, truncated

    normal, or exponential (restricted Gamma)

    frontier dependent explanatory, d(hn) ord(tn) ord(e)

    In Stata, u is one-sided inefficiency and v is the two-sided stochastic frontier. Stata uses notation fromGreene (1990) in which = ratio of standard deviationsu/v, so that = 0 means there is no inefficiency.

    Fixed effects sneaked in by the user underfrontier, orrandom effects by Stata (normally distributed). xtfrontier dependent explanatory, re i(group_id) For minimization, use the option , cost

  • 7/29/2019 Frontier Functions

    20/45

    Stata: heteroscedasticity

    Stata offers a lot of heteroscedasticity: either uor v can be heteroscedastic, or both.

    Heteroscedastic u (one-sided error, inefficiency)

    Heteroscedastic v (two-sided error, randomvariation)

    The same explanatory variables, or different

    variables, can appear in the frontier and in theheteroscedasticity.

    frontier, uhet(var_name) vhet(var_name)

  • 7/29/2019 Frontier Functions

    21/45

    Stata estimates the inefficiency

    Stata estimates the technical efficiency,the percentage of estimated frontier outputattained or the extra percentage spent

    beyond frontier cost predict var_name, te As usual, many other options exist using

    predict. Successful Stata estimation is illustrated atthis point.

  • 7/29/2019 Frontier Functions

    22/45

    Is MLE necessary?

    If you always use Statas options, yes!

    If not, no!

    Not-MLE (1) Corrected OLS

    Not-MLE (2) Fixed effects in panels

    Not-MLE (3) Gamma-distributed inefficiency

    Note: the Gamma distribution or any other

    distribution of inefficiency is unrestricted if MLEis not used; only MLE has a range problem

  • 7/29/2019 Frontier Functions

    23/45

    Not MLE(1)

    Corrected OLS

    Estimate OLSthats all, just OLS yj = xj+ j Estimate residuals ej and interpret them as

    inefficiency

    Assuming production, inefficiency

  • 7/29/2019 Frontier Functions

    24/45

    Not MLE (2)

    Fixed effects as inefficiency

    Schmidt and Sickles (1984) but not inStatafixed effects required!

    Given panel data and fixed effects,

    assume that inefficiency is the fixed effect Estimate yjt = xjt + j + vjt by xtreg predict the fixed effects j and define the

    most efficient (production) max Inefficiency = max - j Min and reverse sign for cost functions

  • 7/29/2019 Frontier Functions

    25/45

    Not MLE (3)

    Gamma-distributed inefficiency

    Greene (1990), not in Stata

    Not a panel, yj = xj + (uj + vj) by reg andpredict the residuals e

    j

    = uj

    + vj

    Adjust residuals to one side of 0 by themax or min; the constant absorbs emax/min

    Assume v ~ (0, v

    2) and u ~ a Gamma

    distribution and estimate v2, , and

    E(e) isnt useful, fixed to 0 by OLS but

  • 7/29/2019 Frontier Functions

    26/45

    Not MLE (3)

    Gamma-distributed inefficiency

    V(e) = v2+ /2

    Skewness(e) = 2/3

    Kurtosis(e) = V2

    (e) + 6/4

    Three equations in three unknowns: V(v),

    two parameters of the distribution of u

    Standard errors by delta method or GMM But the range of the data is a function ofthe parameters? No problem, not MLE!

  • 7/29/2019 Frontier Functions

    27/45

    Failure of well-specified MLE:

    parameters

    Failure to converge; estimation continues indefinitelythrough many iterations with no sign of stropping.

    Repeated non-concave loglikelihood means the loglikelihood is not maximized, maximum likelihood fails;backed up means the loglikelihood decreases.

    Estimation fails to start, initial values not feasible. OLSstarting values imply negative infinite log likelihood.

    Apparent estimates but the SD of inefficiency (u) is small

    or the ratio ofu to the SD of the stochastic frontier (v),=u/v, is small, e.g. .01; sometimes goes as close tozero as Stata can make it, e.g. 0.00001.

  • 7/29/2019 Frontier Functions

    28/45

    Failure of well-specified MLE:

    distributions The truncated normal distribution of inefficiency has an

    extra parameter, the mean of the normal truncated at 0,which often fails in estimation.

    The exponential distribution slopes down from 0

    smoothly, which leads to initial values not feasible ifinefficiency is not strongly skewed right. The stochastic frontier can disappear from the model,

    leaving one-sided inefficiency that violates the MLErange rule (range not a function of the parameters).

    The half-normal is the most often successful, the mostcommon in the literature, and the default in Stata.

    Unsuccessful Stata estimation is illustrated at this point.

  • 7/29/2019 Frontier Functions

    29/45

    Data Envelopment Analysis (DEA)

    Envelop the m inputs and n outputs in m+nspace, i.e. a graph with points, with hyperplanes,i.e. lines/planes/etc.

    Linear programming Constant returns to scale (CRS) = CCR for

    Charnes, Cooper, and Rhodes (1978),

    Variable returns to scale (VRS) = BCC for

    Banker, Charnes, and Cooper (1984). Aigner and Chu (1968) did it first and also did

    quadratic programming

  • 7/29/2019 Frontier Functions

    30/45

    DEA assumptions

    DMU = decision making unit, business, bank,farm, not-for-profit, government, university, etc.

    All actual observed inputs and outputs of anyDMUs are feasible for all DMUs

    All linear combinations of observed inputs andoutputs are feasible.

    Free disposal of inputs and outputs.

    The production function or cost function ispiecewise linear, implying linear or non-differentiable functions everywhere.

  • 7/29/2019 Frontier Functions

    31/45

    DEA efficiency without prices

    Output-oriented technical efficiency is producing thegreatest possible output in the sense of a linear functionof a set of outputs given the value of a linear function ofinputs. No prices are involved. Efficiency = output thatcould be produced from inputs used, if >100%, inefficient.

    Input-oriented technical efficiency is producing a givenset of outputs with the smallest linear function of inputs.No prices are involved. Efficiency = percentage of actualinputs used that would be needed, if

  • 7/29/2019 Frontier Functions

    32/45

    DEA efficiency with prices

    Allocative efficiency is minimizing the cost of thelinear combination of the outputs produced,using input prices.

    Profit maximization: maximizing the value of

    outputs minus the value of inputs, using bothoutput and input prices

    Scale efficiency is operating at the scale ofoperation maximizing the ratio of the linear sum

    of outputs to the linear sum of inputs. An economically efficient business is technicallyand scale efficient.

  • 7/29/2019 Frontier Functions

    33/45

    DEA: tiny example

    constant returns to scale

    DMU x y y/x efficiency supereffic 1 1 6 6.00 1.0000 1.0909 2 2 8 4.00 0.6667 3 2 11 5.50 0.9167

    4 3 9 3.00 0.5000 5 3 13 4.33 0.7222 6 5 15 3.00 0.5000 DMU#1 has the highest y/x and others are inefficient

    according to their ratios of y/x DMU#1 could drop to 5.5 and still be efficient (see

    DMU#3), DMU#1s superefficiency is 6.00/5.50 = 1.0909

  • 7/29/2019 Frontier Functions

    34/45

    DEA: tiny example

    variable returns to scale

    DMU x y y/x efficiency supereffic 1 1 6 6.00 1.0000 2.0000 2 2 8 4.00 0.7000 3 2 11 5.50 1.0000 1.2143

    4 3 9 3.00 0.5333 5 3 13 4.33 1.0000 1.1667 6 5 15 3.00 1.0000 big DMUs#1,3,5,6 define the frontier

    DMU#2 is inefficient relative to 0.6 X #1 + 0.4 X #3 DMU#4 is inefficient relative to 0.4 X #1 + 0.6 X #3 DMU#1 could use twice the input and still be efficient

  • 7/29/2019 Frontier Functions

    35/45

    DEA graph

    0

    10

    20

    30

    0 1 2 3 4 5x

    crs vrs

    y

    Output-oriented DEA: input x, output y

  • 7/29/2019 Frontier Functions

    36/45

    DEA: standard setup

    N decision making units (DMU). Assume a linear function of n inputs produces m outputs. There is no economic production function or cost

    function in basic DEA.

    Assume the linear function of the inputs is minimizedgiven the linear function of the outputs,

    Equivalent: the linear function of the outputs ismaximized given the linear function of the inputs.

    Call inputs x and outputs y as in regression. Call the coefficients on inputs b and the coefficients onoutputs c. These are shadow prices in economics.

  • 7/29/2019 Frontier Functions

    37/45

    DEA: linear programming,

    input oriented (production)

    Consider DMU t, 1tN, N total producers tostudy, with m outputs and n inputs.

    DEA estimates each DMUs efficiency by itself,not relative to one estimated frontier. Each DMU

    t has an individual input and output function. Max, over c.t and b.t, i=1mcityit/ j=1nbjtxjt s.t. bjt0 and i=1mcityip/j=1nbjtxjp 1, all DMUs p. Linear fractional programming is difficult,

    maximizing the ratio of two linear functions;restate to maximize the numerator minus thedenominator, which is a linear program.

  • 7/29/2019 Frontier Functions

    38/45

    DEA: avoiding

    linear fractional programming

    Max i=1mcityit - j=1nbjtxjt s.t. bjt0, all j, andj=1nbjtxjt = 1, a normalization of total cost, andi=1mcityip - j=1nbjtxjp 0, all DMUs p.

    Note on math: given real z, functions f(z), g(z) all

    >0; substituting max f(z)-g(z) for max f(z)/g(z)implies that f(z) and g(z) are near 1.0, so thatln(f(z)) and ln(g(z)) are approximately linear.

    Setting total costs = 1.0 is a normalization butsetting total output (1) near 1.0 is anassumption that inefficiency is not too large.Linearizing overstates large inefficiencies.

    No standard errors, no statistical tests.

  • 7/29/2019 Frontier Functions

    39/45

    DEA including prices

    Basic DEA has no economic production or cost function,but see Ray (2004, Chapter 9), linear programming, withadditional constraints.

    Add constraints to the production or cost (linear) functionusing the market prices.

    Maximize output given inputs but add the linearconstraint on inputs that cost adds up to a total variablecost budget.

    An explicit production function can be added as aconstraint.

    DEA for profit maximization explicitly maximizes the totalrevenue from outputs minus the variable cost of inputsas a linear function.

  • 7/29/2019 Frontier Functions

    40/45

    DEA: what management

    consultants do

    Rank DMUs by efficiency Benchmark to efficient units Estimate superefficiency Use the coefficients to suggest alterations in

    resource allocations.

    Assumption: the production function that appliesto a particular DMU (farm, hospital, or university,

    e.g.) can be expanded or contracted linearly. DMUs with unusual combinations of inputs can

    appear efficient but be very difficult to emulate.

  • 7/29/2019 Frontier Functions

    41/45

    DEA: attempted standard errors

    Interpretation as MLE on efficiency: estimate aprobability distribution of estimated efficiencies. Thefrontier is still non-stochastic; the probability distributionis descriptive and post-estimation; this is not MLE.

    Chance-constrained linear programming: add adisturbance (maybe Gamma) to the non-stochasticfrontier. The frontier is still non-stochastic; not MLE.

    Bootstrap variances: random sampling variation inestimated efficiencies does not represent behavior of

    DMUs or the observed frontier in the actual data. No method provides econometric standard errors, the

    reason many econometricians just say no.

  • 7/29/2019 Frontier Functions

    42/45

    Comparing DEA and SFA

    Comparing SFA to DEA has not beendone very much

    Some work on hospitals

    The correlation of efficiency estimates isnot very high: 0.13-0.63 in hospitals,

    apparently similar elsewhere

    DEA focuses on individual DMUs, whileSFA focuses on estimating the frontier.

  • 7/29/2019 Frontier Functions

    43/45

    Research on frontier functions

    SFA and DEA results

    What systematic factors are associatedwith failure of SFA models:

    topic (banks, farms, hospitals, states, etc.),

    distribution (exponential, half-normal,truncated normal, gamma),

    explanatory variables, sample size, etc.?

    What systematic factors are associatedwith SFA and DEA results being similar ordifferent?

  • 7/29/2019 Frontier Functions

    44/45

    Research on frontier functions:

    methodology

    No theoretical reason to avoid the Gamma distribution,so use it in research and compare results.

    Apply SFA based on moments and compare with MLE. Quadratic programming (minimizing the sum of squared

    inefficiency terms) in DEA was difficult decades ago, buttoday? The method of Wolfe (1959) can be used.

    Aigner and Chu (1968) estimated quadraticprogramming and had apparently different estimates(with no standard errors) of capital-output elasticity and

    technology-output elasticity.

    Fractional linear programming also might be feasible inDEA given modern computing resources.

  • 7/29/2019 Frontier Functions

    45/45

    Go estimate frontier functions

    Economics and policy are often concerned withefficiency of banks, farms, governments, private andpublic agencies, for-profit and not-for-profit producers.

    The weak axiom of cost minimization is reasonable;

    profit maximization is not required. Statas frontier and xtfrontier are available and Statas

    restrictions can be evaded.

    DEA is used by management consultants, estimated bygeneral and specific linear programming packages.

    Comparative or methodological research is possible.


Recommended