Date post: | 14-Dec-2015 |
Category: |
Documents |
Upload: | lukas-hargis |
View: | 214 times |
Download: | 0 times |
Non-linearities, catastrophic risk and thresholds in resource economics
Eric Nævdal
Purpose of class
• Teach some advanced methods in optimal control theory
• Familiarize students with some applications of these methods to natural resource management.
• Enable students to solve simple problems numerically.
• A very applied course. No theorems; just methods!
Prerequisites
• A decent understanding of ordinary differential equations.
• A decent understanding of deterministic optimal control theory.
• E.g. Maxc 0∫T U(c,x)e-rtdt subject to x(0) = x0
and dx/dt=f(x,c). Here 0 < T ≤ ∞.
Cook book solution in easy steps
1) Define the Hamiltonian:
H(c,x) = U(c,x) + λf(c,x)
2) Find optimality conditions– c = argmax H → c = c(x,λ)– dλ/dt = rλ – ∂H/∂x =rλ – ∂U/∂x – λ∂f/∂x– Insert c from into dx/dt and = dλ/dt
3) You now have two differential equations.
What to do with the differential equations
• Compute steady states x* and λ*.
• f(c(x,λ),x) = 0 and
• rλ – U’x(c(x*,λ*),x*)– λf’x(c(x*,λ*),x*) = 0
• Draw a phase diagram.
x
λ
dx/dt = 0
dλ/dt = 0
x*
λ*
•Paths converging to steady states candidates for optimal solution
Optimal Control cont’d
• The co-state λ(t) has an interesting economic interpretation.
• If somebody at time t gave you a present of 1 unit of x so that x(t) jumps to x(t) +1, then λ(t) is (roughly) the value of that present at time t.
• Entirely analogous to shadow prices in static theory.
• If T < ∞, then we have the transversality condition λ(t) = 0 if x(T) is free.
Transversality conditions for infinite Horizon problems
• The economic literature is full of flawed transversality conditions.
• The reason is that it is hard to find general conditions without using stuff like lim sup.
• For the purposes of this class we are satisfied if we can find at least one path where both the (current value) shadow price and the state variable converges to finite numbers.
• If T = ∞, then we really have a hard time with pinning down good transversality conditions. See Seierstad and Sydsæter (1987) for details.
Alternative approach that will (once) be used later
• Take the c(x,λ) function and differentiate it.
• Gives dc/dt = c’xdx/dt + c’λdλ/dt =
= c’xdx/dt + c’λ(rλ – ∂U/∂x – λ∂f/∂x)
The solve c(x,λ) with respect to λ. Gives λ(c,x).
• Use dc/dt and dx/dt after inserting λ(c,x) as alternative differential equations
Example
• A firm has benefits from pollution given by: (u0 – u)2 • The pollution accumulates in nature according to dx/dt = u – δx • Damages from accumulated pollution given by –x2.• The problem:
maxu∫∞ (-x2 - (u0 – u)2)e-rtdt subject to:
dx/dt = u – δx , x(0) givenSolutionDefine the Hamiltonian:H= -x2 - (u0 – u)2 + λ(u – δx)• The value of u that maximizes the Hamiltonian is given by u = u0 + ½λ for
u0 + ½λ > 0. Else u = 0.• dλ/dt = rλ + 2x + δλ• dx/dt = u0 + ½λ – δx
Computing Steady States gives
• x* = u0(r+δ)/(1+δ(r+δ))
• λ* = –2u0/(1+δ(r+δ))
• From these expressions we see (for example)
• dx*/dr > 0
• dx*/dδ < 0
Phase diagramx ' = u + 0.5 y - d xy ' = r y + 2 x + d y
r = 0.05u = 1
d = 1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
-2
-1.8
-1.6
-1.4
-1.2
-1
-0.8
-0.6
-0.4
-0.2
0
x
y
Phase diagram with pathsx ' = u + 0.5 y - d xy ' = r y + 2 x + d y
r = 0.05u = 1
d = 1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
-2
-1.8
-1.6
-1.4
-1.2
-1
-0.8
-0.6
-0.4
-0.2
0
x
y
Phase diagram with paths and optimal paths for infinite horizon
x ' = u + 0.5 y - d xy ' = r y + 2 x + d y
r = 0.05u = 1
d = 1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
-2
-1.8
-1.6
-1.4
-1.2
-1
-0.8
-0.6
-0.4
-0.2
0
x
y
The same solution seen as a function of time
-1.2
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
0 2 4 6 8 10 12 14 16
Time
Sta
te v
ari
ab
le a
nd
sh
ad
ow
pri
ce
xl
Phase diagrams with finite vs inifinite time horizon
x ' = u + 0.5 y - d xy ' = r y + 2 x + d y
r = 0.05u = 1
d = 1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
-2
-1.8
-1.6
-1.4
-1.2
-1
-0.8
-0.6
-0.4
-0.2
0
x
y
Optimal Solution for various time horizons
-10
-8
-6
-4
-2
0
2
4
6
8
10
0 1 2 3 4 5 6 7 8 9 10
Time
x: T =1
λ: T=1
x: T =2
λ: T=2
x: T =4
λ: T=4
x: T =8
λ: T=8
x: T =10
λ: T=10
Crucial insight
• If T is chosen sufficiently large, there will be some value t* such that the optimal solution for the infinite horizon problem and the optimal solution for the finite horizon problem will be numerically indistinguishable over the interval [0,t*]. This allows us to solve infinite horizon problems on the computer
Getting a deeper understanding of Optimal Control
• Some mathematical background: Boundary value problems. A general class of differential equations.
• Consider the problem dL/dt =λL, L(0) = L0. You should all know that the solution is: L0eλt.
• But what about this problem?:
dL/dt =λL, L(T) = LT
Solving a boundary value problem
dL/dt =λL implies that L(t) = Ceλt for some constant C. This constant is found by using the boundary value condition:
CeλT = LT
Gives that C = LTe-λT. Therefore L(t) = LTeλ(t-T) .
Important. We can not independtly specify both L(0) and L(T). There is only one constant!
More boundary value problems
dL/dt =λL, L(0) = 1. dN/dt = N – L, N(1) = 1.Solution: L(t) = Ceλt and N(t) = (λ – 1)-1 (et – eλt)C +etK. We have two constants C and K. Determined by:
Ceλ0 = 1 and (λ – 1)-1 (e1 – eλ1)C +e1K = 1 • C = 1 and K = (e(λ – 1))-1 (1 + e – eλ – λ)• Solution is L(t) = eλt and • N(t) = (λ – 1)-1 ((et – eλt) +et-1 (1 + e – eλ – λ)
Why boundary value problems?
• The solution to an optimal control problem may be written as a boundary value problem. Best seen in finite time problems:
• max 0∫T U(c,x)e-rtdt subject to x(0) = x0 and
dx/dt=f(x,c). Here 0 < T< ∞. x(T) free.
• The maxmimum principle we know, but look at the transversality condition. λ(T)=0.
An even simpler example
Define the Hamiltonian:
H= -ax - ½(u0 – u)2 + λ(u – δx)
1. The value of u that maximizes the Hamiltonian is given by u = u0 + ½λ for u0 + ½λ > 0. Else u = 0.
2. dλ/dt = rλ + a + δλ
3. dx/dt = u0 + ½λ – δx
A Philosophical digression
• The difference between human ecology (AKA economics) and ecology.
• An ecosystem and its inhabitants are unemcumbered by precognition. Humans are not.
• In order to understand an ecosystem we need differential equations and initial values.
• In order to understand human behaviour we need transversality conditions. Humans operate by backwards induction
Numerical methods
• Standard Optimal Control Problems in the literature do the following:– Find explicit solutions. Works for very few
problems. – Phase diagram. Only works for problems with one
state variable.– Steady state analysis. May be hard to do for some
problems. Some times steay states are uninteresting
• Alternative: Numerical analysis
Numerical Methods in finite time - Shooting
• The basic problem; The Maximum Principle gives us a set of differential equations.
• An optimal solution must start with the known and correct initial value of x(0). It must also start with an unknown correct value of λ(0) such that λ(T) = 0.
• Alternatively if x(T) is given, λ(0) must start from a value so that those constraint holds.
• The fundamental problem: Solve a rather complicated equation to find λ(0).
A solution in two steps
• First write computer code that solves the differential equations for arbitrary values of x(0) and λ(0). (As if we are solving an initial value problem.)
• Then write a routine that finds the value of λ(0) that sets λ(T) = 0, (or x(T) to the required value).
• Luckily, there are ways of doing this without writing much code.– Using solver functions in Excel– BVP4C routine in Matlab
• Both methods work well, but may have to be tweaked.
Step 1. The 4th order Runge – Kutta Method
• General formulation. For an OC problem the vector y = [x, λ].
• Let dy/dt = f(t, y). Let h be a small number. Then y(t) is usually well approximated by the following sequence:
• y(t+h) = y(t) +(h/6)×(k1 + 2k2 + 2k3 + k4)
k1= f(t, y(t)), k2=f(t + h/2, y(t) +hk1/2)
k3=f(t + h/2, y(t) +hk2/2), k4= f(t + h, y(t) + hk3)
Starting Example
• Let dy/dt =y (1- y), y(0) = ½. The solution to this differential equation :
• y(t) = Exp(t)/(1 + Exp(t))
• No difference
whatsoever!
0
0.2
0.4
0.6
0.8
1
1.2
0 2 4 6 8 10 12
Time
Runge Kutta
True Solution
Setting up the differential equations for a control problem
• We will return to our previous example. The differential equations are:
• dλ/dt = rλ + 2x + δλ
• dx/dt = u0 + ½λ – δx
1. Start Excel.
2. Click on Sheet Tab to View Code
3. Open a module (Not class module!
4. Write code
Step 2 – Finding λ(0)
• May in principle be done by programming some suitable search algorithm.
• We are going to let Excel take care of it.• Two ways of doing this
– Goal Seek Function. Slow, robust, only handles problems with one state variable
– The Solver. Fast, will stop if the algorithm encounters errors, Handles a large number state variables.
• Let’s do it.
Alternative
• Use Matlab BVC4P function.
• Not really better or more robust.
• Good for when a large number of problems must be solved.
• Also, if the initial guess is far off, all solvers crash. BVC4P is good to generate a sequence of solutions that converges to the problem that one actually wants to solve.
Using Numerical Analysis Optimal Vaccination of non-persistent
epidemics• Very policy relevant
• Economists have made very limited contributions
• Shows the power of numerical analysis when out standard tool kit breaks down
• Solutions programmed in Matlab
Typical trajectory after outbreak – No vaccination
• McKendrick-Kermac model• Suceptibles x, infected y and recovered/dead z
dx/dt = –βxy, x(0) = N – εdy/dt = βxy – γy y(0) = ε = ininital infected
populationdz/dt = γy, z(0) = 0
Essensial paramter γ/β.
Trajectory without vaccination
x
y
Note the effect of reducing suceptibles before outbreak.- More suceptibles at the end of an epidemic episode.- Fewer infected
Model with vaccination
• Individuals may be vaccinated u.dx/dt = –βxy – u, x(0) = N – εdy/dt = βxy – γy y(0) = ε = ininital infected dz/dt = γy + u, z(0) = 0
Objective function:∫(-wy - ½cu2)e-rtdt K is the cost of disease .½cu2 is the cost of vaccination
Must be solved numerically. Standard tools of optimal control useless.
The value of reducing the stock of suceptibles
• The shadow price of x multiplied by -1 is the value of vaccinating one ”population unit.”
• The graph show the shadow price on x for different values of w
• What does it mean that some of the curves are not monotonic?
0.3 0.4 0.5 0.6 0.7 0.8 0.9 1-0.35
-0.3
-0.25
-0.2
-0.15
-0.1
-0.05
0
Antall mottakelige
Mar
gina
lkos
tnad
en a
v m
otta
kelig
e
w2
w4
w1
w3
Succeptibles at outbreak
Explaining “increasing returns”
• ”Brush fire” effekt. At high levels of x, the disease spreads so rapidly that the return on vaccination prior to outbreak is reduced. Flow with the punch (relatively speaking) becomes optimal strategy.
• High disease costs reduces brush fire effect
New Section Multiple Equilibria
• Many systems exhibit non-linear dynamics. May or may represent a technical challenge
• Here we look at convexo-concave differential equations.
• Important to note that systems that naturally exhibit multiple equilibria may not do so when controlled optimally
Example – Eutrophication
• Let x be the nutrient (phosphoros and nitrogen) loading in a lake.
• Let u be the deposition of nutrients.• The ecologists claim that the dynamics of the lake
may be reasonably modeled by:
• Analysis taken from W.A. Brock and D. Starrett and K-G Mäler, A. Xepapadeas, A.de Zeeuw
0,0,1
bx
xbxux
dt
dx
Dynamics with low loading (u)
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
0 0.5 1 1.5 2 2.5 3
Time
dx/d
t
Positive derivative
Negative derivative
The effect of increased (constant) loading
-1
-0.5
0
0.5
1
1.5
0 0.5 1 1.5 2 2.5 3
Time
dx/d
t
New unique steady state.Global attractor
The flip is irreversible even if u is set to zero!
-1
-0.5
0
0.5
1
1.5
0 0.5 1 1.5 2 2.5 3
Time
dx/d
t
Point of no return
"Best" long run state of nature if we go past point of no rerurn
Zero loading equation
Management
• For economic analysis we need some evaluation of consequences.
• Instantaneous benefits from nutrient use given by ½ln(u)
• Instantaneous damages from eutrophication given by –cx2.
The optimal management problem
lx
xbxucxuH
xx
xbxux
dt
dxts
dtecxu rtu
1ln
:nHamiltonia
given 0,1
..
lnmax
2
0
2
Transforming into equations in (u,x) space
space ,in equations
aldifferentiget we1
ith Together w
12
:gives for expression theinto and 1
Insert
and 1
22
2
uxx
xbxux
xx
xurubucxuu
u u
uu
ll
lll
Phase diagram – Low loading optimalx ' = y - b x + (xq)/(1 + xq) y ' = 2 c x y2 - y (b + r - ((xq) q)/(x (1 + xq)2))
b = 0.6q = 2
c = 1r = 0.05
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
0
0.05
0.1
0.15
0.2
0.25
0.3
x
y
Phase diagram – High loading optimalx ' = y - b x + (xq)/(1 + xq) y ' = 2 c x y2 - y (b + r - ((xq) q)/(x (1 + xq)2))
b = 0.6q = 2
c = 0.1r = 0.15
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
x
y
Phase diagram –Three steady statesx ' = y - b x + (xq)/(1 + xq) y ' = 2 c x y2 - y (b + r - ((xq) q)/(x (1 + xq)2))
b = 0.6q = 2
c = 0.5r = 0.05
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
0
0.05
0.1
0.15
0.2
0.25
0.3
x
y
Words of warning
• The differential equation for dλ/dt was wrong on the slide (but correct in the calculations). The right one is:
• When doing numerical analysis, you should still use your brain! I didn’t and it took me ages to figure out why numeric methods didn’t work on the eutrophication example. Can you see why it doesn’t work?
2
1
12
lllx
xbcxr
Modify the Eutrophication model
lx
xbxucxuH
xx
xbxux
dt
dxts
dtecxuu rtu
1ln
:nHamiltonia
given 0,1
..
2
1max
2
0
220
High cost of eutrophication – Only oligotrophic Equilibrium
x ' = 5 + y - b x + (xq)/(1 + xq) y ' = 2 c x + (b + r - ((x(q - 1)) q)/((1 + xq)2)) y
b = 0.6q = 2
c = 0.8r = 0.15
0 0.5 1 1.5 2 2.5 3
-8
-7
-6
-5
-4
-3
-2
-1
0
x
y
Intermediate cost – The Skiba story
x ' = 5 + y - b x + (xq)/(1 + xq) y ' = 2 c x + (b + r - ((x(q - 1)) q)/((1 + xq)2)) y
b = 0.6q = 2
c = 0.7r = 0.15
0 0.5 1 1.5 2 2.5
-7
-6
-5
-4
-3
-2
-1
0
x
y
High cost of eutrophication
x ' = 5 + y - b x + (xq)/(1 + xq) y ' = 2 c x + (b + r - ((x(q - 1)) q)/((1 + xq)2)) y
b = 0.6q = 2
c = 0.5r = 0.15
0 0.5 1 1.5 2 2.5 3 3.5
-8
-7
-6
-5
-4
-3
-2
-1
0
x
y
A
B
Oh dear!
• For low values of x we have two solution candidates!
• Which one to choose. I havn’t proved this formally (although it may be worth a try) that as long as we choose a finite horizon, then the we stay close to A manifold in order to satisfy transversality conditions. I therefore believe this is the relevant manifold
Genetic Management
• Many people are concerned with biodiversity.• I am, for various reasons, not. But I am
concerned about outcomes.• Harvesting living resources leads to
evolutionary genetic selection.• How to regulate this?• Also, an example of non-standard analysis.
Objective
Construct a bioeconomic model where we analyze the effect of selective harvesting on genetic frequency for one specific gene in terms of the socially optimal long-term management of the resource. This object is determined solely through the profits generated by harvesting
Genetic Dynamics
• Standard model of Mendelian genetics• Two alleles, A and a of the same gene. • The homozygotes AA and the heterozygote Aa are of
phenotype G (for good) • The homozygote aa are of phenotype B (for bad)• The frequency of a is q. (q = 1 is bad. q = 0 is good.)• Only individuals of type G are of commercial
interest, and harvesting is totally selective
The model
• Determination of phenotype frequency is determined by gene frequency.
• Population dynamics. In this model B will lose in the absence of harvesting.
BGihxK
xxxrx iii
BGiii , ,1
BG xxx rBG rrr GB
Comparing Phenotype Productivity
• Let hG = xGE and hB = 0.
• One can calculate the difference in productivity between B and G. The difference is:
G BB G
G B
x xs E
x x
Genetic Dynamics
• Changes in x = xB + xG is determined by
• Because of evolutionary pressure imposed by selection, the Hardy-Weinberg law may not apply and q will be a non-constant function of time. q is determined by
GBG hxqxqK
xrxx
2211
2
2
1
1
sq
qsqq
Where s is measure of the extent to which natural selection is acting to reduce the relative contribution of a given genotype to the next generation. Note that it is also the productivity difference!
Optimal Management
• Focus on how selection affects the dynamics of q.• Assume a sole owner that wants to maximize the discounted
profits from harvesting the resource. • Harvesting is assumed costless and valued at an exogenous
price • Gives objective function
0
22
0
1dteEEqEr
r
KqpMax
dtephMax
tGBG
E
tG
E
Solution
• We had to invent some stability analysis.
• From the expression for , s = 0 in steady state.
• From the expression for :
• Inserting these expressions into yields a key expression (q), which happens to be a 4th order polynomial.
qdH
dql l
2B B G
pqKr
rl
dH
dE
Results
• No explicit solution, but the structure of the problem enables us to determine steady state values of q.
• Also possible to find the stability properties of the steady states.
• Boils down to the initial magnitude of q and the relationship between:
Results, key expressions
Br
B G
the intrinsic productivity of phenotype B, measures how fast phenotype B regenerates
gives the selection in absence of harvesting or the rate at which nature selects for phenotype G. Can also be interpreted as the internal rate of return on preserving phenotype G
the discount rate measuring the opportunity cost of capital
Case 4
GB
GBBr
Optimal to conserve A at low levels of q because the low productivity of phenotype B, but at some point the low internal rate of return makes the extinction of phenotype G optimal
The value function
• We have our original OC problem:
maxc 0∫T U(c,x)e-rtdt subject to x(0) = x0 and
dx/dt=f(x,c).
The value that we get depends on x0. Thus we have J(0,x0,T) = maxc 0
∫T U(c,x)e-rtdt .
The function J(0,x0,T) is called the value function. It is what we get when our system is run as optimized.
The embedded problem
Now consider the problem:
maxc t∫T U(c,y)e-rsds subject to y(t) = x and dy/dt=f(y,c).
What have we done?• Renamed the state variable from x to y.
• A new arbitrary starting point t rather than zero. (t> 0)
• Have a new initial condition y(t) = x.
• We now have a new value function J(t,x) =J(0,x) e-rt.
What is J(t,x)?
• It is the same problem as the original, except that we have an arbitrary starting point.
• This function has some economically important properties.
• Shadow price:
Interpretation: If somebody gives you a present of one unit of x at time t, what is the value of that present.
tx
TxtJ l
,,
Another property
• The value of living.
• This has an important interpretation. At every point in time you eat the Hamiltonian. This has given rise to the interpretation of the Hamiltonian as the Net National Product. See recent book by Weitzman.
• HΔt = U(c,x) Δt + λf(c,x) Δt
)(),(,,
,)(),(,,
TxTcHT
TxtJtxtcH
t
TxtJ
The value of Consumption today
The marginal value of “capital” The amount of investment
Some useful extensions of deterministic control theory (1)
• Variable final time.
• Maxc,T 0∫T U(c,x)e-rtdt subject to x(0) = x0 and
dx/dt=f(x,c). T is a choice variable.
• Extra condition required to fix T. It turns out that
• Necessary condition is then H(T,c(T), x(T)) = 0
)(),(,,
txtcHT
TxtJ
Very simple example
• Maxu,T ∫T(–9 – ¼u2)dt subject to dx/dt=u and x(0) = 0 and x(T) = 16.
Hamiltonian H=(–9 – ¼u2)+λu.Maximising H w.r.t. implies u = 2λ.dλ/dt = -dH/dx = 0 → λ = constant K.dx/dt=u = 2λ = 2K → x(t) = 2Kt + C.We now have two constants, K and C, that we
need to determine. We cant use λ(T) = 0 as we have constrained x(T).
Therefore…
x(t) = 2Kt + C. → x(0) = 0 → C = 0.x(T) =16 = 2KT → K = 8/T.We then have that λ =K = 8/T and u = 2×K =
16/TNow find optimal T.The Hamiltonian is now H*= -9 - ¼(16/T)2+ 2×(8/T)2 = 0Solving this for T gives T=8/3
Some useful extensions of deterministic control theory (2)
• Scrap value problems - a complicated problem finite horizon problem.
• E.g. Maxc,T 0∫T U(c,x)e-rtdt +S(x(T)) e-rT.
subject to x(0) = x0 and dx/dt=f(x,c). • Conditions:• The same conditions on c and λ. Different
transversality conditions:• λ(T) = S’(x(T)) e-rT. U(c,x)e-rt + λf(x,c) = r
S(x(T)) e-rT .
A closer look at these conditions
• Why are these conditions not surprising?• λ(T) = S’(x(T)). Says that the shadow price of x
should be continuous.
• U(c,x) + λf(x,c) = r S(x(T)) . Says that the change in utility from continuing one more unit of time should be the same as the loss from not cashing in on the scrap value. Remember that U(c,x)e-rT + λe-rTf(x,c) = J’(T,x).
Deterministic Threshold Problems Irreversible case.
• A threshold is here defined to be some curve in state space such that crossing this curve leads to a discrete jump in state-variables.
• Preliminary observation; if the location of the threshold is known one can choose to cross the threshold or not cross the threshold.
• An irreversible threshold effect can be modeled as a scrap value problem with endogenous time horizon
Mathematical description
• From Seierstad and Sydsæter Theorem 3 and Note 2, pp 182.
• Let x be of dimension n• Let the threshold be given by R(x) = 0• Let τ be the first point in time when the threshold is
reached. We assume that is R(x(τ)) = 0.• Let the threshold effect be x(τ+)-x(τ) = g(x(τ))
– x(τ+) =x(τ-) = limt→τx(t) from below
– x(τ+) = limt→τx(t) from above
Formulating the Optimization Problem(s) - 1 Going over the
threshold
• Maxc,τ 0∫τ U(c,x)e-rtdt + S(x(τ)+ g(x(τ))) e-rτ
subject to x(0) = x0, dx/dt=f(x,c) and R(x(τ)) = 0.
Here we should think of S S(x(τ)+ g(x(τ))) as a value function. In fact S(x) is given by
Maxc,τ τ∫∞ U(c,y)e-rtdt subject to y(0) = x and
dy/dt=f(y,c).
Solve the problem recursively
• From Maxc τ∫∞ U(c,y)e-rtdt subject to y(0) = x
and dy/dt=f(y,c) we get the shadow price λ(x|τ).
Optimality conditions (Present value)
• u maximizes the Hamiltonian
• dλ/dt = –∂H/∂x
• λ(τ) = λ(x(τ)|τ)(1+g’(x)) + γ∂R/∂x
Note that all these may be vectors
• If τ lies in (0, ∞), then
• H + ∂(S(x(τ)+ g(x(τ))) e-rτ)/∂t = 0
Optimality conditions (Present value)
• u maximizes the Hamiltonian
• dλ/dt = –∂H/∂x
• λ(τ) = λ(x(τ)|τ)(1+g’(x)) + γ∂R/∂x
Note that all these may be vectors
• If τ lies in (0, ∞), then
• H + ∂(S(x(τ)+ g(x(τ))) e-rτ)/∂t =
H – r(S(x(τ)+ g(x(τ))) e-rτ) = 0.
Formulating the Optimization Problem(s) - 2 Going over the
threshold
• Maxc,τ 0∫τ U(c,x)e-rtdt + S(x(τ)))e-rτ subject to
x(0) = x0, dx/dt=f(x,c) and R(x(τ)) = 0.
Again we should think of S(x(τ)) as a value function. In fact S(x) is given by
Maxc,τ τ∫∞ U(c,y)e-rtdt subject to y(0) = x,
dy/dt=f(y,c) and R(x(t)) = 0.
Solve the problem recursively
• From Maxc τ∫∞ U(c,y)e-rtdt subject to y(0) = x,
dy/dt=f(y,c), and R(x(t)) = 0. We get the shadow price λ*(x|τ).
• Note: Here I have excluded the possibility that R(x(t)) ≠ 0 for some t > τ. It is however perfectly possible that we may “turn away” from the threshold at some point in time. In particular if we are studying a finite horizon problem
Optimality conditions (Present value)
• u maximizes the Hamiltonian
• dλ/dt = –∂H/∂x
• λ(τ) = λ*(x(τ)|τ) + γ∂R/∂x
Note that all these may be vectors
• If τ lies in (0, ∞), then
• H + ∂(S(x(τ))) e-rτ)/∂t = 0
A simple threshold model (Nævdal 2003)
• Let dx/dt = u – δx.
• Let the threshold be x’.
• Let B(u) be maximised at B(u*) for u* > 0.
• Let the let the utility function be A + B(u) if the threshold is not crossed and let the benefit function be B(u) if it is crossed.
Avoid Crossing the threshold?
• Let Te be the time at which the threshold is crossed.
• If we do not cross the threshold, then after getting to x(Te) = x’ we must freeze u. Thereafter there is no need to reduce u so u = δx’ for all t > Te.
• Then Se(x(Te))e-rTe = e-rTeTe∫
∞(A+B(u*))e-rtdt
Crossing the threshold?
• Let Ta be the time at which the threshold is crossed.
• If we cross the threshold, then we accept the damage. Thereafter there is no need to reduce u so u = u* for all t > Ta.
• Then Sa(x(Ta))e-rTa = e-rTaTa∫
∞B(u*)e-rtdt
Optimality conditions prior to hitting the threshold
• Hamiltonian is the same for both problems• H = (A + B(u))e-rt + p(u –δu)
Gives that B’(ui)e-rt + pi = 0
dpi/dt = δpi → pi(t) = Cie-δt
x(Ti) = x’ i = a,e
The shadow prices after hitting or crossing the threshold are both zero.
u is a decreasing function of time. Pollute early and then reduce
Note
• Both are problems with variable final time and the requirement that xi(T) = x’. We need condition to find Ti.
Staying on the edge
• Condition for optimal Te.
• This condition can be used to prove that ue(t) is continuous at Te.
Important to note
• There are kinks (ue) or jumps (ua) in the optimal paths of the control.
• The previous literature on deterministic thresholds had overlooked:
1. That a threshold could either be crossed or observed
2. That optimal controls may be discontinuous at the time the threshold is crossed. (In some models this can happen when the threshold is reached even if it is not crossed.)
Comparing Scenarios
I said no proofs, but this one is rather instructive and a good example of how simple proofs can give interesting results
Incorporating uncertainty into optimal control
• Most processes are subject to uncertainty
• One class is Brownian motion. I will not talk about that at all.
• Catastrophic events.– Tsunamis– Floods– Car breakdown
Categories of catastrophic uncertainty
• Endogenous vs. exogenous risk– Some processes we can (partially) control. This is
endogenous risk.– Some processes are truly beyond our abilities. Exogenous
risk.
• State-space distributed risk vs time-distributed risk.– State space distributed risk requires movement through
state space. (thresholds)– Time distributed risk depends on location in state- space.
For a fixed location in state-space, uncertainty is invariant with respect to time. This will perhaps become clearer. (I hardly understand it myself)
Preliminaries – Poisson processes in continuous time
• We are here concerned with events that occur at random points in time. In order to deal with these problems they must have some kind of distribution that we actually know.
• We shall refer to the point in time when the event occurs as τ (tau) and it is distributed over a subset of the positive real numbers [0, β) where 0 < β ≤ ∞.
• The pdf is g(t) and the cdf is G(t) = 0∫tg(s)ds.
Poisson processes and conditional updating.
• Our process starts t = 0. We make it to t* >0. What is the distribution of τ conditional on us having made it to t*?
Answer:
We need optimality conditions that reflect updating of the distribution as long as τ does not happen.
*
*|
tdssg
tgtttg
The hazard rate - A very useful concept
• We ask the question; What if we have made it to time t? What is the probability that τ occurs in the time interval (t, t + Δt)?
• If dt is small, then that probability is roughly Pr(τ (t, t + Δt))=g(t|τ>t)×dt.
• g(t|τ>1) is of course:
tdssg
tgtttg 1|
Example - the exponential distribution
• Let μ0 be a positive constant number.
• g(t) = μ0e-μ0t. → G(t) = 1 - e-μ0t.• Then μ(t) = μ0. We have a constant hazard rate. This
is only true for the exponential distribution.
• Interesting fact: E(τ - t) = t∫∞sµe-µsds = 1/µ. The
expected conditional waiting time is constant• We can in principle calculate hazard functions for any
g(t). They are frequently messy.
We have the hazard rate – What is the pdf?
• A very useful property. Let μ(t) be any function that is:– continuous and positive for all t
– has the property that 0∫βμ(t)dt=∞.
• Then we can construct a cdf by solving the following differential equation.
μ(t) = y’(t)/(1 – y(t)), y(β) = 1
Finding a pdf
μ(t) = y’(t)/(1 – y(t)), y(β) = 1
implies that y(t) = 1 – C×Exp(-∫tμ(s)ds)
Here C is a constant determined by the boundary condition. For many choices of hazard rate this constant will be equal to 1.
Counter example: μ(t) = Exp(t)
Having found y(t), the cdf, we can find the pdf y’(t) = C×μ(t)×Exp(-∫tμ(s)ds)
Example – linear hazard rate
• Assume that μ(t) = μ0t defined for t € [0, ∞). Then Exp(-∫tμ(s)ds) = ½μ0t2. Therefore
y(t) = 1 – C× Exp(-½μ0t2) . y(∞) = 1 implies that C = 1, so the cdf is 1 – Exp(-½μ0t2) and the pdf is y’(t) = μ0t×Exp(-½μ0t2) .
Hazard rate continued
• The previous results also hold if the hazard rate can be written μ(t) = φ(x(t)) (or φ(t,x(t)) for that matter) where x(t) is any function of t.
• This holds as long as:– φ(x(t)) only takes positive values– The integral ∫φ(x(t))dt does not converge
• Thus we can work with very general density pdfs over time over time: φ(x(t))Exp(- ∫tφ(x(s))ds)
Exogenous vs endogenous uncertainty
• Slightly confusing literature. Here the difference is as follows.
• If φ(x(t))is the hazard rate and x(t) is determined by a controllable differential equation, then the stochastic process is endogenous.
• If not, the process is exogenous• his is not clear cut. Hurricanes may be (in part)
endogenous to US policymakers but exogenous New Orleans
Controlling exogenous catastrophic uncertainty
• Basic problem: Nature or (somebody we can’t affect) triggers a catastrophic event.
• We can not control the probability of the event occurring, but we can control:– preparedness (what to do before the event)– consequences (how to act after the event)
• No strict boundary between preparedness and consequence management.
The mathematical formulation
• In infinite time:Max E{∫∞(U(x,u)e-rt)dt + h(x(τ-))subject to dx/dt=f(x,u) for all t ≠ τ
x(τ+)-x(τ-) = g(x(τ-))τ distributed μe-μt over [0, ∞)
g(x(τ-)) is the physical description of the eventh(x(τ-)) is the instantaneous value of the shock.
x(τ-) = limt→τx(t) from belowx(τ+) = limt→τx(t) from above
Extreme example
• A tsunami in a community– We have utility before the disaster. Depends on
consumption and stock of capital. – The instantaneous cost of the disaster, depends on
stochasticity and the stock of preparedness capital.– Utility after the disaster, depends on consumption
and the stock of capital that survived the disaster
Solving the problem a recursive algorithm
• First find the optimal policy after the disaster.
• Find optimal policy before the disaster.
• Sounds pretty simple…
Post-disaster control
• There is no stochasticity. Simply solve the following problem:
J(t,x|τ = t)ert = maxu ertt∫
∞U(u,y)e-rsds subject to y(0) = x
and dy/dt=f(y,u). • This looks like our old friend the embedded problem. • Here t and x are arbitrary. The indicate the possible
state(s) of the world after a disaster.• The notation J(t,x|τ = t) indicates that J is evaluated
conditional on τ happening at time t.
Pre-disaster control
• From the post disaster control we will need J’x(t,x|τ = t)ert = λ(x|τ). This is the shadow price of x at the instant after the catastrophic event occurs.
• In this problem, that is really all we need from the post-disaster program.
Pre-disaster program- Necessary conditions Hazard rate depends on t.
• Define the Hamiltonian:
H = U(u,x) + λf(u,x) + µ(t)(J(x+g(x)|τ) –J(x) + h(x))
Utility today
Marginal value ofX × increase in x
Hazard rateChange in utility if Catastrophic event occurs
Optimality Conditions
• Apply the maximum principle to the Hamiltonian.
• u =argmax H
• dλ/dt = rλ - ∂H/∂x
• dx/dt=f(x,u)
• Transversality condition as in deterministic models.
The differential equation for λ
• dλ/dt = rλ - ∂H/∂x
= rλ - ∂U/∂x - λ∂f/∂x
- µ((∂J(x+g(x)|τ)/∂x –∂J(x)/∂x)+h’(x))
But ∂J(x+g(x)|τ)/∂x = λ(x|τ)(1 + g’(x)) and ∂J(x)/∂x = λ, so:
dλ/dt = rλ - ∂U/∂x - λ∂f/∂x + µ(λ - λ(x|τ)(In + g’(x)) – h’(x))
Note: In is the identiy matrix where n is the number of elements in x(t)
Example
Let U(x,u) = -ax - ½(u0 – u)2 and f(x,u) + u – δx
Let τ be exponentially distributed with hazard rate µ. Let x(τ+)-x(τ-) = βx(τ-). h(x) = 0.
Could be a model of pollution where there is a possibility that a shock increases the stock of pollutants.
Optimality conditions after event
H= -ax - ½(u0 – u)2 + λ(u – δx)1. The value of u that maximizes the
Hamiltonian is given by u = u0 + λ for u0 + λ > 0. Else u = 0.
2. dλ/dt = rλ + a + δλ 3. dx/dt = u0 + λ – δxdλ/dt = rλ + a + δλ → λ(t|τ) = -a/(r + δ) for all t.
We make note of λ(t|τ) and proceed to the pre-event control
Pre event conditions
1. The value of u that maximizes the Hamiltonian is given by u = u0 + ½λ for u0 + ½λ > 0. Else u = 0. Same as before!
2. dx/dt = u0 + ½λ – δx. Same as before!
3. dλ/dt = rλ + a + δλ +µ(λ – (–a(1+β)/(r + δ)))
Here is the difference in the conditions. Obviously this implies that u and x will be affectedby the risk
We can solve for λ
• Solution is:
• Here K is a constant that must be determine by transversality conditions. We just not that K ≠ 0 implies no convergence to steady state so we just set K = 0. Note that if β = 0, then pre-event solution is same as post-event
trKerr
rat
l
1
The we have that….
• The optimal value of u(t) is found by inserting λ(t) into u0 + ½λ.
• The optimal value of x(t) is fund by integrating dx/dt = u0 + ½λ – δx
• I will not do this as the resulting expression is a bugger.
• Numeric solutions are straightforward to find
The effect of uncertainty – Let us increase µ
-6
-4
-2
0
2
4
6
8
10
0 0.5 1 1.5 2 2.5 3 3.5
Time
0.5l 0.5 1l 1 2l 2
Weird hazard rates
• The setup is good enough to include time dependent hazard rates. That is, the hazard rate depends on time.
• Let us consider a hazard rate a×(Sin(t) + 1). This implies a cdf given by 1 – exp(-1-t+Cos(t)). Cdf looks like this:
2 4 6 8
0.965
0.97
0.975
0.98
0.985
0.99
0.995
Pre-disaster program- Necessary conditions Hazard rate depends on t.
• Define the Hamiltonian:
H = U(u,x) + λf(u,x) + µ(t)(J(x+g(x)|τ) –J(x) + h(x))
Utility today
Marginal value ofX × increase in x
Hazard rateChange in utility if Catastrophic event occurs
Optimality Conditions
• Apply the maximum principle to the Hamiltonian.
• u =argmax H
• dλ/dt = rλ - ∂H/∂x
• dx/dt=f(x,u)
• Transversality condition as in deterministic models.
NO CHANGE!
The differential equation for λ
• dλ/dt = rλ - ∂H/∂x
= rλ - ∂U/∂x - λ∂f/∂x
- µ(t)((∂J(x+g(x)|τ)/∂x –∂J(x)/∂x)+h’(x))
But ∂J(x+g(x)|τ)/∂x = λ(x|τ)(1 + g’(x)) and ∂J(x)/∂x = λ, so:
dλ/dt = rλ - ∂U/∂x - λ∂f/∂x + µ(t)(λ - λ(x|τ)(In + g’(x)) – h’(x))
Note: In is the identiy matrix where n is the number of elements in x(t)
Endogenous risk – Time distributed
• Recall that we know face a controllable hazard rate.
• The hazard rate depends on a state variable
Endogenous time distributed risk
• Here the hazard rate will depend on the state variable.
µ = µ(x).
Example µ(x) = µ0x
• Remember that x is a function of t because it is determined by an differential equation
1 2 3 4 5
0.1
0.2
0.3
0.4
0.5
0.6
This is the pdf
Endogenous risk continued
• Say that x(t) is driven by dx/dt = - ax + b, x(0) = X. Then x(t) = e-at(X – b/a) + b/a.
• The hazard rate is then transformed from a function of x to a function of time.
µ = µ0×(e-at(X – b/a) + b/a)
1 2 3 4
1
2
3
4
Illustration for different starting points of x(0)
Recursivity - Working yourself backwards
• As before. First solve the post disaster problem. Get the shadow prices
Post-disaster control (Copy of previous slide)
• There is no stochasticity. Simply solve the following problem:
J(t,x|τ = t)ert = maxu ertt∫
∞U(u,y)e-rsds subject to y(0) = x
and dy/dt=f(y,u). • This looks like our old friend the embedded problem. • Here t and x are arbitrary. The indicate the possible
state(s) of the world after a disaster.• The notation J(t,x|τ = t) indicates that J is evaluated
conditional on τ happening at time t.
Pre Disaster Solution
• Define the risk Augmented Hamiltonian
• Then apply the maximum principle• Note: λ is here the hazard rate and µ is the
shadow price
Applying the Maximum Principle
• Find the control that maximizes the Hamiltonian and use d
• Compared to the case with exogenous risk we have an additional term. If risk is exogenous, we have that λ’(x) = 0
A closer look at dµ/dt
• As in the exogenous risk case, we have that
• We have J(x+g(x)|τ) from the post-disaster problem. But what about J(x)?
I turns out that:
• J(x) = z is given by:
• So we can rewrite dµ/dt as:
• Additional transversality condition z(T) = 0 (Why?)
A pedagogic problem
• After inserting for the optimal u, we have three differential equations in x, µ and z. Tough to do.
• In fact, I know of no application where one can find a closed form solution or even a reasonably manageable steady state.
• Numerical methods are all that is left. Oh, and general statements.
The Problem we are going to solve
,
. rate hazard with variablerandom
.00,0given, 0,..
2
1max
0
0
20
K
tx
xxuxts
dteuuET
rt
The post event solution
• Really simple. No more damage from x so post event shadow price is zero.
• This implies that u = u0 for all t > τ.
• J(x|t) = -K/r
The pre-event problem
• Hamiltonian given by
• H= α – ½(u0-u)2 + λ(u – δx) + µ×x×(– K/r - z)
• Apply maximum principle to get:
• u=argmax H = u0 + λ
• dλ/dt = (r+δ)λ + µxλ – µ(–K/r – z)
• dz = rz + ½(u0-u)2 – µ(-K/r – z)
• Transversality conditions λ(T) = z(T) = 0
Endogenous, state-space distributed risk – AKA stochastic
threshold
• We move a state variable through time. If we move in the “right” direction we may trigger a threshold effect.
Possible thresholdlocations
t
x(t)
Risky time segment
Preliminary math.
• We all know the chain rule.
• We should also know that xgxgUxFxgUxF '
tx
x
t
dxxfdttxtxf
dxxfdttxtxf
00
Let f(x) be a distribution
• If the event x = x’ is distributed f(x) over [a, b] and…• x(t) is such that x’(t) 0 for all t and Range[x(t)] =
[a, b]• Then the cdf for the event x = x’ and the event x(τ) =
x are interchangeable. The pdf for the event is distributed over time is f(x(t))x’(t).
• Regardless of whether Range[x(t)] = [a, b], the hazard rate is
txF
txtxftx
1
The hazard rate continued
• Note how the hazard rate depends on the rate of increase in x. dx/dt = 0 implies that the hazard rate is zero.
txF
txtxftx
1
Example. The hazard rate in state-space is linear
• The hazard rate is linear in state space. This is the pdf:
• Assume that x’(t) = -ax(t) + b
1 2 3 4 5
0.1
0.2
0.3
0.4
0.5
0.6
However, the hazard rate in time behaves differently
2 4 6 8 10
0.2
0.4
0.6
0.8
1 x(t)
Hazard rate as a funtion of time
t
Properties of the hazard rate
• If we move away from the threshold, the hazard rate is zero
• If we let x take values that have happened before, then the hazard rate is also zero.
0,max
1x
txF
txftx
Optimal control
• Use exactly the same conditions as with endogenous time distributed problems, but with modified hazard rate.
• Technical note: Previously the hazard rate depended only on state variables. Now it will in general also depend on control variables. (Why?) This does not matter as long as we can restrict ourselves to look at continuous controls,
The Problem we are going to solve
,
tby x geredEvent trig
,0xover ddistributelly exponentia is
x Implies . rate hazard with variablerandom
.00,0given, 0,..
2
1max
0
0
20
K
x
x
xxuxts
dteuuET
rt
The post event solution
• Same as before. No more damage from x so post event shadow price is zero.
• This implies that u = u0 for all t > τ.
• J(x|t) = -K/r
The pre-event problem
• Hamiltonian given by
• H= α – ½(u0-u)2 + λ(u – δx) +
µ× (u-δx)(– K/r - z)
• Apply maximum principle to get:
• u=argmax H = u0 + λ - µ(K/r + z)
• dλ/dt = (r+δ)λ + µ(u – δx)λ +µδ(– K/r - z)
• dz = rz + ½(u0-u)2 – µ(u – δx)(-K/r – z)
Note the difference
Hazard rate nowdirectly affectscontrol!
Derivativeof hazard rate
Optimal u from maximization
Analytics
• One can actually compute steady states here. In steady state x’(t) is zero so lots of stuff disappears. To whit:
• u= u0 + λ - µ(K/r + z)
• dλ/dt = (r+δ)λ - µδ( K/r + z)=0
• dz/dt = rz + ½(u0-u)2 = 0
• dx/dt = u – δx = 0
Steady state solution
Krr
uu
Krrux
ss
ss
2
2
2
0
2
20
Note the following: K = 0 implies x and u equal to unregulated levelsOtherwise K > 0 implies small x and u. K large enough implies x and u less than zero in steady state.
A caveat
• This solution assumed that x(0) ≤ xss. Otherwise we get negative hazard rate. What happens if we have x(0) > xss
• Answer: Freeze x at x(0) indefinitely.
Application – Disintegration of the Western Antarctic Ice
Sheet• Oppenheimer (1998) estimates that a
WAIS disintegration could increase sea levels by as much as 4-6 meters.
• Oppenheimer evaluates that there is a threshold temperature increase above pre-industrial levels where this event could occur. This threshold lies in the range 2.5° to 8° degrees Celsius.
• What does this imply for human welfare?
Consequences of WAIS disintegration
Considerable impact on real state markets.
For some areas such as Bangladesh, the picture is even grimmer
There are several GHGs that contribute to global warming. Methane (1/5) and Carbon (3/5) being the most important. Because of different retention periods in the atmosphere they must be treated separately.Let c be carbon and m be methane. Then there is a threshold defined by: ac(t) + bm(t) = A + .
The functions c and m are determined by
and
A + is the temperature threshold, is a random variable. Emissions are given by ui
Some relevant geoscience
mum mm
cuc cc
Possible paths of the system
The Main Technical Problem. How to translate the event that the threshold is crossed from a probability distribution in State-space to a probability distribution over time?
Answer: Change of variable techniques from integration theory.
We can now state society’s optimization problem as follows:
The Economy
The instantanous cost of emission reduction
The cost of crossing Wais disintegration: = G if threshold is crossed, = 0 otherwise
Steady states are calculated in the paper
miiuuK
uk iii
ii ,,2
20
0
2020, 22
max dtuuK
uuK
E mmm
ccc
uu mc
Some numerical comparative dynamics
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.80.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
0.2
CO2
CH
4
Optimal path when c = 5.4
Steady state when c = 5.4
Optimal path when c = 5
Steady state when c = 5
Optimal path when c = 4.6
Steady state when c = 4.6
Some Unpleasant Facts and Possibilities
• Historical Geophysical data suggests that the Thermohaline Circulation has been disrupted in the past.
• Scientific results suggests that this may be triggered by Global Warming.
• The consequences, although uncertain, will be a bugger. They include but are not limited to:– Regional disruptions in weather patterns– Permanent regional climate change– May trigger other global catastrophic events.
Some Science
• Global Warming will affect ocean temperatures and salinity which are the main drivers of the thermohaline circulation.
• THC disruption depends on temperature levels and the rate of change in temperature.
• If temperature levels or rates of temperature change exceed certain thresholds a shutdown may occur. The location of these thresholds are unknown.
A Simple Model of Global Warming
c u c= - d&
F ac F= - g&
c – Atmospheric carbon concentration F – Increase in global temperature averageu – CO2 emissions
If F or increase above certain thresholds, the THC collapses.
F
The Economic Problem(s)
• What is the optimal CO2 emission pattern in the presence of this risk?
• How to optimize such a system? Piecewise Deterministic Optimal Control!
• Some technical issues must be resolved before optimization can be done.– The events distributed in state space must be
transformed so that they are distributed over time.
– A distribution for the minimum of these events must be derived.
The Optimization Problem
Climate cost. 0 if THC is OK, G if THC collapses
Cost of reducing emissions below unregulated emission level u0.
0
20
2max dtuu
KEu
The Optimization Problem contnd.
Subject to the differential equations and two concurrent stochastic processes.
As long as both F and dF/dt are increasing the hazard rate is the sum of two hazard rates. When only F is increasing, only this hazard rate is active
The Hamiltonian for the present problem
First terms are the standardHamiltonian
This term is the expected loss/gain from disaster in the interval (t, t + dt)
ztcJcm
mucuuuK
H mmcc
l
|,,2
20
Optimal stopping
• We have a risky process and we wonder when to stop.
• In deterministic problems, we use H = 0. Here too.• In Finite time:
Max E{∫T(U(x,u)e-rt)dt + h(x(τ-))e-rτ + G(x(T))e-rT
subject to dx/dt=f(x,u) for all t ≠ τ
x(τ+)-x(τ-) = g(x(τ-))
τ distributed μe-μt over [0, ∞)
A quick word about the value function
• If we have a scrap value, then we have that J(T,x(T)) = Scrapvalue.
• Intuitively obvious, but needs to be pointed out
Optimal Stopping (Eksogenous Risk)
• Apply the maximum principle to the Hamiltonian:
H = U(u,x) + λf(u,x) + µ(t)(h(x)e-rt +J(x+g(x)|τ) –J(x))
• u =argmax H
• dλ/dt = rλ - ∂H/∂x λ(T) = ∂G(x(T))e-rT/∂x
• dx/dt=f(x,u)
Optimality Conditions
• The Stopping Condition
U(u,x) + λf(u,x) +∂ G(x(τ-))e-rτ/∂t
– µ(t)(h(x)e-rt +J(x+g(x)|τ) –G(x(T))e-rT)
Nice ting to know about hazard rates
• If we have two stochastic events τ1 and τ2, with hazard rates λ1 and λ2, then….
• The event τ = (τ1 , τ2) has a hazard rate
λ = λ1 + λ2.
Practical as it enables us to treat two stochastic processes as one!
Optimal harvesting of illegal drugs
• A drug producer grows marihuana. At optimally chosen time T he sells the stuff at a price q, unless…– He gets raided by the cops, in which case he
receives a punishment –c per weight unit of weed
– The local college boys finds the farm and nicks a fraction 1 – b of the drugs. The farmer then sells the remaining.
Mathematical formulation
• Equations of motion
• Maximization problem
• max E((λ1(-cx1e-rτ)+λ2bqx1e-rτ)/λ) +qx1(T)e-rT