An intensity-based approach for equity modeling

Ryerson Applied Mathematics Laboratory.

Technical Report.

Title: An intensity-based approach for equity modeling.

Author: M. Escobar, T. Friederich, M. Krayzler, L. Seco, R. Zagst.

2010: # 5

1

An intensity-based approach

for equity modeling

Authors: M. Escobar, T. Friederich, M. Krayzler, L. Seco, R. Zagst.

Abstract

This paper analyzes an intensity-based approach for equity modeling.We use the Cox-Ingersoll-Ross (CIR) process to describe the intensity of thefirm’s default process. The intensity is purposely linked to the assets of thefirm and consequently is also used to explain the equity. We examine twodifferent approaches to link assets and intensity and derive closed-form ex-pressions for the firms’ equity in both models. We use the Kalman filter toestimate the parameters of the unobservable intensity process. The applica-bility of the presented methods is demonstrated on real data working withhistorical time series of Merrill Lynch.

Key Words: Equity modeling, reduced-form model, default process, Kalman

filter.

Marcos Escobar is Assistant Professor at Ryerson University, Toronto, Canada.

Tim Friederich is a PhD student at the HVB Institute for Mathematical Finance

of the Technische Universitat Munchen, Germany.

Mikhail Krayzler is a PhD student at the HVB Institute for Mathematical Fi-

nance of the Technische Universitat Munchen, Germany.

Luis Seco is the Director of RiskLab Toronto, and Professor at the University of

Toronto, Canada.

Rudi Zagst is the Director of the HVB Institute for Mathematical Finance, and

Professor at the Technische Universitat Munchen, Germany.

2 1 Introduction

1 Introduction

In the last decades several approaches for equity modeling were proposed by aca-

demics and practitioners in the financial industry. Starting from a standard Geo-

metric Brownian Motion and the assumption of normality of log-returns as in [1],

a large number of models were developed. Some famous examples include models

capturing different volatility structures (see e.g. [2] or [3]) and models allowing

jumps in the equity price processes as in [4]. One of the characteristics of this type

of models is the fact that most of them concentrate on modeling equity returns

ignoring the possibility of bankruptcy of the underlying firm.

An alternative approach that provides a link to the underlying firm’s financial

quality can be found in the credit literature and is often referred to as a structural

model. This type of models, according to the first structural model provided by

Merton in [5], has the economic interpretation that default occurs when the value

of the assets falls below the value of the company’s debt at the time of servicing

the debt. Equity is then modeled as a call option on the underlying firm’s assets.

Thereafter, a lot of extensions were developed, such as by Black and Cox in [6]

allowing the default at any time before the debt matures, or by Finkelstein et al.

in [7] where the default threshold is modeled as a random variable.

Structural models in their turn have often been criticized for the underestimation

of credit spreads (see e.g. [8]) and the assumption of complete information, which

leads to a predictable default time [9]. To overcome these issues the so-called

reduced-form (or intensity-based) models have been developed, where default is

determined as the first jump of an exogenously given jump process. Thus, a firm’s

default time is inaccessible and driven by a default intensity, a function of some

latent state variables. However, as none of the models is perfect, the absence of

the economic and financial interpretation of the intensity-based models remains to

be their major drawback.

This paper aims to bridge the gap and combine the reduced-form model with the

Merton approach, and therefore gain from their advantages and try to eliminate

their disadvantages at the same time. In the presented setup equity is modeled as a

derivative on the firm’s assets that are purposely linked to the company’s economic

strengths with respect to its default risk. The default is triggered by the underlying

intensity process. This basic model is then extended to include an additional source

of default, which occurs when the assets fall below some predefined debt threshold.

https://www.researchgate.net/publication/4766519_Valuing_Corporate_Securities_Some_Effect_on_Bond_Indenture_Provisions?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/5217313_A_Closed-Form_Solution_for_Options_With_Stochastic_Volatility_With_Applications_to_Bond_and_Currency_Options?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/222451587_Option_Prices_When_Underlying_Stock_Returns_Are_Discontinuous?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/4766298_On_The_Pricing_of_Corporate_Debt_The_Risk_Structure_of_Interest_Rates?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/228289841_An_Examination_of_the_Classical_Theory_of_Corporate_Security_Valuation?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/228170783_Structural_Versus_Reduced_Form_Models_A_New_Information_Based_Perspective?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

3

Both models presented shed some light into an interesting fact that could be ob-

served during the recent financial crisis, the apparently high leverage and its effect.

Up to now, there are still a lot of discussions about the reasons that led to the

very severe problems for almost all market participants. It was often argued that

“the end of the investment banking”, as this period was called in several financial

magazines, was caused to a large extent by the so-called leverage effect. Due to

the nature of business former investment banks always used to have a very high

leverage, which we define in this work as a portion of debt respectively to the assets

of the company. According to companies’ annual reports this leverage was above

95% for most of the investment banks in the time of the crisis.

However, an interesting observation can be made if one compares these balance

sheets over a longer period. The leverage, though relatively high, was fluctuat-

ing around the same level throughout the decade. Thus, it becomes questionable

whether the leverage alone can explain such dramatic worsening of the firms’ credit

qualities, as its percentage with respect to the firm’s assets did not increase sub-

stantially during the crisis. To answer this question, we analyze an explanatory

effect of both our models, where in the first case the default is only triggered by an

unobservable intensity and the second approach explicitly includes the influence of

leverage as well.

This paper is organized as follows. In Section 2 we present our two general models

for pricing a firm’s equity. In the first model we do not include the debt of the firm

explicitly, whereas in the second one the payoff to the shareholders depends directly

on the company’s assets and debt. We use the theory of ordinary differential

equations to derive a solution in the first model and the mechanism of Green’s

function in the second. In Section 3 we concentrate on the numerical methods

in our analysis, especially on the Kalman Filter used to estimate the parameters

of the CIR process. Thereafter, in Section 4.1, we describe the data used in this

study. Section 4.2 is dedicated to the parameter estimation and the application

of the presented models on real data. Finally, we summarize the most important

findings in Section 5.

2 Model setup

In this section we present two models that allow us to express the equity of a

company as a derivative on its assets within a reduced-form methodology. We will

start with a simplified version and then proceed to the more general case, capturing

4 2 Model setup

the leverage effect and the complete behavior of the equity price.

For a mathematical formulation, let T ∗ be an arbitrary but finite planning horizon

and (Ω,F ,F,P) be a filtered probability space. Furthermore, let N = N(t)t≥0

be a counting process, i.e. a non-decreasing, integer-valued process with N(0) =

0 , and let the default time τD be defined as the first jump of N , i.e. τD =

inf t > 0 : N(t) > 0 . We assume that the filtration F is generated by the count-

ing process N and satisfies the usual conditions of completeness and right-continuity.

Setting F = Ftt≥0 , we assume that the counting process N = N(t)t≥0 admits

a non-negative Ft -predictable intensity λ = λ(t)t≤0 . For a detailed definition and

a proof of the existence see [10]. Following Bremaud et al. [10] and Duffie et al.

[11], the conditional default probability PD (t, T |Ft) for the maturity time T at

time t is given by

PD (t, T |Ft) = 1− IEP

[e−

∫ Tt λ(s)ds

∣∣∣Ft] . (1)

The corresponding conditional survival probability

PS (t, T |Ft) = P (N(T ) = 0|Ft) = P (τD > t|Ft)

is given by

PS (t, T |Ft) = IEP

[e−

∫ Tt λ(s)ds

∣∣∣Ft] .We assume the CIR process as in [12] for the intensity time series

dλ(t) = α(β − λ(t))dt+ σ√λ(t)dW (t) (2)

and that the interest rate r is constant. The choice of the CIR process to describe

the behavior of the default intensity was motivated by the fact that it leads to a

so-called affine model. One of the most important features of these models that

have been thoroughly investigated in the literature (see for example [13] or [14]) is

the existence of the analytical formulas for the default probabilities. This fact helps

to avoid time-consuming simulations and thus simplifies the calibration procedure.

Having originally been proposed by Cox, Ingersoll and Ross in [12] to describe the

short-rate dynamics, the square-root process (CIR process) is nowadays often used

in the reduced-form models (see e.g. [15]). The square root term in the volatil-

ity part assures positive intensity given that the parameters satisfy the condition

2αβ ≥ σ2 . Furthermore, the possibility of direct sampling from the non-central

https://www.researchgate.net/publication/4895408_A_Theory_of_The_Term_Structure_of_Interest_Rates?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/4895408_A_Theory_of_The_Term_Structure_of_Interest_Rates?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/238363637_Transform_analysis_and_asset_pricing_for_ane_jump-diusions?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/226786879_On_Cox_Processes_and_Credit_Risky_Securities?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/243785221_Point_Processes_and_Queues_-_Martingale_Dynamics?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/243785221_Point_Processes_and_Queues_-_Martingale_Dynamics?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/38340088_Affine_Processes_and_Applications_in_Finance?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

https://www.researchgate.net/publication/38340010_Recursive_Valuation_of_Defaultable_Securities_and_the_Timing_of_Resolution_of_Uncertainty?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

5

chi-squared distribution without the need of discretization makes the CIR model

even more appealing.

For the pricing of financial derivatives, we furthermore assume the existence of a

risk-neutral probability measure Q . We introduce the risk premium as γ(t) =

γσ√λ(t) and change the Wiener process via dW (t) = dW (t) + γ(t)dt as well as

the measure from IP to Q using the Radon-Nikodym derivative

dQdP

∣∣∣Ft = exp

(−∫ t

0

γ(s)dW (s)− 1

2

∫ t

0

γ2(s)ds

).

Thus, the intensity dynamics under Q can be described by

dλ(t) = (αβ − λ(t)(α + γσ2))dt+ σ√λ(t)dW (t).

Denoting η = γσ2 , we get

dλ(t) = (αβ − λ(t)(α + η))dt+ σ√λ(t)dW (t). (3)

Hence, besides the real-world parameters (α, β, σ) one additional parameter η has

to be estimated. Equation (3) still represents the CIR process and can be rewritten

in more natural form:

dλ(t) = α(β − λ(t))dt+ σ√λ(t)dW (t), (4)

where α = α+η and β = αβα+η

are the risk-neutral parameters of the CIR process.

We treat the firm’s equity as a derivative on the value A of the firms’ assets with

payoff f (A(T )) at time T and some well-defined function f . Expectations under

the risk-neutral measure are denoted by E , i.e. we skip the subscript Q of EQ .

Thus, we conclude (see, e.g. [13]):

S(t) = E[e−r(T−t)1τD>Tf(A(T ))|Ft] = e−r(T−t)E[e−∫ Tt λ(s)dsf(A(T ))|Ft]. (5)

To specify the model, we link the expression f(A(T )) to the underlying intensity.

We differentiate between two models: one with and one without leverage constraint.


6 2 Model setup

2.1 Model without leverage constraint

For simplification purposes we first assume

f(A(T )) = A(T ) = a · e−bλ(T ),

where a and b are two positive constants, representing the fact that the higher

the intensity (the higher default probability) the lower the value of the assets.

In this model we do not explicitly take the firm’s liabilities into account. We

apply the framework of Duffie et al. [13], to derive the corresponding closed-form

expressions for the necessary expectations. Setting τ = T − t , we rewrite the

equations provided in [13] in our notation and evaluate for the firm’s equity by

S(t) = a · eθ0(τ)+θ1(τ)·λ(t) · e−rτ (6)

where θ0(τ) and θ1(τ) satisfy the following ODEs:

θ1(τ) = −1− αθ1 +1

2θ2

1σ2,

θ0(τ) = αβθ1,

with boundary conditions θ1(0) = −b and θ0(0) = 0 . Solving this system of

differential equations we get:

θ0(τ) = αβ · α− hσ2

τ − 2αβ

σ2· ln(c+ e−hτ

c+ 1

)(7)

and

θ1(τ) =cehτ (α− h) + (α + h)

σ2(cehτ + 1), (8)

where

h =√α2 + 2σ2 (9)

and

c =h+ σ2b+ α

h− σ2b− α. (10)

We furthermore show in Appendix 6.1 that these formulas represent the generalized

case of a well-known pricing formula for zero-coupon bonds when the short-term

interest rate follows a CIR process.



2.2 Model with leverage constraint 7

Using Equation (6), the firm’s equity and its assets have the following reverse

dependence:

λ(t) =rτ + ln(S(t)/a)− θ0(τ)

θ1(τ).

Another important result can be perceived from Equations (5) and (6). We let

a = 1, b = 0 and assume no discounting factor. Then, the closed-form solution for

the risk-neutral probability of default (compare with (1)) can be derived via

PD(t, T ) = 1− eθ0(τ)+θ1(τ)·λ(t), (11)

where θ0(τ) and θ1(τ) can be calculated as above but with c defined as:

c =h+ α

h− α.

Corresponding real-world probabilities are given by the same formula where we

take the real-world parameters instead of the risk-neutral ones.

2.2 Model with leverage constraint

In this section we extend the previous model by introducing the leverage constraint

X . We define f(A(T )) by

f (A(T )) = max (A(T )−X, 0) ,

where

A(T ) = a · e−λ(T )b

as in the first model corresponds to the assets of the company and X to the

company’s debt that is assumed to be constant. The structure of this payoff has a

key financial implication. It mixes a Merton style structural payoff with a reduced

form default scenario. Therefore, it implies that equity prices are affected not only

by an intermediate default, which could be defined exogenously, but also in the

event that assets fall below the debt level at the time of servicing the debt. Hence,

8 2 Model setup

we can calculate the firm’s equity by taking the following expectation

S(t) = e−rτE[e−∫ Tt λ(s)dsmax(ae−λ(T )b −X, 0)|Ft] (12)

Using Green’s function, G(λ, t, λ′, T ) , we derive the closed-form solution for the

firm’s equity defined above. The explicit form of Green’s function for the CIR

process, as presented in [16], is

GCIR(λ, t, λ′, T ) = P (λ, τ)2p(τ)H(2p(τ)λ′|2k + 2, f(λ, τ)), (13)

with

τ = T − t,

h =2αβ

σ2− 1,

p(τ) =d2(τ)

σ2d1(τ),

f(λ, τ) =8h2ehτλ

σ2d2(τ)d1(τ),

P (λ, τ) = B1(τ)e−B2(τ)λ,

B1(τ) =

(2he(α+h) τ

2

d2(τ)

) 2αβ

σ2

, (14)

B2(τ) =2d1(τ)

d2(τ),

h =√α2 + 2σ2,

d1(τ) = ehτ − 1,

d2(τ) = (h+ α)ehτ + (h− α),

H(x|v, w) =1

2

( xw

)(v−2)/4

e−(w+x)/2I(v−2)/2(√wx)

where Iy(·) denotes the modified Bessel function of the first kind of order y .

Given this definition of the Green’s function, Equation (12) can be represented via

2.2 Model with leverage constraint 9

two integrals over the Green’s function and f(A(T )) :

S(t) = e−rτ∫ λ∗

0

GCIR(λ, t, λ′, T )ae−λ′bdλ′

−e−rτX∫ λ∗

0

GCIR(λ, t, λ′, T )dλ′

= e−rτ (I1 − I2) .

The integral limit λ∗ is defined such that ae−λ∗b − X = 0 , i.e. λ∗ = 1

bln(aX

).

Substituting (13) in the expression for the second integral, we get

I2 = X

∫ λ∗

0

P (λ, τ)2p(τ)H(2p(τ)λ′|2k + 2, f(λ, τ))dλ′

= XP (λ, τ)

∫ 2p(τ)λ∗

0

H(y|2k + 2, f(λ, τ))dy

= XP (λ, τ)χ2(2p(τ)λ∗|2k + 2, f(λ, τ))

where we use the fact that H(x|v, w) is the density function of a non-central

Chi-square distribution χ2(x|v, w) . Analogously, for the first integral we have

I1 = aP (λ, τ)

∫ λ∗

0

2p(τ)H(2p(τ)λ′|2k + 2, f(λ, τ))e−λ′bdλ′

= aP (λ, τ)

∫ 2p(τ)λ∗

0

H(y|2k + 2, f(λ, τ))e−yb

2p(τ)dy

= aP (λ, τ)

∫ 2p(τ)λ∗

0

1

2

(y

f(λ, τ)

)k/2e−

f(λ,τ)2− y

2p(τ)+bp(τ) Ik(

√f(λ, τ)y)dy

Setting m(λ, τ) = f(λ, τ) · p(τ)p(τ)+b

and l = y · p(τ)+bp(τ)

, we get

I1 = aP (λ, τ)

(p(τ)

p(τ) + b

)k+1

e−f(λ,τ)

2b

p(τ)+b

∫ 2(p(τ)+b)λ∗

0

1

2

(l

m

)k/2e−

m+l2 Ik(

√ml)dl

= aP (λ, τ)

(p(τ)

p(τ) + b

)k+1

e−f(λ,τ)

2b

p(τ)+bχ2(2(p(τ) + b)λ∗|2k + 2,m(λ, τ)).

10 3 Procedure

Summarizing our findings, the firm’s equity is given by

S(t) = e−rτP (λ, τ)

(a

(p(τ)

b+ p(τ)

)k+1

e−f2

bp(τ)+bχ2(2(p(τ) + b)λ∗|2k + 2,m(λ, τ))

−Xχ2(2p(τ)λ∗|2k + 2, f(λ, τ))

)(15)

with k = 2αβσ2 − 1 and λ∗ = 1

bln(aX

).

The model derived above also includes the first model (without leverage constraint).

To see this, let X = 0 and receive f(A(T )) = a ·e−bλ(T ) . The closed-form solution

of the second model then simplifies to

S(t) = a · e−rτP (λ, τ)

(p(τ)

b+ p(τ)

)k+1

e−f2

bp(τ)+b . (16)

Here, we used the fact that λ∗ tends towards infinity when X → 0 and that

the probability function of the Chi-square distribution equals 1 when the argu-

ment reaches infinity. In Appendix 6.2 we show that for this case both analytical

expressions for the first and the second model coincide.

3 Procedure

In the previous section we have shown how we can model the firm’s equity within

a reduced-form model. The remaining question is how to estimate the unknown

parameters including the three coefficients from the CIR process, α, β and σ , as

well as the coefficients a and b that we use to link the assets with the intensity.

Hereby, we assume the leverage level to be known and constant. For this estimation

we use an extended Kalman filter. We first formulate the state-space problem where

the equity time series is an observable process and the underlying intensity is an

unobservable state process that should be estimated. We base our derivation on

the state-space approach described by Geyer and Pichler in [17].

We use real-world observations of the equity process which is why we describe the

intensity dynamics under the real-world measure P . However, the pricing should

be done under the risk-neutral measure Q . Hence, besides the real-world param-

eters (α, β, σ) one additional parameter η must be estimated. To summarize, we

https://www.researchgate.net/publication/227345812_A_State-Space_Approach_to_Estimate_and_Test_Multifactor_Cox-Ingersoll-Ross_Models_of_the_Term_Structure?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

11

will use the real-world measure P and Equation (2) in the prediction step of the

Kalman filter and the risk-neutral probability measure Q and Equation (3) or (4)

for the correction step. Since the use of the CIR process leads to a non-Gaussian

model, the standard Kalman filter is no longer optimal and cannot be applied in

its original form. To overcome this issue we replace the exact transition density

(non-central χ2 ) by a normal density (as in [17]):

λt|λt−1 ∼ N(µt, Qt).

We define µt and Qt within a moment matching method, so that the first two

moments of the exact and approximated density function coincide:

µt = β(1− e−α∆t) + e−α∆tλt−1

and

Qt = σ2 1− e−α∆t

α

[β

1− e−α∆t

2+ e−α∆tλt−1

]+ e−2α∆tQt−1.

Both equations help to construct the so-called prediction step. The value of the

unobservable factor λ at time t based on its value at time t− 1 is given by

λt|t−1 = β(1− e−α∆t) + e−α∆tλt−1|t−1,

with ∆t representing the time step between t− 1 and t . Analogously, we get for

the variance:

Qt|t−1 = σ2 1− e−α∆t

α

[β

1− e−α∆t

2+ e−α∆tλt−1|t−1

]+ e−2α∆tQt−1|t−1.

Observed and unobserved variables are linked by the so-called observation density

p(yt|λt) that can be described as a measurement equation

yt = At +Btλt + εt, εt ∼ N(0, q) (17)

Thus, the expected values for the state space variables conditional on the informa-

tion at time t− 1 are calculated as

yt = At +Btλt|t−1.

The formulas for these two coefficients At and Bt can be derived from the pricing

formulas for the firm equity, an observable process in our case, as will be shown in

https://www.researchgate.net/publication/227345812_A_State-Space_Approach_to_Estimate_and_Test_Multifactor_Cox-Ingersoll-Ross_Models_of_the_Term_Structure?el=1_x_8&enrichId=rgreq-b137edc6-2336-4e50-ac34-a38c773ea45b&enrichSource=Y292ZXJQYWdlOzIyNzY5MDU5ODtBUzo5ODU1MzcxMjA4NzA1OEAxNDAwNTA4NDQ0OTE0

12 3 Procedure

the sequel.

Model without leverage constraint. In the first model we use Equation (6)

to find the value of the equity when the intensity is given. We can transform this

equation so that we get an observable process linearly depending on the intensity.

We take the logarithm of both sides and get

ln(S(t)) = ln(a) + θ0(τ)− rτ + θ1(τ)λ(t),

where θ0 and θ1 depend on the risk-neutral parameters α , β and σ . Hence,

in terms of the parameters of the measurement Equation (17), yt = ln(S(t)) ,

At = ln(a) + θ0(τ)− rτ and Bt = θ1(τ) .

Model with leverage constraint. For the second model the whole procedure

is slightly more complicated because of the non-linear dependence between the

observable and the hidden process. For this purpose we apply an extended Kalman

filter and linearize Equation (15) by means of a first-order Taylor approximation

around the best prediction λt|t−1 :

S(t, λ(t)) = S(t, λt|t−1) +∂

∂λ(t)S(t, λ(t))|λ(t)=λt|t−1

· (λ(t)− λt|t−1). (18)

Comparing Equation (18) and the measurement Equation (17), we define At and

Bt as

At = S(t, λt|t−1)− ∂


and

Bt =∂


.

Thus, in both models we can calculate the state space variables yt based on the

predicted values and formulas derived above. In the next step we can use an

additional information provided by the observed variables yt via the so-called

correction step and update our estimations for λ(t) .

λt|t = max(λt|t−1 +K · [yt − yt], 0) (19)

13

and for the variance

Qt|t = [1−KBt]Qt|t−1,

where K is the so-called Kalman gain matrix which is given as

K =Qt|t−1Bt

Qt|t−1B2t + q

.

The update Equation (19) of the state estimate for the CIR model differs from the

standard Kalman filter because of the non-negativity condition for the underlying

process.

Finally, we calculate the quasi log-likelihood function for each parameter set

(α, β, σ, η, a, b, q) for both models:

logL = −1

2(N − 1)log(2π)− 1

2

N∑t=2

log|Ft| −1

2

N∑t=2

v2t

Ft,

with Ft = B2tQt + q and vt = yt − yt representing the error term. This function

is further maximized with respect to the unknown parameters.

4 Empirical results

As an application of the models presented in the previous sections we’ve chosen one

of the former investment banks, Merrill Lynch. For a very long period the company

was one of the leading and most renowned banks on Wall Street. However, due to

the sub-prime crises, as many other banks, Merrill Lynch had enormous losses, in

total about 40 billion USD. The company was even among the three banks that

suffered most from the financial crisis. As a result, in September 2008, Merrill

Lynch was taken over by the Bank of America. This acquisition seemed to be the

only possibility for Merrill Lynch not to follow the tragic fate of Lehman Brothers

which defaulted almost the same day.

14 4 Empirical results

4.1 Data

For our analysis we use the equity time series of Merrill Lynch as provided by

Bloomberg in the time span from September 2001 to September 2008. Figure 1

shows the equity values of Merrill Lynch in the period from September 2001 to

September 2008. Until the beginning of 2007 there were almost no signs for the

coming financial market crisis and the company’s equity was fluctuating around a

positive trend. Since then we can see the negative dynamics in the firm’s value

that became very strong after May 2007.

40

60

80

100

120

0

20

40

60

80

100

120

ML Equity

Figure 1: Equity time series of Merrill Lynch (September 2001 - September 2008).

The examined period consists of 1770 daily observations. Moreover, we assume that

the average maturity of Merrill Lynch’s debt equals T = 5 years. We also assume a

flat interest rate at the level of 4% , which corresponds to the average of 5-year US

Government interest rates in the considered time span. Furthermore, we scale the

observable equity time series by dividing it by the initial asset value A(0) . For the

second model we assume a constant leverage of X = D(0)/A(0) . This information

was extracted from the consolidated financial data of Merrill Lynch, as presented

in Table 11 which shows some main figures characterizing the creditworthiness of

the company and its overall economic and financial conditions for the time period

between 2001 and 2008. This table justifies our assumption of constant leverage2

1 Extracted from annual reports of Merrill Lynch for the corresponding years.2 Fluctuations in leverage are less than 3% .

4.2 Application of the model 15

which we set to the initial value X = 0.95 (which also corresponds to the average

level in the examined period).

Main figure 2001 2002 2003 2004 2005 2006 2007 2008(in bn USD)EBIT 1.38 3.76 5.65 5.84 7.23 10.43 -12.83 -41.83Net Revenues 21.88 18.61 20.15 22.02 26.01 34.66 11.25 -12.59Total Assets 419.42 447.93 494.52 648.06 681.02 841.30 1,020.05 667.54Total Liabilities 396.72 422.40 464.20 616.69 645.42 802.26 988.12 647.54Stockholders’ Equity 20.01 22.88 27.65 31.37 35.60 39.04 31.93 20.00Leverage 0.95 0.94 0.94 0.95 0.95 0.95 0.97 0.97

Table 1: Consolidated financial data of Merrill Lynch for 2001-2008

4.2 Application of the model

The equity time series in our model without leverage constraint is calculated via

(6) with the following parameter set Θ1 = (α, β, σ, η, a, b) . Estimating these pa-

rameters via the Kalman filter method described in Section 3, we get

α = 0.3143

β = 0.0126

σ = 0.0560

a = 6.9968

b = 506.0191

η = −0.1096.

We use the same technique of the Kalman filter for the model with leverage con-

straint X that is set to the constant value of 0.95 and get the following parameters

α = 0.2585

β = 0.0140

σ = 0.0505

a = 4.5633

b = 336.8172

η = −0.0108.

One of the main advantages of the Kalman filter is that, additional to parameter

estimations, it gives the values of the unknown state process, i.e. the intensity time

16 4 Empirical results

series in our case. Knowing the intensity and the corresponding parameters, it is

possible to calculate the default probabilities by applying Equation (11). We use

both parameter sets for our models and the corresponding intensity time series to

calculate the default probabilities and compare them in Figure 2. We see similar

evolutions of both time series. However, the default probabilities derived in the

second model are slightly higher.

3.0%

4.0%

5.0%

6.0%

7.0%

8.0%

9.0%

10.0%

0.0%

1.0%

2.0%

3.0%

4.0%

5.0%

6.0%

7.0%

8.0%

9.0%

10.0%

First Model Second Model

Figure 2: 5-year default probabilities derived in both models.

Inappropriately high leverage that major investment banks ran in the last couple of

years was often named as one of the major reasons of subsequent default as in the

case of Lehman Brothers or fire sales as of Merrill Lynch and Bear Sterns. However,

if one just looks at the balance sheet information as in Table 1, there was actually

no acute reason for that: though the leverage ratio was quite high, it remained

almost constant over the entire examined period starting from 2001. At the same

time we can observe changing default probabilities (see Figure 2) and even their

triplication from beginning of 2007 to September 2008 when Lehman collapsed (on

both models). The reason for this at first glance odd phenomenon can be seen

in the fact that these changes in the firm’s creditworthiness cannot be attributed

to the leverage effect only. Some other economic factors might play an even more

essential role and should necessarily be taken into consideration. In the presented

modeling approach an unobservable intensity accounts for this exogenous effect.

The resulting relatively small differences between default probabilities implied by

both models can be seen as an evidence of this major impact of the unobservable

default intensity process on the firm’s credit quality.

4.2 Application of the model 17

We furthermore want to compare the empirical distribution of Merrill Lynch equity

returns with those implied by our models. For this purpose we use the parameter

sets determined above, simulate the underlying intensity process and transform

it to an equity time series. On Figure 3 the histogram presents the empirical

density function of Merrill Lynch returns as well as the density functions from

our simulation with 100 scenarios with 1770 time steps for the first (light grey

line - without leverage constraint) and the second (dark grey line - with leverage

constraint) model.

10

15

20

25

30

35

40

0

5

10

15

20

25

30

35

40

Hisorical First Model Second Model Normal Distribution

Figure 3: Empirical histogram and model density functions.

As we can see from the graph, simulated distributions in both cases are very

similar to the observed distribution. This fact can be verified via a two-sample

Kolmogorov-Smirnov test as described in [18]. The null hypothesis in our case

is that empirical log-returns of Merrill Lynch and simulated returns are samples

from the same continuous distribution. We test this hypothesis at the 1% and 5%

significance level. The corresponding statistics can be found in Table 2. The first

column (ksstat) represents the value of the Kolmogorov-Smirnov statistic itself,

which is the maximum difference between two cumulative distribution functions.

The second column gives the p-value for the corresponding test. In order to reject

the null hypothesis the p-value should be below some predefined confidence level.

We can conclude from Table 2 that in both models the null hypothesis can not be

rejected at the 1% significance level. At the same time, we reject the first model

at the 5% significance level.

18 5 Conclusion

Model ksstat pvalue 1% 5%First Model 0.0358 0.0217 not rejected rejectedSecond Model 0.0304 0.0766 not rejected not rejectedNormal Distribution 0.122 0 rejected rejected

Table 2: Kolmogorov-Smirnov Test

It is often claimed that equity returns are normally distributed. Thus, in order to

test this assumption, we add the normal distribution to our comparison analysis.

For this purpose we calculate mean and standard deviation of the Merrill Lynch

equity log-returns and then plot the density function of the corresponding normal

distribution. As before, the fitting quality can be analyzed on the histogram (see

the dotted line on Figure 3) and via Kolmogorov-Smirnov test. This time we

sample from the normal distribution with a given mean and standard deviation.

The corresponding test results are presented in the last row of Table 2. We can

observe that the value of the test statistic (ksstat column) doubled in comparison

with distributions provided by our models. In this case the corresponding null

hypothesis (normal distribution of the Merrill Lynch log-returns) can be rejected

at the 5% and 1% significance level.

5 Conclusion

In this paper we proposed two modeling approaches for the firm’s equity within an

intensity setup. In both cases, the company’s equity is treated as a credit derivative

with two different payoff structures - with and without leverage constraint. For the

first model, the payment of the derivative only depends on the corresponding value

of the intensity. The second model takes into account the liabilities of the firm by

including them into the payoff function. For both cases we were able to obtain

analytical formulas based on the paper of Duffie et al. [13] for the first model and

Buettler, Waldvogel [16] for the second one. We showed that the second model is a

generalization of the first one and thus we provide another mechanism for deriving

a closed-form solution for this type of affine models - Green’s function instead of

ODEs.

We used the Cox-Ingersoll-Ross process to describe the stochastic dynamics of

the intensity. However, other common modeling frameworks such as Vasicek [19]

or Hull-White [20] and even more general affine models are also applicable and

represent possible directions for further research. At this point one should bear in

19

mind, that some of these models might lead to negative intensities, which are not

accessible in our approach. Furthermore, the closed-form solution in case of a more

complicated model, allowing jumps for example, might not be achieved anymore.

This becomes a particularly complicated task for the second modeling approach,

where an expectation of a maximum function should be calculated explicitly. At

this stage we refer the Duffie’s paper mentioned above and his joint work with

Filipovic and Schachermayer [14] for more detailed information on the theory of

affine models.

Another way to generalize our approach and overcome the limitation of modeling

the firm’s assets depending only on the default intensity process is to include

some additional stochastic factor. This would lead to the two factor model, where

one factor accounts for the idiosyncratic part of the assets volatility and other

represents the external influence of the financial market in general. This paper can

be seen as a first step in the direction of such a generalized model that can better

capture the firm’s equity behavior and its credit quality.

We applied both presented methods to the case of Merrill Lynch and used its

equity time series to fit the parameters of the model. The estimation procedure is

based on the extended Kalman filter. We then were able to calculate the default

probabilities, which provide investors with additional insight on the firm’s credit

quality. This approach is especially valuable for those companies that do not have

liquid CDS or bond spreads and the only source of information is the equity time

series. Furthermore, using estimated parameters we ran a simulation with both

our models in order to compare historical equity returns with those implied by

the models. We observed that the resulting distributions are much closer to the

historical distribution than a normal one. This conclusion is also supported by the

Kolmogorov-Smirnov test that we performed at the end.

6 Appendix

6.1 Note on the model without constraint

It is well known that the value of a zero-coupon bond with stochastic interest rate

r is given by

ZB(t, T ) = E[e−

∫ Tt r(s)ds|Ft

].

20 6 Appendix

Moreover, there is a closed-form solution (see e.g. [21]) when the underlying stochas-

tic process follows a CIR process:

ZB(t, T ) = A(τ)e−B(τ)r(t),

where τ = T − t ,

A(τ) =

(2h e0.5(α+h)τ

2h+ (α + h)(ehτ − 1)

) 2αβ

σ2

and

B(τ) =2(ehτ − 1)

2h+ (α + h)(ehτ − 1). (20)

It can be shown that these formulas are a particular case of those derived in Section

2.1. There we got:

E[e−∫ Tt λ(s)dse−bλ(T )|Ft] = eθ0(τ)+θ1(τ)·λ(t), (21)

where θ0 and θ1 were defined in Equations (7) and (8). Setting b = 0 and

replacing λ by r , we receive:

θ1(τ) =c(α− h) + (α + h)e−hτ

σ2(c+ e−hτ ), (22)

where

c =h+ α

h− α(23)

and

h =√α2 + 2σ2.

Substituting (23) in (22) results in:

θ1(τ) =(α + h)(1− e−hτ )(α− h)

σ2(α + h+ e−hτh− e−hτ α)

which after some transformations and using the fact that

α2 − h2 = −2σ2

6.2 Note on the model with constraint 21

leads to:

θ1(τ) = − 2(ehτ − 1)

2h+ (α + h)(ehτ − 1).

The expression above coincides with −B(τ) from (20). Analogously, for θ0(τ) ,

we first calculate the expression

eθ0 = eαβα−hσ2 τ

(c+ 1

c+ e−hτ

) 2αβ

σ2

,

which after some simple transformations gives

eθ0 =

((c+ 1)e0.5(α+h)τ

cehτ + 1

) 2αβ

σ2

.

Substituting

c =h+ α

h− αleads us to

eθ0 =

(2he0.5(α+h)τ

2h+ (α + h)(ehτ − 1)

) 2αβ

σ2

.

Hence, eθ0(τ) = A(τ) and, consequently, Equation (20) is a particular case of the

generalized Equation (21).

6.2 Note on the model with constraint

As we have already mentioned in the Section 2, the closed-form solution from

the second model with leverage constraint represents a generalized version of the

first model. In the following we show that both analytical formulas coincide when

X = 0 . We recall the closed-form solution for the second model - Equation (16):

S(t) = a · e−rτP (λ, τ)

(p

b+ p

)k+1

e−f2

bp+b .

22 6 Appendix

We rewrite this equation using Equation set (14) in order to combine all con-

stituents with and without λ :

S(t) = ae−rτQ0(τ)e−Q1(τ)λ,

where

Q0(τ) =

(2he(α+h)τ/2

d2

) 2αβ

σ2(

p

p+ b

) 2αβ

σ2

and

Q1(τ) =4h2ehτ

σ2d2d1

b

p+ b+

2d1

d2

For the first model without leverage constraint (Equation (6)) we have

S(t) = ae−rτ θ0(τ)e−θ1(τ)λ,

where, according to Equation (7),

θ0(τ) = eθ0 =

((c+ 1)e0.5(α+h)τ

cehτ + 1

) 2αβ

σ2

,

and, according to Equation (8),

θ1(τ) = −cehτ (α− h) + (α + h)

σ2(cehτ + 1).

In order to verify the equivalence of both equations for S(t) , we compare the

parts including λ and not-including λ . We start with the terms without λ .

Substituting p = d2σ2d1

, we rewrite the expression for Q0 as

Q0(τ) =

(2he0.5(α+h)τ

d2

· d2

σ2d1b+ d2

) 2αβ

σ2

=

(2he0.5(α+h)τ

σ2d1b+ d2

) 2αβ

σ2

.

6.2 Note on the model with constraint 23

Using d1 = ehτ − 1 , d2 = (h + α)ehτ + (h − α) , and c = h+σ2b+αh−σ2b−α we further

conclude that

Q0(τ) =

(2he0.5(α+h)τ

σ2d1b+ d2

) 2αβ

σ2

=

(2he0.5(α+h)τ

(h+ α)ehτ + (h− α) + σ2b(ehτ − 1)

) 2αβ

σ2

=

(2he0.5(α+h)τ

(h+ σ2b+ α)ehτ + (h− σ2b− α)

) 2αβ

σ2

=

((c+ 1)e0.5(α+h)τ

cehτ + 1

) 2αβ

σ2

= θ0(τ).

In the next step we compare the coefficients of λ : θ0 and Q1 . We start again

from the one of the second model and use the fact, that p(τ) = d2σ2d1

:

Q1(τ) =4h2ehτ

σ2d2d1

b

p+ b+

2d1

d2

=4h2ehτ

d2

b

d2 + σ2d1b+

2d1

d2

(24)

=4h2ehτb+ 2d2

1σ2b+ 2d1d2

d2(d2 + σ2d1b).

Using Equations (9) and (10) we can also simplify the expression for θ1 :

24 References

θ1(τ) = −cehτ (α− h) + (α + h)

σ2(cehτ + 1)

= −(h+ σ2b+ α)ehτ (α− h) + (α + h)(h− σ2b− α)

σ2((h+ σ2b+ α)ehτ + (h− σ2b− α))

= −σ2b(ehτ (α− h)− (α + h))− (h2 − α2)ehτ + (h2 − α2)

σ2((h+ σ2b+ α)ehτ + (h− σ2b− α))

= − b(ehτ (α− h)− (α + h))− 2ehτ + 2

ehτ (h+ α) + (h− α) + σ2b(ehτ − 1)

=−b(ehτ (α− h)− (α + h)) + 2d1

d2 + σ2d1b

=−b(ehτ (α− h)− (α + h))d2 + 2d1d2

d2(d2 + σ2d1b).

Now, using Equation set (14), we conclude

−(ehτ (α− h)− (α + h))d2 = (ehτ (h− α) + (h+ α))((ehτ (h+ α) + (h− α))

= e2hτ (h2 − α2) + (h+ α)2ehτ + ehτ (h− α)2 + (h2 − α2)

= 2e2hτσ2 + 4α2ehτ + 4σ2ehτ + 2σ2

= 4α2ehτ + 8σ2ehτ + 2e2hτσ2 − 4σ2ehτ + 2σ2

= 4(α2 + 2σ2)ehτ + 2(ehτ − 1)2σ2

= 4h2ehτ + 2d21σ

2

and thus Q1(τ) = θ1(τ) , i.e. we have shown that both expression for the firm’s

equity coincide, and consequently, the first model represents a particular case of

the second model, when the leverage constraint is set to zero.

References

[1] Black F, Scholes M. The pricing of options and corporate liabilities. Journal

of Political Economy 1973; 81(3): 637:654. DOI: 10.1086/260062.

[2] Heston S. A closed-form solution for options with stochastic volatility with

applications to bond and currency options. The Review of Financial Studies

1993; 6(2): 327-343.

References 25

[3] Derman E, Kani I. The volatility smile and its impliead tree. Working Paper,

Goldman Sachs January 1994.

[4] Merton RC. Option pricing when underlying stock returns are discountinu-

ous. Journal of Financial Economics 1976; 3: 125-144.

[5] Merton RC. On the pricing of corporate debt: the risk structure of interest

rates. Journal of Finance 1974; 29(2): 449-470. DOI: 10.2307/2325877.

[6] Black F, Cox JC. Valuing corporate securities: some effects of bond indenture

provisions. Journal of Finance 1976; 31(2): 351-367.

[7] Finkelstein V, Pan G, Lardy J, Ta T, and Tierney J. CreditGrades Technical

Document, RiskMetrics Group, 2001.

[8] Lyden S, Saraniti D. An empirical examination of the classical theory of

corporate security valuation. Research Paper, Barclays Global Investors.

[9] Jarrow R, Protter P. Structural versus reduced form models: a new informa-

tion based perspective. Working Paper, Cornell University 2004.

[10] Bremaud P. Point processes and queues: martingale dynamics, Springer:

Berlin, 1981; 354.

[11] Duffie D, Schroder M, and Skiadas C. Recursive valuation of defaultable

securities and the timing of resolution of uncertainty. The Annals of Applied

Probability 1996; 6(4): 1075-1090. DOI: 10.1214/aoap/1035463324.

[12] Cox JC, Ingersoll JE, and Ross SA. A theory of the term structure of interest

rates. Econometrica 1985; 53(2): 385-407. DOI:10.2307/1911242.

[13] Duffie D, Pan J, and Singleton K. Transform analysis and asset pricing

for affine jump-diffusions. Econometrica 2000; 68(6): 1343-1376. DOI:

10.1111/1468-0262.00164.

[14] Duffie D, Filipovic D, Schachermayer W. Affine processes and applications

in finance. Annals of Applied Probability 2003; 13: 984-1053.

[15] Lando D. On Cox Processes and Credit Risky Securities. Review of Deriva-

tives Research 1998; 2: 99-102.

[16] Buettler HJ, Waldvogel J. Pricing callable bonds by means of green’s

function. Mathematical Finance 1996; 6(1): 53-88. DOI: 10.1111/j.1467-

9965.1996.tb00112.x.

26 References

[17] Geyer A, Pichler S. A state-space approach to estimate and test multifac-

tor Cox-Ingersoll-Ross models of the term structure. Journal of Financial

Research 1999; 22(1): 107-130.

[18] Massey FJ. The Kolmogorov-Smirnov test for goodness of fit. Jour-

nal of the American Statistical Association 1951; 46(253): 68-78. DOI:

10.2307/2286009.

[19] Vasicek O. An equilibrium characterization of the term structure. Journal of

Financial Economics 1977; 5(2): 177-188. DOI: 10.1234/12345678.

[20] Hull J, White A. Pricing interest-rate derivative securities. The Review of

Financial Studies 1990; 3(4): 573-592. DOI: 10.1234/12345678.

[21] Brigo D, Alfonsi A. Credit Default Swap calibration and derivatives pricing

with the SSRD stochastic intensity model. Finance and Stochastic 2005,

9(1): 29-42. DOI: 10.1007/s00780-004-0131-x.

Date post:	26-Nov-2023
Category:	Documents
Upload:	independent
View:	0 times
Download:	0 times

An intensity-based approach for equity modeling

Documents