Numerical Solution of Fractional PDEs

Numerical Solution of Fractional PDEs —Beijing Computational Science Research Center,

November 2015

William McLeanThe University of New South Wales

Updated November 27, 2015

Part I

Fractional integrals and derivatives

Introduction

This lecture provides some key definitions and results fromfractional calculus, needed for our study of fractional PDEs. Theliterature contains several concepts of fractional differentiation, butwe focus only on the Riemann–Liouville and Caputo definitions,with a brief mention of the Grunwald–Letnikov approach.

In the sequel, we work almost exclusively with theRiemann–Liouville fractional integral and derivative.

Outline

Fractional integration

Fractional differentiation

Grunwald–Letnikov definition

Fractional integration

Motivation: consider the n-fold integration operator Ina+ basedat a, defined recursively by

I0a+f (x) = f (x)

and

Ina+f (x) =

∫ x

aIn−1f (y) dy for n ≥ 1.

We claim

Ina+f (x) =

∫ x

a

(x − y)n−1

(n − 1)!f (y) dy for n ≥ 1.

The formula holds for n = 1 because (x − y)0/0! = 1 and

I1a+f (x) =

∫ x

af (y) dy .

Easy proof by induction on n

Let n ≥ 1 and assume

Ina+f (x) =

∫ x

a

(x − y)n−1

(n − 1)!f (y) dy .

Then

In+1a+ f (x) =

∫ x

aIna+f (z) dz =

∫ x

a

∫ z

a

(z − y)n−1

(n − 1)!f (y) dy dz

=

∫ x

a

∫ x

y

(z − y)n−1

(n − 1)!dz f (y) dy

=

∫ x

a

(x − y)n

n!f (y) dy .

Gamma function

Recall that

Γ(α) =

∫ ∞0

e−ttα−1 dt for α > 0,

andΓ(n + 1) = n! for any integer n ≥ 0.

For any real α > 0, we define the left-sided, Riemann–Liouvillefractional integration operator of order α by

Iαa+f (x) =

∫ x

a

(x − y)α−1

Γ(α)f (y) dy for x > a.

This definition is consistent with our earlier definition of Ina+

when α = n.

Transposed operator

Putting

〈f , g〉 =

∫ b

af (x)g(x) dx

we find that ⟨Iαa+f , g

⟩=⟨f , Iαb−g

⟩,

where the right-sided, Riemann–Liouville fractional integrationoperator of order α is given by

Iαb−g(x) =

∫ b

x

(y − x)α−1

Γ(α)g(y) dy for x < b.

Semigroup property

From the recursive definition, we see that

Ima+Ina+ = Im+na+ for all integers m ≥ 0, n ≥ 0.

Key question: does

Iαa+Iβa+ = Iα+β

a+ for all α > 0 and β > 0?

Consider

Iαa+Iβa+f (x) =

∫ x

a

(x − z)α−1

Γ(α)

∫ z

a

(z − y)β−1

Γ(β)f (y) dy dz

=

∫ x

a

∫ x

y

(x − z)α−1

Γ(α)

(z − y)β−1

Γ(β)dz f (y) dy .

Putting t = (z − y)/(x − y), we have

z = y + t(x−y), x− z = (1− t)(x−y), z−y = t(x−y),

so the Beta function identity∫ 1

0(1− t)α−1tβ−1 dt = B(α, β) =

Γ(α)Γ(β)

Γ(α + β),

gives∫ x

y

(x − z)α−1

Γ(α)

(z − y)β−1

Γ(β)dz

=(x − y)(α−1)+(β−1)+1

Γ(α)Γ(β)

∫ 1

0(1− t)α−1tβ−1 dt

=(x − y)α+β−1

Γ(α + β),

implying the desired result.

Gel’fand–Shilov function

Let

Υα(x) =xα−1

+

Γ(α),

where x+ = max(x , 0), and use the abbreviation

Iα = Iα0+ for α > 0.

The fractional integral is given by the Laplace convolution

Iαf (x) =

∫ x

0Υα(x − y)f (y) dy = Υα ∗ f (x), x > 0.

We easily see that

Υα ∗Υβ = Υα+β for α > 0, β > 0.

In fact, since ∗ is associative,

(Υα ∗Υβ) ∗ f = Υα ∗ (Υβ ∗ f ) = Iα(Iβf ) = Iα+βf = Υα+β ∗ f

for every continuous f .

It follows thatIαΥβ = Υα+β,

generalising the identity

Im xn

n!=

xm+n

(m + n)!.

Shifted version

LetΥβ,a(x) = Υβ(x − a).

Using the substitution y = a + t,

Iαa+Υβ,a(x) =

∫ x

aΥα(x − y)Υβ(y − a) dy

=

∫ x−a

0Υα(x − a− t)Υβ(t) dt

= (Υα ∗Υβ)(x − a) = Υα+β(x − a),

or in other words,

Iαa+Υβ,a(x) = Υα+β,a(x), x > a.

Fractional differentiationAssume that

n − 1 < α ≤ n for some n ∈ 1, 2, 3, . . .,

and write Dn = (d/dx)n.

The Riemann–Liouville fractional derivative is defined by

Dαa+f (x) = DnIn−αa+ f (x) for x > a.

whereas the Caputo fractional derivative is defined by

CDαa+f (x) = Iα−na+ Dnf (x) for x > a.

LemmaFor x > a and β > 0,(

DIβa+ − Iβa+D

)f (x) = f (a)Υβ(x − a).

Proof

Note that DΥβ(x) = Υβ−1(x) and Υ1(x) = 1.

By the fundamental theorem of calculus,

Ia+Df (x) =

∫ x

af ′(y) dy = f (x)− f (a),

sof (x) = Ia+Df (x) + f (a)Υ1(x − a),

Thus,Iβa+f (x) = Iβ+1

a+ Df (x) + f (a)Υβ+1(x − a),

and finally

DIβa+f (x) = DIa+ Iβa+Df (x) + f (a)Υβ(x − a).

Relation between Dα and CDα

TheoremIf n − 1 < α < n, then

Dαa+f (x) = CDαa+f (x) +n−1∑k=0

Dk f (a)(x − a)k−α

Γ(k + 1− α), x > a.

Proof.In the case n = 2, we have 1 < α < 2 and the Lemma gives

Dαa+f (x) = D2I2−αa+ f (x) = D

(I2−αa+ Df (x) + f (a)Υ2−α(x − a)

)= I2−α

a+ D2f (x) +Df (a)Υ2−α(x − a) + f (a)Υ1−α(x − a)

= CDαa+f (x) + f (a)(x − a)−α

Γ(1− α)+Df (a)

(x − a)1−α

Γ(2− α).

The general case follows in the same way.

Differentiating a shifted Gel’fand–Shilov function

LemmaIf α > 0, β > 0 and x > a, then

Dαa+Υβ,a(x) = Υβ−α,a(x).

Proof.If n − 1 < α < n then

Dαa+Υβ,a(x) = DnIn−αa+ Υβ,a(x) = DnΥn−α+β,a(x) = Υβ−α,a(x).

In particular, since Υ1,a(x) ≡ 1, if x > a then

Dαa+1(x) = Υ1−α,a(x) =(x − a)−α

Γ(1− α)whereas CDαa+1(x) = 0.

Relation between Dα and CDα restated

Since(x − a)k−α

Γ(k + 1− α)= Υk+1−α,a(x) = Dαa+Υk+1,a(x)

the relation

Dαa+f (x) = CDαa+f (x) +n−1∑k=0

Dk f (a)(x − a)k−α

Γ(k + 1− α), x > a,

may be re-stated in the form

CDαa+f (x) = Dαa+

(f (x)−

n−1∑k=0

Dk f (a)(x − a)k

k!

), x > a,

where n − 1 < α < n, as before.

Alternative representation

LemmaIf 0 < α < 1 and x > a, then

Dαa+f (x) = f (x)Υ1−α(x−a)+

∫ x

aΥ−α(x−y)

[f (y)−f (x)

]dx .

Proof.Differentiate the identity

I1−αa+ f (x) = f (x)Υ2−α(x − a) +

∫ x

aΥ1−α(x − y)

[f (y)− f (x)

]dy ,

noting that the derivative of the integral on the right is∫ x

aΥ−α(x − y)

[f (y)− f (x)

]dx −Υ2−α(x − a)f ′(x).

Representation as a Hadamard finite-part integral

Assume 0 < α < 1 and x > a. Then∫ x

aΥ−α(x − y)

[f (y)− f (x)

]dy =

∫ x

x−ε· · · dy

+

∫ x−ε

aΥ−α(x − y)f (y) dy + f (x)

[Υ1−α(ε)−Υ1−α(x − a)

],

so

Dαa+f (x) =f (x)ε−α

Γ(1− α)+

∫ x−ε

aΥ−α(x − y)f (y) dy + O(ε1−α).

and therefore

Dαa+f (x) = “I−αa+ f (x)” = fpε↓0

∫ x−ε

aΥ−α(x − y)f (y) dy .

Grunwald–Letnikov definitionCan we define a fractional derivative (or integral) directly, withoutusing integer-order derivatives and integrals?Denote the backward difference by

∆hf (x) = f (x)− f (x − h).

Can check by induction on k that

∆khf (x) =

k∑j=0

(k

j

)(−1)j f (x − jh) for k ∈ 0, 1, 2, . . . .

Hence define the fractional backward difference of order α by

∆αh,nf (x) =

n∑j=0

(α

j

)(−1)j f (x − jh),

with (α

j

)=α

1

α− 1

2· · · α− j + 1

j=

Γ(α + 1)

Γ(j + 1)Γ(α− j + 1).

An induction on k shows that

∆khf (x) =

∫ h

0· · ·∫ h

0f (k)(x − t1 − · · · − tk) dt1 · · · dtk ,

and thus

f (k)(x) = limh→0

∆khf (x)

hk.

Given x and a, we therefore define

GLDαa+f (x) = lim∆α

h,nf (x)

hα,

where the limit is obtained by sending n→∞ and h→ 0+ keeping

h =x − a

n,

so that nh = x − a is constant.

Can show that if m − 1 < α < m, then

GLDαa+f (x) =m−1∑k=0

f (k)(a)(x − a)k−α

Γ(k + 1− α)

+

∫ x

a

(x − y)m−α−1

Γ(m − α)f (m)(y) dy ,

which means that

GLDαa+f (x) = CDαa+f (x) +m−1∑k=0

f (k)(a)(x − a)k−α

Γ(k + 1− α)= Dαa+f (x).

Furthermore,

GLD−αa+ f (x) = Iαa+f (x), α > 0.

Part II

Useful tools

Introduction

We will make extensive use of the Laplace transform (and someuse of the Fourier transform), first to derive the fractional diffusionequation and then to study properties of the solution. Laplacetransformation also plays a large role in some of the numericalmethods we study, either as part of the method itself or for theerror analysis.

This lecture also introduces some special functions that are arise inthe study of fractional initial-boundary value problems.

Outline

Laplace transforms

Mittag–Leffler function

Wright functions

Laplace transforms

Notation:

f (z) = (Lf )(z) =

∫ ∞0

e−zt f (t) dt.

If f is locally integrable on [0,∞), and if

|f (t)| ≤ Ceλt for t > 0,

then f (z) exists and is analytic for <z > λ, and we have theinversion formula

f (t) =1

2πi

∫ a+i∞

a−i∞ezt f (z) dz , a > λ.

Transform of a Gel’fand–Shilov function

For α > 0 and z > 0, the substitution y = tz gives

Υα(z) =1

Γ(α)

∫ ∞0

e−zttα−1 dt =1

Γ(α)

∫ ∞0

e−y(

y

z

)α−1 dy

z

=z−α

Γ(α)

∫ ∞0

e−yyα−1 dy ,

that is,Υα(z) = z−α,

consistent with

z−α−β = Υα+β(z) = L(Υα ∗Υβ) = Υα(z)Υβ(z) = z−αz−β.

Laplace transform of an integral

Since

If (t) =

∫ t

0f (s) ds = (Υ1 ∗ f )(t)

we haveLIf (t) = Υ1(z)f (z) = z−1f (z).

In general, Inf = Υn ∗ f so

LInf (t) = z−n f (z) for n ∈ 0, 1, 2, . . ..

Laplace transform of a derivative

Integration by parts shows

LDf (t) =

∫ ∞0

e−ztDf (t) dt

=[e−zt f (t)

]∞t=0−∫ ∞

0(−z)e−zt f (t) dt

= 0− f (0) + z

∫ ∞0

e−zt f (t) dt,

soLDf (t) = zf (z)− f (0).

Easily verify by induction on n that

LDnf (t) = zn f (z)−n−1∑k=0

zn−1−kDk f (0).

Laplace transform of a Caputo fractional derivative

If n − 1 < α < n, then

CDαf (t) = In−αg(t) where g(t) = Dnf (t),

so

LCDαf (t) = z−(n−α)g(z)

= zα−n(

zn f (z)−n−1∑k=0

zn−1−kDk f (0)

)and so

LCDαf (t) = zαf (z)−n−1∑k=0

zα−1−kDk f (0).

Laplace transform of a Riemann–Liouville fractionalderivative

If n − 1 < α < n, then

Dαf (t) = CDαf (t) +n−1∑k=0

Dk f (0)Υk+1−α(t),

and since Υk+1−α(z) = z−(k+1−α) we have

LDαf (t) = LCDαf (t)+n−1∑k=0

Dk f (0)zα−1−k ,

that is,LDαf (t) = zαf (z).

Mittag–Leffler function

Problem: find f (t) satisfying

CDαf (t) = f (t) for t > 0, with f (0) = 1.

If α = 1 then f (t) = et .

If 0 < α < 1, then we claim

f (t) =∞∑k=0

Υ1+kα(t) =∞∑k=0

tkα

Γ(1 + kα).

That is,f (t) = Eα(tα),

where the Mittag–Leffler function is

Eα(z) =∞∑k=0

zk

Γ(1 + kα).

Proof

In fact, CDαΥ1 = 0 and for k ≥ 1,

CDαΥ1+kα = I1−αDΥ1+kα = I1−αΥkα = Υ1+(k−1)α,

so

CDαf = CDα(Υ1 + Υ1+α + Υ2+α + Υ3+α + · · ·

)= 0 + Υ1 + Υ1+α + Υ2+α + · · · = f .

Also,Υ1(t) ≡ 1 and Υ1+kα(0) = 0 for k ≥ 1,

so f (0) = 1.

Convergence?

We claim Eα(z) is an entire function of z . By the ratio test, itsuffices to show that as k →∞,∣∣∣∣ zk+1

Γ(1 + (k + 1)α

)/ zk

Γ(1 + kα)

∣∣∣∣ = |z | Γ(1 + kα)

Γ(1 + kα + α)→ 0.

In fact, using Stirling’s approximation,

Γ(x) =

√2π

x

(x

e

)x(1 + O(x−1)

)as x →∞,

we find

Γ(1 + kα)

Γ(1 + kα + α)=

(1

1 + kα + α

)α(1 + O(k−1)

).

Mittag–Leffler function Eα(x) for 0 < α ≤ 1

Mittag–Leffler function Eα(x) for 1 < α ≤ 2

Special choices of α

E0(x) =∞∑k=0

zk =1

1− z.

E1/2(x) = exp(x2) erfc(−x) =1√2π

∫ ∞−x

exp(x2 − t2) dt.

E1(x) =∞∑k=0

xk

k!= exp(x).

E2(x) =∞∑k=0

xk

(2k)!= cosh(

√x), x ≥ 0.

E2(−x) =∞∑k=0

(−1)kxk

(2k)!= cos(

√x), x ≥ 0.

Fractional relaxation equationProblem: find u satisfying

CDαu + λu = 0 for t > 0, with u(0) = 1.

We claim that the solution is

u(t) = Eα(−λtα) =∞∑k=0

(−λ)ktkα

Γ(1 + kα).

In fact,

CDαu(t) = CDα∞∑k=0

(−λ)kΥ1+kα(t)

= 0 +∞∑k=1

(−λ)kΥ1+(k−1)α(t)

= −λ∞∑k=1

(−λ)k−1Υ1+(k−1)α(t) = −λu(t).

Equivalent formulation

The function u(t) = Eα(−λtα) satisfies

I1−αDu(t) + λu(t) = 0.

To obtain an equation involving a Riemann–Liouville fractionalderivative, apply DIα and obtain

DI1Du(t) + λDIαu(t) = 0.

Therefore, since DI1f (t) = f (t),

Du(t) + λD1−αu(t) = 0.

Taking Laplace transforms, zu(z)− u(0) + λz1−αu(z) = 0, andthus (z + λz1−α)u(z) = 1, showing that

u(z) = LEα(−λtα) =1

z + λz1−α .

Fractional relaxation: u(t) = Eα(−tα)

Integral representation

The reciprocal Gamma function has the representation

1

Γ(a)=

1

2πi

∫ 0+

−∞eww−a dw ,

where∫ 0+−∞ means integration around a Hankel contour that

encircles the negative real axis and has a counterclockwiseorientation.

TheoremThe Mittag–Leffler function admits the integral representation

Eα(z) =1

2πi

∫ 0+

−∞

ewwα−1

wα − zdw

provided the Hankel contour encloses the disc |w | ≤ |z |1/α.

Proof

Sincezk

Γ(1 + αk)=

zk

2πi

∫ 0+

−∞eww−1−αk dw ,

we see that

Eα(z) =∞∑k=0

zk

2πi

∫ 0+

−∞eww−1−αk dw

=1

2πi

∫ 0+

−∞

ew

w

∞∑k=0

(z

wα

)k

dw (if |z | < |w |α)

=1

2πi

∫ 0+

−∞

ew

w

1

1− zw−αdw

=1

2πi

∫ 0+

−∞

ewwα−1

wα − zdw .

Positivity

Another integral representation follows from the Laplace inversionformula,

Eα(−λtα) =1

2πi

∫ 0+

−∞

ezt dz

z + λz1−α , t > 0.

By collapsing the Hankel contour onto the negative real axis, wefind

Eα(−λtα) =1

π

∫ ∞0

e−rtλrα sinαπ dr(rα + λ cosαπ

)2+ λ2 sin2 απ

,

which shows that

Eα(−λtα) > 0 andd

dtEα(−λtα) < 0 for all t > 0.

Asymptotic behaviour

Since

1

z + z1−α =zα−1

1 + zα=

1

z

∞∑n=0

(−1)nz(n+1)α, |z | < 1,

we find that as t →∞,

Eα(−tα) ∼∞∑n=0

(−1)n

2πi

∫ 0+

−∞eztz(n+1)α dz

z,

and a substitution gives

1

2πi

∫ 0+

−∞eztz(n+1)α dz

z=

t−(n+1)α

2πi

∫ 0+

−∞ezz(n+1)α dz

z

=t−(n+1)α

Γ(1− (n + 1)α

) .

Thus,

Eα(−tα) ∼∞∑n=0

(−1)nt−(n+1)α

Γ(1− (n + 1)α

) ,that is,

Eα(−tα) ∼∞∑n=1

(−1)n+1t−nα

Γ(1− nα)=

t−α

Γ(1− α)− t−2α

Γ(1− 2α)+ · · · .

The identity1

Γ(1− z)=

1

πΓ(z) sinπz

yields an alternative form,

Eα(−tα) ∼ 1

π

∞∑n=1

(−1)n+1t−nαΓ(nα) sin nπα.

Notice what happens as α→ 1.

Wright functions

The Wright function is defined by

Wλ,µ(z) =∞∑k=0

zk

k!Γ(λk + µ), λ > −1, µ ∈ C.

This series converges for all z ∈ C so Wλ,µ is an entire function.

TheoremThe Wright function has the integral representation

Wλ,µ(z) =1

2πi

∫ 0+

−∞exp(w + zw−λ

) dw

wµ.

Proof

We once again use

1

Γ(a)=

1

2πi

∫ 0+

−∞eww−a dw ,

and find

Wλ,µ(z) =∞∑k=0

zk

k!

1

2πi

∫ 0+

−∞eww−λk−µ dw

=1

2πi

∫ 0+

−∞eww−µ

∞∑k=0

(zw−λ)k

k!dw

=1

2πi

∫ 0+

−∞exp(w + zw−λ)

dw

wµ.

Wright M-functionOur main concern is with the function

Mα(t) = W−α,1−α(−t) =∞∑k=0

(−1)ktk

k!Γ(1− (k + 1)α

) ,0 < α < 1, introduced by F. Mainardi in 1994. The identity

1

Γ(1− z)=

1

πΓ(z) sinπz

yields an alternative expression

Mα(t) =1

π

∞∑k=0

(−1)ktk

k!Γ((k + 1)α

)sinπ(k + 1)α.

Two special cases: (Ai = Airy function)

M1/2(t) =exp(−t2/4)√

πand M1/3(t) = 32/3 Ai(t),

Wright M-function Mα(t)

Integral representation and Laplace transform

Putting λ = −α and µ = 1− α in the integral representation ofthe Wright function, and replacing t by −t, gives

Mα(t) =1

2πi

∫ 0+

−∞exp(w − twα)

dw

w 1−α ,

which allows us to prove the following result.

TheoremFor 0 < α < 1, the Laplace transform of Mα is

Mα(z) = Eα(−z).

Proof

Mα(z) =

∫ ∞0

e−ztMα(t) dt

=

∫ ∞0

e−zt1

2πi

∫ 0+


dw

w 1−α dt

=1

2πi

∫ 0+

−∞

ew

w 1−α

∫ ∞0

e−(z+wα)t dt dw

=1

2πi

∫ 0+

−∞

ew

w 1−α1

z + wαdw

=1

2πi

∫ 0+

−∞

ewwα−1

wα − (−z)dw

= Eα(−z).

Fourier transform

Notation:

f (ξ) = Ff (x) =

∫ ∞−∞

e−iξx f (x) dx .

If f ∈ L1(R) then f is continuous on R and f (ξ)→ 0 as |ξ| → ∞.

Plancherel theorem: Fourier transform extends uniquely to abounded linear operator F : L2(R)→ L2(R) satisfying

1

2π

∫ ∞−∞F f (ξ)Fg(ξ) dξ =

∫ ∞−∞

f (x)g(x) dx .

Inversion formula:

f (x) =1

2πlim

R→∞

∫ R

−Re iξx f (ξ) dξ, −∞ < x <∞.

Symmetric Wright M-function

We will require the Fourier transform of Mα(|x |).

LemmaFor κ > −1 and 0 < α < 1,∫ ∞

0tκMα(t) dt =

Γ(κ+ 1)

Γ(ακ+ 1).

TheoremFor 0 < α < 1,

FMα(|x |) = 2E2α(−ξ2).

Proof of the lemma

∫ ∞0

tκMα(t) dt =

∫ ∞0

tκ1

2πi

∫ 0+


dw

w 1−α dt

=1

2πi

∫ 0+

−∞ew∫ ∞

0tκe−tw

αdt

dw

w 1−α

and∫ ∞0

tκe−twα

dt = Γ(κ+ 1)

∫ ∞0

e−twα

Υκ+1(t) dt

= Γ(κ+ 1)Υκ+1(wα) = Γ(κ+ 1)(wα)−(κ+1)

so∫ ∞0

tκMα(t) dt =Γ(κ+ 1)

2πi

∫ 0+

−∞eww−(ακ+1) dw =

Γ(κ+ 1)

Γ(ακ+ 1).

Proof of the theorem

FMα(|x |) =

∫ ∞−∞

e−iξxMα(|x |) dx

= 2

∫ ∞0

Mα(x) cos ξx dx

= 2

∫ ∞0

Mα(x)∞∑k=0

(−1)k(ξx)2k

(2k)!dx

= 2∞∑k=0

(−1)kξ2k

(2k)!

∫ ∞0

x2kMα(x) dx

= 2∞∑k=0

(−1)kξ2k

(2k)!

Γ(2k + 1)

Γ(2αk + 1)

= 2∞∑k=0

(−ξ2)k

Γ(1 + 2αk)= 2 E2α(−ξ2).

Part III

Anomalous diffusion

Introduction

The classical diffusion equation,

ut − K∇2u = 0,

describes how the concentration u of a substance evolves over timeif, at the microscopic scale, its particles exhibit Brownian motion.The equation can also be derived from a purely macroscopicargument based on conservation of mass and Fick’s law, whichstates that the mass flux vector equals −K∇u.

In this lecture, we study continuous-time random walks, whichprovide a generalization of Brownian motion. When thewaiting-time distribution obeys a power law, the particlesexperience trapping and their macroscopic behaviour is said to besubdiffusive. The mean-square displacement of such a particle isproportional to tα for a characteristic exponent in therange 0 < α < 1.

With the help of Fourier and Laplace transformation, we show thatthe macroscopic concentration obeys a time-fractional PDE,

ut − ∂1−αt Kα∇2u = 0,

where it is convenient to write ∂1−αt = D1−α

0+ for theRiemann–Liouville fractional derivative with respect to t. Theconstant Kα > 0 is a generalized diffusivity, and the classicaldiffusion equation then arises as the limiting special casewhen α→ 1.

Outline

Continuous-time random walks

Rescaling

Subdiffusion

Continuous-time random walks

A walker moves along the x-axis, starting at position x0 attime t0 = 0. At time t1 the walker jumps to x1, then at time t2

jumps to x2, and so on. We assume that the increments

∆tn = tn − tn−1 and ∆xn = xn − xn−1

are independent, identically distributed random variables withprobability density functions ψ(t) and λ(x), respectively. That is,

P(a < ∆tn < b) =

∫ b

aψ(t) dt for 0 < a < b <∞,

and

P(a < ∆xn < b) =

∫ b

aλ(x) dx for −∞ < a < b <∞.

Aim: determine the probability that the particle lies in a givenspatial interval at time t.

An example

Suppose that the waiting-time distribution is exponential withparameter τ > 0,

ψ(t) = τ−1e−t/τ for 0 < t <∞,

and that the jump-length distribution is normal with mean 0 andvariance σ2,

λ(x) =1

σ√

2πexp

(− x2

2σ2

)for −∞ < x <∞.

Thus,

E(∆tn) = τ, E(∆t2n) = τ2, E(∆xn) = 0, E(∆x2

n ) = σ2.

The position x(t) of the walker is a step function.

Typical path (τ = 1, σ = 1, x0 = 2)

Random walk in 3D (σx = σy = σz = 1)

Convolutions and probability

We denote the Fourier convolution of f and g by

f ~ g(z) =

∫ ∞−∞

f (z − y)g(y) dy , for −∞ < z <∞.

If f (x) = 0 for x < 0 and g(y) = 0 for y < 0 then this integralequals the Laplace convolution

f ∗ g(z) =

∫ z

0f (z − y)g(y) dy for z > 0.

TheoremIf X and Y are independent random variables with probabilitydensity functions f and g, respectively, then the sum Z = X + Yhas probability density function f ~ g.

Probability distribution of tn

Let ψn(t) denote the probability density function of the randomvariable

tn = ∆t1 + ∆t2 + · · ·+ ∆tn,

that is,

P(a < tn < b) =

∫ b

aψn(t) dt for 0 < a < b <∞.

By the theorem quoted above,

ψn(t) = (ψn−1 ∗ ψ)(t) =

∫ t

0ψn−1(s)ψ(t − s) ds,

soψn = ψ ∗ ψ ∗ · · · ∗ ψ︸︷︷︸

n factors

with ψ0 = δ.

Survival probability

Let Ψ(t) denote the survival probability, that is, the probability ofthe walker not jumping within a time t, or equivalently, theprobability of remaining stationary for at least a duration t. Then,

Ψ(t) =

∫ ∞t

ψ(s) ds = 1−∫ t

0ψ(s) ds for 0 < t <∞.

It follows that the probability of taking exactly n steps up to time tis

χn(t) =

∫ t

0ψn(s)Ψ(t − s) ds for 0 < t <∞.

Probability distribution of xn − x0

Let λn(x) denote the probability density function of the randomvariable

xn − x0 = ∆x1 + ∆x2 + · · ·+ ∆xn,

that is,

P(a < xn − x0 < b) =

∫ b

aλn(x) dx for −∞ < a < b <∞.

Since

λn(x) = (λn−1 ~ λ)(x) =

∫ ∞−∞

λn−1(y)λ(x − y) dy ,

we haveλn = λ~ λ~ · · ·~ λ︸︷︷︸

n factors

with λ0 = δ.

Characteristic functionsTerminology from probability theory: the characteristic function ofψ(t) is just its Laplace transform,

ψ(z) = Lψ(z) =

∫ ∞0

e−ztψ(t) dt,

whereas the characteristic function of λ(x) is its Fourier transform,

λ(ξ) = Fλ(ξ) =

∫ ∞−∞

e−iξxλ(x) dx .

Since ψn and λn are n-fold convolutions of ψ and λ, respectively,

ψn(z) = ψ(z)n and λn(ξ) = λ(ξ)n for n ≥ 0.

The characteristic function for the survival probabilityΨ = 1− I1ψ is

Ψ(z) = z−1 − z−1ψ(z) =1− ψ(z)

z.

Probability density

Let p(x , t) denote the probability density function for the positionof the particle at time t, that is,

P(a < x(t)− x0 < b) =

∫ b

ap(x , t) dx .

Since χn(t) is the probability of taking n steps up to time t,

p(x , t) =∞∑n=0

λn(x)χn(t).

Recalling that χn = ψn ∗Ψ,

χn(z) = ψn(z)Ψ(z) = ψ(z)n1− ψ(z)

z.

Characteristic function

Denote the Fourier–Laplace transform of p by

ˆp(ξ, z) = LFp(ξ, z) =

∫ ∞0

e−zt∫ ∞−∞

e−iξxp(x , t) dx dt.

Using the results derived above,

ˆp(ξ, z) =∞∑n=0

λn(ξ)χn(z)

=∞∑n=0

λ(ξ)nψ(z)n1− ψ(z)

z

=1− ψ(z)

z

∞∑n=0

[λ(ξ)ψ(z)

]n.

Geometric series

Since λ is a probability density function,

λ(0) =

∫ ∞−∞

λ(x) dx = 1,

and likewise

ψ(0) =

∫ ∞0

ψ(t) dt = 1.

But if ξ 6= 0 or z > 0, then∣∣λ(ξ)ψ(z)

∣∣ < 1 so

∞∑n=0

[λ(ξ)ψ(z)

]n=

1

1− λ(ξ)ψ(z)

and hence

ˆp(ξ, z) =1− ψ(z)

z

1

1− λ(ξ)ψ(z).

Earlier example

If

ψ(t) = τ−1e−t/τ and λ(x) =1

σ√

2πexp

(− x2

2σ2

),

we find that

ψ(z) =1

1 + τzand λ(ξ) = e−σ

2ξ2/2,

so

1− ψ(z)

z=

τ

1 + τz,

1− ψ(z)λ(ξ) =1 + τz − exp(−σ2ξ2/2)

1 + τz,

and thusˆp(ξ, z) =

τ

1 + τz − exp(−σ2ξ2/2).

Uncertain initial position

Instead of assuming x(0) = x0 is known, we can treat x0 as arandom variable with probability density p0(x), so that

P(a < x0 < b) =

∫ b

ap0(x) dx for −∞ < a < b <∞.

Let λn denote the probability density function for xn (rather thanxn − x0, as before), so

P(a < xn < b) =

∫ b

aλn(x) dx for −∞ < a < b <∞.

Sincexn = x0 + ∆x1 + · · ·+ ∆xn,

we haveλn = p0 ~ λ~ λ~ · · ·~ λ︸︷︷︸

n factors

.

In particular, λ0 = p0 (rather than δ),

Using[λ(ξ)]np0(ξ)

in the preceding analysis leads to

ˆp(ξ, z) =1− ψ(z)

z

p0(ξ)

1− λ(ξ)ψ(z),

that is, the only change is to introduce a factor p0(ξ).

Formally, put p0(x) = δ(x − x0) to recover the case when x0 isknown with certainty.

Rescaling

Assume now that the probability density functions ψ(t) and λ(x)are normalized to satisfy∫ ∞

0tψ(t) dt = 1,

∫ ∞−∞

xλ(x) dx = 0,

∫ ∞−∞

x2λ(x) dx = 1.

Let τ > 0 and σ > 0, and let the random variables ∆tn and ∆xnnow have the rescaled probability density functions

ψτ (t) =1

τψ

(t

τ

)and λσ(x) =

1

σλ

(x

σ

),

so that

E(∆tn) = τ, E(∆xn) = 0, E(∆x2n ) = σ2.

We want to investigate what happens as τ and σ tend to zero.

Typical path (σ = 0.1, τ = 0.005)

Detailed view of inset

MomentsWe saw earlier that ψ(0) = 1 = λ(0). Since

dk ψ

dzk=

∫ ∞0

e−zt(−t)kψ(t) dt

we have

ψ′(0) = −∫ ∞

0tψ(t) dt = −1.

Similarly,dk λ

dξk=

∫ ∞−∞

e−iξx(−ix)kλ(x) dx

so

λ′(0) = −i

∫ ∞−∞

xλ(x) dx = 0,

λ′′(0) = −∫ ∞−∞

x2λ(x) dx = −1.

Characteristic functions

Sinceψτ (z) = ψ(τz) and λσ(ξ) = λ(σξ),

we have

ˆp(ξ, z ;σ, τ) =1− ψ(τz)

z

1

1− ψ(τz)λ(σξ).

The Taylor expansion

ψ(z) = ψ(0) + ψ′(0)z + · · · = 1− z + O(z2) as z → 0,

implies that

1− ψ(τz)

z=τz + O(τ2z2)

z= τ

(1 + O(τz)

)as τ → 0.

Assume for simplicity that λ(−x) = λ(x). Then λ′′′(0) = 0 and

λ(ξ) = λ(0) + λ′(0)ξ + 12 λ′′(0)ξ2 + · · · = 1− 1

2ξ2 + O(ξ4).

Thus,

ψ(τz)λ(σξ) =[1− τz + O(τ2z2)

][1− 1

2σ2ξ2 + O(σ4ξ4)

]= 1− τz − 1

2σ2ξ2 + O(τ2z2 + σ4ξ4)

and

1− ψ(τz)λ(σξ) =(τz + 1

2σ2ξ2)[

1 + O(τz + σ2ξ2)],

so

ˆp(ξ, z ;σ, τ) =τ

τz + 12σ

2ξ2× 1 + O(τz)

1 + O(τz + σ2ξ2).

Limiting probability densityNow send σ → 0 and τ → 0 while keeping

σ2

2τ= K ,

for a fixed K > 0, and obtain

ˆp(ξ, z) = limτ

τz + 12σ

2ξ2=

1

z + Kξ2.

Inverting the Laplace transform, we find

p(ξ, t) =1

2πi

∫ a+i∞

a−i∞

ezt dz

z + Kξ2= e−Kξ

2t ,

and then inverting the Fourier transform,

p(x , t) =1√

4πKtexp

(− x2

4Kt

).

Snapshots of p(x , t)

Partial differential equation for p

Notice that

LF(pt − Kpxx

)= z ˆp(ξ, z)− p(ξ, 0) + Kξ2ˆp(ξ, z)

= (z + Kξ2)ˆp(ξ, z)− p(ξ, 0)

= 1− p(ξ, 0) = 0,

where the final step follows because p(x , 0) = δ(x) and sop(ξ, 0) = 1.

Therefore, p(x , t), the probability density for x(t)− x0, theposition (relative to x0) of the walker at time t, satisfies the partialdifferential equation

pt − Kpxx = 0 for 0 < t <∞ and −∞ < x <∞.

Uncertain initial position

When x0 is uncertain,

ˆp(ξ, z ;σ, τ) =τ p0(ξ)

τz + 12σ

2ξ2× 1 + O(τz)

1 + O(τz + σ2ξ2),

so in the scaling limit,

ˆp(ξ, z) =p0(ξ)

z + Kξ2

and thus

LF(pt − Kpxx

)= z ˆp(ξ, z)− p(ξ, 0) + Kξ2ˆp(ξ, z)

= (z + Kξ2)ˆp(ξ, z)− p(ξ, 0)

= p0(ξ)− p(ξ, 0) = 0,

since p(x , 0) = p0(x).

Subdiffusion

Let 0 < α < 1, and suppose now that the waiting time probabilitydensity function is a power law:

ψ(t) ∼ A

t1+αas t →∞,

for some constant A > 0. It follows that∫ ∞0

tψ(t) dt = +∞,

so the preceding analysis of the random walk breaks down.

We make no change to our assumptions on λ(x).

Example

ψ(t) =α

(1 + t)1+αand λ(x) =

1

σ√

2πexp

(− x2

2σ2

).

Power law (α = 0.75) vs Exponential (τ = 1) distributions

Typical path (α = 0.75, σ = 1)

Behaviour of the characteristic function as z → 0Assume there exists T > 0 such that

|t1+αψ(t)− A| ≤ Ct−1 for T ≤ t <∞,

and let 0 < z ≤ T−1 (so Tz ≤ 1). Since we know ψ(0) = 1,consider

1− ψ(z) =

∫ ∞0

(1− e−zt)ψ(t) dt = I1 + I2 + I3,

where

I1 =

∫ T

0(1− e−zt)ψ(t) dt,

I2 =

∫ ∞T

(1− e−zt)(ψ(t)− At−1−α) dt,

I3 =

∫ ∞T

(1− e−zt)At−1−α dt.

Since0 ≤ 1− e−y ≤ min(1, y) for 0 ≤ y ≤ 1,

we immediately see that

0 ≤ I1 ≤∫ T

0ztψ(t) dt ≤ zT

∫ ∞0

ψ(t) dt = Tz .

Also, the substitution t = y/z gives

|I2| ≤∫ ∞T

(1− e−zt)Ct−2−α dt ≤ Cz1+α

∫ ∞Tz

(1− e−y )y−2−α dy

≤ Cz1+α

(∫ 1

Tzy−1−α dy +

∫ ∞1

y−2−α dy

)= Cz1+α

((Tz)−α − 1

α+

1

1 + α

)≤ Cα,T z .

The same substitution t = y/z yields

I3 =

∫ ∞T

(1− e−zt)At−1−α dt = Azα∫ ∞Tz

1− e−y

y 1+αdy = Bαzα + I4,

where

Bα = A

∫ ∞0

1− e−y

y 1+αdy and I4 = −Azα

∫ Tz

0

1− e−y

y 1+αdy .

Since

|I4| ≤ Azα∫ Tz

0y−α dy =

AT−αz

1− α,

we have shown that∣∣1− ψ(z)− Bαzα∣∣ = |I1 + I2 + I4| ≤ CT ,αz for 0 < z ≤ T−1.

Integrating by parts,∫ ∞0

1− e−y

y 1+αdy =

∫ ∞0

(1− e−y ) d(−α−1y−α

)=

[− 1− e−y

αyα

]∞0

+ α−1

∫ ∞0

e−yy−α dy

= α−1Γ(1− α) = −Γ(−α),

which completes the proof of the following result.

TheoremIf

ψ(t) = At−1−α + O(t−2−α) as t →∞,

then, with Bα = Aα−1Γ(1− α),

ψ(z) = 1− Bαzα + O(z) as z → 0.

Rescaling

As before, normalize λ(x) so that∫ ∞−∞

xλ(x) dx = 0 and

∫ ∞−∞

x2λ(x) dx = 1,

but now suppose

ψ(t) ∼ A

t1+αas t →∞.

Define the rescaled probability density functions

ψτ (t) =1

τψ

(t

τ

)and λσ(x) =

1

σλ

(x

σ

),

and notice ψτ is again a power law,

ψτ (t) ∼ Aτα

t1+αas t →∞.

Characteristic functions

Hence,

E(∆tn) = +∞, E(∆xn) = 0, E(∆x2n ) = σ2.

As before,

ˆp(ξ, z ;σ, τ) =1− ψ(τz)

z

1

1− ψ(τz)λ(σξ),

withλ(ξ) = 1− 1

2ξ2 + O(ξ4) as ξ → 0,

but this time

ψ(z) = 1− Bαzα + O(z) as z → 0,

where Bα = Aα−1Γ(1− α).

Thus,

1− ψ(τz)

z=

Bαταzα + O(τz)

z= Bατ

αzα−1[1 + O(τ1−αz1−α)

].

Likewise,

ψ(τz)λ(σξ) =[1− Bατ

αzα + O(τz)][

1− 12σ

2ξ2 + O(σ4ξ4)]

= 1− Bαταzα − 1

2σ2ξ2 + O(τz + τ2αz2α + σ4ξ4)

so

1− ψ(τz)λ(σξ) = Bαταzα + 1

2σ2ξ2 + O(τz + τ2αz2α + σ4ξ4)

=[Bατ

αzα + 12σ

2ξ2][

1 + O(τ1−αz1−α + ταzα + σ2ξ2)].

Limiting probability density

Therefore,

ˆp(ξ, z ;σ, τ) =Bατ

αzα−1

Bαταzα + 12σ

2ξ2

1 + O(τ1−αz1−α)

1 + O(τ1−αz1−α + ταzα + σξ

) .Once again, send σ → 0 and τ → 0, but now keep

σ2

2Bατα= Kα,

for a fixed Kα > 0, and obtain

ˆp(ξ, z) = limBατ

αzα−1

Bαταzα + 12σ

2ξ2=

zα−1

zα + Kαξ2.

Notice that we recover the earlier formula by putting α = 1.

Typical path (α = 0.75, σ = 0.1, τα = σ2/2, N = 800)

Detailed view of inset

Recall that

Lt→zEα(−λtα) =1

z + λz1−α

andFx→ξMα/2(|x/µ|) = 2µEα(−µ2ξ2).

Since

ˆp(ξ, z) =zα−1

zα + Kαξ2=

1

z + λz1−α if λ = Kαξ2,

we see that

p(ξ, t) = Eα(−Kαtαξ2) = Eα(−µ2ξ2) if µ =√

Kαtα.

Thus,

p(x , t) =1

2√

KαtαMα/2

(|x |√Kαtα

).

Snapshots of p(x , t) when α = 2/3.

Fractional partial differential equation for p

We have

LFpt − KαD1−αpxx = z ˆp(ξ, z)− p(ξ, 0) + Kαz1−αξ2ˆp(ξ, z)

=(z + Kαz1−αξ2

)ˆp(ξ, z)− p(ξ, 0) = 0

since

ˆp(ξ, z) =zα−1

zα + Kαξ2=

1

z + Kαz1−αξ2and p(ξ, 0) = δ(ξ) = 1.

Thus, p satisfies the time-fractional diffusion equation,

pt − KαD1−αpxx = 0 for 0 < t <∞ and −∞ < ξ <∞.

Mean-square displacement

Let

V (t) = E((x(t)− x0)2

)=

∫ ∞−∞

x2p(x , t) dx for t > 0.

Since

V (z) =

∫ ∞−∞

x2p(x , z) dx = − d2

dξ2ˆp(ξ, z)

∣∣∣∣ξ=0

= − d2

dξ2

(z + Kαz1−αξ2

)−1∣∣∣∣ξ=0

= 2Kαz−1−α,

it follows that V (t) = 2KαΥ1+α(t), that is,

E((x(t)− x0)2

)=

2Kαtα

Γ(1 + α)∝ tα.

Part IV

The time-fractional diffusion equation

Introduction

We have shown that in 1D the probability density function for thelocation of a subdiffusive particle at time t obeys a time-fractionalPDE. The concentration u = u(x , t) of a large number of suchparticles evolves in the same way. Moreover, the 1D analysis canbe generalized to higher dimensions to yield the time-fractionaldiffusion equation

ut − Kα∂1−αt ∇2u = 0.

As in the classical case α = 1, the solution u in a bounded spatialdomain (and subject to homogeneous boundary conditions) can beconstructed by separation of variables to yield a series expansioninvolving the eigenfunctions of −∇2. We use this representationof u(x , t) to investigate its behaviour.

Outline

Initial-boundary value problem

Smoothing property of fractional diffusion

Positivity

Initial-boundary value problem

Let Ω denote a bounded, Lipschitz domain in R2 or R3. We seeku = u(x , t) satisfying

ut − Kα∂1−αt ∇2u = f (x , t) for x ∈ Ω and t > 0,

u = u0(x) for x ∈ Ω, when t = 0,

and impose homogeneous boundary conditions, either Dirichlet

u = 0 for x ∈ ∂Ω and t > 0,

or else Neumann,

∂u

∂n= 0 for x ∈ ∂Ω and t > 0,

where n is the outward unit normal to Ω.

Abstract initial-value problem

Let A be a linear operator with dense domain D(A) in a Hilbertspace H with inner product 〈u, v〉 and norm ‖u‖ =

√〈u, u〉. Given

u0 ∈ H and f : [0,∞)→ H we seek u : [0,∞)→ H satisfying

u + ∂1−αt Au = f (t) for t > 0,

with u(0) = u0, where u = ut = ∂u/∂t.

Standard example:

H = L2(Ω), Au = −Kα∇2u, D(A) = H2(Ω) ∩ H10 (Ω).

Integrate to obtain an equivalent Volterra equation in H,

u(t) +

∫ t

0Υα(t − s)Au(s) ds = u0 +

∫ t

0f (s) ds, t > 0.

Eigenfunction expansionAssume that A is self-adjoint and positive-semidefinite, with acomplete orthonormal eigensystem, say

Aφm = λmφm for m = 0, 1, 2, . . . ,

with 〈φm, φn〉 = δmn. Number the eigenvalues so that

0 ≤ λ0 ≤ λ1 ≤ λ2 ≤ · · · .

(These assumptions hold for our standard example.) Thus,

u(t) =∞∑

m=0

um(t)φm where um(t) = 〈u(t), φm〉.

Likewise, putting fm(t) = 〈f (t), φm〉 and u0m = 〈u0, φm〉, we have

f (t) =∞∑

m=0

fm(t)φm and u0 =∞∑

m=0

u0mφm.

Dirichlet boundary conditions in 1D

Take Ω = (0, L) and A = −Kαd2/dx2 with homogeneous Dirichletboundary conditions. Then,

φm(x) =

√2

Lsin

mπ

Lx and λm = Kα

(mπ

L

)2

,

for m ∈ 1, 2, 3, . . ., so

v(x) =∞∑

m=1

vmφm(x)

is just the sine series expansion of v , where

vm = 〈v , φm〉 =

√2

L

∫ L

0v(x) sin

mπ

Lx dx .

Neumann boundary conditions in 1D

Again take Ω = (0, L) and A = −Kαd2/dx2, but now imposehomogeneous Neumann boundary conditions. Then,

φ0(x) =1√L

and λ0 = 0,

with

φm(x) =

√2

Lcos

mπ

Lx and λm = Kα

(mπ

L

)2

for m ≥ 1,

so

v(x) =∞∑

m=0

vmφm(x)

is just the cosine series expansion of v .

Separation of variables

The function u satisfies

u + ∂1−αt Au = f (t) for t > 0, with u(0) = u0.

iff the mth eigenmode satisfies

um + λm∂1−αt um = fm(t) for t > 0, with um(0) = u0m,

for m = 0, 1, 2, . . . . Laplace transformation gives

zum(z)− um(0) + λmz1−αum(z) = fm(z)

so

um(z) =u0m + fm(z)

z + λmz1−α .

Duhamel formula and the mild solution

Recall that

LEα(−λtα) =1

z + λz1−α ,

so

um(t) = Eα(−λmtα)u0m +

∫ t

0Eα(−λm(t − s)α

)fm(s) ds.

Define the solution operator for the homogeneous problem,

E(t)v =∞∑

m=0

Eα(−λmtα)〈v , φm〉φm for t > 0 and v ∈ H,

then the mild solution of the abstract initial-value problem is

u(t) = E(t)u0 +

∫ t

0E(t − s)f (s) ds, t > 0.

Caution: E(t + s) 6= E(t)E(s) if 0 < α < 1.

Stability in HRecall that Eα(−λtα) is positive and decreasing for t > 0, andequals 1 at t = 0, so

0 < Eα(−λtα) ≤ 1 for 0 ≤ t <∞ and any λ ≥ 0.

Thus, using Parseval’s identity,

‖E(t)v‖2 =∞∑

m=0

Eα(−λmtα)2〈v , φm〉2 ≤∞∑

m=0

〈v , φm〉2 = ‖v‖2,

and therefore

‖E(t)v‖ ≤ ‖v‖ for t ≥ 0 and v ∈ H.

Hence, the mild solution satisfies the stability estimate

‖u(t)‖ ≤ ‖u0‖+

∫ t

0‖f (s)‖ ds for t ≥ 0.

Smoothing property of fractional diffusion

For 0 ≤ r <∞, define the norm

‖v‖2r =

∥∥(I + A)r/2v‖2 =∞∑

m=0

(1 + λm)r 〈v , φm〉2

and the corresponding closed subspace

Hr = v ∈ H : ‖v‖r <∞,

which is a Hilbert space with respect to the inner product thatinduces ‖ · ‖r .

For our standard example H = L2(Ω) and A = −Kα∇2, writeHr = Hr

D(Ω) or HrN(Ω) to indicate the choice of Dirichlet or

Neumann boundary conditions.

Dirichlet boundary conditions and Sobolev spaces

Can prove the following via interpolation and elliptic regularity.

TheoremSuppose that ∂Ω is C∞. If 0 ≤ r < 1

2 , then HrD(Ω) = H r (Ω),

however if 2j − 32 < r < 2j + 1

2 for j ∈ 1, 2, 3, . . ., then

HrD(Ω) = v ∈ H r (Ω) : v = Av = · · · = Aj−1v = 0 on ∂Ω .

In the exceptional case r = 2j − 32 , the condition Aj−1v = 0 on ∂Ω

must be replaced by Aj−1v ∈ H1/2(Ω).

If Ω is Lipschitz, then the conclusions still hold for r ≤ 1, and inparticular H1

D = H10 (Ω) = u ∈ H1(Ω) : u = 0 on ∂Ω .

If Ω is convex or C 1,1, then r ≤ 2 is OK, and in particularH2

D = H2(Ω) ∩ H10 (Ω) = u ∈ H2(Ω) : u = 0 on ∂Ω .

Neumann boundary conditions and Sobolev spaces

TheoremSuppose that ∂Ω is C∞. If 0 ≤ r < 3

2 , then HrN(Ω) = H r (Ω),

however if 2j − 12 < r < 2j + 3

2 for j ∈ 1, 2, 3, . . ., then

HrN(Ω) = v ∈ H r (Ω) :

∂nv = ∂nAv = · · · = ∂nAj−1v = 0 on ∂Ω .

In the exceptional case r = 2j − 12 , the condition ∂nAj−1v = 0

on ∂Ω must be replaced by ∂nAj−1v ∈ H1/2(Ω).

If Ω is Lipschitz, then the conclusions still hold if r ≤ 1, and inparticular H1

N(Ω) = H1(Ω).

If Ω is convex or C 1,1, then r ≤ 2 is OK, and in particularH2

N(Ω) = u ∈ H2(Ω) : ∂nu = 0 on ∂Ω .

A 1D example

Consider v(x) = 1 for x ∈ Ω = (0, L). Since∫ L

0v(x) sin

mπ

Lx dx = L

1− (−1)m

mπ

and (1 + λm)r ∼ (1 + m2)r , we see that

‖v‖r <∞ ⇐⇒∞∑p=0

(1 + 2p)2r−2 <∞,

so v ∈ HrD(Ω) iff 2r − 2 < −1, that is, r < 1

2 . However,v ∈ Hr

N(Ω) for all r ≥ 0 because∫ L

0v(x) cos

mπ

Lx dx = 0 for all m ≥ 1.

Smoothing property of classical diffusion

If α = 1 then Eα(−tα) = e−t so

E(t)v =∞∑

m=0

e−λmt〈v , φm〉φm

and thus

‖E(t)v‖2r+µ =

∞∑m=0

(1 + λm)r+µ(e−λmt〈v , φm〉

)2.

If 0 < t ≤ T and λ ≥ 0, then

(1 + λ)µ(e−λt)2 ≤ t−µ(T + λt)µe−2λt ≤ CT ,µt−µ

and so‖E(t)v‖r+µ ≤ CT ,µt−µ/2‖v‖r for µ ≥ 0.

Weaker smoothing property for subdiffusion

The theorem below shows that if v ∈ Hr then E(t)v ∈ Hr+2 foreach t > 0, but ‖E(t)v‖r+2 may blow up as t → 0+.

Lemma0 < Eα(−tα) ≤ C min(1, t−α) for 0 < t <∞.

Proof.Follows because

Eα(−tα) =

1 + O(tα) as t → 0+,

t−α/Γ(1− α) + O(t2−α) as t →∞.

TheoremLet 0 ≤ µ ≤ 2 and 0 ≤ r <∞. If v ∈ Hr , then

‖E(t)v‖r+µ ≤ CT t−αµ/2‖v‖r for 0 < t ≤ T .

Proof of theoremPut g(t) = Eα(−tα). Since g(λ1/αt) = E (−λtα), we have

‖E(t)v‖2r+µ =

∞∑m=0

(1 + λm)r+µg(λ1/αm t)2〈v , φm〉2.

The lemma implies that (assuming 0 ≤ µ ≤ 2)

0 < g(t) ≤ C (1 + tα)−µ/2 for 0 < t <∞,

so, for 0 < t ≤ T ,

g(λ1/αt)2 ≤ C (1 + λtα)−µ = Ct−µα(t−α + λ)−µ

≤ CT t−µα(1 + λ)−µ

and thus

‖E(t)v‖2r+µ ≤ CT

∞∑m=0

(1 + λm)r 〈v , φm〉2 = CT t−αµ‖v‖2r .

Regularity in time

Let q ∈ 1, 2, 3, . . .. Similar arguments yield the followingestimates.

LemmaThe function g(t) = Eα(−tα) satisfies

tq|g (q)(t)| ≤ Cq min(tα, t−α) for 0 < t <∞.

TheoremLet −2 ≤ µ ≤ 2, 0 ≤ r <∞ and q ∈ 1, 2, 3, . . .. If v ∈ Hr , then

tq‖E(q)(t)v‖r+µ ≤ Cq,T t−αµ/2‖v‖r for 0 < t ≤ T .

Detailed behaviour as t → 0+

Since

Eα(−λtα) =M−1∑p=0

(−1)ptαp

Γ(1 + αp)λp + O(λMtαM) as t → 0+,

and λpm〈v , φm〉 = 〈Apv , φm〉, we can show the following.

TheoremLet 0 ≤ r <∞ and M ∈ 1, 2, 3, . . .. If v ∈ Hr+2M , then

E(t)v = v +M−1∑p=1

(−1)ptαp

Γ(1 + αp)Apv + RM(t)AMv ,

where, given 0 ≤ µ ≤ 2, the remainder operator satisfies

‖RM(t)v‖r+µ ≤ CM,T tMα−αµ/2‖v‖r for 0 < t ≤ T .

Behaviour of an eigenmode

If the initial data is an eigenfunction of A, say u0 = φm, then thesolution of the homogeneous problem is

u(t) = E(t)φm = Eα(−λmtα)φm,

so

u(x , t) =

(1− λmtα

Γ(1 + α)+ O(t2α)

)φm(x) as t → 0+.

This example makes clear the fact that the time derivativeu = O(tα−1) is unbounded as t → 0+ no matter how regular theinitial data (so long as it is not zero, but in that case u ≡ 0).

Contrast this behaviour with that of the classical diffusion equation(α = 1): if u0 = φm then u(x , t) = e−λmtφm(x) is C∞ for t ≥ 0.

The inhomogeneous problemThe function u(t) = E(t)u0 solves the homogeneous problem

u + ∂1−αt Au = 0 for t > 0, with u(0) = u0.

Now consider

u(t) = E ∗ f (t) =

∫ t

0E(t − s)f (s) ds,

which solves the inhomogeneous problem with vanishing initialdata:

u + ∂1−αt Au = f (t) for t > 0, with u(0) = 0.

TheoremFor 0 ≤ r <∞ and q ∈ 0, 1, 2, . . ., the function u = E ∗ fsatisfies

tq‖u(q)(t)‖r ≤ Cq

q∑j=0

∫ t

0s j‖f (j)(s)‖r ds for 0 < t <∞.

Positivity

If we write

u+(t) =

0, t < 0,

u(t), t ≥ 0,,

then the Laplace transform of u is related to the Fourier transformof u+ by

u(iy) = u+(y) =

∫ ∞0

e−iytu(t) dt.

Thus, the Parseval–Plancherel identity,∫ ∞−∞

f (t)g(t) dt =1

2π

∫ ∞−∞

f (ξ)g(ξ) dξ,

implies that ∫ ∞0

u(t)v(t) dt =1

2π

∫ ∞−∞

u(iy)v(iy) dy .

We can now show that the operator ∂1−αt is positive semidefinite.

TheoremIf 0 < β < 1 and if u is real-valued, then∫ ∞

0(∂βt u)u dt =

cos 12πβ

π

∫ ∞0

yβ|u(iy)|2 dy ≥ 0.

Proof.Since L∂βt u(t) = zβ u(z),∫ ∞

0(∂βt u)u dt =

1

2π

∫ ∞−∞

(iy)β|u(iy)|2 dy .

The result follows because u(iy) = u(−iy) and for y > 0,

(±iy)β = (e±iπ/2y)β = yβ(cos 1

2πβ ± i sin 12πβ

).

H-valued case

LemmaFor 0 < α < 1 and suitable u : (0,∞)→ H,∫ ∞

0〈∂1−α

t Au, u〉 dt =sin 1

2πα

π

∫ ∞0

y 1−α‖A1/2u(iy)‖2 dy ≥ 0.

Proof.

∫ ∞0〈∂1−α

t Au, u〉 dt =

∫ ∞0

∞∑m=0

λm(∂1−αt um)um dt

=∞∑

m=0

λmcos 1

2π(1− α)

π

∫ ∞0

y 1−α|um(iy)|2 dy .

Digression: fractional derivative at a jump discontinuity

Suppose that

v(t) =

v1(t), 0 ≤ t < a,

v2(t), t > a,

where v1 : [0, a]→ R and v2 : [a,∞)→ R are C 1 functions. If0 < t < a, then differentiating the formula

Iαv(t) = Υα ∗ v(t) =

∫ t

0Υα(s)v(t − s) ds

gives

∂1−αt v(t) = DIαv1(t) = v1(0)Υα(t) +

∫ t

0Υα(s)v ′1(t − s) ds

= v(0+)Υα(t) +

∫ t

0Υα(t − s)v ′(s) ds.

However, if t > a then

Iαv(t) =

∫ a

0Υα(t − s)v1(s) ds +

∫ t

aΥα(t − s)v2(s) ds

=

∫ t

t−aΥα(s)v1(t − s) ds +

∫ t−a

0Υα(s)v2(t − s) ds

so

∂1−αv(t) = v1(0)Υα(t)− v1(a)Υα(t − a) + v2(a)Υα(t − a)

+

∫ t

t−aΥα(s)v ′1(t − s) ds +

∫ t−a

0Υα(s)v ′2(t − s) ds

and therefore, with [v ]a = v2(a)− v1(a) = v(a+)− v(a−),

∂1−αv(t) = v(0+) Υα(t) + [v ]aΥα(t − a)

+

∫ a

0Υα(t − s)v ′(s) ds +

∫ t

aΥα(t − s)v ′(s) ds.

Example

Stability via an energy argument

Consider the homogeneous equation,

u(t) + ∂1−αt Au(t) = 0.

Take the inner product with u(t) and integrate to obtain∫ T

0〈u, u〉 dt +

∫ T

0〈∂1−α

t Au, u〉 dt = 0.

Letting

u∗(t) =

u(t), 0 < t < T ,

0, t > T ,

we have ∫ T

0〈∂1−α

t Au, u〉 dt =

∫ ∞0〈∂1−α

t Au∗, u∗〉 dt ≥ 0.

Thus, ∫ T

0〈u, u〉 dt ≤ 0.

But ∫ T

0〈u, u〉 dt =

[12〈u, u〉

]Tt=0

= 12‖u(T )‖2 − 1

2‖u(0)‖2,

so ‖u(T )‖ ≤ ‖u(0)‖, which again shows that

‖E(t)u0‖ ≤ ‖u0‖ for t > 0.

Part V

Simple finite difference schemes

Introduction

We begin our study of numerical methods for fractional diffusionproblems by considering simple explicit and implicit finite difference(and quadrature) schemes in the 1D case:

ut − Kα∂1−αt uxx = f (x , t).

These schemes generalize the forward and backward Euler methodsfor the heat equation. As in the classical setting, the explicitscheme is stable only if the time step is sufficiently small, but theimplicit scheme is unconditionally stable.

Unlike the classical Euler methods, the grid stencils extend backthrough all preceding time levels resulting in a dramatically highercomputational cost.

Outline

Explicit Euler method for a fractional ODE

Explicit Euler method for a fractional PDE

Implicit Euler method

Explicit Euler method for a fractional ODE

Consider first the scalar problem (A = λ > 0)

u + λ∂1−αt u = f (t) for 0 < t < T , with u(0) = u0,

where λ > 0. Put ∆t = T/N and define grid points

tn = n∆t for 0 ≤ n ≤ N.

We want to computeUn ≈ u(tn).

Time-stepping

Integrating the ODE gives

u(tn+1)− u(tn) + λ

∫ tn+1

tn

∂1−αt u(t) dt =

∫ tn+1

tn

f (t) dt,

which suggests the time-stepping scheme

Un+1 − Un + λ

∫ tn+1

tn

∂1−αt U(t) dt = f n ∆t,

where f n = f (tn) and U is the piecewise-constant function

U(t) = Un for tn ≤ t < tn+1.

Approximation of the fractional derivative

Recalling ∂1−αt v = (Υα ∗ v)t , we have∫ tn+1

tn

∂1−αt U(t) dt = (Υα ∗ U)(tn+1)− (Υα ∗ U)(tn)

=

∫ tn+1

tn

Υα(tn+1 − t)Un dt

+n−1∑j=0

∫ tj+1

tj

[Υα(tn+1 − t)−Υα(tn − t)

]U j dt

=∆tα

Γ(α + 1)

n∑j=0

wn−jUj .

Weights

Here, ∫ tn+1

tn

Υα(tn+1 − t) dt = Υα+1(∆t) =∆tα

Γ(α + 1)

so w0 = 1, and for 0 ≤ j ≤ n − 1,

∆tα

Γ(α + 1)wn−j =

∫ tj+1

tj

[Υα(tn+1 − t)−Υα(tn − t)

]U j dt.

Find that

wj = (j + 1)α − 2jα + (j − 1)α for j ≥ 1.

Note that wj < 0 for j ≥ 1, with wj ≈ α(α− 1)jα−2 for large j .

Weights when α = 1/2

Implementation

In this way, we arrive at

Un+1 − Un +λ∆tα

Γ(α + 1)

n∑j=0

wn−jUj = f n ∆t.

Thus, starting from U0 = u0 we compute

Un+1 = Un + f n ∆t − λ∆tα

Γ(α + 1)

n∑j=0

wn−jUj

for n = 0, 1, 2, . . . , N.

Classical Euler method

In the limiting case α = 1 we have wj = 0 for j ≥ 1 so

λ∆tα

Γ(α + 1)

n∑j=0

wn−jUj = λ∆t Un

and therefore

Un+1 − Un + λ∆t Un = f n ∆t,

or equivalently,Un+1 − Un

∆t+ λUn = f n.

Conditional stabilitySince w0 = 1 we have

Un+1 =

(1− λ∆tα

Γ(α + 1)

)Un + f n ∆t − λ∆tα

Γ(α + 1)

n−1∑j=0

wn−jUj ,

and we can prove the following discrete analogue of the stabilityestimate for the continuous problem:

|u(t)| ≤ |u0|+∫ t

0|f (s)| ds for t > 0.

TheoremIf

λ∆tα

Γ(α + 1)≤ 1

then

|Un| ≤ |U0|+n−1∑j=0

|f n|∆t for 1 ≤ n ≤ N.

Proof

Put ρ = λ∆tα/Γ(α + 1) ≤ 1, then

|Un+1| ≤ (1− ρ)|Un|+ |f n|∆t + ρ

n−1∑j=0

|wn−j ||U j |

and we find

n−1∑j=0

|wn−j | = −n∑

j=1

wj = 1 + nα − (n + 1)α ≤ 1,

so

|Un+1| ≤ (1− ρ)|Un|+ |f n|∆t + ρ max0≤j≤n−1

|U j |

≤ |f n|∆t + max1≤j≤n

|U j |.

The desired estimate now follows using induction on n.

Example: α = 1/2, λ = 1, f ≡ 0, N = 25, ρ = 0.3192

Example: α = 1/2, λ = 4, f ≡ 0, N = 25, ρ = 1.2766

Explicit Euler method for a PDE

Consider Ω = (0, L) in 1D with boundary ∂Ω = 0, L. We seeku = u(x , t) satisfying

ut − Kα∂1−αt uxx = f (x , t) for x ∈ Ω and 0 < t < T ,


u = 0 for x ∈ ∂Ω and 0 < t < T .

Put ∆x = L/P and ∆t = T/N, and define grid points

(xp, tn) = (p ∆x , n ∆t) for 0 ≤ p ≤ P and 0 ≤ n ≤ N.

We want to computeUnp ≈ u(xp, tn).

Second central difference in spaceUsing the approximation

uxx(xp, tn) ≈Unp+1 − 2Un

p + Unp−1

∆x2

and letting f np = f (xp, tn), we discretize in time as before and

arrive at the scheme

Un+1p − Un

p −Kα ∆tα

Γ(α + 1)

n∑j=0

wn−jU jp+1 − 2U j

p + U jp−1

∆x2= f n

p ∆t

for 0 ≤ n ≤ N − 1 and 1 ≤ p ≤ P − 1, with the initial conditions

U0p = u0(xp) for 0 ≤ p ≤ P,

and boundary conditions

Un0 = 0 = Un

P for 1 ≤ n ≤ N.

Stencil

x

t

Matrix–vector formulationLet

Un =

Un

1

Un2...

UnP−2

UnP−1

and A =Kα

∆x2

2 −1−1 2 −1

. . .. . .

. . .

−1 2 −1−1 2

so that

Un+1 −Un +∆tα

Γ(α + 1)

n∑j=0

wn−jAUj = ∆t fn

and hence

Un+1 = Un + ∆t fn − ∆tα

Γ(α + 1)

n∑j=0

wn−jAUj .

Eigenvectors

Recall that

φm(x) = sinmπ

Lx and λm = Kα

(mπ

L

)2

satisfy

−Kαd2

dx2φm = λmφm for x ∈ Ω = (0, L),

with φm(0) = 0 = φm(L). Putting

Φm =

φm(x1)φm(x2)

...φm(xP−1)

and Λm =Kα

∆x2

(2 sin

mπ

2L∆x

)2

,

we find thatAΦm = ΛmΦm.

Discrete L2-inner product and -norm

For U, V ∈ RP−1 define

〈U,V〉 =P−1∑p=1

UpVp ∆x and ‖U‖ =√〈U,U〉.

We find that Φ1, Φ2, . . . . . . , ΦP−1 form an orthogonal basisfor RP−1,

〈Φm,Φm′〉 = 0 if m 6= m′ and m, m′ ∈ 1, 2, . . . ,P − 1,

and, with θ = mπ/P,

‖Φm‖2 =P−1∑p=1

(sin

mπ

Lxp

)2

∆x =L

P

P∑p=1

sin2 pθ

=L

P

(P

2− cos(P + 1)θ sin Pθ

2 sin θ

)=

L

2.

Stability of the discrete Fourier modesDefine the discrete Fourier coefficients

Unm =

〈Un,Φm〉‖Φm‖2

, 1 ≤ m ≤ P − 1,

so that

Un =P−1∑m=1

UnmΦm.

For 1 ≤ m ≤ P − 1,

Un+1m − Un

m +Λm ∆tα

Γ(α + 1)

n−1∑j=0

wn−j Ujm = f n

m ∆t,

with U0m = u0m, so our earlier analysis gives

|Unm| ≤ |U0

m|+n−1∑j=0

|f nm|∆t provided

Λm ∆tα

Γ(α + 1)≤ 1.

Stability of the full solution

Since

‖U‖2 =P−1∑m=1

|Um|2‖Φm‖2 =L

2

P−1∑m=1

|Um|2,

and Λm ≤ 4Kα/∆x2, we can show the following.

TheoremIf

ρ ≡ 4Kα ∆tα

Γ(α + 1) ∆x2≤ 1

then

‖Un‖2 ≤ 2‖U0‖2 + 2tn

n−1∑j=0

‖f j‖2 ∆t for 1 ≤ n ≤ N.

Problem if α is small

The stability restriction ρ ≤ 1 means that the time step must bechosen so that

∆tα ≤ Γ(α + 1)

4Kα∆x2.

This is a severe restriction if α is small.

Example

Suppose α = 1/5, Kα = Γ(α + 1) and ∆x = 2× 10−3, then werequire

∆t ≤ (∆x/2)10 = 10−30.

Therefore natural to consider implicit methods.


Again start with the ODE

u + λ∂1−αt u = f (t) for 0 < t < T , with u(0) = u0,

but now integrate over (tn−1, tn) to obtain

u(tn)− u(tn−1) + λ

∫ tn

tn−1

∂1−αt u(t) dt =

∫ tn

tn−1

f (t) dt,

and compute Un ≈ u(tn) via

Un − Un−1 + λ

∫ tn

tn−1

∂1−αt U(t) dt = f n ∆t,

whereU(t) = Un for tn−1 < t ≤ tn.

Weights

We find that∫ tn

tn−1

∂1−αt U(t) dt = (Υα ∗ U)(tn)− (Υα ∗ U)(tn−1)

=

∫ tn

tn−1

Υα(tn − t)Un dt

+n−1∑j=1

∫ tj

tj−1

[Υα(tn − t)−Υα(tn−1 − t)

]U j dt

=∆tα

Γ(α + 1)

n∑j=1

ωn−jUj ,

where, as before ω0 = 1 and

ωj = (j + 1)α − 2jα + (j − 1)α for j ≥ 1.

Unconditional stability

In this way,

Un − Un−1 + ρ

n∑j=1

wn−jUj = f j ∆t, ρ =

λ∆tα

Γ(α + 1).

Thus, starting from U0 = u0, we compute Un for n = 1, 2, . . . , Nby solving

(1 + ρ)Un = Un−1 + f n ∆t − ρn−1∑j=1

wn−jUj .

Theorem

|Un| ≤ |U0|+n∑

j=1

|f j |∆t for 1 ≤ n ≤ N.

Proof

We have

n−1∑j=1

|wn−j | =n−1∑j=1

|wj | = −n−1∑j=1

wj = 1 + (n − 1)α − nα ≤ 1

so(1 + ρ)|Un| ≤ |Un−1|+ |f n|∆t + ρ max

1≤j≤n−1|U j |

and therefore

|Un| ≤ |f n|∆t +|Un−1|1 + ρ

+ρ

1 + ρmax

1≤j≤n−1|U j |

≤ |f n|∆t + max1≤j≤n−1

|U j |.

The desired estimate follows using induction on n.

Example: α = 1/2, λ = 1, f ≡ 0, N = 25

Convergence behaviour

Put

E1(N) = max1≤n≤N

|Un − u(tn)|,

E2(N) = max0.5≤tn≤T

|Un − u(tn)|.

N E1(N) E2(N)

80 1.344e-02 3.951e-03

160 8.303e-03 0.6944 2.023e-03 0.9654

320 5.005e-03 0.7303 1.028e-03 0.9771

640 2.911e-03 0.7821 5.196e-04 0.9844

1280 1.650e-03 0.8186 2.617e-04 0.9892

2560 9.159e-04 0.8495 1.316e-04 0.9925

5120 4.998e-04 0.8739 6.602e-05 0.9947

10240 2.688e-04 0.8949 3.310e-05 0.9963

Remark on computing the weights

Recall that if we compute a sum with M terms,

S =M∑

m=1

Am

in a system of floating-point arithmetic with unit roundoff ε, then

| fl(S)− S | ≤ Mε

1−Mε

M∑m=1

|Am|.

Thus, for wj = (j + 1)α − 2jα + (j − 1)α and j large,

| fl(wj)− wj | . εjα whereas wj ≈ α(1− α)jα−2

so the relative rounding error | fl(wj)− wj |/|wj | is of order εj2.

By writing wj = ∆j −∆j−1 and using expm1 and log1p toevaluate

∆j ≡ (j + 1)α − jα = jα[(1 + j−1)α − 1

]= jα

(exp[α log(1 + j−1)

]− 1),

we can reduce somewhat the rounding error in fl(wj) for large j .

When we compute

S = ρ

n∑j=1

wn−jUj

the estimates above yield

| fl(S)− S | . nεn∑

j=1

(n − j)α|U j | . nα+2ε max1≤j≤n

|U j |,

which suggests that roundoff might become a problem oncenα+2 ≥ ε−1.

Fractional diffusion equation

For Ω = (0, L), we again consider the initial-boundary valueproblem

ut − Kα∂1−αt uxx = f (x , t) for x ∈ Ω and 0 < t < T ,


u = 0 for x ∈ ∂Ω and 0 < t < T .

The implicit time-stepping scheme leads to

Unp − Un−1

p − Kα ∆tα

Γ(α + 1)

n∑j=1

wn−jU jp+1 − 2U j

p + U jp−1

∆x2= f n

p ∆t

for 1 ≤ n ≤ N and 1 ≤ p ≤ P − 1, with

U0p = u0(xp) and Un

0 = 0 = UnP .

Stencil

x

t

Matrix–vector formulation

We have

Un −Un−1 +∆tα

Γ(α + 1)

n∑j=1

wn−jAUj = ∆t fn

so

(I + B)Un = Un−1 + ∆t fn −n−1∑j=1

wn−jBUj

where

B =∆tα

Γ(α + 1)A =

Kα ∆tα

Γ(α + 1)∆x2

2 −1−1 2 −1

. . .. . .

. . .

−1 2 −1−1 2

.

Computational cost

At the nth time step, evaluation of the RHS costs O(nP) flops,and the elliptic solve costs O(P) flops. Since

N∑n=1

n ≈ N2

2,

the overall cost of N time steps is O(N2P).

Also, the nth time step requires O(nP) active memory locations.

Contrast this with using the implicit Euler method to solve theclassical diffusion equation (the case α = 1): each time steprequires O(P) flops and O(P) active memory locations, and Ntime steps requires O(NP) flops.

Conclusion: cost when 0 < α < 1 is N times the cost when α = 1.

Comparison with direct simulation

Let u = u(x , t) be the solution of

ut − Kα∂1−αt uxx = f (x , t) for x ∈ Ω = (−L, L) and 0 < t < T ,

u = δ(x) for x ∈ Ω, when t = 0,

u = 0 for x ∈ ∂Ω = −L, L and 0 < t < T .

We can approximate u using the implicit Euler method, or bysimulating CTRWs with

ψ(t) =α

(1 + t)αand λ(x) =

1√2π

exp(−x2/2).

In this case ψ(t) ∼ A/t1+α as t →∞, with A = α, soBα = Aα−1Γ(1− α) = Γ(1− α) and we must rescale in such away that

σ2

2Γ(1− α)τα= Kα

Implicit Euler solutions (α = 2/3)

Probability densities of CTRWs (15, 000 samples)

Part VI

Spatial discretization via finite elements

Introduction

The finite element method provides the simplest approach fordiscretization of a fractional diffusion problem on a spatial domainof general shape. In the classical method of lines for the heatequation, such a spatial discretization leads to a large, stiff systemof first-order ODEs in time, that can be integrated using anappropriate black-box routine. For a time-fractional diffusionproblem, we instead obtain a system of fractional-order ODEs.

In this lecture, we seek to estimate the errors from the spatialdiscretization assuming that the time integration is performedexactly. Suitable approaches for the time discretization include theimplicit Euler method described previously and the more accurateschemes addressed in subsequent lectures.

Outline

Method of lines

Error estimates

Non-smooth data

Method of lines

Spatial domain Ω ⊆ Rd (d = 1, 2 or 3); for simplicity a convexpolygon or polyhedron so the elliptic problem is H2-regular.

With 0 < α < 1, let u = u(x , t) be the mild solution of

ut − Kα∂1−αt ∇2u = f (x , t) for x ∈ Ω and t > 0,


subject to homogeneous Dirichlet boundary conditions

u = 0 for x ∈ ∂Ω and t > 0.

We wish (for now) to discretize in space only.

Weak formulation

Put

Au = −Kα∇2u and a(u, v) =

∫Ω

Kα∇u · ∇v dx .

First Green identity:∫Ω

Au v dx = a(u, v)−∫∂Ω

Kα∂u

∂nv ds.

Thus,

〈ut , v〉+ a(∂1−αt u, v) = 〈f , v〉 for v ∈ H1

0 (Ω),

and also

〈ut , v〉+ ∂1−αt a(u, v) = 〈f , v〉 for v ∈ H1

0 (Ω).

Finite element space

Family of triangulations Th of Ω, where as usual

h = maxK∈Th

diam(K ).

Let Vh denote the space of real-valued functions on Ω that arecontinuous piecewise polynomials of degree at most p ≥ 1 withrespect to Th, and which vanish on ∂Ω. Hence,

Vh ⊆ H10 (Ω).

Seek a finite element solution uh : [0,∞)→ Vh satisfying

〈uht , χ〉+ ∂1−αt a(uh, χ) = 〈f , χ〉 for χ ∈ Vh and t > 0,

with uh(0) = u0h ≈ u0 for a suitable u0h ∈ Vh.

Alternative formulation

Define the discrete elliptic operator Ah : Vh → Vh by

〈Ahψ, χ〉 = a(ψ, χ) for ψ, χ ∈ Vh,

and the L2-projector Ph : L2(Ω)→ Vh by

〈Phv , χ〉 = 〈v , χ〉 for v ∈ L2(Ω) and χ ∈ Vh.

Since

∂1−αt a(uh, χ) = ∂1−α

t 〈Ahuh, χ〉 = 〈∂1−αt Ahuh, χ〉,

we see that

〈uht + ∂1−αt Ahuh, χ〉 = 〈f , χ〉 = 〈Phf , χ〉 for all χ ∈ Vh,

and thusuht + ∂1−α

t Ahuh = Phf for t > 0.

Nodal equationsConstruct a nodal basis χ1, χ2, . . . , χN for Vh so that

χj(xk) = δjk ,

where x1, x2, . . . , xN are the free (interior) nodes. Thus,

uh(x , t) =N∑

k=1

Uk(t)χk(x) where Uk(t) = uh(xk , t).

Define the mass matrix M, stiffness matrix S and load vector f by

Mjk = 〈χk , χj〉, Sjk = a(χk , χj), fk(t) = 〈f (t), χk〉

then the nodal vector U = [Uk(t)] satisfies the system of ordinaryintegro-differential equations

MdU

dt+ ∂1−α

t SU = f(t) for t > 0.

Discrete eigensystem

The finite dimensional linear operator Ah : Vh → Vh is symmetricand positive-definite, so Vh has an orthonormal basis φh1, φh2, . . . ,φhN consisting of eigenfunctions of Ah. Thus,

Ahφhm = λhmφ

hm for 1 ≤ m ≤ N,

with 〈φhm, φhn〉 = δmn, and consequently

uh(t) = Eh(t)u0h +

∫ t

0Eh(t − s)Phf (s) ds,

where the discrete solution operator for the homogeneous problemis defined by

Eh(t)χ =N∑

m=1

Eα(−λhmtα)〈χ, φhm〉φhm for χ ∈ Vh.

Stability

TheoremThe finite element solution is stable in L2(Ω):

‖uh(t)‖ ≤ ‖u0h‖+

∫ t

0‖Phf (s)‖ ds for t ≥ 0.

Proof.Since 0 < Eα(−s) ≤ 1 for 0 ≤ s <∞,

‖Eh(t)χ‖2 =N∑

m=1

∣∣Eα(−λhmtα)〈χ, φhm〉∣∣2 ≤ N∑

m=1

∣∣〈χ, φhm〉∣∣2 = ‖χ‖2,

so ‖Eh(t)χ‖ ≤ ‖χ‖ and the desired estimate follows from theDuhamel formula above.

Error estimates

We wish to estimate the error uh(t)− u(t) in L2(Ω) and in H10 (Ω).

Our assumption that Ω is convex implies that the elliptic problem,

−∇2u = f in Ω, with u = 0 on ∂Ω,

is H2-regular, that is, the weak solution u ∈ H10 (Ω), given by

a(u, v) = 〈f , v〉 for all v ∈ H10 (Ω),

necessarily belongs to H2(Ω) and

‖u‖2 ≤ C‖f ‖.

Caution: in this lecture, ‖v‖r denotes the norm in H r (Ω), so‖v‖r <∞ does not guarantee v ∈ Hr

D(Ω) unless v vanishesappropriately on ∂Ω.

Error in the elliptic problem

Let uh ∈ Vh be the finite element solution of the elliptic problemabove, so that

a(uh, χ) = 〈f , χ〉 for all χ ∈ Vh.

Since the symmetric bilinear form a(u, v) is bounded and coerciveon H1

0 (Ω), and using an appropriate quasi-interpolant, we have

‖uh − u‖1 ≤ C infχ∈Vh

‖χ− u‖1 ≤ Chr−1‖u‖r for 1 ≤ r ≤ p + 1.

The usual duality argument (which relies on H2-regularity) thenimplies that

‖uh − u‖ ≤ Ch infχ∈Vh

‖χ− u‖1 ≤ Chr‖u‖r for 1 ≤ r ≤ p + 1.

Ritz projector

It is convenient to define Rh : H10 (Ω)→ Vh by Rhu = uh, or

equivalently,

a(Rhv , χ) = a(v , χ) for all χ ∈ Vh.

It follows that R2h = Rh, with

‖v − Rhv‖ ≤ Chr‖v‖r and ‖v − Rhv‖1 ≤ Chr−1‖v‖r

for 1 ≤ r ≤ p + 1. Also, since

〈PhAv , χ〉 = 〈Av , χ〉 = a(v , χ) = a(Rhv , χ) = 〈AhRhv , χ〉

for all v ∈ H10 (Ω) and χ ∈ Vh, we see that

PhA = AhRh : H10 (Ω)→ Vh.

Equation for the error

Returning to the time-dependent case, we split the error into twoterms

uh(t)− u(t) = ϑ(t) + %(t), ϑ = uh − Rhu, % = Rhu − u.

Then for all χ ∈ Vh,

〈ϑt , χ〉+ ∂1−αt a(ϑ, χ)

= 〈uht , χ〉+ ∂1−αt a(uh, χ)− 〈Rhut , χ〉 − ∂1−α

t a(Rhu, χ)

= 〈f , χ〉+ 〈Rhut , χ〉 − ∂1−αt a(u, χ)

= 〈ut , χ〉+ ∂1−αt a(u, χ)− 〈Rhut , χ〉 − ∂1−α

t a(u, χ)

so ϑ : [0,∞)→ Vh satisfies an equation of the same form as theone for uh (with ut − Rhut playing the role of f )

〈ϑt , χ〉+ ∂1−αt a(ϑ, χ) = 〈ut − Rhut , χ〉.

Estimate for ϑ = uh − Rhu

LemmaFor 1 ≤ r ≤ p + 1,

‖ϑ(t)‖ ≤ ‖u0h − Rhu0‖+ Chr

∫ t

0‖ut‖r ds

Proof.Stability in L2(Ω) gives

‖ϑ(t)‖ ≤ ‖ϑ(0)‖+

∫ t

0‖Ph(ut − Rhut)‖ ds,

and here ϑ(0) = uh0 − Rhu0 with

‖Ph(ut − Rhut)‖ ≤ ‖ut − Rhut‖ ≤ Chr‖ut‖r .

Estimate for % = Rhu − u

LemmaFor 1 ≤ r ≤ p + 1,

‖%(t)‖ ≤ Chr

(‖u0‖r +

∫ t

0‖ut‖r ds

).

Proof.Write

%(t) = %(0) +

∫ t

0%t(s) ds

and use‖%t‖ = ‖ut − Rhut‖ ≤ Chr‖ut‖r

together with

‖%(0)‖ = ‖Rhu0 − u0‖ ≤ Chr‖u0‖r .

Quasi-optimal error bound in L2(Ω)

TheoremThe finite element solution of the time-fractional diffusion equationsatisfies

‖uh(t)− u(t)‖ ≤ ‖u0h − Rhu0‖+ Chr

(‖u0‖r +

∫ t

0‖ut(s)‖r ds

)for 1 ≤ r ≤ p + 1.

In practice, this bound is unlikely to be useful for r > 2 because uwill not be sufficiently smooth for

(· · ·)

on the RHS to be finite.

Realistic smoothness

Consider the homogeneous problem with f ≡ 0 so thatu(t) = E(t)u0, and choose p = 1 (piecewise-linears).

For sufficiently small ε > 0, if u0 ∈ H2+ε(Ω) and u0 = 0 on ∂Ω,then u0 ∈ H2+ε

D (Ω) so by our earlier regularity theorem,

‖tut‖2 ≤ Ctεα/2‖u0‖2+ε,

implying that ‖ut‖2 = O(tεα/2−1) as t → 0. Choosing u0h = Rhu0

for simplicity, we obtain

‖uh(t)− u(t)‖ ≤ Ch2‖u0‖2+ε for 0 ≤ t ≤ T ,

with C = C (T , α, ε,Ω).

Estimate for ‖ϑ(t)‖1

We saw that

〈ϑt , χ〉+ a(∂1−αt ϑ, χ) = 〈−ρt , χ〉 for all χ ∈ Vh.

Choosing χ = Ahϑ(t), we have

〈ϑt ,Ahϑ〉 =1

2

d

dt〈ϑ,Ahϑ〉 =

1

2

d

dta(ϑ, ϑ)

witha(∂1−α

t ϑ,Ahϑ) = 〈∂1−αt Ahϑ,Ahϑ〉

and

〈−%t ,Ahϑ〉 = −〈%t ,PhAhϑ〉 = −〈Ph%t ,Ahϑ〉 = −a(Ph%t , ϑ).

Thus,

1

2

d

dta(ϑ, ϑ) + 〈∂1−α

t Ahϑ,Ahϑ〉 = −a(Ph%t , ϑ).

Integrating from t = 0 to t = T and using∫ T

0〈∂1−α

t Ahϑ,Ahϑ〉 dt ≥ 0,

we have

a(ϑ(T ), ϑ(T )

)− a(ϑ(0), ϑ(0)

)≤ −2

∫ T

0a(Ph%t , ϑ) dt

Since a(v , v) is equivalent to ‖v‖21,

‖ϑ(T )‖1 ≤ C

(‖ϑ(0)‖1 +

∫ T

0‖Ph%t‖1 dt

).

Choose t∗ such that

‖ϑ(t∗)‖1 = max0≤t≤T

‖ϑ(t)‖1,

then

‖ϑ(T )‖1‖ϑ(t∗)‖1 ≤ ‖ϑ(t∗)‖21

≤ C‖ϑ(0)‖21 + C

∫ t∗

0‖Ph%t‖1‖ϑ‖1 dt

≤ C

(‖ϑ(0)‖1 +

∫ T

0‖Ph%t‖1 dt

)‖ϑ(t∗)‖1

so

‖ϑ(T )‖21 ≤ C‖ϑ(0)‖2

1 + C

∫ T

0‖Ph%t(t)‖1‖ϑ(t)‖1 dt

Quasi-optimal error bound in H1(Ω)

Assume now that Th is such that the L2-projector Ph is stablein H1(Ω), that is,

‖Phv‖1 ≤ C‖v‖1 for v ∈ H1(Ω).

For instance, it suffices to assume that Th is quasi-uniform.

TheoremThe finite element solution of the time-fractional diffusion equationsatisfies

‖uh(t)− u(t)‖1 ≤ C‖u0h − Rhu0‖1

+ Chr−1

(‖u0‖r +

∫ t

0‖ut(s)‖r ds

)for 2 ≤ r ≤ p + 1.

ProofRecall that uh − u = ϑ+ % and

‖ϑ(t)‖1 ≤ C

(‖ϑ(0)‖1 +

∫ t

0‖Ph%t(s)‖1 dt

).

Here,‖ϑ(0)‖1 = ‖u0h − Rhu0‖1,

and since Ph is stable in H1(Ω),

‖Ph%t(s)‖1 ≤ C‖ut(s)− Rhut(s)‖1 ≤ Chr−1‖ut(s)‖r .

The error bound follows because

‖%(t)‖1 ≤ ‖%(0)‖1 +

∫ t

0‖%t(s)‖1 ds

= ‖u0 − Rhu0‖1 +

∫ t

0‖ut(s)− Rhut(s)‖1 ds

≤ Chr−1

(‖u0‖1 +

∫ t

0‖ut(s)‖1 ds

).

Non-smooth data

Define the closed sector

Σψ = z ∈ C : z 6= 0 and | arg z | ≤ ψ ∪ 0.

Our aim now is to prove the following error bound, which does notrequire any spatial regularity for u0 or f .

TheoremAssume that u0h = Phu0, and fix ϕ such that 0 < ϕ < π/2. Then,

‖uh(t)− u(t)‖ ≤ C t−αh2(‖u0‖+ sup

z∈∂Σπ−ϕ

‖f (z)‖)

for 0 < t ≤ T .

Under Laplace transformation, our problem

ut + ∂1−αt Au = f (t) for t > 0, with u(0) = u0,

becomeszu(z)− u0 + z1−αAu(z) = f (z),

so

(zαI + A)u(z) = zα−1g(z) where g(z) = u0 + f (z).

Similarly, for uh we have

(zαI + Ah)uh(z) = zα−1gh(z) where gh(z) = u0h + Ph f (z).

Notice that gh(z) = Phg(z) because we assume u0h = Phu0.

Resolvent estimate for A

Recall that Aφm = λmφm with 0 ≤ λ1 < λ2 < · · · .

TheoremLet ϕ satisfy 0 < ϕ ≤ π/2 and put

M =1

sinϕ.

If z ∈ Σπ−ϕ then

‖(zI + A)−1‖ ≤ M

|z |≤(

1 +2

λ1

)M

1 + |z |.

ProofLet z = re iθ with |θ| ≤ π − ϕ and r > 0.

If 0 ≤ |θ| ≤ π/2 then |z | ≤ |z + λm| because

0 ≤ <z ≤ <(z + λm) and =z = =(z + λm)

If π/2 ≤ |θ| ≤ π − ϕ then |z | ≤ M|z + λm| because

|z | sinϕ ≤ |z | sin |θ| = |=z | = |=(z + λm)| ≤ |z + λm|.

Therefore, since M ≥ 1,

1

|z + λm|≤ M

|z |

and so (zI + A)−1 exists with

(zI + A)−1v =∞∑

m=1

〈v , φm〉z + λm

φm for v ∈ L2(Ω).

Hence, Parseval’s identity yields the first estimate:

‖(zI + A)−1v‖2 =∞∑

m=1

∣∣∣∣〈v , φm〉z + λm

∣∣∣∣2≤(

M

|z |

)2 ∞∑m=1

|〈v , φm〉|2 =

(M

|z |‖v‖)2

.

If |z | ≤ λ1/2, then |z + λm| ≥ λ1 − |z | ≥ λ1/2 so

1 + |z ||z + λm|

≤ 1 + λ1/2

λ1/2= 1 +

2

λ1.

If |z | ≥ λ1/2, then

1 + |z ||z + λm|

≤ 2|z |/λ1 + |z ||z | sinϕ

= M

(1 +

2

λ1

),

and the second estimate follows at once.

Resolvent estimate for Ah

TheoremLet ϕ satisfy 0 < ϕ ≤ π/2 and put M = 1/ sinϕ. If z ∈ Σπ−ϕ then

‖(zI + Ah)−1‖ ≤ M

|z |≤(

1 +2

λ1

)M

1 + |z |.

Proof.The same proof shows that the conclusion holds with λh1 in placeof λ1. But

λh1 = minχ∈Vh

〈Ahχ, χ〉‖χ‖2

= minχ∈Vh

〈Aχ, χ〉‖χ‖2

≥ minv∈H1

0 (Ω)

〈Av , v〉‖v‖2

= λ1,

so 2/λh1 ≤ 2/λ1.

Integral representations

Sinceu(z) = zα−1(zαI + A)−1g(z)

anduh(z) = zα−1(zαI + Ah)−1Phg(z)

the Laplace inversion formula gives

u(t) =1

2πi

∫Γ

eztzα−1(zαI + A)−1g(z) dz

and

uh(t) =1

2πi

∫Γ

eztzα−1(zαI + Ah)−1Phg(z) dz ,

where

Γ = a + iy : −∞ < y <∞ for any a > 0.

Error representation

Thus,

uh(t)− u(t) =1

2πi

∫Γ

eztzα−1Gh(z)g(z) dz

where

Gh(z) = (zαI + Ah)−1Ph − (zαI + A)−1 = G 1h (z) + G 2

h (z),

with

G 1h (z) = (zαI + Ah)−1Ph − Ph(zαI + A)−1,

G 2h (z) = (Ph − I )(zαI + A)−1.

LemmaFor z ∈ Σπ−ϕ,

‖A(zαI + A)−1v‖ ≤ C‖v‖ and ‖(zαI + Ah)−1Ahχ‖ ≤ C‖χ‖.

Proof.The identity

A(zαI + A)−1 = (zαI + A− zαI )(zαI + A)−1 = I − zα(zαI + A)−1

implies that

‖A(zαI + A)−1‖ ≤ 1 + |zα| M

|zα|≤ C .

Similarly,

(zαI + Ah)−1Ah = I − zα(zαI + Ah)−1.

Estimate for G 1h (z)

Lemma‖G 1

h (z)v‖ ≤ Ch2‖v‖ for z ∈ Σπ−ϕ.

Proof.Recall that PhA = AhRh, so

G 1h (z) = (zαI + Ah)−1Ph − Ph(zαI + A)−1

= (zαI + Ah)−1[Ph(zαI + A)− (zαI + Ah)Ph

](zαI + A)−1

= (zαI + Ah)−1Ah(Rh − Ph)(zαI + A)−1,

H2-regularity of the elliptic problem gives

‖G 1h (z)v‖ ≤ C‖(Rh − Ph)(zαI + A)−1v‖ ≤ Ch2‖(zαI + A)−1v‖2

≤ Ch2‖A(zαI + A)−1v‖ ≤ Ch2‖v‖.

Estimate for G 2h (z)

Lemma‖G 2

h (z)v‖ ≤ Ch2‖v‖ for z ∈ Σπ−ϕ.

Proof.As above,

‖G 2h (z)v‖ = ‖(Ph − I )(zαI + A)−1v‖

≤ Ch2‖(zαI + A)−1v‖2

≤ Ch2‖A(zαI + A)−1v‖≤ Ch2‖v‖.

Error estimate

Thus, we have shown that

uh(t)− u(t) =1

2πi

∫Γ

eztzα−1Gh(z)g(z) dz

with ‖Gh(z)v‖ ≤ Ch2‖v‖ for z ∈ Σπ−ϕ.

Deform the integration contour Γ to ∂Σπ−ϕ = Γ+ − Γ−, where

Γ± = se±i(π−ϕ) : 0 < s <∞,

so that uh(t)− u(t) = I+ − I− where

I± =1

2πi

∫Γ±

eztzα−1Gh(z)g(z) dz .

The substitution z = se±i(π−ϕ) = s(− cosϕ± i sinϕ) gives

‖I±‖ ≤ Ch2(‖u0‖+ sup

z∈Γ±

‖f (z)‖)∫ ∞

0e−st cosϕsα

ds

s.

A second substitution s = w(t cosϕ)−1 shows that the integral onthe right equals∫ ∞

0e−w

(w

t cosϕ

)α dw

w= Cα,ϕt−α.

Part VII

Methods based on the Laplace transform

Introduction

The Laplace inversion formula yields a contour integralrepresentation of the finite element solution uh(t) to thetime-fractional diffusion equation. Applying a quadratureapproximation [Lopez-Fernandez+Palencia-2004] leads to a fullydiscrete solution UN,h(t). The main cost of the method is thecomputation of uh(zj) at the quadrature point zj for |j | ≤ N,which requires the solution of a system of finite element equationsinvolving the complex parameter zj .

The main advantages of the this approach are high accuracy andeasy parallel implementation. A key disadvantage is that themethod imposes severe limitations on form of the source term f (t).We outline a modified approach [McLean+Thomee-2010] thatsacrifices some accuracy to relax the requirements on f (t).

Outline

Contour integral and quadrature

Parallel-in-time algorithm

A more flexible approach

Contour integral and quadratureOnce again consider

u + ∂1−αt Au = f (t) for t > 0, with u(0) = u0,

and recall that

u(t) =1

2πi

∫Γ

eztzα−1(zαI + A)−1g(z) dz ,

whereg(z) = u0 + f (z).

We now choose for Γ a contour of the form

z = z(ξ) = µ(1− sin(δ − iξ)

)for −∞ < ξ <∞,

where the parameters µ and δ satisfy

µ > 0 and 0 < δ <π

2.

Hyperbola

Sincesin(δ − iξ) = sin δ cosh ξ − i cos δ sinh ξ,

we have

x(ξ) = <z(ξ) = µ(1− sin δ cosh ξ

),

y(ξ) = =z(ξ) = µ cos δ sinh ξ,

so Γ is the left branch of the hyperbola(x − µµ sin δ

)2

−(

y

µ cos δ

)2

= 1.

Notice that the asymptotes are

y = ±(x − µ) cot δ.

Parameterised integral

We have

u(t) =1

2πi

∫Γ

eztzα−1(zαI + A)−1g(z) dz

=1

2πi

∫ ∞−∞

ez(ξ)tz(ξ)α(z(ξ)αI + A

)−1g(z(ξ)

)z ′(ξ)

z(ξ)dξ

with

z(ξ) = µ(1− sin(δ − iξ)

)= µ

(1− sin δ cosh ξ + i cos δ sinh ξ

),

z ′(ξ) = iµ cos(δ − iξ) = µ(− sin δ sinh ξ + i cos δ cosh ξ

).

Find that∣∣∣∣z ′(ξ)

z(ξ)

∣∣∣∣2 =cosh2 ξ − sin2 δ

(cosh ξ − sin δ)2=

cosh ξ + sin δ

cosh ξ − sin δ≤ 1 + sin δ

1− sin δ.

Double exponential decay

Key factor in the behaviour of the integrand is

|ez(ξ)t | = ex(ξ)t = exp(µt(1− sin δ cosh ξ)

),

and since cosh ξ ≥ 12 exp(|ξ|),

|ez(ξ)t | ≤ exp(µt − 1

2µt sin δ exp(|ξ|)).

Discretisation errorFor ∆ξ > 0, let

Q∞(v) =∞∑

j=−∞v(j ∆ξ) ∆ξ.

Theorem ([Trefethen+Weideman-2014])

Let r± > 0 and assume

1. v(ζ) is analytic on the strip −r− ≤ =ζ ≤ r+;

2.∫ r+

−r− |v(ξ + iη)| dη → 0 as |ξ| → ∞;

3.∫∞−∞ |v(ξ ± ir±)| dξ ≤ M±.

Then ∣∣∣∣Q∞(v)−∫ ∞−∞

v(ξ) dξ

∣∣∣∣ ≤ DE+ + DE−

where

DE± =M±

exp(2πr±/∆ξ)− 1.

Conformal mapping

The formulaz = Φ(ζ) = µ

(1− sin(δ − iζ)

)defines conformal mapping that takes the line <ζ = η to the leftbranch of the hyperbola(

x − µµ sin(δ + η)

)2

−(

y

µ cos(δ + η)

)2

= 1.

Need to ensure0 < δ + η <

π

2

so that the hyperbola crosses into the left half-plane.

Truncation error

In practice, compute

QN(v) =N∑

j=−Nv(j ∆ξ) ∆ξ,

so need to estimate

TE =∑|j |>N

v(j ∆ξ) ∆ξ

then the triangle inequality gives∣∣∣∣QN(v)−∫ ∞−∞

v(ξ) dξ

∣∣∣∣ ≤ |DE+ |+ |DE− |+ |TE |.

In our application, if ζ = ξ + iη then

|v(ζ)| ≤ C |eΦ(ζ)t | ≤ C exp(µt(1− sin(δ + η) cosh ξ

).

Lemma ([Lopez-Fernandez+Palencia-2004])

For a > 0 and 0 < b ≤ 1,∫ ∞0

ea(1−b cosh ξ) dξ ≤ Cea(1−b)L(ab)

where

L(x) =

1, x ≥ 1,

loge

x, x ≤ 1.

Proof

Since a(1− b cosh ξ) = a(1− b) + ab(1− cosh ξ) it suffices toestimate

I ≡∫ ∞

0eab(1−cosh ξ) dξ =

∫ ∞0

e−aby dy√y(y + 2)

=

∫ ∞0

e−x dx√x(x + 2ab)

,

where we used the substitution y = cosh ξ− 1 followed by x = aby .If ab ≥ 1 then I ≤

∫∞0 x−1/2e−x dx <∞, whereas if ab ≤ 1, then

I ≤∫ 1

0

dx√x(x + 2ab)

+

∫ ∞1

e−x dx

and the substitution x = abt/e gives∫ 1

0

dx√x(x + 2ab)

=

∫ e/(ab)

0

dt√t(t + 2e)

≤ C + loge

ab.

Quadrature error [Weideman+Trefethen-2007]

Thus, taking a = µt and b = sin(δ ± r±) in the lemma,∫ ∞−∞|v(ξ ± ir±)| dξ ≤ M± = Ceµt(1−sin(δ±r±))L

(µt sin(δ ± r±)

),

so

|DE± | ≤M±

e2πr±/∆ξ − 1≤ M±e−2πr±/∆ξ

≤ CL(µt sin(δ ± r±)

)exp[µt(1− sin(δ ± r±)

)− 2πr±/∆ξ

].

At the same time,

|TE | ≤∑|j |>N

|v(j ∆ξ)|∆ξ . 2|v(N ∆ξ)|∆ξ

≤ C exp(µt(1− sin δ cosh(N ∆ξ)

)∆ξ.

Choice of parameters

To balance TE+, TE− and DE, we want

µt(1− sin(δ + r+)

)− 2πr+

∆ξ= µt

(1− sin(δ − r−)

)− 2πr−

∆ξ

= µt(1− sin δ cosh(N ∆ξ)

),

while satisfying

0 < δ − r− < δ + r+ <π

2.

The limiting choices r+ = π/2− δ and r− = δ give

− 2π

∆ξ

(π

2− δ)

= µt − 2πδ

∆ξ= µt

(1− sin δ cosh(N ∆ξ)

),

so

µt =π

∆ξ(4δ − π) and µt sin δ cosh(N ∆ξ) =

2πδ

∆ξ.

Eliminating µt,

sin δ cosh(N ∆ξ) =2δ

4δ − π,

so we defineb(δ) = cosh−1

(2δ

(4δ − π) sin δ

)and obtain

∆ξ =b(δ)

Nand µ =

π(4δ − π)

b(δ)

N

t.

In this case,

− 2π

∆ξ

(π

2− δ)

= −B(δ)N where B(δ) =π(π − 2δ)

b(δ),

and the error bound for the quadrature rule is

|DE+ |+ |DE− |+ |TE | ≤ CL(· · · )e−B(δ)N .

Optimal choice of δ

We find that B(δ) has a unique maximum in theinterval π/4 < δ < π/2, namely, at

δ∗ = 1.1721 0423, (near 3π/8 = 1.1780 9724)

and the maximum value is

B(δ∗) = 2.3156 5403.

Given N and t, the corresponding optimal parameters are

∆ξ∗ =1.0817 9214

Nand µ∗ = 4.4920 7528

N

t,

leading to an error bound with the decay factor

e−B(δ∗)N = e−2.3157N = 10.1315−N .

Scalar test problem

For λ > 0, recall that

Le−λt =

∫ ∞0

e−(z+λ)t dt = (z + λ)−1.

We consider

e−λt =1

2πi

∫Γ

ezt(z + λ)−1 dz

=1

2πi

∫ ∞−∞

ez(ξ)t(z(ξ) + λ

)−1z ′(ξ) dξ

≈ QN =1

2πi

N∑j=−N

ezj t(zj + λ)−1z ′j ∆ξ,

wherezj = z(j ∆ξ) and z ′j = z ′(j ∆ξ).

Convergence behaviour

Quadrature points for different N (with t = 1)

Quadrature points for different t (with N = 15)

Fixed points with varying t

Given N and τ > 0, suppose we choose ∆ξ∗ and µ∗ as abovefor t = τ . What if we use the approximation

e−λt ≈ 1

2πi

N∑j=−N

ezj t(zj + λ)−1z ′j ∆ξ

for t near τ?

In the fractional PDE case, we can solve one set of ellipticproblems for uh(zj) at the zj optimized for t = τ and use them notjust at t = τ but for several values of t in an interval around τ .The zj are then slightly sub-optimal but we avoid having tocompute a new set of uh(zj) for each t.

Error for the scalar example with τ = 1

Parallel-in-time algorithm

Combining the above approach to numerical inversion of theLaplace transform with a spatial discretisation by finite elementsleads to a fully discrete numerical method that involves no timestepping.

The method is particularly suited to applications in which thesolution is required for only a few values of t.

In addition to its high, spectral-order accuracy in time, the methodis embarassingly parallel.

Method-of-lines solution

Recall that the semidiscrete finite elementsolution uh : [0,∞)→ Vh satisfies

〈uht , χ〉+ ∂1−αt a(uh, χ) = 〈f (t), χ〉 for χ ∈ Vh and t > 0,

or equivalently,

uht + ∂1−αt Ahuh = Phf (t) for t > 0,

with uh(0) = u0h ≈ u0.

Under Laplace transformation in time, we obtain

zuh + z1−αAhuh = gh(z)

where gh(z) = u0h + Ph f (z).

Fully-discrete solution

If we solve the (complex) finite element equations for uh(z) ateach quadrature point z = zj , then we can compute

UN,h(t) =1

2πi

N∑j=−N

ezj t uh(zj)z ′j ∆ξ

as an approximation to

uh(t) =1

2πi

∫Γ

ezt uh(z) dz =1

2πi

∫ ∞−∞

ez(ξ)t uh

(z(ξ)

)z ′(ξ) dξ.

The computational cost is dominated by solving the ellipticproblems, and a key advantage of the method is that (unlike in atime-stepping scheme) these elliptic solves can easily be performedin parallel.

Halving the computational cost

Assuming that u0 and f are real-valued, it follows that uh isreal-valued and so

uh(z) = uh(z).

Sincez(−ξ) = z(ξ) and z ′(−ξ) = −z ′(ξ),

soz−j = zj , uh(z−j) = uh(zj), z ′−j/i = z ′j/i

and therefore

1

2πi

−1∑j=−N

ezj t uh(zj)z ′j ∆ξ =1

2πi

N∑j=1

ezj t uh(zj)z ′j ∆ξ.

Since y0 = 0 and x ′0 = 0, it follows that

UN,h(t) =1

2πex0t uh(x0)y ′0 ∆ξ +

1

π

N∑j=1

<(ezj t uh(zj)z ′j/i

)∆ξ.

In particular, it suffices to compute uh(zj) for 0 ≤ j ≤ N.

In matrix terms, we solve the N + 1 linear systems

(zαj M + S)U(zj) = zα−1j G(zj), 0 ≤ j ≤ N,

where M and S are the mass and stiffness matrices, U(z) is thevector of nodal values of uh(z), and G(zj) is the load vectorfor gh(z).

Quadrature error

To estimate DE± we must bound

M± =1

2π

∫ ∞−∞

∥∥ez(ξ+iη)t uh

(z(ξ + iη)

)z ′(ξ + iη)

∥∥ dξ

for η = δ ± r± where 0 < r− < δ < r+ < π/2. We saw earlier that

‖(zI + Ah)−1v‖ ≤ ‖v‖|z | sinϕ

for z ∈ Σπ−ϕ and v ∈ Vh,

and ∣∣∣∣z ′(ξ + iη)

z(ξ + iη)

∣∣∣∣ ≤√

1 + sin(δ + η)

1− sin(δ + η)=

2

cos(δ + η).

Since(zαI + Ah)uh(z) = zα−1gh(z),

we have

‖uh(z)‖ =∥∥zα−1(zαI + Ah)−1gh(z)

∥∥ ≤ ‖gh(z)‖|z | sinϕ

for zα ∈ Σπ−ϕ, that is, for z ∈ Σ(π−ϕ)/α and hence for z ∈ Σπ−ϕ.

Therefore,

M± ≤1

2π sinϕ

∫ ∞−∞

∣∣ez(ξ±ir±)t∣∣∥∥gh

(z(ξ ± ir±)

)∥∥ |z ′(ξ ± ir±)||z(ξ ± ir±)|

dξ

≤ 1

π sinϕ cos(δ ± r±)

∫ ∞−∞

∣∣ez(ξ±ir±)t∣∣∥∥gh

(z(ξ ± ir±)

)∥∥ dξ,

and similarly for TE.

Conclusion: if f (z) is analytic with

‖f (z)‖ ≤ Cf ,ϕ for z ∈ Σπ−ϕ,

so that

‖gh(z)‖ ≤ ‖u0‖+ ‖Ph f (z)‖ ≤ ‖u0‖+ Cf ,ϕ for z ∈ Σπ−ϕ,

then we may estimate the quadrature error as before and obtain

‖UN,h(t)− uh(t)‖ ≤ Ce−B(δ)NL(ct).

Thus, by the triangle inequality,

‖UN,h(t)− u(t)‖ ≤ ‖UN,h(t)− uh(t)‖+ ‖uh(t)− u(t)‖= O

(L(ct)e−B(δ)N

)+ O(t−αh2).

Roundoff

In practice, roundoff means that we compute

U?N,h(t) =

1

2πi

N∑j=−N

ezj t uh(zj)z ′j (1 + εj) ∆ξ

with ‖εj‖L∞(Ω) ≤ ε, leading to an additional perturbation

‖U?N,h(t)− UN,h(t)‖ ≤ ε

2π

N∑j=−N

|ezj t |‖uh(zj)‖|z ′j |∆ξ

≤ ε

π sinϕ cos δ

N∑j=−N

|ezj t |‖gh(zj)‖∆ξ

Since

N∑j=−N

|ezj t |∆ξ =N∑

j=−Nexp(µt(1− sin δ cosh(j ∆ξ)

)∆ξ

≈∫ N ∆ξ

−N ∆ξeµt(1−sin δ cosh ξ dξ ≤ Ceµ(1−sin δ)L(µt sin δ)

we conclude that

‖U?N,h(t)− UN,h(t)‖ ≤ Cεeµt(1−sin δ)L(µt sin δ) max

−N≤j≤N‖g(zj)‖.

The optimal choices

δ∗ = 1.1721 and µ∗ = 4.4921N/t

lead toeµ∗t(1−sin δ∗) = e1.4224N = 1.422N .

Example

Suppose h = 10−3 and ε = 2−52 ≈ 2.22× 10−16. Then

ε 1.422N ≥ h2 = 10−6

when

N ≥ log(10−6/ε)

log 1.422= 63.14.

So roundoff probably not an issue.

However, “optimal” parameter values might be problematic if ahigh-accuracy spatial discretisation (say a spectral method) wereused. We do not want µ to be too large.

Behaviour of f (z)

Example

If f (x , t) = g(x)e−at cosωt, then

f (x , z) = g(x)z + a

(z + a)2 + ω2,

which has simple poles at z = −a± iω.

Example

If

f (x , t) =

g(x), 1 < t < 2,

0, otherwise,

then

f (x , z) =

∫ 2

1e−ztg(x) dt = g(x)

e−z − e−2z

z,

and ∣∣e−2z(ξ)∣∣ = eµ(sin δ cosh ξ−1).

A more flexible approach [McLean+Thomee-2010]

Now drop the assumption that f (z) is analytic and boundedfor z ∈ Σπ−ϕ. We will describe a method based on Duhamel’sformula,

u(t) = E(t)u0 +

∫ t

0E(t − s)f (s) ds, t > 0.

Recall that E(t), the solution operator for the homogeneousfractional diffusion equation has the series representation

E(t)v =∞∑

m=0

Eα(−λmtα)〈v , φm〉φm, Aφm = λmφm,

and the integral representation

E(t)v =1

2πi

∫Γ

ezt E(z)v dz , E(z) = zα−1(zαI + A)−1.

Noting that∫ t

0E(t − s)f (s) ds =

∫ t

0

1

2πi

∫Γ

ez(t−s)E(z)f (s) dz ds

=1

2πi

∫ΓE(z)

∫ t

0ez(t−s)f (s) ds dz ,

we defineg(z , t) = eztu0 +

∫ t

0ez(t−s)f (s) ds

and deduceu(t) =

1

2πi

∫ΓE(z)g(z , t) dz .

Notice that g(z , t) is an entire function of z , with

‖g(z , t)‖ ≤ ‖u0‖+

∫ t

0‖f (s)‖ ds for <z ≤ 0 and t ≥ 0.

Since E(z) ∼ z−1I as |z | → ∞ with z ∈ Σπ−ϕ, we expect

E0(z) = E(z)− z−1I

to decay more rapidly than E(z), which is needed to compensatefor the disappearance of the factor ezt in the integrand.

TheoremIf 0 ≤ σ ≤ 1 and v ∈ D(Aσ), then

‖E0(z)v‖ ≤ Cϕ,σ‖Aσv‖|z |1+ασ

for z ∈ Σπ−ϕ,

where

Cϕ,σ =

(1 +

1

sinϕ

)1−σ( 1

sinϕ

)σ.

Proof

We have

E0(z) = E(z)− zI

= zα−1(zαI + A)−1 − z−1(zαI + A)(zαI + A)−1

=[zα−1I − z−1(zαI + A)

](zαI + A)−1

= −z−1A(zαI + A)−1

= −z−1[A(zαI + A)−1

]1−σ[(zαI + A)−1

]σAσ

and

A(zαI + A)−1 =[(zαI + A)− zα

](zαI + A)−1

= I − zα(zαI + A)−1.

For z ∈ Σπ−ϕ, our resolvent estimate gives

‖(zαI + A)−1‖ ≤ 1

|z |α sinϕ,

so ∥∥[(zαI + A)−1]σ∥∥ ≤ ∥∥(zαI + A)−1

∥∥σ ≤ ( 1

|z |α sinϕ

)σand ∥∥[A(zαI + A)−1

]1−σ∥∥ ≤ (1 +1

sinϕ

)1−σ.

Thus,

‖E0(z)v‖ ≤ 1

|z |

(1 +

1

sinϕ

)1−σ( 1

|z |α sinϕ

)σ‖Aσv‖.

Modified representation

Since E(z) = z−1I + E0(z) and

1

2πi

∫Γ

z−1g(z , t) dt = resz=0

g(z , t)

z= g(0, t) = u0 +

∫ t

0f (s) ds.

we have

u(t) = g(0, t) +1

2πi

∫Γ

w0(z , t) dz

where

w0(z , t) = E0(z)g(z , t) = w(z , t)− z−1g(z , t).

and w(z , t) = E(z)g(z , t) denotes the solution of the (complex)elliptic problem

(zαI + A)w(z , t) = zα−1g(z , t).

Similarly, if we define the spatially discrete operators

Eh(z) = zα−1(zI + Ah)−1 and E0h(z) = Eh(z)− z−1I

and put

gh(z , t) = eztu0h +

∫ t

0ez(t−s)Phf (s) ds,

then

uh(t) = gh(0, t) +1

2πi

∫ΓE0h(z)gh(z , t) dz

= u0h +

∫ t

0Phf (s) ds +

1

2πi

∫Γ

w0h(z , t) dz

where w0h(z , t) = E0h(z)gh(z , t) = wh(z , t)− z−1gh(z , t), andwh(z , t) is computed by solving the (complex) finite elementequations

(zαI + Ah)wh(z , t) = zα−1gh(z , t).

Fully-discrete schemeThe equal-weight quadrature approximation∫

Γw0h(z , t) dz ≈

N∑j=−N

w0h(zj , t)z ′j ∆ξ

leads to

UN,h(t) = gh(0, t) +1

2πi

N∑j=−N

w0h(zj , t)z ′j ∆ξ.

LemmaIf z = z(ξ + iη) and z ′ = z ′(ξ + iη), then for0 < σ1 + α−1 = σ ≤ 1,

‖wh(z , t)z ′‖ ≤ 2Cϕ,σ,σ1

cos(δ + η)

eµt(1−sin δ)

|z |ασ

(‖Aσhu0h‖

+ ‖Aσ1h Phf (0)‖+

∫ t

0‖Aσ1

h Phf ′(s)‖ ds

).

Outline of Proof

Integrating by parts, we find that

wh(z , t) = E0h(z)(eztu0h

)+ E0h(z)

(ezt − 1

zPhf (0) +

∫ t

0

ez(t−s) − 1

zPhf ′(s) ds

)with

‖E0h(z)u0hz ′‖ ≤ Cϕ,σ|z ′|‖Aσhu0h‖|z |1+ασ

= Cϕ,σ|z ′||z |‖Aσhu0h‖|z |ασ

and, since 2 + ασ1 = 1 + α(σ1 + α−1) = 1 + ασ,

‖z−1E0h(z)Phf (0)z ′‖ ≤ Cϕ,σ1 |z ′|‖Aσ1

h u0h‖|z |2+ασ1

= Cϕ,σ1

|z ′||z |‖Aσ1

h u0h‖|z |ασ

.

Now recall that |z ′/z | ≤ 2/ cos(δ + η).

Quadrature error

We saw previously that

|z(ξ + iη)| = µ(cosh ξ − sin(δ + η)

),

and since

cosh ξ − sin δ12 e |ξ|

= 1 + e−2|ξ| − 2e−|ξ| sin δ

= (1− e−|ξ| sin δ)2 + e−2|ξ| cos2 δ

≥ (1− sin δ)2,

it follows that

1

|z(ξ + iη)|ασ≤(

2

1− sin(δ + η)

)ασ e−ασ|ξ|

µασ.

Set r = r± such that 0 < δ − r < δ + r < π/2, and estimate∫ ∞−∞

∥∥w(z(ξ ± ir)

)z ′(ξ ± ir)

∥∥ dξ ≤ Ceµt(1−sin δ)

µασ

∫ ∞0

e−ασξ dξ

so

|DE± | ≤ Ceµt(1−sin δ)

µασe−2πr/∆ξ.

At the same time,

∑|j |>N

∥∥wh(zj , t)z ′j∥∥∆ξ ≤ C

eµt(1−sin δ)

µασ

∞∑j=N+1

e−ασj ∆ξ ∆ξ

so

|TE | ≤ Ceµt(1−sin δ)

µασe−ασN∆ξ.

Setting 2πr/∆ξ = ασN∆ξ, and choosing µ > 0 to minimiseeµt(1−sin δ)/µασ, we arrive at the following estimate.

TheoremFor the flexible scheme described above, if

∆ξ =

√2πr

ασNand µ =

ασ

t(1− sin δ),

then‖UN,h(t)− uh(t)‖ ≤ Ctασ exp

(−√

2πrασN).

The error bound suggests choosing δ = π/4 and r slightly lessthan π/4.

Example

Taking

r =π

4, α =

1

2, σ = 1

gives 2πrασ = π2/4 so the decay factor in the error bound is oforder

e−12π√N = e−1.5708

√N ,

compared to

e−B(δ∗)N = e−2.3157N = 10.1315−N

for our earlier method.

Part VIII

Convolution Quadrature

Introduction

Convolution quadrature [Lubich-1988, Lubich-2004] refers to anapproximation of the form∫ tn

0K (tn − s)f (s) ds ≈

n∑j=0

wn−j f (tj), tj = j ∆t,

where the convolution weights wn = wn(∆t) are computed directlyfrom the Laplace transform K (z) rather than the kernel K (t). Thisapproach can be advantageous if K (z) is simpler than K (t).

Convolution quadrature can be used to approximate any functionof the form K ∗ f , and in particular the fractionalintegral Iαf = Υα ∗ f .

Outline

Contour integrals again

Operational calculus

Application to fractional diffusion

Contour integrals again

Assume that K (z) is analytic and satisfies

|K (z)| ≤ C |z |−µ for z ∈ Σπ−ϕ, where µ > 0.

Then, as before,

K (t) =1

2πi

∫Γ

eztK (z) dz , t > 0.

Since∫ t

0K (t − s)f (s) ds =

∫ t

0

(1

2πi

∫Γ

ez(t−s)K (z) dz

)f (s) ds,

by reversing the order of integration we obtain

K ∗ f (t) =1

2πi

∫Γ

K (z)

∫ t

0ez(t−s)f (s) ds dz

The associated ODE

Put

y(t; z) =

∫ t

0ez(t−s)f (s) ds

so that

K ∗ f (t) =1

2πi

∫Γ

K (z)y(t; z) dz .

Notice thatdy

dt= f (t) + z

∫ t

0ez(t−s)f (s) ds

so y is the solution of the initial-value problem

dy

dt− zy = f (t) for t > 0, with y(0) = 0.


Compute

Y n = Y n(z) ≈ y(tn; z) where tn = n ∆t,

by solving

Y n − Y n−1

∆t− zY n = F n for n ≥ 1, with Y 0 = 0,

where F n = f (tn). Then,

K ∗ f (tn) =1

2πi

∫Γ

K (z)y(tn; z) dz

≈ 1

2πi

∫Γ

K (z)Y n(z) dz .

Generating functionLet

Y (ζ) = Y (ζ; z) =∞∑n=0

Y n(z)ζn.

Putting Y−1 = 0, we have

∞∑n=0

(Y n − Y n−1)ζn =∞∑n=0

Y nζn −∞∑

n=−1

Y nζn+1 = (1− ζ)Y (ζ),

so the finite difference equation implies that

1− ζ∆t

Y (ζ)− zY (ζ) = F (ζ).

Thus,

Y (ζ; z) =F (ζ)

δ(ζ)∆t−1 − zwhere δ(ζ) = 1− ζ.

Recall

(K ∗ f )(tn) ≈ 1

2πi

∫Γ

K (z)Y n(z) dz .

For ∆t and |ζ| sufficiently small, Γ passes to the left of the poleat z = δ(ζ)∆t−1, and

∞∑n=0

(K ∗ f )(tn)ζn ≈ 1

2πi

∫Γ

K (z)Y (ζ; z) dz

= − 1

2πi

∫Γ

K (z) dz

z − δ(ζ)∆t−1F (ζ).

Here, the integrand is O(|z |−1−µ) so Cauchy’s theorem gives

− 1

2πi

∫Γ

K (z) dz

z − δ(ζ)∆t−1= K

(δ(ζ)∆t−1

).

Thus,K ∗ f (ζ) ≈ K

(δ(ζ)∆t−1

)F (ζ).

WeightsDefine wn = wn(∆t) by

K(δ(ζ)∆t−1

)= w(ζ) =

∞∑n=0

wnζn,

then because

K(δ(ζ)∆t−1

)F (ζ) =

( ∞∑n=0

wnζn

)( ∞∑m=0

Fmζm)

=∞∑n=0

( n∑j=0

wn−jFj

)ζn,

we conclude that

(K ∗ f )(tn) ≈n∑

j=0

wn−jFj =

n∑j=0

wjFn−j .

Example

Suppose K (t) = Υα(t) and so K (z) = z−α. Then

K(δ(ζ)∆t−1

)=((1− ζ)∆t−1

)−α= ∆tα(1− ζ)−α

= ∆tα∞∑n=0

(−αn

)(−1)nζn,

showing that

wn(∆t) = ∆tα(−αn

)(−1)n.

Note that wn > 0 because w0 = ∆tα and, for n ≥ 1,(−αn

)(−1)n =

−α1× −α− 1

2× · · · × −α− n + 1

n(−1)n

=α

1× α + 1

2× · · · × α + n − 1

n=

(α + n − 1

n

).

Weights when α = 1/2 and ∆t = 1

Higher-order methods

To improve on the implicit Euler method, recall that the polynomial

Y n +∆Y n

∆t(t − tn) +

1

2

∆2Y n

∆t2(t − tn)(t − tn−1) + · · ·

+1

p!

∆pY n

∆tp(t − tn) · · · (t − tn−p+1)

of degree p takes the value Y j at t = tj for n − p ≤ j ≤ n, where∆Y n = Y n − Y n−1 denotes the backward difference.Differentiating with respect to t and setting t = tn leads to thebackward differentiation formula (BDF)

y ′(tn) ≈ ∆Y n

∆t+

1

2

∆2Y n

∆t2∆t + · · ·

+1

p!

∆pY n

∆tp(∆t)(2∆t) . . .

((p − 1)∆t

).

Simplifying:

y ′(tn) ≈ ∆Y n

∆t+

1

2

∆2Y n

∆t+ · · ·+ 1

p

∆pY n

∆t=

1

∆t

p∑`=1

∆`Y n

`.

Compute Y n(z) ≈ y(tn; z) by solving

1

∆t

p∑`=1

∆`Y n

`− zY n = F n for n ≥ 1,

with starting values Y 0 = Y−1 = · · · = Y−p = 0. Generatingfunction still satisfies

δ(ζ)

∆tY (ζ)− zY (ζ) = F (ζ) but now δ(ζ) =

p∑`=1

(1− ζ)`

`.

A(α)-Stability

The BDF of order p has the property that

| arg δ(ζ)| ≤ π − α for |ζ| < 1,

for the following values of α.

p α

1 90

2 90

3 86

4 73

5 51

6 17

Operational calculus

Notation:

K (∂)f (t) = K ∗ f (t) =

∫ t

0K (t − s)f (s) ds for t > 0.

In this way, sinceLK ∗ f = K (z)f (z),

we have

K (∂)f (t) =1

2πi

∫Γ

eztK (z)f (z) dz .

Explanation: if K (t) = 1 then K (z) = z−1 so

∂−1f (t) =

∫ t

0f (s) ds and L∂−1f = z−1f (z).

Example

For K (t) = Υα(t) = tα−1/Γ(α) we have K (z) = z−α and so

∂−αf (t) = Υα ∗ f (t) = Iαf (t)

is the fractional integral of order α > 0, with

L∂−αf = z−αf (z).

Example

If K (t) = eat then K (z) = (z − a)−1 so

(∂ − a)−1f (t) =

∫ t

0ea(t−s)f (s) ds

withL(∂ − a)−1f = (z − a)−1f (z).

Theorem

K (∂) =1

2πi

∫Γ

K (z)(∂ − z)−1 dz .

Proof.

K (∂)f (t) = (K ∗ f )(t) =

∫ t

0K (t − s)f (s) ds

=

∫ t

0

1

2πi

∫Γ

ez(t−s)K (z) dz f (s) ds

=1

2πi

∫Γ

K (z)

∫ t

0ez(t−s)f (s) ds dz

=1

2πi

∫Γ

K (z)(∂ − z)−1f (t) dz .

Discrete operational calculus

Notation:

K (∂∆t)f (t) =∑

0≤tj≤twj(∆t)f (t − tj) for t > 0.

In particular, at t = tn,

K (∂∆t)f (tn) =n∑

j=0

wj f (tn − tj) =n∑

j=0

wj f (tn−j),

or equivalently,

K (∂∆t)F n =n∑

j=0

wjFn−j =

n∑j=0

wn−jFj .

Example

If K (t) = 1 then K (z) = z−1 so

K(δ(ζ)∆t−1

)=

(1− ζ

∆t

)−1

=∆t

1− ζ= ∆t

∞∑n=0

ζn

for |ζ| < 1, showing that wj = ∆t and hence

∂−1∆t f (t) =

∑0≤tj≤t

f (t − tj) ∆t

≈∫ t

0f (t − s) ds =

∫ t

0f (s) ds = ∂−1f (t).

Example

The function

y(t) = (∂ − a)−1f (t) =

∫ t

0ea(t−s)f (s) ds

is the solution of the initial-value problem

y − ay = f (t) for t > 0, with y(0) = 0.

The BDF solution Y n satisfies(δ(ζ)∆t−1 − a

)Y (ζ) = F (ζ), F n = f (tn),

so if (δ(ζ)∆t−1 − a

)−1=∞∑n=0

wnζn,

thenY n =

n∑j=0

wn−jFj = (∂∆t − a)−1f (tn).

Example

For the BDF of order p = 2,

y ′(tn) ≈ ∆Y n

∆t+

1

2

∆2Y n

∆t

andδ(ζ) = (1− ζ) + 1

2 (1− ζ)2 = 32 (1− ζ)(1− 1

3ζ).

Recalling that

(1− ζ)−α =∞∑n=0

(α + n − 1

n

)ζn

we see that the weights for ∂−α∆t are

wn =

(2

3

)α n∑j=0

(α + n − j − 1

n − j

)(α + j − 1

j

)3−j .

Integral representation of the weights

Since

K(δ(ζ)∆t−1

)=∞∑n=0

wnζn,

for any sufficiently small ε > 0 we have

wn =1

2πi

∮|ζ|=ε

K(δ(ζ)∆t−1

)ζn+1

dζ

We can use this representation to show the following result.

Theorem

K (∂∆t) =1

2πi

∫Γ

K (z)(∂∆t − z)−1 dz .

Proof

We have

wn =1

2πi

∫|ζ|=ε

K(δ(ζ)∆t−1

)ζn+1

dζ

and

K(δ(ζ)∆t−1

)=

1

2πi

∫Γ

K (z) dz

δ(ζ)∆t−1 − zdζ

so

wn =1

2πi

∫Γ

K (z)1

2πi

∫|ζ|=ε

(δ(ζ)∆t−1 − z

)−1

ζn+1dζ︸︷︷︸

w?n (∆t; z)

dz .

Since (δ(ζ)∆t−1 − z

)−1=∞∑n=0

w?n (∆t; z)ζn,

it follows that

K (∂∆t)f (t) =∑

0≤tj≤twj f (t − tj)

=∑

0≤tj≤t

1

2πi

∫Γ

K (z)w?j (∆t; z) dzf (t − tj)

=1

2πi

∫Γ

K (z)∑

0≤tj≤tw?j (∆t; z)f (t − tj) dz

=1

2πi

∫Γ

K (z)(∂∆t − z)−1f (t) dz .

Summary

K (∂)f (t) =

∫ t

0K (t − s)f (s) ds =

1

2πi

∫Γ

eztK (z)f (z) dz

=1

2πi

∫Γ

K (z)y(t; z) dz =1

2πi

∫Γ

K (z)(∂ − z)−1f (t) dz .

w(ζ) =∞∑n=0

wnζn = K

(δ(ζ)∆t−1

).

K (∂∆t)F n =n∑

j=0

wn−jFj =

1

2πi

∮|ζ|=ε

w(ζ)F (ζ)

ζn+1dζ

=1

2πi

∫Γ

K (z)Y n(z) dz =1

2πi

∫Γ

K (z)(∂∆t − z)−1f (t) dz .

Accuracy of convolution quadrature

Since

K (∂∆t)f (t) =∞∑j=0

wj f (t − tj)χ(t − tj), χ(t) =

1, t > 0,

0, t < 0,

we have, with ζ = e−z ∆t ,

LK (∂∆t)f =

∫ ∞0

e−zt∞∑j=0

wj f (t − tj)χ(t − tj) dt

=∞∑j=0

wj

∫ ∞tj

e−zt f (t − tj) dt

=∞∑j=0

wj

∫ ∞0

e−z(t+tj )f (t) dt =∞∑j=0

wjζj

∫ ∞0

e−zt f (t) dt

= K(δ(e−z ∆t)∆t−1

)f (z).

The BDF of order p satisfies

δ(e−h)h−1 = 1 + O(hp) as h→ 0,

so

δ(e−z∆t)∆t−1 = zδ(e−h)h−1, h = z∆t,

= z + O(zp+1∆tp)

and thus

LK (∂∆t)f = K(z + O(zp+1∆tp)

)f (z).

Notice also

K (∂∆t)f (t)− K (∂)f (t)

=1

2πi

∫Γ

K (z)[(∂∆t − z)−1 − (∂ − z)−1

]f (t) dz .

(A1) There exist 0 < ϕ < π/2 and −∞ < µ <∞ such that thefunction G (z) is analytic with |G (z)| ≤ C |z |−µ for | arg z | < π−ϕ.

(A2) The linear multistep method is strongly A-stable oforder p ≥ 1, that is,

I δ(ζ) is analytic in a neighbourhood of the closed unitdisk |ζ| ≤ 1,

I for ζ in this neighbourhood, δ(ζ) = 0 iff ζ = 1,

I there exists ϕ1 > ϕ such that | arg δ(ζ)| ≤ π − ϕ1 for |ζ| < 1,

I h−1δ(e−h) = 1 + O(hp) as h→ 0.

Theorem ([Lubich-2004])

If assumptions (A1) and (A2) hold, then for 0 < t <∞,

∣∣G (∂∆t)tβ−1 − G (∂)tβ−1∣∣ ≤ Ctµ−1+β−p∆tp, p ≤ β,

Ctµ−1∆tβ, 0 < β ≤ p.

Multiplication property

Notice that since

K1 ∗ (K2 ∗ f ) = (K1 ∗ K2) ∗ f and LK1 ∗ K2 = K1(z)K2(z),

we haveK1(∂)K2(∂) = (K1K2)(∂),

so in particular K1(∂) commutes with K2(∂).

The analogous identity holds in the discrete case.

Theorem

K1(∂∆t)K2(∂∆t) = (K1K2)(∂∆t).

Proof

On the one hand,

K1(∂∆t)K2(∂∆t)f (t) =∑

0≤tj≤tw 1j K2(∂∆t)f (t − tj)

=∑

0≤tj≤tw 1j

∑0≤tk≤t−tj

f (t − tj − tk)

=∑

0≤tn≤t

∑j+k=n

w 1j w 2

k f (t − tj+k)

=∑

0≤tn≤t

( n∑j=0

w 1j w 2

n−j︸︷︷︸wn

)f (t − tn).

On the other hand,

K1

(δ(ζ)∆t−1

)K2

(δ(ζ)∆t−1

)=∞∑j=0

w 1j ζ

j∞∑k=0

w 2k ζ

k

=∞∑n=0

∑j+k=n

w 1j w 2

k ζj+k

=∞∑n=0

( n∑j=0

w 1j w 2

n−j

)ζn

=∞∑n=0

wnζn.

Example

∂−α∆t ∂−β∆t = ∂

−(α+β)∆t .

Associativity

Associativity of the Laplace convolution means that

K ∗ (f ∗ g) = (K ∗ f ) ∗ g ,

or equivalently,

K (∂)(f ∗ g) =(K (∂)f

)∗ g .

The analogous identity holds in the discrete case,

K (∂∆t)(f ∗ g) =(K (∂∆t)f

)∗ g ,

because

L

K (∂∆t)(f ∗ g)

= K(δ(e−z∆t)∆t−1

)f (z)g(z).

RemarkTaylor expansion gives

f (t) =

p−1∑k=0

f (k)(0)

k!tk +

1

(p − 1)!

∫ t

0(t − s)p−1f (p)(s) ds

=

p−1∑k=0

f (k)(0)Υk+1(t) + (Υp ∗ f (p))(t),

so

G (∂)f (t) =

p−1∑k=0

f (k)(0)G (∂)tk +(G (∂)Υp

)∗ f (p)(t),

and the same formula holds with ∂ replaced by ∂∆t . Hence,

∣∣[G (∂∆t)− G (∂)]f (t)

∣∣ ≤ Ctµ−1p−1∑k=0

|f (k)(0)|∆tk+1

+ C ∆tp∫ t

0(t − s)µ−1|f (p)(s)| ds.

Correction terms

If p ≥ 2, put

G (∂∆t)∼f (tn) = G (∂∆t)f (tn) +

p−2∑j=0

w∼nj f (tj)

and choose the extra weights w∼nj so that the modified quadraturerule is exact for polynomials up to degree p − 2:

p−1∑j=1

w∼njΥk(tj) = G (∂)Υk(tn)− G (∂∆t)Υk(tn)

for 1 ≤ k ≤ p − 1. Unfortunately, the matrix [Υk(tj)] is badlyconditioned.

An alternative approach works if tn is bounded away from 0.

Application to fractional diffusion

For simplicity, we suppose f ≡ 0 so that, after integrating in time,our initial-value problem takes the form

u + ∂−αAu = u0,

or equivalently,

(I + ∂−αA)u = (I + ∂−αA)u0 − ∂−αAu0.

Thus,u = u0 − (I + ∂−αA)−1∂−αAu0,

which suggests seeking U(t) ≈ u(t) such that

U = u0 − (I + ∂−α∆t A)−1∂−αAu0.

Thus,

U = u0 + W where (I + ∂−α∆t A)W = −∂−αAu0,

which leads to the implicit scheme

W n +n∑

j=0

wn−jAW j = −Υ1+α(tn)Au0 for n ≥ 1,

with W 0 = 0, where the weights wn = wn(∆t) are given by

[δ(ζ)∆t−1

]−α=∞∑n=0

wnζn.

Fully-discrete version: Uh = u0h + Wh where W 0h = 0 and

W nh +

n∑j=0

wn−jAhW jh = −Υ1+α(tn)Ahu0h for n ≥ 1.

Error bound for nonsmooth initial dataSince

u − u0 = −(I + ∂−αA)∂−αAu0,

U − u0 = −(I + ∂−α∆t A)∂−αAu0,

the error from the time discretization is

U − u =[G (∂)− G (∂∆t)

]∂−αu0,

whereG (z) = (I + z−αA)−1A.

Theorem ([Cuesta+Lubich+Palencia-2006])

For t > 0,

‖U(t)− u(t)‖ ≤

Ct−1∆t‖u0‖, p = 1,

Ct−1−α∆t1+α‖u0‖, p ≥ 2.

Proof

Since

G (z) = (I + z−αA)−1A = zα(zαI + A)−1A

= zα(zαI + A)−1[(zαI + A)− zαI

]= zα

[I − zα(zαI + A)−1

]and ‖(zαI + A)−1‖ ≤ C |z |−α, we have

‖G (z)‖ ≤ C |z |α for z ∈ Σπ−ϕ.

Noting that ∂−αu0 = Υ1+α(t)u0 = tαu0/Γ(1 + α), we apply thetheorem with µ = −α and β = 1 + α to conclude

‖U(t)− u(t)‖ =∥∥[G (∂)tβ − G (∂∆t)tβ

]u0

∥∥/Γ(1 + α)

≤ C‖u0‖ ×

t−p∆tp, p ≤ 1 + α,

t−α−1∆t1+α, p ≥ 1 + α.

Error bound for smooth initial dataRecall that

u(t) = E(t)u0 = u0 −tα

Γ(1 + α)Au0 + · · · as t → 0,

and observe that

(I + ∂−αA)[u(t)− u0 + Υ1+α(t)Au0

]= u0 − (I + ∂−αA)u0 + (I + ∂−αA)Υ1+α(t)Au0

= Υ1+2α(t)A2u0,

so

u(t) = u0 −Υ1+α(t)Au0 + (I + ∂−αA)−1Υ1+2α(t)A2u0.

We therefore consider

U(t) = u0 −Υ1+α(t)Au0 + (I + ∂−α∆t A)−1Υ1+2α(t)A2u0.

The error is

U(t)− u(t) =[(I + ∂−α∆t A)−1 − (I + ∂−αA)−1

]Υ1+2α(t)A2u0

=[G (∂∆t)− G (∂)

]Υ1+2α(t)Au0,

where, once again, G (z) = (I + z−αA)−1A. Applying the theoremwith µ = −α and β = 1 + 2α we have

‖U(t)− u(t)‖ ≤ C‖Au0‖ ×

tα−p∆tp, p ≤ 1 + 2α,

t−1−α∆t1+2α, p ≥ 1 + 2α.

For instance, if p = 2 and 1/2 ≤ α < 1, then

‖U(t)− u(t)‖ ≤ Ctα−2∆t2‖Au0‖.

Part IX

Discontinuous Galerkin methods for

time stepping

Introduction

We consider a class of time-stepping methods in which u(t) isapproximated by a piecewise polynomial U in t. Continuity acrossthe time levels is enforced only weakly. These fully implicitmethods are flexible and robust, and can achieve high accuracy,but are rather complicated to implement in general and have asomewhat higher computational cost than many simplertime-stepping schemes. Crucially, they allow the use of highlynon-uniform grids.

Once again, we largely ignore the spatial discretization.

Outline

Discontinuous piecewise-polynomial approximation in time

Stability

Convergence

Discontinuous piecewise-polynomial approximation in time

Let t = (tn)Nn=0 be a vector of time levels satisfying

0 = t0 < t1 < t2 < · · · < tN = T ,

and denote the nth open subinterval and its length by

In = (tn−1, tn) and ∆tn = tn − tn−1 for 1 ≤ n ≤ N.

For each n, choose a closed subspace Sn ⊆ H10 (Ω) and an

integer pn ≥ 0, and write S = (Sn)Nn=0 and p = (pn)Nn=1.

We define our trial space W =W(t,S,p) to consist of thosefunctions U : (0,T )→ H1

0 (Ω) such that U|In is a polynomial ofdegree at most pn in t with coefficients in Sn for 1 ≤ n ≤ N.

Example

Choose Sn = H10 (Ω) and pn = p independent of n.

Example

Choose Sn = Vh to be the usual continuous piecewise-linear finiteelement space with respect to a triangulation Th of Ω andenforcing a homogeneous Dirichlet boundary condition.

Thus, our methods are conforming in space but non-conforming intime.

For U ∈ W, we denote the one-sided limits and the jumps at tn by

Un± = U(t±n ) = lim

t→t±n

U(t) and [U]n = Un+ − Un

−

for 0 ≤ n ≤ N, with the convention that U0− ∈ S0 (even though I0

is undefined).

Weak formulation

Recall that the mild solution u of our initial-boundary valueproblem for the fractional diffusion equation satisfies

〈u(t), v〉+ a(∂1−αt u(t), v

)= 〈f (t), v〉 for all v ∈ H1

0 (Ω),

where u = ut = ∂u/∂t. Hence, for suitable v : In → H10 (Ω),∫

In

[〈u(t), v(t)〉+ a

(∂1−αt u(t), v(t)

)]dt =

∫In

〈f (t), v(t)〉 dt.

In the discontinuous Galerkin (DG) method we seek U ∈ Wsatisfying, for all X ∈ W and for 1 ≤ n ≤ N,

〈Un−1+ ,X n−1

+ 〉+

∫In

[〈U(t),X (t)〉+ a

(∂1−αt U(t),X (t)

)]dt

= 〈Un−1− ,X n−1

+ 〉+

∫In

〈f (t),X (t)〉 dt.

In addition, we require that U0− = U0 for a suitable

approximation U0 ∈ S0 to the given initial data u0.

Example

Take pn = 0 and Sn = Vh for all n, so that U and X are piecewiseconstant in time. Writing Un = Un

− and χ = X n−, we have

U(t) = Un = Un−1+ and X (t) = χ = X n−1

+ for all t ∈ In,

with U = 0 on In, so

〈Un − Un−1, χ〉+

∫In

a(∂1−αt U(t), χ

)dt

=

∫In

〈f (t), χ〉 dt for all χ ∈ Sn,

which is essentially the implicit Euler scheme we considered earlier,but using finite elements instead of finite differences in space.

Fully implicit time stepping

If ψn0 , ψn

1 , . . . , ψnpn is a basis for the space of polynomials of degree

at most pn, and if χ1, χ2, . . . , χMn is a basis for Sn (soMn = dim Sn), then we can write

U(x , t) =

pn∑r=0

Mn∑l=1

Unrl ψ

nr (t)χl(x) for x ∈ Ω and t ∈ In.

Starting from the known (approximate) initial data

U0−(x) = U0(x) =

M0∑l=1

U0l χ

0l (x) for x ∈ Ω,

we compute the unknown Unrl by solving the

[(pn + 1)Mn]× [(pn + 1)Mn] linear system determined by the DGequations on In for successive n = 1, 2, . . . , N.

Stability

We now prove a series of technical lemmas that will establishunconditional stability of DG time stepping. This robustness is akey benefit of the method, and helps justify its relatively highcomputational cost. The stability estimate below also shows thatmost of the jumps [U]j must be small for large N.

TheoremIf U0

− = U0 ∈ H and f ∈ L2

((0,T );H

)then there exists a unique

DG solution U ∈ W, and for 1 ≤ n ≤ N,

‖Un−‖2 +

n−1∑j=1

‖[U]j‖2 +

∫ tn

0a(∂1−α

t U,U) dt

≤ ‖U0‖2 +

∫ tn

0〈A−1f , I1−αf 〉 dt.

Global bilinear form

By rewriting the nth DG equation as

〈[U]n−1,X n−1+ 〉+

∫In

[〈U,X 〉+ a

(∂1−αt U,X

)]dt =

∫In

〈f ,X 〉 dt,

and summing over n, we see that U ∈ W is the DG solution iff

GN(U,X ) = 〈U0,X 0+〉+

∫ tN

0〈f (t),X (t)〉 dt for all X ∈ W,

where the bilinear form GN is defined by

GN(U,X ) = 〈U0−,X

0+〉+

N∑n=1

〈[U]n−1,X n−1+ 〉

+N∑

n=1

∫In

[〈U(t),X (t)〉+ a

(∂1−αt U(t),X (t)

)]dt.

Lemma

〈[U]n−1,Un−1+ 〉+

∫In

〈U,U〉 dt =1

2

(‖[U]n−1‖2 +‖Un

−‖2−‖Un−1− ‖2

).

Proof.Since 〈U,U〉 = (d/dt) 1

2‖U‖2, twice the LHS equals

2〈[U]n−1,Un−1+ 〉+ ‖Un

−‖2 − ‖Un−1+ ‖2

= 〈[U]n−1, [U]n−1 + Un−1+ + Un−1

− 〉+ ‖Un−‖2 − ‖Un−1

+ ‖2

= ‖[U]n−1‖2 + ‖Un−1+ ‖2 − ‖Un−1

− ‖2 + ‖Un−‖2 − ‖Un−1

+ ‖2

= ‖[U]n−1‖2 + ‖Un−‖2 − ‖Un−1

− ‖2.

Lemma

GN(U,U) = 12‖U

0+‖2 + 1

2‖UN−‖2 + 1

2

N−1∑n=1

‖[U]n‖2

+

∫ tN

0a(∂1−α

t U,U) dt.

Proof.

〈U0−,U

0+〉+

N∑n=1

(〈[U]n−1,Un−1

+ 〉+

∫In

〈U,U〉 dt

)

= 〈U0−,U

0+〉+ 1

2 [U]0 − 12‖U

0−‖2︸︷︷︸

12‖U

0+‖2

+ 12‖U

N−‖2 + 1

2

N−1∑n=1

‖[U]n‖2.

To prove the next lemma, recall the identities∫ ∞0

(∂βt u)v dt =

∫ ∞−∞

(iy)β u(iy)v(iy) dy

and, when u is real-valued,∫ ∞0

(∂βt u)u dt =cos 1

2πβ

π

∫ ∞0

yβ|u(iy)|2 dy ≥ 0.

LemmaFor 0 < β < 1, and real-valued u and v,∣∣∣∣∫ ∞

0(∂βt u)v dt

∣∣∣∣≤ 1

cos 12πβ

(∫ ∞0

(∂βt u)u dt

)1/2(∫ ∞0

(∂βt v)v dt

)1/2

.

Proof

Noting that u(−iy) = u(iy) and v(−iy) = v(iy), and using theCauchy–Schwarz inequality, we have∣∣∣∣∫ ∞

0(∂βt u)v dt

∣∣∣∣ ≤ 1

2π

∫ ∞−∞|y |β|u(iy)||v(iy)| dy

=1

π

∫ ∞0

(yβ/2|u(iy)|)(yβ/2|v(iy)|) dy

≤ 1

π

(∫ ∞0

yβ|u(iy)|2 dy

)1/2(∫ ∞0

yβ|v(iy)|2 dy

)1/2

=1

cos 12πβ

(∫ ∞0

(∂βt u)u dt

)1/2(∫ ∞0

(∂βt v)v dt

)1/2

.

LemmaFor 0 < β < 1,

2

∣∣∣∣∫ T

0〈∂βt u, v〉 dt

∣∣∣∣ ≤ ∫ T

0〈∂βt u, u〉 dt +

1

cos2 12πβ

∫ T

0〈∂βt v , v〉 dt.

Proof.Extend u and v by zero. For any µ > 0, the mth Fouriercoefficients satisfy

2

∫ T

0(∂βt um)vm dt = 2

∫ ∞0

(∂βt um)vm dt

≤ 1

cos 12πβ

(µ

∫ T

0(∂βt um)um dt +

1

µ

∫ T

0(∂βt vm)vm dt

),

and the result follows by summing over m, using Parseval’s identityand choosing µ = cos 1

2πβ.

Proof of the stability theorem

The DG solution U satisfies

GN(U,X ) = 〈U0,X 0+〉+

∫ tN

0〈f (t),X (t)〉 dt for all X ∈ W,

and by choosing X = U the second lemma gives

12‖U

0+‖2 + 1

2‖UN−‖2 + 1

2

N−1∑n=1

‖[U]n‖2

+

∫ tN

0a(∂1−α

t U,U) dt = 〈U0,U0+〉+

∫ tN

0〈f (t),U(t)〉 dt.

Now use 〈U0,U0+〉 ≤ 1

2‖U0‖2 + 1

2‖U0+‖2 and cancel the

term 12‖U

0+‖2.

Thus,

‖UN−‖2 +

N−1∑n=1

‖[U]n‖2

+ 2

∫ tN

0a(∂1−α

t U,U) dt = ‖U0‖2 + 2

∫ tN

0〈f (t),U(t)〉 dt.

Write 〈f (t),U(t)〉 = 〈∂1−αt g(t), v(t)〉 where

g(t) = I1−αA−1/2f (t) and v(t) = A1/2U(t),

so that

2

∫ tN

0〈f (t),U(t)〉 dt ≤

∫ tN

0a(∂1−α

t U,U) dt

+1

cos2 12πβ

∫ tN

0〈A−1f (t), I1−αf (t)〉 dt.

Piecewise-constant case

If pn = 0 then ‖U(t)‖ ≤ ‖Un∗− ‖ = max0≤n≤N ‖Un‖ for 0 ≤ t ≤ tN .

Since

‖Un∗− ‖2 ≤ ‖Un∗

− ‖2 +n∗−1∑n=1

‖[U]n‖2

+ 2

∫ tn∗

0a(∂1−α

t U,U) dt = ‖U0‖2 + 2

∫ tn∗

0〈f (t),U(t)〉 dt

and U0 = U0−, we have

‖Un∗− ‖‖UN

−‖ ≤ ‖Un∗− ‖2 ≤

(‖U0‖+

∫ tn∗

0‖f (t)‖ dt

)‖Un∗− ‖,

so

‖Un−‖ ≤ ‖U0‖+ 2

∫ tn

0‖f (t)‖ dt, 0 ≤ n ≤ N.

Piecewise-linear case

If pn = 1 for all n, then

‖U‖In ≡ supt∈In‖U(t)‖ = max

‖Un−1

+ , ‖Un−‖.

For the first subinterval,

‖U‖2I1 ≤ ‖U

0+‖2 + ‖U1

−‖2 ≤ 2〈U0,U0+〉+ 2

∫ t1

0〈f (t),U(t)〉 dt

so

‖U‖I1 ≤ 2‖U0‖+ 2

∫ t1

0‖f (t)‖ dt.

For n ≥ 2, since Un−1+ = Un−1

− + [U]n−1,

‖Un−1+ ‖2 ≤ 2‖Un−1

− ‖2 + 2‖[U]n−1‖2

≤ (2 + 2)

(2〈U0,U0

+〉+ 2

∫ tn

0〈f (t),U(t)〉 dt

),

and by choosing n∗ such that ‖U‖In∗ = max1≤n≤N ‖U‖In we see

‖U‖In ≤ 8

(‖U0‖+

∫ tn

0‖f (t)‖ dt

)for 1 ≤ n ≤ N.

However, for p ≥ 2 we have not been able to prove such an L∞(L2)stability bound that mimics the one for the continuous problem:

‖u(t)‖ ≤ ‖u0‖+

∫ t

0‖f (s)‖ ds, 0 ≤ t ≤ T .

Convergence

For simplicity, assume now that Sn = H10 (Ω) for all n (so no spatial

discretization). We decompose the DG error as

U − u = ϑ+ %, ϑ = U − Πu, % = Πu − u,

where the quasi-interpolant Πu ∈ W(t,S,p) is defined by theconditions

(Πu)n− = u(t−n ) and

∫In

(u − Πu)tq−1 dt = 0

for 1 ≤ q ≤ pn and 1 ≤ n ≤ N, with (Πu)0− = u(0).

Example

If pn = 0 then (Πu)(t) = u(tn) for t ∈ In.

Example

If pn = 1 then

(Πu)(t) = u(tn) +u(tn)− avgIn(u)

∆tn/2(t − tn) for t ∈ In,

where avgIn(u) = ∆t−1n

∫In

u dt. Can show that

(Πu)(t)− u(t) =

∫ tn

tu′(s) ds − 2

tn − t

∆t2n

∫In

(s − tn−1)u′(s) ds

=

∫ tn

t(t − s)u′′(s) ds +

tn − t

∆t2n

∫In

(s − tn−1)2u′′(s) ds,

for t ∈ In, and hence

‖u − Πu‖In ≤ 2

∫In

‖u′(t)‖ dt ≤ 2∆tn‖u′‖In ,

‖u − Πu‖In ≤ 3∆tn

∫In

‖u′′(t)‖ dt ≤ 3∆t2n‖u′′‖In .

In the general case we have the following estimate.

Theorem ([Schoetzau+Schwab-2000])

For 0 ≤ q ≤ pn,∫In

‖(u − Πu)′(t)‖2 dt ≤ Cε(pn, q)

(∆tn

2

)2q ∫In

‖u(q+1)(t)‖2 dt,

where

ε(p, q) =(p − q)!

(p + q)!.

Notice that

ε(p, p) =1

(2p)!.

Recall

GN(U,X ) = 〈U0+,X

0+〉+

N−1∑n=1

〈[U]n,X n+〉

+N∑

n=1

∫In

[〈U(t),X (t)〉+ a

(∂1−αt U(t),X (t)

)]dt.

Integration by parts yields a dual representation

GN(U,X ) = 〈UN− ,X

N− 〉 −

N−1∑n=1

〈Un−, [X ]n〉

+N∑

n=1

∫In

[−〈U(t), X (t)〉+ a

(∂1−αt U(t),X (t)

)]dt.

For all X ∈ W(t,p,S), the DG solution satisfies

GN(U,X ) = 〈U0,X 0+〉+

∫ t

0〈f (t),X (t)〉 dt,

and, since [u]n = 0, the exact solution satisfies

GN(u,X ) = 〈u0,X0+〉+

∫ t

0〈f (t),X (t)〉 dt.

Furthermore, the construction of Π ensures

%n− = 0 and

∫In

〈%, X 〉 dt = 0 for 1 ≤ n ≤ N,

so

GN(%,X ) =

∫ tN

0a(∂1−α

t %,X ) dt.

Thus,GN(U − u,X ) = 〈U0 − u0,X

0+〉,

and since U − u = ϑ+ % we have

GN(ϑ,X ) = GN(U − u − %,X ) = GN(U − u,X )− GN(%,X ).

Therefore,

GN(ϑ,X ) = 〈U0 − u0,X0+〉+

∫ tN

0〈−A∂1−α

t %,X 〉 dt

for all X ∈ W(t,p,S), showing that ϑ is the DG solution withinitial data U0 − u0 and source term −∂1−α

t A%. By applying thestability result to ϑ it is possible to prove the following errorestimate.

h-Version accuracy

Theorem ([Mustapha-2015])

Let γ ≥ 1 and σ > 0. If

tn = (n/N)γT and p = (1, p, p, . . . , p)

and

‖u(j)(t)‖1 ≤ Ctσ−j for 0 < t ≤ T and 1 ≤ j ≤ p + 1,

then

‖U(t)− u(t)‖ ≤ C ∆trN ≤ CN−r for 0 ≤ t ≤ T ,

where

r =

minγ(σ + 1

2α−12 ), 2 + 1

2α−12, p = 1,

minγ(σ + 12α−

12 ), p + 1 + 1

2α−12 −

12 , p ≥ 2.

Piecewise-constants on a uniform mesh

Theorem ([McLean+Mustapha-2015])

Suppose f ≡ 0, tn = n ∆t and pn = 0 for all n.

1. For 0 < α < 1, if u0 ∈ H then

‖Un − u(tn)‖ ≤ Ct−1n ∆t‖u0‖.

2. for 0 < α ≤ 1/2, if A2u0 ∈ H then

‖Un − u(tn)‖ ≤ Ct2α−1n ∆t‖A2u0‖.

3. for 1/2 ≤ α < 1, if A1/αu0 ∈ H then

‖Un − u(tn)‖ ≤ C ∆t‖A1/αu0‖.

Corollary

‖Un − u(tn)‖ ≤ Ctrα−1n ∆t‖Aru0‖ for 0 ≤ r ≤ min2, 1/α.

Part X

Further reading

Eduardo Cuesta, Christian Lubich, and Cesar Palencia.Convolution quadrature time discretization of fractionaldiffusion-wave equations.Applied Numerical Mathematics, 75:673–696, 2006.

M. Lopez-Fernandez and C. Palencia.On the numerical inversion of the laplace transform of certainholomorphic mappings.Applied Numerical Mathematics, 51:289–303, 2004.

C. Lubich.Convolution quadrature and discretized operational calculus. i.Numerische Mathematik, 54:129–145, 1988.

Christian Lubich.Convolution quadrature revisited.BIT Numerical Mathematics, 44:503–514, 2004.

William McLean and Kassem Mustapha.Time-stepping error bounds for fractional diffusion problemswith non-smooth initial data.

Journal of Computational Physics, 293:201–217, 2014.

William McLean and Vidar Thomee.Numerical solution via laplace transforms of a fractional orderevolution equation.Journal of Integral Equations and Applications, 22:57–94,2010.

Kassem Mustapha.Time-stepping discontinuous galerkin methods for fractionaldiffusion problems.Numerische Mathematik, 130:497–516, 2015.

Dominik Schotzau and Christoph Schwab.Time discretization of parabolic problems by the hp-version ofthe discontinuous galerkin method.SIAM Journal on Numerical Analysis, 38:837–875, 2000.

Lloyd N. Trefethen and J. A. C. Weideman.The exponentially convergent trapezoidal rule.SIAM Review, 56:385–458, 2014.

J. A. C. Weideman and L. N. Trefethen.

Parabolic and hyperbolic contours for computing the bromwichintegral.Mathematics of Computation, 76:1341–1356, 2007.

Date post:	11-Sep-2021
Category:	Documents
Upload:	others
View:	10 times
Download:	1 times

Numerical Solution of Fractional PDEs

Documents