Stochastic Differential Equations (SDE)

Stochastic Diﬀerential

Equations

Do not worry about your problems with mathematics, I assure you

mine are far greater.

Albert Einstein.

Florian Herzog

2013

Stochastic Diﬀerential Equations (SDE)

A ordinary diﬀerential equation (ODE)

dx(t)

= f (t, x) , dx(t) = f(t, x)dt , (1)

with initial conditions x(0) = x

can be written in integral form

x(t) = x



f(s, x(s))ds , (2)

where x(t) = x(t, x

, t

) is the solution with initial conditions x(t

) = x

. An

example is given as

dx(t)

= a(t)x(t) , x(0) = x

. (3)

Stochastic Systems, 2013 2

Stochastic Diﬀerential Equations (SDE)

When we take the ODE (3) and assume that a(t) is not a deterministic parameter

but rather a stochastic parameter, we get a stochastic diﬀerential equation (SDE). The

stochastic parameter a(t) is given as

a(t) = f(t) + h(t)ξ(t) , (4)

where ξ(t) denotes a white noise process.

Thus, we obtain

dX(t)

= f (t)X(t) + h(t)X(t)ξ(t) . (5)

When we write (5) in the diﬀerential form and use dW (t) = ξ(t)dt, where dW (t)

denotes diﬀerential form of the Brownian motion,we obtain:

dX(t) = f(t)X(t)dt + h(t)X(t)dW (t) . (6)

Stochastic Systems, 2013 3

Stochastic Diﬀerential Equations (SDE)

In general an SDE is given as

dX(t, ω) = f (t, X(t, ω))dt + g(t, X(t, ω))dW (t, ω) , (7)

where ω denotes that X = X(t, ω) is a random variable and possesses the initial

condition X(0, ω) = X

with probability one. As an example we have already

encountered

dY (t, ω) = µ(t)dt + σ(t)dW (t, ω) .

Furthermore, f(t, X(t, ω)) ∈ R, g(t, X(t, ω)) ∈ R, and W (t, ω) ∈ R. Similar as

in (2) we may write (7) as integral equation

X(t, ω) = X



f(s, X(s, ω))ds +



g(s, X(s, ω))dW (s, ω) . (8)

Stochastic Systems, 2013 4

Stochastic Integrals

For the calculation of the stochastic integral



g(t, ω)dW (t, ω), we assume that

g(t, ω) is only changed at discrete time points t

(i = 1, 2, 3, ..., N − 1), where

0 = t

< t

< . . . < t

N−1

< t

< T . We deﬁne the integral

S =



g(t, ω)dW (t, ω) , (9)

as the Riemannßum

(ω) =



i=1

g(t

i−1

, ω)



W (t

, ω) − W (t

i−1

, ω)



. (10)

with N → ∞.

Stochastic Systems, 2013 5

Stochastic Integrals

A random variable S is called the Itˆo integral of a stochastic process g(t, ω) with

respect to the Brownian motion W (t, ω) on the interval [0, T ] if

lim

N→∞



S −



i=1

g(t

i−1

, ω)



W (t

, ω) − (W (t

i−1

, ω)



= 0 , (11)

for each sequence of partitions (t

, t

, . . . , t

) of the interval [0, T ] such that

max

− t

i−1

) → 0. The limit in the above deﬁnition converges to the stochastic

integral in the mean-square sense. Thus, the stochastic integral is a random variable,

the samples of which depend on the individual realizations of the paths W (., ω).

Stochastic Systems, 2013 6

Stochastic Integrals

The simplest p ossible example is g(t) = c for all t. This is still a stochastic

process, but a simple one. Taking the deﬁnition, we actually get



c dW (t, ω) = c lim

N→∞



i=1



W (t

, ω) − W (t

i−1

, ω)



= c lim

N→∞

[(W (t

, ω)−W (t

, ω)) + (W (t

, ω)−W (t

, ω)) + . . .

+(W (t

, ω)−W (t

N−1

, ω))

= c (W (T, ω) − W (0, ω)) ,

where W (T, ω) and W (0, ω) are standard Gaussian random variables. With

W (0, ω) = 0, the last result becomes



c dW (t, ω) = c W (T, ω) .

Stochastic Systems, 2013 7

Stochastic Integrals

Example: g(t, ω) = W (t, ω)



W (t, ω) dW (t, ω) =

= lim

N→∞



i=1

W (t

i−1

, ω)



W (t

, ω) − W (t

i−1

, ω)



= lim

N→∞





i=1

, ω) − W

i−1

, ω)) −



i=1

(W (t

, ω) − W (t

i−1

, ω))



= −

lim

N→∞



i=1

(W (t

, ω) − W (t

i−1

, ω))

(T, ω) , (12)

where we have used the following algebraic relationship y(x − y) = yx − y

−

(x − y)

Stochastic Systems, 2013 8

Stochastic Integrals

We take now a detailed look at :lim

N→∞



i=1

(W (t

, ω) − W (t

i−1

, ω))

E[ lim

N→∞



i=1

(W (t

, ω) − W (t

i−1

, ω))

] = lim

N→∞



i=1

E[(W (t

, ω) − W (t

i−1

, ω))

]

= lim

N→∞



i=1

− t

i−1

)

= T

Var[ lim

N→∞



i=1

(W (t

, ω) − W (t

i−1

, ω))

] = lim

N→∞



i=1

Var[(W (t

, ω) − W (t

i−1

, ω))

]

= 2 lim

N→∞



i=1

− t

i−1

)

Stochastic Systems, 2013 9

Stochastic Integrals

By reducing the partition, the variance becomes zero,

lim

N→∞



i=1

− t

i−1

)

≤ max

− t

i−1

) lim

N→∞



i=1

− t

i−1

)

= max

− t

i−1

) T

= 0 , (13)

since t

i−1

− t

→ 0. Since the expected value of



i=1

− t

i−1

)

is T and the

variance becomes zero, we get



i=1

(W (t

, ω) − W (t

i−1

, ω))

= T (14)

Stochastic Systems, 2013 10

Stochastic Integrals

The stochastic integral has the solution



W (t, ω) dW (t, ω) =

(T, ω) −

T (15)

This is in contrast to our intuition from standard calculus. In the case of a deterministic

integral



x(t)dx(t) =

(t), whereas the Itˆo integral diﬀers by the term −

T .

— This example shows that the rules of diﬀerentiation (in particular the chain rule)

and integration need to be re-formulated in the stochastic calculus.

Stochastic Systems, 2013 11

Stochastic Integrals

Properties of Itˆo Integrals.



g(t, ω) dW (t, ω)] = 0 .

Proof:



g(t, ω)dW (t, ω)] = E[ lim

N→∞



i=1

g(t

i−1

, ω)



W (t

, ω) − W (t

i−1

, ω)



]

= lim

N→∞



i=1

E[g(t

i−1

, ω)] E[



W (t

, ω) − W (t

i−1

, ω)



]

= 0 .

The expectation of stochastic integrals is zero. This is what we would expect anyway.

Stochastic Systems, 2013 12

Stochastic Integrals

Properties of Itˆo Integrals.

Var





g(t, ω)dW (t, ω)





E[g

(t, ω)]dt . (16)

Proof:

Var





g(t, ω)dW (t, ω)



= E



(



g(t, ω)dW (t, ω))



= E



lim

N→∞



i=1

g(t

i−1

, ω)



W (t

, ω) − W (t

i−1

, ω)





Stochastic Systems, 2013 13

Stochastic Integrals

= lim

N→∞



i=1



j=1

E[g(t

i−1

, ω)g(t

j−1

, ω)

(W (t

, ω) − W (t

i−1

, ω))(W (t

, ω) − W (t

j−1

, ω))]

= lim

N→∞



i=1

E[g

i−1

, ω)] E[



W (t

, ω) − W (t

i−1

, ω)



]

= lim

N→∞



i=1

E[g

i−1

, ω)] (t

− t

i−1

)



E[g

(t, ω)]dt . (17)

Stochastic Systems, 2013 14

Stochastic Integrals

The calculation of the variance of the Itˆo Integrals shows two important properties:

• E





g(t, ω)dW (t, ω)









(t, ω)



•



E[g

(t, ω)]dt < ∞

The second property is the condition of existence for Itˆo integrals. The next property is

the linearity of Itˆo integrals:



(t, ω) + a

(t, ω)]dW (t, ω)

= a



(t, ω)dW (t, ω) + a



(t, ω)dW (t, ω) , (18)

for numbers a

, a

and stochastic functions g

(t, ω), g

(t, ω).

Stochastic Systems, 2013 15

Itˆo’s lemma

As mentioned shown in the second example, the rules of classical calculus are not valid

for stochastic integrals and diﬀerential equations. It is the equivalent to the chain rule

in classical calculus. The problem can be stated as follows:

Given a stochastic diﬀerential equation

dX(t) = f(t, X(t))dt + g(t, X(t))dW (t) , (19)

and another process Y (t ) which is a function of X(t),

Y (t) = ϕ(t, X(t)) ,

where the function ϕ(t, X(t)) is continuously diﬀerentiable in t and twice continuously

diﬀerentiable in X, ﬁnd the stochastic diﬀerential equation for the process Y (t):

dY (t) =

f(t, X(t))dt + ˜g(t, X(t))dW (t) .

Stochastic Systems, 2013 16

Itˆo’s lemma

In the case when we assume that g(t, X(t)) = 0, we know the result: the chain rule

for standard calculus. The result is given by

dy(t) = (ϕ

(t, x) + ϕ

(t, x)f(t, x))dt . (20)

In the case of stochastic problems, we reason as follows: The Taylor expansion of

ϕ(t, X(t)) yields

dY (t) = ϕ

(t, X)dt +

(t, X)dt

+ ϕ

(t, X)dX(t)

(t, X)(dX(t))

+ h.o.t . (21)

Stochastic Systems, 2013 17

Itˆo’s lemma

We use (19) for dX(t) and get

dY (t) = ϕ

(t, X)dt + ϕ

(t, X)[f (t, X(t))dt + g(t, X(t))dW (t)]

+ϕ

(t, X)dt

(t, X)



(t, X(t))dt

+ g

(t, X(t))dW

(t)

+2f(t, X(t))g(t, X(t))dt dW (t)



+ h.o.t . (22)

The diﬀerentials of higher order (dt, dW ) become fast zero, dt

→ 0 and

dtdW (t) → 0. The stochastic term dW

(t) according to the rules of Brownian

motion is given as

(t, ω) = dt . (23)

Stochastic Systems, 2013 18

Itˆo’s lemma

Omitting higher order terms and using the properties of Brownian motion, we arrive at

dY (t) = [ϕ

(t, X) + ϕ

(t, X)f (t, X(t)) +

(t, X)g

(t, X(t))]dt

+ϕ

(t, X)g(t, X(t))dW (t) . (24)

Reordering the terms yields the scalar version of Itˆo’s Lemma:

dY (t) =

f(t, X(t))dt + ˜g(t, X(t))dW (t) , (25)

f(t, X(t)) = ϕ

(t, X) + ϕ

(t, X)f (t, X(t))

(t, X)g

(t, X(t)) , (26)

˜g(t, X(t)) = ϕ

(t, X)g(t, X(t)) . (27)

Stochastic Systems, 2013 19

Itˆo’s lemma

The term

(t, X)g

(t, X(t)) is often called the Itˆo corretion term, since this

does not occur in the det. case.

We apply Itˆos formula for the following problem: ϕ(t, X) = X

with the SDE

dX(t) = dW (t). From the SDE, we get X(t) = W (t) and calculate the partial

derivatives of

∂ϕ(t,X)

∂X

= 2X,

∂

ϕ(t,X)

∂X

= 2, and

∂ϕ(t,X)

∂t

= 0. The Itˆo lemma yields

d(W

(t)) = 1dt + 2W (t)dW (t) . (28)

We rewrite the equation and use W (0) = 0

(t) = 1t + 2



W (t)dW (t) ,



W (t)dW (t) =

(t) −

t . (29)

Stochastic Systems, 2013 20

Itˆo’s lemma

We now allow that the process X(t) is in R

. We let W (t) be an m-dimensional

standard Brownian motion and f (t, X(t)) ∈ R

and g(t, X(t)) ∈ R

n×m

. Consider

a scalar process Y (t) deﬁned by Y (t) = ϕ(t, X(t)), where ϕ(t, X) is a scalar

function which is continuously diﬀerentiable with respect to t and twice continuously

diﬀerentiable with respect to X. The Itˆo formula can be written in vector notation as

follows:

dY (t) =

f(t, X(t))dt + ˜g(t, X(t))dW (t) , (30)

f(t, X(t)) = ϕ

(t, X(t)) + ϕ

(t, X(t)) · f(t, X(t))



(t, X(t))g(t, X(t))g

(t, X(t)))



, (31)

˜g(t, X(t)) = ϕ

(t, X(t)) · g(t, X(t)) , (32)

where “tr” denotes the trace operator.

Stochastic Systems, 2013 21

Itˆo’s lemma

Consider the following stochastic diﬀerential equation:

dS(t) = µ S(t)dt + σ S(t)dW (t) , (33)

We want to ﬁnd the SDE for the process Y related to S as follows: Y (t) = ϕ(t, S) =

ln(S(t)) . The partial derivatives are:

∂ϕ(t,S)

∂S

∂

ϕ(t,S)

∂S

= −

, and

∂ϕ(t,S)

∂t

= 0.

Therefore, according to Itˆo we get,

dY (t) =



∂ϕ(t, S)

∂t

∂ϕ(t, S)

∂S

µS(t) +

∂

ϕ(t, S)

∂S

(t)





∂ϕ(t, S)

∂S

σS(t)



dW (t) , (34)

dY (t) = (µ −

)dt + σdW (t) . (35)

Stochastic Systems, 2013 22

Itˆo’s lemma

Since the right hand side of (35) is independent of Y (t), we are able to compute the

stochastic integral:

Y (t) = Y



(µ −

)dt +



σdW , (36)

Y (t) = Y

+ (µ −

)t + σW (t) . (37)

Since Y (t) = ln S(t) we have found a solution for S(t) :

ln(S(t)) = ln(S(0)) + (µ −

)t + σW (t) , (38)

S(t) = S(0)e

(µ−

)t+σW (t)

, (39)

where W (t) is a standard BM.

Stochastic Systems, 2013 23

Itˆo’s lemma

Show for U(t) = X

(t)X

(t) with

(t) = f

(t, X

)dt + g

(t, X

)dW (t) ,

(t) = f

(t, X

)dt + g

(t, X

)dW (t) ,

that following formula is valid:

dU(t) = dX

(t)X

(t) + X

(t)dX

(t) + g

(t, X

)dt (40)

We show that we obtain the same result as in the previous formula by apply Itˆo’s

lemma. By (40) liefert

dU(t) = [ X

(t)f

(t, X

) + X

(t)f

(t, X

) + g

(t, X

)]dt

+[X

(t)g

(t, X

) + X

(t)g

(t, X

)]dW (t)

Stochastic Systems, 2013 24

Itˆo’s lemma

The partial derivatives of U are :

∂U

∂X

= (X

(t), X

(t))

∂

∂X



0 1

1 0



and

∂U

∂t

= 0.

dU(t) = [

∂U

∂t

∂U

∂X

(t, X

), f

(t, X

)]



∂

∂X



(t, X

)

(t, X

)

(t, X

) g

(t, X

)





]dt

∂U

∂X

(t, X

), g

(t, X

)]

dW (t)

= [X

(t)f

(t, X

) + X

(t)f

(t, X

) + g

(t, X

)]dt

+[X

(t)g

(t, X

) + X

(t)g

(t, X

)]dW (t)

Stochastic Systems, 2013 25

Stochastic Diﬀerential Equations (SDE)

We classify SDEs into two large groups, linear SDEs and non-linear SDEs. Furthermore,

we distinguish between scalar linear and vector-valued linear SDEs.

We start with the easy case, the scalar linear linear SDEs. An SDE

dX(t) = f(t, X(t))dt + g(t, X(t))dW (t) , (41)

for a one-dimensional stochastic process X(t) is called a linear (scalar) SDE if and

only if the functions f(t, X(t)) and g(t, X(t)) are aﬃne functions of X(t) ∈ R and

thus

f(t, X(t)) = A(t)X(t) + a(t) ,

g(t, X(t)) = [B

(t)X(t) + b

(t), ··· , B

(t)X(t) + b

(t)] ,

where A(t), a(t) ∈ R, W (t) ∈ R

is an m-dimensional Brownian motion, and

(t), b

(t) ∈ R, i = 1, ··· , m. Hence, f (t, X(t)) ∈ R and g(t, X(t)) ∈ R

1×m

Stochastic Systems, 2013 26

Stochastic Diﬀerential Equations (SDE)

The linear SDE possesses the following solution

X(t) = Φ(t)





−1

(s)



a(s) −



i=1

(s)b

(s)





i=1



−1

(s)b

(s)dW

(s)



, (42)

where we denote Φ(t) as the fundamental matrix, which we obtain from

Φ(t) = exp







A(s) −



i=1

(s)



ds +



i=1



(s)dW

(s)



, (43)

The solution is similar to the solution of ODEs.

Stochastic Systems, 2013 27

Stochastic Diﬀerential Equations (SDE)

Let us assume that W (t) ∈ R, a(t) = 0, b(t) = 0, A(t) = A, B(t) = B. We

want to compute the solution of the SDE

dX(t) = AX(t)dt + BX(t)dW (t) , X(t) = x

, (44)

We can solve it using (42) and (43):

Φ(t) = e

(A−

)t+BW (t)

, (45)

and (42) is easy to calculate since

x(t) = Φ(t)x

= x

(A−

)t+BW (t)

. (46)

Stochastic Systems, 2013 28

Stochastic Diﬀerential Equations (SDE)

The expectation m(t) = E[X(t)]and the second moment P (t) = E[X

(t)] for

dX(t) = (A(t)X(t) + a(t))dt +



i=1

(t)X(t) + b(t))dW

(t) . (47)

can be calculated by solving the following system of ODEs:

˙m(t) = A(t)m(t) + a(t) , m(0) = x

, (48)

P (t) =



2A(t) +



i=1

(t)



P (t) + 2m(t)



a(t) +



i=1

(t)b

(t)





i=1

(t)



, P (0) = x

. (49)

Stochastic Systems, 2013 29

Stochastic Diﬀerential Equations (SDE)

The ODE for the expectation is derived by applying the expectation operator on both

sides of (42).

E[dX(t)] = E[(A(t)X(t) + a(t))dt +



i=1

(t)X(t) + b

(t))dW

(t) ]

E[dX(t)]

  

dm(t)

= (A(t) E[X(t)]

  

=m(t)

+a(t))dt



i=1

E[(B

(t)X(t) + b

(t))] E[dW

(t) ]

  

dm(t) = (A(t)m(t) + a(t))dt . (50)

Stochastic Systems, 2013 30

Stochastic Diﬀerential Equations (SDE)

In order to compute the second moment, we need to derive the SDE for Y (t) = X

(t):

dY (t) =



2X(t)(A(t)X(t) + a(t)) +



i=1



(t)X(t) + b

(t)





+2X(t)



i=1



(t)X(t) + b

(t)



(t) (51)

dY (t) =



2A(t)X

(t) + 2X(t)a(t) +



i=1



(t)X

(t) + 2B

(t)b

(t)X(t)

(t)



dt + 2X(t)



i=1



(t)X(t) + b

(t)



(t) (52)

Stochastic Systems, 2013 31

Stochastic Diﬀerential Equations (SDE)

Furthermore, we apply the expectation operator to (52) and use P (t) = E[X

(t)] =

E[Y (t)] and m(t) = E[X(t)].

E[dY (t)] =



2A(t)E[X

(t)] + 2a(t)E[X(t)] +



i=1



(t)E[X

(t)]

+2B

(t)b

(t)E[X(t)] + b

(t)





2X(t)



i=1



(t)X(t) + b

(t)



(t)



dP (t) =



2A(t)P (t) + 2a(t)m(t)



i=1



(t)P (t) + 2B

(t)b

(t)m(t) + b

(t)



Stochastic Systems, 2013 32

Stochastic Diﬀerential Equations (SDE)

In the case that B

(t) = 0, i = 1, . . . , m, we are able to directly compute the

distribution. The scalar linear SDE

dX(t) = (A(t)X(t) + a(t))dt +



i=1

(t)dW

(t), (53)

with X(0) = x

is normaly distributed

P (X(t)|x

) ∼ N(m(t), V (t)) with expected value m(t) and variance V (t), which

are solutions of the following ODEs,

˙m(t) = A(t)m(t) + a(t) , m(0) = x

, (54)

V (t) = 2A(t)V (t) +



i=1

(t) , V (0) = 0 . (55)

Stochastic Systems, 2013 33

Stochastic Diﬀerential Equations (SDE)

There are some speciﬁc scalar linear SDEs which are found to be quite useful in practice.

The simplest case of SDE is where the drift and the diﬀusion coeﬃcients are independent

of the information received over time

dS(t) = µdt + σdW (t) , S(0) = S

. (56)

This model has been used to simulate commodity prices, such as metals or agricultural

products.

The mean is E[S(t)] = µt + S

and the variance Var[S(t)] = σ

t. S(t) possesses

a behavior of ﬂuctuations around the straight line S

+ µt.The process is normally

distributed with the given mean and variance.

Stochastic Systems, 2013 34

Stochastic Diﬀerential Equations (SDE)

The standard model of stock prices is the geometric Brownian motion as given by

dS(t) = µS(t)dt + σS(t)dW (t, ω) , S(0) = S

The mean is given by E[S(t)] = S

µt

and its variance by Var[S(t)] = S

2µt

−

1). This model forms the starting point for the famous Black-Scholes formula for option

pricing. The geometric Brownian motion has two main features which make it popular

for stock

The ﬁrst property is that S(t) > 0 for all t ∈ [0, T ] and the second is that all returns

are in scale with the current price. This process has a log-normal probability density

function.

Stochastic Systems, 2013 35

Stochastic Diﬀerential Equations (SDE)

Another very popular class of SDEs are mean reverting linear SDEs. The model is

obtained by

dS(t) = κ[µ − S(t)]dt + σ dW (t, ω) , S(0) = S

. (57)

A special case of this SDE where µ = 0 is called Ohrnstein-Uhlenbeck process.

Equation (57) models a process which naturally falls back to its equilibrium level of µ.

The expected price is E[S(t)] = µ − (µ − S

−κ t

and the variance is

Var[S(t)] =

2κ



1 − e

−2κ t



Stochastic Systems, 2013 36

Stochastic Diﬀerential Equations (SDE)

In the long run, the following (unconditional) approximations are valid

lim

t→∞

E[S(t)] = µ

and

lim

t→∞

Var[S(t)] =

2κ

This analysis shows that the process ﬂuctuates around µ and has a variance of

2κ

which depends on the parameter κ: the higher κ, the lower the variance.

This is obvious since the higher κ, the faster the process reverts back to its mean

value.

This process is a stationary process which is normally distributed.

Stochastic Systems, 2013 37

Stochastic Diﬀerential Equations (SDE)

A popular extension is where the diﬀusion term is in scale with the current value, i.e.,

the geometric mean reverting process:

dS(t) = κ[µ − S(t)]dt + σS(t)dW (t, ω) , S(0) = S

In this model S(t) ≥ 0, if S

≥ 0, µ > 0, and κ > 0.

The ﬁrst mean reversion model(57) may produce negative values even for µ > 0.

Since the second mean-reversion model has always positive realizations, it is also

called log-normal mean reversion. This type of model is used to model interest rate or

volatilities.

Stochastic Systems, 2013 38

Stochastic Diﬀerential Equations (SDE)

In control engineering science, the most important (scalar) case is

dX(t) = (A(t)X(t) + C(t)u(t)) dt +



i=1

(t) dW

. (58)

In this equation, X(t) is normally distributed because the Brownian motion is just

multiplied by time-dependent factors.

When we compute an optimal control law for this SDE, the deterministic optimal control

law (ignoring the Brownian motion) and the stochastic optimal control law are the same.

This feature is called certainty equivalence. For this reason, the stochastics are often

ignored in control engineering.

Stochastic Systems, 2013 39

Stochastic Diﬀerential Equations (SDE)

The logical extension of scalar SDEs is to allow X(t) ∈ R

to be a vector. The rest of

this section proceeds in a similar fashion as for scalar linear SDEs. A stochastic vector

diﬀerential equation

dX(t) = f(t, X(t))dt + g(t, X(t))dW (t)

with the initial condition X(0) = x

∈ R

for an n-dimensional stochastic process

X(t) is called a linear SDE if the functions f (t, X(t)) ∈ R

and g(t, X(t)) ∈ R

n×m

are aﬃne functions of X(t) and thus

f(t, X(t)) = A(t)X(t) + a(t) ,

g(t, X(t)) = [B

(t)X(t) + b

(t), ··· , B

(t)X(t) + b

(t)] ,

where A(t) ∈ R

n×n

, a(t) ∈ R

, W (t) ∈ R

is an m-dimensional Brownian motion,

and B

(t) ∈ R

n×n

, b

(t) ∈ R

Stochastic Systems, 2013 40

Stochastic Diﬀerential Equations (SDE)

Alternatively, the vector-valued linear SDE can be written as

dX(t) = (A(t)X(t) + a(t))dt +



i=1

(t)X(t) + b

(t))dW

(t) . (59)

A common extension of the above equation is the following form of a controlled

stochastic diﬀerential equation as given by

dX(t) = (A(t)X(t) + C(t)u(t) + a(t)) dt



i=1

(t)X(t) + D

(t)u(t) + b

(t)) dW

, (60)

where u (t) ∈ R

, C(t) ∈ R

n×k

, D

(t) ∈ R

n×k

Stochastic Systems, 2013 41

Stochastic Diﬀerential Equations (SDE)

The linear SDE (59) has the following solution:

X(t) = Φ(t)





−1

(s)



a(s) −



i=1

(s)b

(s)





i=1



−1

(s)b

(s)dW

(s)



, (61)

where the fundamental matrix Φ(t) ∈ R

n×n

is the solution of the homogenous

stochastic diﬀerential equation.

Stochastic Systems, 2013 42

Stochastic Diﬀerential Equations (SDE)

The fundamental matrix Φ(t) ∈ R

n×n

is the solution of the homogenous stochastic

diﬀerential equation:

dΦ(t) = A(t)Φ(t)dt +



i=1

(t)Φ(t)dW

(t) , (62)

with initial condition Φ(0) = I, I ∈ R

e now prove that (61) and (62) are

solutions of (59). We rewrite (61) as

X(t) = Φ(t)





−1

(t)dY (t)



dY (t) =



a(t) −



i=1

(t)b

(t)



dt +



i=1

(t)dW

(t) .

Stochastic Systems, 2013 43

Stochastic Diﬀerential Equations (SDE)

X(t) = Φ(t)Z(t) , Z(t) =





−1

(t)dY (t)



dZ(t) = Φ

−1

(t)dY (t)

We use the Itˆo formula to calculate X(t) = Φ(t )Z(t):

dX(t) = Φ(t)dZ(t) + dΦ(t)Z(t) +



i=1

(t)Φ(t)Φ(t)

−1

(t)dt

= dY (t) + A(t)Φ(t)Z(t)dt +



i=1

(t)Φ(t)Z(t)dW

(t) +



i=1

(t)b

(t)dt

Stochastic Systems, 2013 44

Stochastic Diﬀerential Equations (SDE)

Noting that Z(t) = Φ

−1

(t)X(t) and using the SDE for Y (t), we get

dX(t) = dY (t) + A(t)Φ(t)Z(t)dt +



i=1

(t)Φ(t)Z(t)dW

(t) +



i=1

(t)b

(t)dt



a(t) −



i=1

(t)b

(t)



dt +



i=1

(t)dW

(t) + A(t)X(t)dt



i=1

(t)X(t)dW

(t) +



i=1

(t)b

(t)dt

= [a(t ) + A(t)X(t)]dt +



i=1

(t)X(t) + b

(t))dW

(t) .

This completes the proof.

Stochastic Systems, 2013 45

Stochastic Diﬀerential Equations (SDE)

The expectation m(t) = E[X(t)] ∈ R

and the second moment matrix P (t) =

E[X(t)X

(t)] ∈ R

n×n

can be computed as follows:

˙m(t) = A(t)m(t) + a(t) , m(0) = x

, (63)

P (t) = A(t)P (t) + P (t)A

(t) + a(t)m

(t) + m(t)a

(t)



i=1

(t)P (t)B

(t) + B

(t)m(t)b

(t)

(t)m

(t)B

(t) + b

(t)b

(t)

] , P (0) = x

. (64)

The covariance matrix for the system of linear SDEs is given by als

V (t) = Var{x(t)} = P (t) − m(t)m

(t) . (65)

Stochastic Systems, 2013 46

Stochastic Diﬀerential Equations (SDE)

The special case

dX(t) = (A(t)X(t) + a(t))dt +



i=1

(t)dW

(t)

with the initial condition X(0) = x

∈ R

is normally distributed, i.e.,

P (X(t)|x

) ∼ N(m(t), V (t))

where

˙m(t) = A(t)m(t) + a(t) m(0) = x

V (t) = A(t)V (t) + V (t)A

(t) +



i=1

(t) V (0) = 0 .

Stochastic Systems, 2013 47

Stochastic Diﬀerential Equations (SDE)

As ﬁrst example of a linear vector valued SDE, we consider a two dimensional geometric

Brownian motion:

(t) = µ

(t)dt + S

(t)



(t) + σ

(t)



, (66)

(t) = µ

(t)dt + S

(t)



(t) + σ

(t)



. (67)

Written in matrix form S = (S

, S

)

, the same SDE is given as:

A(t) =



0 µ



a(t) =





(t) =



0 σ



(t) =



0 σ



Both processes S

(t) and S

(t) are correlated if σ

= σ

̸= 0. This model can be

easily extended to n processes.

Stochastic Systems, 2013 48

Stochastic Diﬀerential Equations (SDE)

The observed volatility for real existing price processes, such as stocks or bonds is itself

a stochastic process. The following model describes this observation:

dP (t) = µdt + σ(t)dW

(t) , P (0) = P

dσ(t) = κ(θ − σ(t))dt + σ(t)σ

(t) , σ(0) = σ

where θ is the average volatility, σ

a volatility, and κ the mean reversion rate of

the volatility process σ(t). If this model is used for stock prices, the transformation

P (t) = ln(S(t)) is useful. The two Brownian motions dW

(t) and dW

(t) are

correlated, hence corr[dW

(t), dW

(t)] = ρ. This model captures the behavior of

real existing prices better and its distribution of returns shows “fatter tails”.

Stochastic Systems, 2013 49

Stochastic Diﬀerential Equations (SDE)

Die system (68) can be rewritten as linear SDE:

A(t) =



0 0

0 −κ



a(t) =



κθ



(t) =



0 1

0 σ



(t) =



0 0

0 σ



1 − ρ



wobei x(t) = (P (t), σ(t))

. The system (68) has the property, that the variance

of P (t) depends on the initial condition σ

For the parameters µ = 0.1, κ = 2,

θ = 0.2, σ

= 0.5 and ρ = 0.5, we calculate the standard deviation of P (t) with

= 0.1 and alternatively with σ

= 0.8. The expected value of σ(t) has the

following evaluation over time m(t) = θ + (σ

− θ)e

−κt

and thus the variance of

P (t) depends on σ

Stochastic Systems, 2013 50

Stochastic Diﬀerential Equations (SDE)

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5

0.1

0.2

0.3

0.4

0.5

0.6

0.7

time

Standardabweichung

=0.1

=0.8

Abbildung 1: Stand. dev. of P (t) for diﬀerent initial conditions of σ(t)

Stochastic Systems, 2013 51

Stochastic Diﬀerential Equations (SDE)

In comparison with linear SDEs, nonlinear SDEs are less well understood. No general

solution theory exists. And there are no explicit formulae for calculating the moments.

In this section, we show some examples of nonlinear SDEs and their properties.

In general, a scalar square root process can be written as

dX(t) = f(t, X(t))dt + g(t, X(t))dW (t)

with

f(t, X(t)) = A(t)X(t) + a(t)

g(t, X(t)) = B(t)



X(t) ,

where A(t), a(t), and B(t) are real scalars. The nonlinear mean reverting SDEs diﬀer

from the linear scalar equations by their nonlinear diﬀusion term. For this process, the

distribution and moments can be calculated.

Stochastic Systems, 2013 52

Stochastic Diﬀerential Equations (SDE)

For a speciﬁc square root process with A(t) = 0, a(t) = 1 and B(t) = 2 we are

able to derive the analytical solution: The SDE

dX(t) = 1dt + 2



X(t)dW (t) , X(0) = x

has the solution X(t) = (W (t) + x

)

We verify the solution using Itˆo formula. We

use Φ(t) = X(t) = (Y (t) + x

)

and dY (t) = dW (t). The partial derivatives are

= 0, Φ

= 2(Y (t) + x

), and Φ

Y Y

= 2. Thus

dΦ(t) = [Φ

+ Φ

· 0 +

Y Y

· 1]dt + Φ

· 1dW (t) ,

dΦ(t) = 1dt + 2(Y (t) + x

)dW (t) , ⇒ dX(t) = 1dt + 2



X(t)dW (t) ,

since



X(t) = Y (t) + x

Stochastic Systems, 2013 53

Stochastic Diﬀerential Equations (SDE)

Another widely used mean reversion model is obtained by

dS(t) = κ[µ − S(t)]dt + σ



S(t)dW (t) , S(0) = S

. (68)

This model is also known as the Cox-Ross-Ingersol processes.The process shows a

less volatile behavior than its linear geometric counterpart and it has a non-central

chi-square distribution. The process is often used to model short-term interest rates or

stochastic volatility processes for stock prices. Another often used square root process

is similar to the geometric Brownian motion, but with a square root diﬀusion term

instead of the linear diﬀusion term. Its model is given by

dS(t) = µS(t)dt + σ



S(t)dW (t) , S(0) = S

. (69)

Stochastic Systems, 2013 54

Stochastic Diﬀerential Equations (SDE)

The expected value for (69) is E[S(t)] = S

µt

and the variance is obtained by

Var[S(t)] =



2µt

− e

µt



Another widely used mean reversion model is obtained by

dS(t) = κS(t)[µ − ln(S(t))]dt + S(t)σdW (t) . (70)

Using the transformation P (t) = ln(S(t)) yields the linear mean reverting and

normally distributed process P (t):

dP (t) = κ[(µ −

2κ

) − P (t)]dt + σdW (t) , (71)

Because of the transformation, S(t) is log-normally distributed. This model is used

to model stock prices, stochastic volatilities, and electricity prices. Because S(t) is

log-normally distributed, S(t) is always positive.

Stochastic Systems, 2013 55

Stochastic Diﬀerential Equations (SDE)

In this part, we introduce three major methods to compute solution of SDEs.

• The ﬁrst method is based on the Itˆo integral and has already been used for linear

solutions.

• We introduce numerical methods to compute path-wise solutions of SDEs.

• The third method is based on partial diﬀerential equations, where the problem of

ﬁnding the probability density function of the solution is transformed into solving a

partial diﬀerential equation.

Stochastic Systems, 2013 56

Stochastic Diﬀerential Equations (SDE)

The stochastic process X(t) governed by the stochastic diﬀerential equation

dX(t) = f(t, X(t))dt + g(t, X(t))dW (t)

X(0) = X

is explicitly described by the integral form

X(t, ω) = X



f(s, X(s)) ds +



g(s, X(s)) dW (s) ,

where the ﬁrst integral is a path-wise Riemann integral and the second integral is an

Itˆo integral.

In this deﬁnition, it is assumed that the functions f(t, X(t)) and g(t, X(t)) are

suﬃciently smooth in order to guarantee the existence of the solution X(t).

Stochastic Systems, 2013 57

Stochastic Diﬀerential Equations (SDE)

There are several ways of ﬁnding analytical solutions. One way is to guess a soluti-

on and use the Itˆo calculus to verify that it is a solution for the SDE under consideration.

We assume that the following nonlinear SDE

dX(t) = dt + 2



X(t) dW (t) ,

has the solution

X(t) = (W (t) +



)

In order to verify this claim, we use the Itˆo calculus. We have X(t) = ϕ(W ) where

ϕ(W ) = (W (t) +

√

)

, so that ϕ

′

(W ) = 2(W (t) +

√

) and ϕ

′′

(W ) = 2.

Stochastic Systems, 2013 58

Stochastic Diﬀerential Equations (SDE)

Using Itˆo’s rule, we get

dX(t) =



f(t, X)dt + g(t, X)dW (t)



f(t, X) = ϕ(W )

′

1 +

′′

(W )(2



X(t))

= 1

g(T, X) = ϕ

′

(W )(2



X(t)) = 2(W (t) +



) .

Since X(t) = (W (t) +

√

)

we know that (W (t) +

√

) =



X(t) and thus

the Itˆo calculation generated the original SDE where we started at.

Stochastic Systems, 2013 59

Stochastic Diﬀerential Equations (SDE)

For some classes of SDEs, analytical formulas exist to ﬁnd the solution, e.g. consider

the following SDE:

dX(t) = f(t, X(t))dt + σ(t)dW (t) , X(0) = x

(72)

where X(t) ∈ R

, f(t, X(t)) ∈ R

is an arbitrary function, σ(t) ∈ R

n×m

and

dW (t) ∈ R

. This class of SDEs has the following general solution:

X(t) = Y (t) + F (t) (73)

dY (t) = f (t, Y (t) + F (t))dt , Y (0) = x

(74)

dF (t) = σ(t)dW (t) , F (0) = 0 . (75)

The SDE for F (t) can be integrated, i.e. F (t) =



σ(s)dW (s). When σ(t) = σ

than F (t) = σW (t).

Stochastic Systems, 2013 60

Stochastic Diﬀerential Equations (SDE)

SinceF (t) is know,, we are able to solve for Y (t) in in function of F (t).

Using Itˆo lemman, we show that X(t) = Y (t) + F (t) and this solves the SDE

dX(t) = dY (t) + dF (t) = f (t, Y (t) + F (t))dt + σ(t)dW (t)

= f (t, X(t))dt + σ(t)dW (t) (76)

This solution is not very suprising, since X(t) is the sum of the process of Y (t) and

the BM of F (t).

Stochastic Systems, 2013 61

Stochastic Diﬀerential Equations (SDE)

For another class of SDEs, exist an analytical formula for their solution:

dX(t) = f(t, X(t))dt + c(t)X(t)dW (t) , X(0) = x

, (77)

where f (t, X(t)) ∈ R, c(t) ∈ R and dW ∈ R. DThe solution can be derived as

follows:

X(t) = F

−1

(t)Y (t) (78)

dF (t) = F (t)c

(t)dt − F (t)c(t)dW (t) , F (0) = 1 (79)

dY (t) = F (t)f(t, F

−1

Y (t))dt (80)

The proof is similar to the ﬁrst case, sice the diﬀusion is linear.

Stochastic Systems, 2013 62

Stochastic Diﬀerential Equations (SDE)

Calculate the analytical solution for

dX(t) =

X(t)

+ αX(t)dW (t) , X(0) = x

F (t) = e

t−αW (t)

, dY (t) =

F (t)

−1

(t)Y

dt =

(t)

dY (t)Y (t) = F

(t)dt ,

(t) =



(s)ds + C

Y (t) =



+ 2



s−2αW (s)



X(t) = e

−

t+αW (t)



+ 2



s−2αW (s)



Stochastic Systems, 2013 63

Stochastic Diﬀerential Equations (SDE)

However, most SDEs, especially nonlinear SDEs, do not have analytical solutions so

that one has to resort to numerical approximation schemes in order to simulate sample

paths of solutions to the given equation.

The simplest scheme is obtained by using a ﬁrst-order approximation. This is called the

Euler scheme

X(t

) = X(t

k−1

) + f(t

k−1

, X(t

k−1

))∆t + g(t

k−1

, X(t

k−1

))∆W (t

) .

The Brownian motion term can be approximated as follows:

∆W (t

) = ϵ(t

)

√

∆t ,

where the ϵ(.) is a discrete-time Gaussian white process with mean 0 and standard

deviation 1.

Stochastic Systems, 2013 64