last updated: September 19, 2024

Stochastic Processes and Financial Mathematics
(part one)

\(\newcommand{\footnotename}{footnote}\) \(\def \LWRfootnote {1}\) \(\newcommand {\footnote }[2][\LWRfootnote ]{{}^{\mathrm {#1}}}\) \(\newcommand {\footnotemark }[1][\LWRfootnote ]{{}^{\mathrm {#1}}}\) \(\let \LWRorighspace \hspace \) \(\renewcommand {\hspace }{\ifstar \LWRorighspace \LWRorighspace }\) \(\newcommand {\mathnormal }[1]{{#1}}\) \(\newcommand \ensuremath [1]{#1}\) \(\newcommand {\LWRframebox }[2][]{\fbox {#2}} \newcommand {\framebox }[1][]{\LWRframebox } \) \(\newcommand {\setlength }[2]{}\) \(\newcommand {\addtolength }[2]{}\) \(\newcommand {\setcounter }[2]{}\) \(\newcommand {\addtocounter }[2]{}\) \(\newcommand {\arabic }[1]{}\) \(\newcommand {\number }[1]{}\) \(\newcommand {\noalign }[1]{\text {#1}\notag \\}\) \(\newcommand {\cline }[1]{}\) \(\newcommand {\directlua }[1]{\text {(directlua)}}\) \(\newcommand {\luatexdirectlua }[1]{\text {(directlua)}}\) \(\newcommand {\protect }{}\) \(\def \LWRabsorbnumber #1 {}\) \(\def \LWRabsorbquotenumber "#1 {}\) \(\newcommand {\LWRabsorboption }[1][]{}\) \(\newcommand {\LWRabsorbtwooptions }[1][]{\LWRabsorboption }\) \(\def \mathchar {\ifnextchar "\LWRabsorbquotenumber \LWRabsorbnumber }\) \(\def \mathcode #1={\mathchar }\) \(\let \delcode \mathcode \) \(\let \delimiter \mathchar \) \(\def \oe {\unicode {x0153}}\) \(\def \OE {\unicode {x0152}}\) \(\def \ae {\unicode {x00E6}}\) \(\def \AE {\unicode {x00C6}}\) \(\def \aa {\unicode {x00E5}}\) \(\def \AA {\unicode {x00C5}}\) \(\def \o {\unicode {x00F8}}\) \(\def \O {\unicode {x00D8}}\) \(\def \l {\unicode {x0142}}\) \(\def \L {\unicode {x0141}}\) \(\def \ss {\unicode {x00DF}}\) \(\def \SS {\unicode {x1E9E}}\) \(\def \dag {\unicode {x2020}}\) \(\def \ddag {\unicode {x2021}}\) \(\def \P {\unicode {x00B6}}\) \(\def \copyright {\unicode {x00A9}}\) \(\def \pounds {\unicode {x00A3}}\) \(\let \LWRref \ref \) \(\renewcommand {\ref }{\ifstar \LWRref \LWRref }\) \( \newcommand {\multicolumn }[3]{#3}\) \(\require {textcomp}\) \(\newcommand {\intertext }[1]{\text {#1}\notag \\}\) \(\let \Hat \hat \) \(\let \Check \check \) \(\let \Tilde \tilde \) \(\let \Acute \acute \) \(\let \Grave \grave \) \(\let \Dot \dot \) \(\let \Ddot \ddot \) \(\let \Breve \breve \) \(\let \Bar \bar \) \(\let \Vec \vec \) \(\DeclareMathOperator {\var }{var}\) \(\DeclareMathOperator {\cov }{cov}\) \(\def \ra {\Rightarrow }\) \(\def \to {\rightarrow }\) \(\def \iff {\Leftrightarrow }\) \(\def \sw {\subseteq }\) \(\def \wt {\widetilde }\) \(\def \mc {\mathcal }\) \(\def \mb {\mathbb }\) \(\def \sc {\setminus }\) \(\def \v {\textbf }\) \(\def \p {\partial }\) \(\def \E {\mb {E}}\) \(\def \P {\mb {P}}\) \(\def \R {\mb {R}}\) \(\def \C {\mb {C}}\) \(\def \N {\mb {N}}\) \(\def \Q {\mb {Q}}\) \(\def \Z {\mb {Z}}\) \(\def \B {\mb {B}}\) \(\def \~{\sim }\) \(\def \-{\,;\,}\) \(\def \|{\,|\,}\) \(\def \qed {$\blacksquare $}\) \(\def \1{\unicode {x1D7D9}}\) \(\def \cadlag {c\`{a}dl\`{a}g}\) \(\def \p {\partial }\) \(\def \l {\left }\) \(\def \r {\right }\) \(\def \F {\mc {F}}\) \(\def \G {\mc {G}}\) \(\def \H {\mc {H}}\) \(\def \Om {\Omega }\) \(\def \om {\omega }\)

3.3 Martingales

In this section we introduce martingales, which are the mathematical representation of a ‘fair game’. As usual, let \((\Om ,\F ,\P )\) be a probability space. We refer to a sequence of random variables \((S_n)_{n=0}^\infty \) as a stochastic process. In this section of the course we only deal with discrete time stochastic processes.

We have previously discussed the idea of gradually learning more and more information about the outcome of some experiment, through seeing the information visible from gradually larger \(\sigma \)-fields. We formalize this concept as follows.

  • Definition 3.3.1 A sequence of \(\sigma \)-fields \((\F _n)_{n=0}^\infty \) is known as a filtration if \(\F _0\subseteq \F _1\ldots \subseteq \F .\)

  • Definition 3.3.2 We say that a stochastic process \(X=(X_n)\) is adapted to the filtration \((\mc {F}_n)\) if, for all \(n\), \(X_n\) is \(\F _n\) measurable.

We should think of the filtration \(\mc {F}_n\) as telling us which information we have access too at time \(n=1,2,\ldots \). Thus, an adapted process is a process whose (random) value we know at all times \(n\in \N \).

We are now ready to give the definition of a martingale.

  • Definition 3.3.3 A process \(M=(M_n)_{n=0}^\infty \) is a martingale if

    • 1. if \((M_n)\) is adapted,

    • 2. \(M_n\in L^1\) for all \(n\),

    • 3. \(\E [M_{n+1}\|\F _{n}]=M_{n}\) almost surely, for all \(n\).

    We say that \(M\) is a submartingale if, instead of 3, we have \(\E [M_n\|\F _{n-1}]\geq M_{n-1}\) almost surely.
    We say that \(M\) is a supermartingale if, instead of 3, we have \(\E [M_n\|\F _{n-1}]\leq M_{n-1}\) almost surely.

  • Remark 3.3.4 The second condition in Definition 3.3.3 is needed for the third to make sense.

  • Remark 3.3.5 \((M_n)\) is a martingale iff it is both a submartingale and a supermartingale.

A martingale is the mathematical idealization of a fair game. It is best to understand what we mean by this through an example.

Let \((X_n)\) be a sequence of i.i.d. random variables such that

\[\P [X_i=1]=\P [X_i=-1]=\frac {1}{2}.\]

Define \(\mc {F}_n=\sigma (X_1,\ldots ,X_n)\) and \(\mc {F}_0=\{\emptyset ,\Omega \}\). Then \((\mc {F}_n)\) is a filtration. Define

\[S_n=\sum \limits _{i=1}^n X_i\]

(and \(S_0=0\)). We can think of \(S_n\) as a game in the following way. At each time \(n=1,2,\ldots \) we toss a coin. We win if the \(n^{th}\) round if the coin is heads, and lose if it is tails. Each time we win we score \(1\), each time we lose we score \(-1\). Thus, \(S_n\) is our score after \(n\) rounds. The process \(S_n\) is often called a simple random walk.

We claim that \(S_n\) is a martingale. To see this, we check the three properties in the definition. (1) Since \(X_1,X_2,\ldots ,X_n\in m\sigma (X_1,\ldots ,X_n)\) we have that \(S_n\in \mc {F}_n\) for all \(n\in \N \). (2) Since \(|S_n|\leq n\) for all \(n\in \N \), \(\E [|S_n|]\leq n\) for all \(n\), so \(S_n\in L^1\) for all \(n\). (3) We have

\begin{align*} \E \l [S_{n+1}\|\mc {F}_{n}\r ]&=\E [X_{n+1}\|\mc {F}_n]+\E [S_n\|\mc {F}_n]\\ &=\E [X_{n+1}]+S_n\\ &=S_n. \end{align*} Here, in the first line we used the linearity of conditional expectation. To deduce the second line we used the relationship between independence and conditional expectation (for the first term) and the measurability rule (for the second term). To deduce the final line we used that \(\E [X_{n+1}]=(1)\frac 12+(-1)\frac 12=0\).

At time \(n\) we have seen the result of rounds \(1,2,\ldots ,n\), so the information we currently have access to is given by \(\mc {F}_n\). This means that at time \(n\) we know \(S_1,\ldots ,S_n\). But we don’t know \(S_{n+1}\), because \(S_{n+1}\) is not \(\mc {F}_n\)-measurable. However, using our current information we can make our best guess at what \(S_{n+1}\) will be, which naturally is \(\E [S_{n+1}|\mc {F}_n]\). Since the game is fair, in the future, on average we do not expect to win more than we lose, that is \(\E [S_{n+1}|\mc {F}_n]=S_n\).

In this course we will see many examples of martingales, and we will gradually build up an intuition for how to recognize a martingale. There is, however, one easy sufficient (but not necessary) condition under which we can recognize that a stochastic process is not a martingale.

  • Lemma 3.3.6 Let \((\mc {F}_n)\) be a filtration and suppose that \((M_n)\) is a martingale. Then

    \[\E [M_n]=\E [M_0]\]

    for all \(n\in \N \).

Proof: We have \(\E [M_{n+1}|\mc {F}_n]=M_n\). Taking expectations and using the ‘taking \(\E \)’ property from Proposition 3.2.1, we have \(\E [M_{n+1}]=\E [M_n]\). The result follows by a trivial induction.   ∎

Suppose, now, that \((X_n)\) is an i.i.d. sequence of random variables such that \(\P [X_i=2]=\P [X_i=-1]=\frac {1}{2}\). Note that \(\E [X_n]>0\). Define \(S_n\) and \(\mc {F}_n\) as before. Now, \(\E [S_n]=\sum _1^n\E [X_n]\), which is not constant, so \(S_n\) is not a martingale. However, as before, \(S_n\) is \(\mc {F}_n\)-measurable, and \(|S_n|\leq 2n\) so \(S_n\in L^1\), essentially as before. We have

\begin{align*} \E [S_{n+1}|\mc {F}_n]&=\E [X_{n+1}|\mc {F}_n]+\E [S_n|\mc {F}_n]\\ &=\E [X_{n+1}]+S_n\\ &\geq S_n. \end{align*} Hence \(S_n\) is a submartingale.

In general, if \((M_n)\) is a submartingale, then by definition \(\E [M_{n+1}\|\mc {F}_n]\geq M_n\), so taking expectations gives us \(\E [M_{n+1}]\geq \E [M_n]\). For supermartingales we get \(\E [M_{n+1}]\leq \E [M_n]\). In words: submartingales, on average, increase, whereas supermartingales, on average, decrease. The use of super- and sub- is counter intuitive in this respect.

  • Remark 3.3.7 Sometimes we will want to make it clear which filtration is being used in the definition of a martingale. To do so we might say that \((M_n)\) is an \(\mc {F}_n\)-martingale’, or that \((M_n)\) is a martingale with respect to \(\mc {F}_n\)’. We use the same notation for super/sub-martingales.

    Our definition of a filtration and a martingale both make sense if we look at only a finite set of times \(n=1,\ldots , N\). We sometimes also use the terms filtration and martingale in this situation.

We end this section with two important general examples of martingales. You should check the conditions yourself, as exercise 3.3.

  • Example 3.3.8 Let \((X_n)\) be a sequence of i.i.d. random variables such that \(\E [X_n]=1\) for all \(n\), and there exists \(c\in \R \) such that \(|X_n|\leq c\) for all \(n\). Define \(\mc {F}_n=\sigma (X_1,\ldots ,X_n)\). Then

    \[M_n=\prod _{i=1}^n X_n\]

    is a martingale.

  • Example 3.3.9 Let \(Z\in L^1\) be a random variable and let \((\mc {F}_n)\) be a filtration. Then

    \[M_n=\E [Z|\mc {F}_n]\]

    is a martingale.