Study notes for Lawler's Stochastic Calculus (Chapter 1 — Martingales in Discrete Time).
| Symbol | Meaning |
|---|---|
| \(E[M_n^2] < \infty\) | Square integrability — the hypothesis of §1.5 |
| \(L^2(\Omega, \mathcal{F}, P)\) | Hilbert space of square-integrable random variables with inner product \((X,Y) = E[XY]\) |
| \((X, Y) = E[XY]\) | Inner product on \(L^2\) — orthogonality means \((X,Y) = 0\) |
| \(\Delta M_n = M_n - M_{n-1}\) | Increment of \(M_n\) at step \(n\) |
| \(E[\Delta M_{n+1} \cdot \Delta M_{m+1}] = 0,\; m \neq n\) | Orthogonality of martingale increments |
| \(J_n\) | Predictable integrand — \(J_n\) is \(\mathcal{F}_{n-1}\)-measurable |
| \(Z_n = \sum_{j=1}^n J_j X_j\) | Discrete stochastic integral of \(J\) with respect to the random walk \(S_n\) |
| \(\sigma^2 = E[X_j^2]\) | Variance of each i.i.d. increment |
| \(\mathrm{Var}[Z_n] = \sigma^2 \sum_{j=1}^n E[J_j^2]\) | Variance rule for the discrete stochastic integral |
| \(\bar{Y}_n = \max\{Y_0, Y_1, \ldots, Y_n\}\) | Running maximum of a nonneg submartingale \(Y_n\) |
| \(\overline{M}_n = \max\{\lvert M_0 \rvert, \ldots, \lvert M_n \rvert\}\) | Running maximum of \(\lvert M_n \rvert\) |
| \(P\{\bar{Y}_n \geq a\} \leq a^{-1} E[Y_n]\) | Doob’s maximal inequality for submartingales |
| \(P\{\overline{M}_n \geq a\} \leq a^{-2} E[M_n^2]\) | Doob’s \(L^2\) maximal inequality for square integrable martingales |
A martingale \(M_n\) is called square integrable if for each \(n\), \(E[M_n^2] < \infty\).
This is the condition that \(M_n \in L^2(\Omega, \mathcal{F}_n, P)\) at every time \(n\).
Why stronger than integrability: Square integrability \(E[M_n^2] < \infty\) implies integrability \(E[\lvert M_n \rvert] < \infty\) by Jensen’s inequality, but not vice versa.
Why weaker than uniform \(L^2\) boundedness: The definition requires \(E[M_n^2] < \infty\) for each fixed \(n\), but the bound may grow with \(n\). The stronger condition \(\sup_n E[M_n^2] \leq C < \infty\) (used in OST III and the MCT) is a separate, stricter requirement.
Random variables \(X, Y\) are orthogonal if \(E[XY] = E[X]\, E[Y]\).
For zero-mean random variables, orthogonality reduces to \(E[XY] = 0\). Independent random variables are orthogonal, but orthogonal random variables need not be independent.
Suppose that \(M_n\) is a square integrable martingale with respect to \(\{\mathcal{F}_n\}\). Then if \(m < n\),
\[E[(\Delta M_{n+1})(\Delta M_{m+1})] = 0,\]where \(\Delta M_k = M_k - M_{k-1}\). Moreover, for all \(n\),
\[E[M_n^2] = E[M_0^2] + \sum_{j=1}^n E\bigl[(\Delta M_j)^2\bigr].\]Proof of orthogonality:
For \(m < n\), the increment \(\Delta M_{m+1} = M_{m+1} - M_m\) is \(\mathcal{F}_n\)-measurable (since \(m+1 \leq n\)). Therefore:
\[E[(\Delta M_{n+1})(\Delta M_{m+1}) \mid \mathcal{F}_n] = (\Delta M_{m+1})\, E[\Delta M_{n+1} \mid \mathcal{F}_n] = (\Delta M_{m+1}) \cdot 0 = 0.\]The second equality uses the martingale property: \(E[\Delta M_{n+1} \mid \mathcal{F}_n] = E[M_{n+1} - M_n \mid \mathcal{F}_n] = 0\). Taking full expectations gives \(E[(\Delta M_{n+1})(\Delta M_{m+1})] = 0\).
Proof of the Pythagorean identity:
Write \(M_n = M_0 + \sum_{j=1}^n \Delta M_j\) and expand the square:
\[M_n^2 = M_0^2 + \sum_{j=1}^n (\Delta M_j)^2 + \sum_{j \neq k} (\Delta M_j)(\Delta M_k).\]Taking expectations and using orthogonality (all cross terms vanish):
\[E[M_n^2] = E[M_0^2] + \sum_{j=1}^n E[(\Delta M_j)^2].\]Suppose that \(X_1, X_2, \ldots\) are independent, identically distributed random variables with mean zero and variance \(\sigma^2\).
The two main examples are:
Let \(S_n = X_1 + \cdots + X_n\) and let \(\{\mathcal{F}_n\}\) be the filtration generated by \(X_1, \ldots, X_n\).
A sequence of random variables \(J_1, J_2, \ldots\) is called predictable (with respect to \(\{\mathcal{F}_n\}\)) if for each \(n\), \(J_n\) is \(\mathcal{F}_{n-1}\)-measurable.
This is the non-anticipating condition from §1.2: The integrand \(J_n\) is determined by observations strictly before time \(n\).
The discrete stochastic integral is defined by:
\[Z_n = \sum_{j=1}^n J_j X_j = \sum_{j=1}^n J_j \,\Delta S_j.\]Property 1 — Martingale property
\[Z_n \text{ is a martingale with respect to } \{\mathcal{F}_n\}.\]Proof:
\[E[Z_{n+1} \mid \mathcal{F}_n] = E[Z_n + J_{n+1} X_{n+1} \mid \mathcal{F}_n] = Z_n + J_{n+1}\, E[X_{n+1} \mid \mathcal{F}_n] = Z_n + J_{n+1} \cdot 0 = Z_n.\]Here: \(Z_n\) is \(\mathcal{F}_n\)-measurable (Property 1 of §1.1); \(J_{n+1}\) is \(\mathcal{F}_n\)-measurable and pulls out (Property 5); \(X_{n+1}\) is independent of \(\mathcal{F}_n\) with \(E[X_{n+1}] = 0\) (Property 3).
Property 2 — Linearity
If \(J_n, K_n\) are predictable sequences and \(a, b\) constants, then \(aJ_n + bK_n\) is predictable and:
\[\sum_{j=1}^n (aJ_j + bK_j) X_j = a \sum_{j=1}^n J_j X_j + b \sum_{j=1}^n K_j X_j.\]Property 3 — Variance rule
\[\mathrm{Var}[Z_n] = E[Z_n^2] = \sigma^2 \sum_{j=1}^n E[J_j^2].\]Proof:
Using orthogonality of martingale increments (§1.5), the cross terms \(E[J_j X_j \cdot J_k X_k]\) vanish for \(j \neq k\):
\[E[Z_n^2] = \sum_{j=1}^n E[J_j^2 X_j^2].\]Since \(J_j\) is \(\mathcal{F}_{j-1}\)-measurable and \(X_j\) is independent of \(\mathcal{F}_{j-1}\):
\[E[J_j^2 X_j^2] = E\bigl[E[J_j^2 X_j^2 \mid \mathcal{F}_{j-1}]\bigr] = E\bigl[J_j^2\, E[X_j^2 \mid \mathcal{F}_{j-1}]\bigr] = E[J_j^2]\, \sigma^2.\]Summing over \(j\) gives \(E[Z_n^2] = \sigma^2 \sum_{j=1}^n E[J_j^2]\).
| Feature | Discrete: \(Z_n = \sum J_j X_j\) | Continuous: \(\int_0^t A_s\, dB_s\) |
|---|---|---|
| Integrand condition | \(J_n\) is \(\mathcal{F}_{n-1}\)-measurable (predictable) | \(A_s\) is adapted, square-integrable |
| Martingale property | \(Z_n\) is a martingale | \(\int_0^t A_s\, dB_s\) is a martingale |
| Variance rule | \(E[Z_n^2] = \sigma^2 \sum E[J_j^2]\) | \(E\!\left[\left(\int_0^t A_s\, dB_s\right)^2\right] = \int_0^t E[A_s^2]\, ds\) |
| Linearity | ✓ direct from summation | ✓ by construction |
Suppose \(Y_n\) is a nonneg submartingale with respect to \(\{\mathcal{F}_n\}\), and \(\bar{Y}_n = \max\{Y_0, Y_1, \ldots, Y_n\}\). Then for every \(a > 0\),
\[P\{\bar{Y}_n \geq a\} \leq \frac{1}{a}\, E[Y_n].\]Proof:
Let \(T = \min\{k \leq n : Y_k \geq a\}\) (with \(T = n+1\) if no such \(k\) exists). Then:
\[\{\bar{Y}_n \geq a\} = \bigsqcup_{k=0}^n A_k, \quad A_k = \{T = k\}.\]Each \(A_k \in \mathcal{F}_k\). Since \(Y_n\) is a submartingale, \(E[Y_n \mid \mathcal{F}_k] \geq Y_k\) for \(k \leq n\), so:
\[E[Y_n \mathbf{1}_{A_k}] = E\bigl[E[Y_n \mid \mathcal{F}_k]\, \mathbf{1}_{A_k}\bigr] \geq E[Y_k\, \mathbf{1}_{A_k}] \geq a\, P(A_k).\]Summing over \(k = 0, 1, \ldots, n\):
\[E[Y_n] \geq E\!\left[Y_n\, \mathbf{1}_{\{\bar{Y}_n \geq a\}}\right] = \sum_{k=0}^n E[Y_n\, \mathbf{1}_{A_k}] \geq a\, P\{\bar{Y}_n \geq a\}.\]Dividing by \(a\) gives the result.
If \(M_n\) is a square integrable martingale with respect to \(\{\mathcal{F}_n\}\) and \(\overline{M}_n = \max\{\lvert M_0 \rvert, \ldots, \lvert M_n \rvert\}\), then for every \(a > 0\),
\[P\{\overline{M}_n \geq a\} \leq \frac{E[M_n^2]}{a^2}.\]Proof: Exercise 1.15 shows that if \(M_n\) is a martingale and \(\varphi\) is a convex function, then \(\varphi(M_n)\) is a submartingale. Taking \(\varphi(x) = x^2\) gives that \(M_n^2\) is a nonneg submartingale. Apply Theorem 1.7.1 to \(Y_n = M_n^2\) with threshold \(a^2\):
\[P\{\bar{Y}_n \geq a^2\} \leq \frac{E[M_n^2]}{a^2}.\]Since \(\{\bar{Y}_n \geq a^2\} = \{\max_k M_k^2 \geq a^2\} = \{\overline{M}_n \geq a\}\), the result follows.
Setup: \(X_1, X_2, \ldots\) i.i.d. with \(P\{X_j = \pm 1\} = \tfrac{1}{2}\), so \(\sigma^2 = 1\). Let \(S_n = X_1 + \cdots + X_n\).
Integrand: \(J_j = S_{j-1}\) (the running sum just before step \(j\), which is \(\mathcal{F}_{j-1}\)-measurable ✓).
Integral: \(Z_n = \sum_{j=1}^n S_{j-1} X_j.\)
Apply the variance rule:
\[E[Z_n^2] = \sigma^2 \sum_{j=1}^n E[J_j^2] = \sum_{j=1}^n E[S_{j-1}^2] = \sum_{j=1}^n (j-1) = \frac{n(n-1)}{2}.\]Here we used \(E[S_{j-1}^2] = j - 1\) (since \(\mathrm{Var}[S_{j-1}] = j-1\) for zero-mean i.i.d. increments with \(\sigma^2 = 1\)).
Direct check with Itô’s formula analogy: The integral \(Z_n = \sum S_{j-1} X_j\) is the discrete analogue of \(\int_0^t B_s\, dB_s\), which by Itô’s formula equals \(\tfrac{1}{2}(B_t^2 - t)\). The variance of \(\tfrac{1}{2}(B_t^2 - t)\) at time \(t\) is \(\tfrac{t^2}{2}\), consistent with \(n(n-1)/2 \approx n^2/2\) for large \(n\).
All three sections prepare the same three-property package that will recur throughout Chapters 3 and 4:
| Property | §1.6 Discrete version | Chapter 3 Continuous version |
|---|---|---|
| Martingale | \(Z_n = \sum J_j X_j\) is a martingale | \(\int_0^t A_s\, dB_s\) is a martingale |
| Linearity | \(\sum (aJ_j + bK_j) X_j = a Z_n^J + b Z_n^K\) | \(\int (aA + bC)\, dB = a\int A\, dB + b \int C\, dB\) |
| Variance rule (Itô isometry) | \(E[Z_n^2] = \sigma^2 \sum E[J_j^2]\) | \(E\!\left[\left(\int_0^t A_s\, dB_s\right)^2\right] = \int_0^t E[A_s^2]\, ds\) |
| Maximal inequality | \(P\{\overline{M}_n \geq a\} \leq E[M_n^2]/a^2\) | \(P\{\sup_{s \leq t} \lvert M_s \rvert \geq a\} \leq E[M_t^2]/a^2\) |