Harvard

Donsker's Theorem Proof

Ashley December 16, 2024

3 minutes read

Donsker's Theorem, also known as the Donsker-Varadhan theorem or the invariance principle, is a fundamental result in probability theory and statistics. It provides a foundation for the asymptotic theory of empirical processes and has far-reaching implications in various fields, including statistical inference, stochastic processes, and mathematical finance. The theorem states that the empirical process, which is a random process derived from a sequence of independent and identically distributed (i.i.d.) random variables, converges weakly to a Gaussian process, known as the Brownian bridge, as the sample size increases.

Table of Contents

Introduction to Donsker’s Theorem

Donsker’s Theorem was first proved by Monroe Donsker in 1951, and later generalized by S.R.S. Varadhan. The theorem has since become a cornerstone of modern probability theory and statistics. It establishes a deep connection between the empirical process and the Brownian bridge, which is a Gaussian process with zero mean and covariance function given by cov(t, s) = min(t, s) - ts. The Brownian bridge is a continuous stochastic process that arises as the limit of the empirical process, and it plays a crucial role in the asymptotic theory of statistical inference.

Statement of Donsker’s Theorem

Let X₁, X₂, …, X_n be a sequence of i.i.d. random variables with mean 0 and variance 1. The empirical process is defined as:

Empirical Process: G_n(t) = √n(F_n(t) - t), where F_n(t) is the empirical distribution function.

Donsker's Theorem states that the empirical process G_n(t) converges weakly to the Brownian bridge B(t) as n → ∞. The convergence is in the sense of the uniform topology on the space of continuous functions on [0, 1].

Empirical Process	Brownian Bridge
G_n(t) = √n(F_n(t) - t)	B(t) = W(t) - tW(1)
Mean: 0	Mean: 0
Variance: 1	Variance: t(1 - t)

💡 The key insight behind Donsker's Theorem is that the empirical process, which is a discrete stochastic process, can be approximated by a continuous Gaussian process, the Brownian bridge, as the sample size increases. This approximation enables the use of powerful tools from Gaussian process theory to analyze the asymptotic behavior of statistical estimators and tests.

Proof of Donsker’s Theorem

The proof of Donsker’s Theorem involves several technical steps, including the use of the Central Limit Theorem (CLT) and the Continuous Mapping Theorem (CMT). The basic idea is to show that the empirical process G_n(t) converges weakly to the Brownian bridge B(t) by establishing the convergence of the finite-dimensional distributions and the tightness of the empirical process.

Finite-Dimensional Convergence

Let t₁, t₂, …, t_k be a set of distinct points in [0, 1]. The finite-dimensional distribution of the empirical process is given by:

G_n(t₁), G_n(t₂), ..., G_n(t_k) → B(t₁), B(t₂), ..., B(t_k)

as n → ∞, where the convergence is in distribution. This result follows from the CLT and the fact that the empirical process is a linear combination of i.i.d. random variables.

Tightness of the Empirical Process

To establish the tightness of the empirical process, we need to show that the empirical process is uniformly bounded and that the oscillations of the empirical process are controlled. This can be achieved by using the boundedness of the empirical process and the equicontinuity of the empirical process.

What is the significance of Donsker's Theorem in statistical inference?

Donsker's Theorem provides a foundation for the asymptotic theory of statistical inference. It enables the use of Gaussian process theory to analyze the asymptotic behavior of statistical estimators and tests, and it has far-reaching implications in various fields, including hypothesis testing, confidence sets, and non-parametric statistics.

Applications of Donsker’s Theorem

Donsker’s Theorem has numerous applications in statistical inference, including:

Hypothesis Testing: Donsker's Theorem provides a framework for testing hypotheses about the distribution of a population. It enables the use of Gaussian process theory to analyze the asymptotic behavior of test statistics and to construct confidence sets.
Confidence Sets: Donsker's Theorem can be used to construct confidence sets for population parameters. The theorem provides a framework for analyzing the asymptotic behavior of confidence sets and for constructing confidence sets with guaranteed coverage probabilities.
Non-Parametric Statistics: Donsker's Theorem has far-reaching implications in non-parametric statistics. It enables the use of Gaussian process theory to analyze the asymptotic behavior of non-parametric estimators and tests, and it provides a framework for constructing confidence sets and testing hypotheses about the distribution of a population.

In conclusion, Donsker’s Theorem is a fundamental result in probability theory and statistics. It provides a foundation for the asymptotic theory of empirical processes and has far-reaching implications in various fields, including statistical inference, stochastic processes, and mathematical finance. The theorem has numerous applications in hypothesis testing, confidence sets, and non-parametric statistics, and it continues to be an active area of research in modern probability theory and statistics.

Ashley Today

755 3 minutes read

Donsker's Theorem Proof