Eigenvalues of random matrices and Wigner’s semicircle law

Random matrix theory is concerned with the study of properties of large matrices whose entries are sampled according to some known probability distribution. Usually we are interested in objects like their eigenvalues and eigenvectors.

What is interesting is that the asymptotic behavior of the random matrices is often universal, i.e. it does not depend on the probability distribution of the entries. Wigner’s semicircle law is one foundational example of that. Reference 1 goes so far to say that “the semicircle law is as important to random matrix theory as the central limit theorem is to scalar probability theory”. That’s a pretty big claim!

First, let’s define the sequence of matrices for which the semicircle law applies:

Definition (Wigner matrices): Fix an infinite family of real-valued random variables $\{ Y_{ij} \}_{j \geq i \geq 1}$ . Define a sequence of symmetric random matrices ${\bf Y}_n \in \mathbb{R}^{n \times n}$ by

$\begin{aligned} ({\bf Y}_n)_{ij} = \begin{cases} Y_{ij} &\text{if } i \leq j, \\ Y_{ji} &\text{if } i > j. \end{cases} \end{aligned}$

Assume that:

The $\{ Y_{ij} \}$ are independent.

The diagonal entries $\{ Y_{ii}\}_{i \geq 1}$ are identically distributed, and the off-diagonal entries $\{ Y_{ij} \}_{1\leq i < j}$ are identically distributed.

$\mathbb{E}[Y_{ij}^2] < \infty$ for all $i, j$ .

$\mathbb{E}[Y_{12}^2] > 0$ (so that the off-diagonal terms are not trivially all 0 almost surely).

Then the matrices ${\bf X}_n = n^{-1/2} {\bf Y}_n$ are Wigner matrices. (Sometimes, the entire sequence of matrices is referred to as a Wigner matrix.)

For $t > 0$ , define the distribution

$\sigma_t (dx) = \dfrac{1}{2\pi t} \sqrt{(4t - x^2)_+} dx.$

Wigner’s semicircle law states that if $\mathbb{E}[Y_{12}^2] = t$ , then the limiting distribution of the eigenvalues of ${\bf X}_n$ is $\sigma_t$ . The result is stated more formally below:

Theorem (Wigner’s semicircle law): Let ${\bf X}_n = n^{-1/2} {\bf Y}_n$ be a sequence of Wigner matrices such that $\mathbb{E}[Y_{ij}] = 0$ for all $i, j$ and $\mathbb{E}[Y_{12}^2] = t > 0$ . Let $I \subset \mathbb{R}$ be an interval. Define the random variables

$\begin{aligned} E_n(I) = \dfrac{\# \{ \{ \lambda_1 ({\bf X}_n), \dots, \lambda_n ({\bf X}_n) \} \cap I \}}{n}, \end{aligned}$

where $\lambda_1 ({\bf X}_n), \dots, \lambda_n ({\bf X}_n)$ are the eigenvalues of ${\bf X}_n$ . Then $E_n(I) \rightarrow \sigma_t(I)$ in probability as $n \rightarrow \infty$ .

(Note: Convergence in probability can be strengthened to almost sure convergence.)

See reference 2 for a proof of the semicircle law. Here, we demonstrate the semicircle law empirically. The code below generates a $1000 \times 1000$ Wigner matrix with $Y_{ij} \stackrel{iid}{\sim} \mathcal{N}(0, 1)$ and computes its eigenvalues:

# make Wigner matrix and compute eigenvalues
n <- 1000
set.seed(113)
Y <- matrix(rnorm(n * n), nrow = n)
Y[lower.tri(Y)] <- 0
Y <- Y + t(Y * upper.tri(Y))
X <- Y / sqrt(n)
eigval <- eigen(X)$values

In this setting, $t = \mathbb{E}[Y_{12}^2] = 1$ . The code below computes the true limiting distribution and plots a histogram of the eigenvalues along with the true limit:

# truth
t <- 1
xx <- seq(-2 * sqrt(t), 2 * sqrt(t), length.out = 201)
yy <- ifelse(4 * t - xx^2 > 0, sqrt(4 * t - xx^2), 0) / (2 * pi * t)

# make plot
hist(eigval, breaks = 100, freq = FALSE, main = "Histogram of eigenvalues\nNormal(0, 1)")
lines(xx, yy, col = "red")

We can amend the code simply to have the entries have Laplace distribution (this time, we have $t = 2$ ).

# make Wigner matrix and compute eigenvalues
n <- 1000
set.seed(113)
Y <- matrix(rexp(n * n), nrow = n) * matrix(ifelse(rnorm(n * n) > 0, 1, -1), nrow = n)
Y[lower.tri(Y)] <- 0
Y <- Y + t(Y * upper.tri(Y))
X <- Y / sqrt(n)
eigval <- eigen(X)$values

# truth
t <- 2
xx <- seq(-2 * sqrt(t), 2 * sqrt(t), length.out = 201)
yy <- ifelse(4 * t - xx^2 > 0, sqrt(4 * t - xx^2), 0) / (2 * pi * t)

# make plot
hist(eigval, breaks = 100, freq = FALSE, main = "Histogram of eigenvalues\nLaplace rate 1")
lines(xx, yy, col = "red")

What happens if the distribution of $Y_{ij}$ doesn’t have finite second moment? Then the theorem doesn’t hold. For example, the code below has the entries being i.i.d. Cauchy distributed (or $t$ distribution with 1 degree of freedom):

# make Wigner matrix and compute eigenvalues
n <- 1000
set.seed(113)
Y <- matrix(rt(n * n, df = 1), nrow = n)
Y[lower.tri(Y)] <- 0
Y <- Y + t(Y * upper.tri(Y))
X <- Y / sqrt(n)
eigval <- eigen(X)$values

# make plot
hist(eigval, breaks = 100, freq = FALSE, main = "Histogram of eigenvalues\nCauchy")

Notice how different the scale on the x axis is from the previous plots! Most of the eigenvalues are still within a narrow range close to 0, but there are a handful of eigenvalues that are way out. This is somewhat reminiscent of the Central Limit Theorem in scalar probability theory, where one also needs to have a finite second moment condition in order to have the desired limiting behavior.

References:

Feier, A. R. Methods of proof in random matrix theory.
Kemp, T. Math 247A: Introduction to random matrix theory.

2 thoughts on “Eigenvalues of random matrices and Wigner’s semicircle law”

Dan MacKinlay on October 10, 2019 at 3:00 am said:

Am I missing something? the conditions of the theorem as given do not seem to mention that each of the successive $\mathbf{Y}_n$ matrices eachhas dimension $n \times n$, which we need to make this work, right?

LikeLike

Reply ↓
- kjytay on October 10, 2019 at 4:55 am said:
  
  Yes you are right. I’ve amended the post to make this more explicit. Thanks!
  
  LikeLike
  
  Reply ↓

Statistical Odds & Ends

Eigenvalues of random matrices and Wigner’s semicircle law

2 thoughts on “Eigenvalues of random matrices and Wigner’s semicircle law”

Leave a comment Cancel reply

Share this:

Related

2 thoughts on “Eigenvalues of random matrices and Wigner’s semicircle law”

Leave a comment Cancel reply