random matrices in wireless communications …...random matrices in wireless communications course...

73
Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet ST-Ericsson, Sup´ elec, FRANCE [email protected] Sup ´ elec R. Couillet (Sup ´ elec) Random Matrix Theory Course 29/10/2009 1 / 46

Upload: others

Post on 11-Jul-2020

9 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Random Matrices in Wireless CommunicationsCourse 1: Introduction to random matrix theory and the Stieltjes

transform

Romain CouilletST-Ericsson, Supelec, FRANCE

[email protected]

Supelec

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 1 / 46

Page 2: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Outline

1 What is a random matrix? Generalities

2 History of mathematical advances

3 The moment approach and free probability

4 Introduction of the Stieltjes transform

5 Proof of the Marcenko-Pastur law

6 Summary of what we know, what is left to be done, which approach to consider to attack a large dimensional

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 2 / 46

Page 3: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Outline

1 What is a random matrix? Generalities

2 History of mathematical advances

3 The moment approach and free probability

4 Introduction of the Stieltjes transform

5 Proof of the Marcenko-Pastur law

6 Summary of what we know, what is left to be done, which approach to consider to attack a large dimensional

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 3 / 46

Page 4: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

High-dimensional data

Let x1, x2 . . . ∈ CN be independently drawn from an N-variate process of mean zero andcovariance R = E[x1xH

1 ].

Law of large numbers

As n → ∞,1

n

n∑

i=1

xi xHi

a.s.−→ R

In reality, one cannot afford n → ∞.

if n ≫ N,

Rn =1

n

n∑

i=1

xi xHi

is a “good” estimator of R.

if N/n = O(1), and if both (n,N) are large, we can still say, for all (i, j),

(Rn)ija.s.−→ (R)ij

What about the global behaviour? What about the eigenvalue distribution?

Assume R = IN and draw the eigenvalues of Rn for n,N large.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 4 / 46

Page 5: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

High-dimensional data

Let x1, x2 . . . ∈ CN be independently drawn from an N-variate process of mean zero andcovariance R = E[x1xH

1 ].

Law of large numbers

As n → ∞,1

n

n∑

i=1

xi xHi

a.s.−→ R

In reality, one cannot afford n → ∞.

if n ≫ N,

Rn =1

n

n∑

i=1

xi xHi

is a “good” estimator of R.

if N/n = O(1), and if both (n,N) are large, we can still say, for all (i, j),

(Rn)ija.s.−→ (R)ij

What about the global behaviour? What about the eigenvalue distribution?

Assume R = IN and draw the eigenvalues of Rn for n,N large.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 4 / 46

Page 6: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

High-dimensional data

Let x1, x2 . . . ∈ CN be independently drawn from an N-variate process of mean zero andcovariance R = E[x1xH

1 ].

Law of large numbers

As n → ∞,1

n

n∑

i=1

xi xHi

a.s.−→ R

In reality, one cannot afford n → ∞.

if n ≫ N,

Rn =1

n

n∑

i=1

xi xHi

is a “good” estimator of R.

if N/n = O(1), and if both (n,N) are large, we can still say, for all (i, j),

(Rn)ija.s.−→ (R)ij

What about the global behaviour? What about the eigenvalue distribution?

Assume R = IN and draw the eigenvalues of Rn for n,N large.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 4 / 46

Page 7: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

High-dimensional data

Let x1, x2 . . . ∈ CN be independently drawn from an N-variate process of mean zero andcovariance R = E[x1xH

1 ].

Law of large numbers

As n → ∞,1

n

n∑

i=1

xi xHi

a.s.−→ R

In reality, one cannot afford n → ∞.

if n ≫ N,

Rn =1

n

n∑

i=1

xi xHi

is a “good” estimator of R.

if N/n = O(1), and if both (n,N) are large, we can still say, for all (i, j),

(Rn)ija.s.−→ (R)ij

What about the global behaviour? What about the eigenvalue distribution?

Assume R = IN and draw the eigenvalues of Rn for n,N large.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 4 / 46

Page 8: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

High-dimensional data

Let x1, x2 . . . ∈ CN be independently drawn from an N-variate process of mean zero andcovariance R = E[x1xH

1 ].

Law of large numbers

As n → ∞,1

n

n∑

i=1

xi xHi

a.s.−→ R

In reality, one cannot afford n → ∞.

if n ≫ N,

Rn =1

n

n∑

i=1

xi xHi

is a “good” estimator of R.

if N/n = O(1), and if both (n,N) are large, we can still say, for all (i, j),

(Rn)ija.s.−→ (R)ij

What about the global behaviour? What about the eigenvalue distribution?

Assume R = IN and draw the eigenvalues of Rn for n,N large.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 4 / 46

Page 9: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Empirical and limit spectra of Wishart matrices

0 0.5 1 1.5 2 2.5 30

0.2

0.4

0.6

0.8

Eigenvalues of Rn

Den

sity

Empirical eigenvalue distribution

Marcenko-Pastur Law

Figure: Histogram of the eigenvalues of Rn for n = 2000, N = 500, R = IN

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 5 / 46

Page 10: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Definition

Definition

Let Ω be some probability space, and let ω ∈ Ω. A random matrix X = X(ω) is a random variablewhose value lies in some matrix space.

Note:

the probability space Ω is often neglected; it is e.g. the propagation environment for MIMOchannel matrices.

for asymptotic considerations, ω ∈ Ω will be the realization of an infinite sequenceX1(ω),X2(ω), . . . of size 1, 2, . . . random matrices.

In practice, we are mostly interested into Hermitian matrices and especially in the distribution oftheir eigenvalues.

Definition

The distribution function FN of the eigenvalues of the N × N random Hermitian matrix XN = XN(ω)is called the empirical spectrum distribution (e.s.d.) of XN . If FN has a limit F when N → ∞, thislimit is called the limit spectral distribution of XN .

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 6 / 46

Page 11: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Definition

Definition

Let Ω be some probability space, and let ω ∈ Ω. A random matrix X = X(ω) is a random variablewhose value lies in some matrix space.

Note:

the probability space Ω is often neglected; it is e.g. the propagation environment for MIMOchannel matrices.

for asymptotic considerations, ω ∈ Ω will be the realization of an infinite sequenceX1(ω),X2(ω), . . . of size 1, 2, . . . random matrices.

In practice, we are mostly interested into Hermitian matrices and especially in the distribution oftheir eigenvalues.

Definition

The distribution function FN of the eigenvalues of the N × N random Hermitian matrix XN = XN(ω)is called the empirical spectrum distribution (e.s.d.) of XN . If FN has a limit F when N → ∞, thislimit is called the limit spectral distribution of XN .

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 6 / 46

Page 12: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Finite size and asymptotic considerations

The field of random matrices is often segmented intoFinite-size random matrices:

of interest are: joint entry distributions, ordered eigenvalue distributions, e.s.d., expectation offunctionalsparticularly suitable to small size matriceshowever, much problems arise for models more involved than i.i.d. Gaussian

Limiting results:of interest are: limit spectral distributions (l.s.d.), functionals of l.s.d., central limit theorems etc.suitable to large matrices, but often good approximation to smaller matricesmuch easier to work with than finite size, more flexible (i.i.d., Kronecker, variance profile models,structured matrices)possesses a variety of powerful tools: Stieltjes transform, free probability

Remark: This course will mainly focus on limiting results and almost no finite size considerations.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 7 / 46

Page 13: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Finite size and asymptotic considerations

The field of random matrices is often segmented intoFinite-size random matrices:

of interest are: joint entry distributions, ordered eigenvalue distributions, e.s.d., expectation offunctionalsparticularly suitable to small size matriceshowever, much problems arise for models more involved than i.i.d. Gaussian

Limiting results:of interest are: limit spectral distributions (l.s.d.), functionals of l.s.d., central limit theorems etc.suitable to large matrices, but often good approximation to smaller matricesmuch easier to work with than finite size, more flexible (i.i.d., Kronecker, variance profile models,structured matrices)possesses a variety of powerful tools: Stieltjes transform, free probability

Remark: This course will mainly focus on limiting results and almost no finite size considerations.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 7 / 46

Page 14: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Why is this useful to wireless communications?

increasing number of parameters: multi-user systems, multiple concurrent cells, multipleantennas

matrices with random entries are the basis for MIMO channels, CDMA codes

it is no longer possible to treat large dimensional problems with classical probabilityapproaches

random matrices answer a widening panel of problems: system performance, detection,estimation. . .

Example

MIMO channel capacity Call H ∈ Cn×N the realization of a MIMO channel matrix whose entriesand distributed according to some random process. We have the per-antenna mutual information

C(σ2) =1

Nlog det

[

IN +1

σ2HHH

]

Note that, with hi the i th column of H, HHH =∑N

i=1 hi hHi . If H has i.i.d. entries, then, as both

n,N → ∞, n/N → c,

C(σ2) →∫

log[

1 +t

σ2

]

dFc(t)

with Fc the Marcenko-Pastur law with parameter c.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 8 / 46

Page 15: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

What is a random matrix? Generalities

Why is this useful to wireless communications?

increasing number of parameters: multi-user systems, multiple concurrent cells, multipleantennas

matrices with random entries are the basis for MIMO channels, CDMA codes

it is no longer possible to treat large dimensional problems with classical probabilityapproaches

random matrices answer a widening panel of problems: system performance, detection,estimation. . .

Example

MIMO channel capacity Call H ∈ Cn×N the realization of a MIMO channel matrix whose entriesand distributed according to some random process. We have the per-antenna mutual information

C(σ2) =1

Nlog det

[

IN +1

σ2HHH

]

Note that, with hi the i th column of H, HHH =∑N

i=1 hi hHi . If H has i.i.d. entries, then, as both

n,N → ∞, n/N → c,

C(σ2) →∫

log[

1 +t

σ2

]

dFc(t)

with Fc the Marcenko-Pastur law with parameter c.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 8 / 46

Page 16: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

Outline

1 What is a random matrix? Generalities

2 History of mathematical advances

3 The moment approach and free probability

4 Introduction of the Stieltjes transform

5 Proof of the Marcenko-Pastur law

6 Summary of what we know, what is left to be done, which approach to consider to attack a large dimensional

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 9 / 46

Page 17: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

Wishart matrices

J. Wishart, “The generalized product moment distribution in samples from a normal multivariatepopulation”, Biometrika, vol. 20A, pp. 32-52, 1928.

First random matrix considerations date back to Wishart (1928) who studies the jointdistribution of Gaussian sample covariance matrices Rn = XXH =

∑ni=1 xi xH

i ,xi ∈ CN ∼ N (0,R),

PRn (B) =πN(N−1)/2

det Rn∏N

i=1(n − i)!e− tr(R−1B) det Bn−N

Subsequent work provide expressions of the joint and marginal eigenvalue distributions,

P(λi )(λ1, . . . , λN) =

det(e−r−1j λi N)

∆(R−1)∆(L)

N∏

j=1

λn−Nj

j!(n − j)!

with r1 ≥ . . . ≥ rN the eigenvalues of R and L = diag(λ1 ≥ . . . ≥ λN) and

pλ(λ) =1

M

N−1∑

k=0

k!

(k + n − N)![Ln−N

k ]2λn−Ne−λ

where Lkn are the Laguerre polynomials defined as

Lkn(λ) =

k!λn

dk

dλk(e−λλn+k )

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 10 / 46

Page 18: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

Semi-circle law, Full circle law...

First asymptotic approach is due to Wigner for nuclear physics purposes

E. Wigner, “Characteristic vectors of bordered matrices with infinite dimensions,” The annals ofmathematics, vol. 62, pp. 546-564, 1955.

If XN ∈ CN×N is Hermitian with i.i.d. entries of mean 0, variance 1/N, then F XNa.s.−→ F where

F has density f the semi-circle law

f (x) =1

(4 − x2)+

If XN ∈ CN×N has with i.i.d. 0 mean, variance 1/N entries, then asymptotically its complexeigenvalues distribute uniformly on the complex unit circle.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 11 / 46

Page 19: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

Semi-circle law

−3 −2 −1 0 1 2 30

0.1

0.2

0.3

0.4

Eigenvalues

Den

sity

Empirical eigenvalue distribution

Semi-circle Law

Figure: Histogram of the eigenvalues of Wigner matrices and the semi-circle law, for N = 500

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 12 / 46

Page 20: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

Circular law

−1 −0.5 0 0.5 1

−1

−0.5

0

0.5

1

Eigenvalues (real part)

Eig

enva

lues

(imag

inar

ypa

rt)

Empirical eigenvalue distribution

Circular Law

Figure: Eigenvalues of XN with i.i.d. standard Gaussian entries, for N = 500.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 13 / 46

Page 21: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

More involved matrix models

much study has surrounded the Marcenko-Pastur law, the Wigner semi-circle law etc.for practical purposes, we often need more general matrix models

products and sums of random matricesi.i.d. models with correlation/variance profiledistribution of inverses etc.

for these models, it is often impossible to have an expression of the limiting distribution.

sometimes we do not have a limiting convergence.

Tools for random matrix theory

To study these models, a consistent powerful mathematical framework is required.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 14 / 46

Page 22: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

More involved matrix models

much study has surrounded the Marcenko-Pastur law, the Wigner semi-circle law etc.for practical purposes, we often need more general matrix models

products and sums of random matricesi.i.d. models with correlation/variance profiledistribution of inverses etc.

for these models, it is often impossible to have an expression of the limiting distribution.

sometimes we do not have a limiting convergence.

Tools for random matrix theory

To study these models, a consistent powerful mathematical framework is required.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 14 / 46

Page 23: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

History of mathematical advances

More involved matrix models

much study has surrounded the Marcenko-Pastur law, the Wigner semi-circle law etc.for practical purposes, we often need more general matrix models

products and sums of random matricesi.i.d. models with correlation/variance profiledistribution of inverses etc.

for these models, it is often impossible to have an expression of the limiting distribution.

sometimes we do not have a limiting convergence.

Tools for random matrix theory

To study these models, a consistent powerful mathematical framework is required.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 14 / 46

Page 24: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Outline

1 What is a random matrix? Generalities

2 History of mathematical advances

3 The moment approach and free probability

4 Introduction of the Stieltjes transform

5 Proof of the Marcenko-Pastur law

6 Summary of what we know, what is left to be done, which approach to consider to attack a large dimensional

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 15 / 46

Page 25: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Eigenvalue distribution and moments

Moments of eigenvalue distributions,The e.s.d. of an N × N Hermitian matrix XN (ω) has successive empirical moments Mk , k = 1, 2, . . .,

Mk =1

N

N∑i=1

λki

if FN denotes the e.s.d. of XN (ω), Mk is

Mk =

∫λ

k dF (λ)

In classical probability theory, if A and B are independent, the moments of A + B arefunctions of the moments of A and those of B. In particular, for A, B independent,

ck (A + B) = ck (A) + ck (B)

with ck (X) the cumulants of X (polynomial functions of the moments mk of X ).The cumulants cn are connected to the moments mn through formulas invoking partitions,

mn =∑

π∈P(n)

V∈π

c|V |

If A, B are Hermitian matrices, we feel that, if they have independent entries, there shouldexist a relationship between the eigenvalue distribution momentsMk (A + B) = Eω[Mk (A(ω) + B(ω))]

D. V. Voiculescu, K. J. Dykema, A. Nica, “Free random variables,” American Mathematical Society,1992.

Free probabilityR. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 16 / 46

Page 26: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Eigenvalue distribution and moments

Moments of eigenvalue distributions,The e.s.d. of an N × N Hermitian matrix XN (ω) has successive empirical moments Mk , k = 1, 2, . . .,

Mk =1

N

N∑i=1

λki

if FN denotes the e.s.d. of XN (ω), Mk is

Mk =

∫λ

k dF (λ)

In classical probability theory, if A and B are independent, the moments of A + B arefunctions of the moments of A and those of B. In particular, for A, B independent,

ck (A + B) = ck (A) + ck (B)

with ck (X) the cumulants of X (polynomial functions of the moments mk of X ).The cumulants cn are connected to the moments mn through formulas invoking partitions,

mn =∑

π∈P(n)

V∈π

c|V |

If A, B are Hermitian matrices, we feel that, if they have independent entries, there shouldexist a relationship between the eigenvalue distribution momentsMk (A + B) = Eω[Mk (A(ω) + B(ω))]

D. V. Voiculescu, K. J. Dykema, A. Nica, “Free random variables,” American Mathematical Society,1992.

Free probabilityR. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 16 / 46

Page 27: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Eigenvalue distribution and moments

Moments of eigenvalue distributions,The e.s.d. of an N × N Hermitian matrix XN (ω) has successive empirical moments Mk , k = 1, 2, . . .,

Mk =1

N

N∑i=1

λki

if FN denotes the e.s.d. of XN (ω), Mk is

Mk =

∫λ

k dF (λ)

In classical probability theory, if A and B are independent, the moments of A + B arefunctions of the moments of A and those of B. In particular, for A, B independent,

ck (A + B) = ck (A) + ck (B)

with ck (X) the cumulants of X (polynomial functions of the moments mk of X ).The cumulants cn are connected to the moments mn through formulas invoking partitions,

mn =∑

π∈P(n)

V∈π

c|V |

If A, B are Hermitian matrices, we feel that, if they have independent entries, there shouldexist a relationship between the eigenvalue distribution momentsMk (A + B) = Eω[Mk (A(ω) + B(ω))]

D. V. Voiculescu, K. J. Dykema, A. Nica, “Free random variables,” American Mathematical Society,1992.

Free probabilityR. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 16 / 46

Page 28: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Eigenvalue distribution and moments

Moments of eigenvalue distributions,The e.s.d. of an N × N Hermitian matrix XN (ω) has successive empirical moments Mk , k = 1, 2, . . .,

Mk =1

N

N∑i=1

λki

if FN denotes the e.s.d. of XN (ω), Mk is

Mk =

∫λ

k dF (λ)

In classical probability theory, if A and B are independent, the moments of A + B arefunctions of the moments of A and those of B. In particular, for A, B independent,

ck (A + B) = ck (A) + ck (B)

with ck (X) the cumulants of X (polynomial functions of the moments mk of X ).The cumulants cn are connected to the moments mn through formulas invoking partitions,

mn =∑

π∈P(n)

V∈π

c|V |

If A, B are Hermitian matrices, we feel that, if they have independent entries, there shouldexist a relationship between the eigenvalue distribution momentsMk (A + B) = Eω[Mk (A(ω) + B(ω))]

D. V. Voiculescu, K. J. Dykema, A. Nica, “Free random variables,” American Mathematical Society,1992.

Free probabilityR. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 16 / 46

Page 29: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Eigenvalue distribution and moments

Moments of eigenvalue distributions,The e.s.d. of an N × N Hermitian matrix XN (ω) has successive empirical moments Mk , k = 1, 2, . . .,

Mk =1

N

N∑i=1

λki

if FN denotes the e.s.d. of XN (ω), Mk is

Mk =

∫λ

k dF (λ)

In classical probability theory, if A and B are independent, the moments of A + B arefunctions of the moments of A and those of B. In particular, for A, B independent,

ck (A + B) = ck (A) + ck (B)

with ck (X) the cumulants of X (polynomial functions of the moments mk of X ).The cumulants cn are connected to the moments mn through formulas invoking partitions,

mn =∑

π∈P(n)

V∈π

c|V |

If A, B are Hermitian matrices, we feel that, if they have independent entries, there shouldexist a relationship between the eigenvalue distribution momentsMk (A + B) = Eω[Mk (A(ω) + B(ω))]

D. V. Voiculescu, K. J. Dykema, A. Nica, “Free random variables,” American Mathematical Society,1992.

Free probabilityR. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 16 / 46

Page 30: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Free probability

Free probability applies to asymptotically large random matrices. We assume here all matriceshave infinite size

To connect the moments of A + B to those of A and B, independence is not enough. Oneneeds for A = A(ω) and B(ω) to be realizations of free sub-algebras of random matrices.Roughly speaking, A and B need to be independent and to have “disconnectedeigen-directions”.

two Gaussian matrices are freea Gaussian matrix and any deterministic matrix are freeunitary (Haar distributed) matrices are freea Haar matrix and a Gaussian matrix are free etc.

Similarly as in classical probability, we define free cumulants Ck ,

C1 = M1

C2 = M2 − M21

C3 = M3 − 3M1M2 + 2M21

R. Speicher, “Combinatorial theory of the free product with amalgamation and operator-valuedfree probability theory,” Mem. A.M.S., vol. 627, 1998.

A combinatorial description of the relation moments-cumulants invokes non-crossingpartitions,

Mn =∑

π∈NC(n)

V∈π

C|V |

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 17 / 46

Page 31: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Free probability

Free probability applies to asymptotically large random matrices. We assume here all matriceshave infinite size

To connect the moments of A + B to those of A and B, independence is not enough. Oneneeds for A = A(ω) and B(ω) to be realizations of free sub-algebras of random matrices.Roughly speaking, A and B need to be independent and to have “disconnectedeigen-directions”.

two Gaussian matrices are freea Gaussian matrix and any deterministic matrix are freeunitary (Haar distributed) matrices are freea Haar matrix and a Gaussian matrix are free etc.

Similarly as in classical probability, we define free cumulants Ck ,

C1 = M1

C2 = M2 − M21

C3 = M3 − 3M1M2 + 2M21

R. Speicher, “Combinatorial theory of the free product with amalgamation and operator-valuedfree probability theory,” Mem. A.M.S., vol. 627, 1998.

A combinatorial description of the relation moments-cumulants invokes non-crossingpartitions,

Mn =∑

π∈NC(n)

V∈π

C|V |

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 17 / 46

Page 32: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Non-crossing partitions

1

2

3

4

5

6

7

8

Figure: Non-crossing partition π = 1, 3, 4, 2, 5, 6, 7, 8 of NC(8).

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 18 / 46

Page 33: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Moments of sums and products of random matrices

Combinatorial calculus of all moments

Theorem

For free random matrices A and B, we have the relationship,

Ck (A + B) = Ck (A) + Ck (B)

Mn(AB) =∑

(π1,π2)∈NC(n)

V1∈π1V2∈π2

C|V1|(A)C|V2|

(B)

in conjunction with free moment-cumulant formula, gives all moments of sum and product.Denote mF (z) the moment-generating function of the l.s.d. F of a random Hermitian matrix X,also called Stieltjes transform,

mF (z) = −∞∑

k=0

Mk z−k−1

Theorem

If F is a compactly supported distribution function, then mF above exists for all z ∈ C∗ and givesaccess to F through an inverse Stieltjes-transform formula (see Section 23).

In the absence of support compactness, it is impossible to retrieve the distribution functionfrom moments. This is in particular the case of Vandermonde matrices.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 19 / 46

Page 34: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Moments of sums and products of random matrices

Combinatorial calculus of all moments

Theorem

For free random matrices A and B, we have the relationship,

Ck (A + B) = Ck (A) + Ck (B)

Mn(AB) =∑

(π1,π2)∈NC(n)

V1∈π1V2∈π2

C|V1|(A)C|V2|

(B)

in conjunction with free moment-cumulant formula, gives all moments of sum and product.Denote mF (z) the moment-generating function of the l.s.d. F of a random Hermitian matrix X,also called Stieltjes transform,

mF (z) = −∞∑

k=0

Mk z−k−1

Theorem

If F is a compactly supported distribution function, then mF above exists for all z ∈ C∗ and givesaccess to F through an inverse Stieltjes-transform formula (see Section 23).

In the absence of support compactness, it is impossible to retrieve the distribution functionfrom moments. This is in particular the case of Vandermonde matrices.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 19 / 46

Page 35: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Moments of sums and products of random matrices

Combinatorial calculus of all moments

Theorem

For free random matrices A and B, we have the relationship,

Ck (A + B) = Ck (A) + Ck (B)

Mn(AB) =∑

(π1,π2)∈NC(n)

V1∈π1V2∈π2

C|V1|(A)C|V2|

(B)

in conjunction with free moment-cumulant formula, gives all moments of sum and product.Denote mF (z) the moment-generating function of the l.s.d. F of a random Hermitian matrix X,also called Stieltjes transform,

mF (z) = −∞∑

k=0

Mk z−k−1

Theorem

If F is a compactly supported distribution function, then mF above exists for all z ∈ C∗ and givesaccess to F through an inverse Stieltjes-transform formula (see Section 23).

In the absence of support compactness, it is impossible to retrieve the distribution functionfrom moments. This is in particular the case of Vandermonde matrices.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 19 / 46

Page 36: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Free convolution

In classical probability theory, for independent A, B,

fA+B(x) = fA(x) ∗ fB(x)∆=

fA(t)fB(x − t)dt

In free probability, for free A, B, we use the notations

µA+B = µA ⊞ µB, µA = µA+B ⊟ µB, µAB = µA ⊠ µB, µA = µA+B µB

Ø. Ryan, M. Debbah, “Multiplicative free convolution and information-plus-noise type matrices,”Arxiv preprint math.PR/0702342, 2007.

Theorem

Convolution of the information-plus-noise model Let XN ∈ CN×n have i.i.d. Gaussian entries ofmean 0 and variance 1, RN ∈ CN×n, such that µ 1

n RN RHN⇒ µΓ, as n/N → c. Then the e.s.d. of

BN =1

n(RN + σXN) (RN + σXN)

H

converges weakly and almost surely to µB such that

µB =(

(µΓ µc) ⊞ δσ2)

⊠ µc

with µc the Marcenko-Pastur law.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 20 / 46

Page 37: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Free convolution

In classical probability theory, for independent A, B,

fA+B(x) = fA(x) ∗ fB(x)∆=

fA(t)fB(x − t)dt

In free probability, for free A, B, we use the notations

µA+B = µA ⊞ µB, µA = µA+B ⊟ µB, µAB = µA ⊠ µB, µA = µA+B µB

Ø. Ryan, M. Debbah, “Multiplicative free convolution and information-plus-noise type matrices,”Arxiv preprint math.PR/0702342, 2007.

Theorem

Convolution of the information-plus-noise model Let XN ∈ CN×n have i.i.d. Gaussian entries ofmean 0 and variance 1, RN ∈ CN×n, such that µ 1

n RN RHN⇒ µΓ, as n/N → c. Then the e.s.d. of

BN =1

n(RN + σXN) (RN + σXN)

H

converges weakly and almost surely to µB such that

µB =(

(µΓ µc) ⊞ δσ2)

⊠ µc

with µc the Marcenko-Pastur law.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 20 / 46

Page 38: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Similarities between classical and free probability

Classical Probability Free probability

Moments mk =

xk dF (x) Mk =

xk dF (x)

Cumulants mn =∑

π∈P(n)

V∈π

c|V | Mn =∑

π∈NC(n)

V∈π

C|V |

Independence classical independence freenessAdditive convolution fA+B = fA ∗ fB µA+B = µA ⊞ µB

Multiplicative convolution fAB µAB = µA ⊠ µBSum Rule ck (A + B) = ck (A) + ck (B) Ck (A + B) = Ck (A) + Ck (B)

Central Limit1√n

n∑

i=1

xi → N (0, 1)1√n

n∑

i=1

Xi ⇒ semi-circle law

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 21 / 46

Page 39: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

The moment approach and free probability

Bibliography on Free Probability related work

D. Voiculescu, “Addition of certain non-commuting random variables,” Journal of functionalanalysis, vol. 66, no. 3, pp. 323-346, 1986.

R. Speicher, “Combinatorial theory of the free product with amalgamation andoperator-valued free probability theory,” Mem. A.M.S., vol. 627, 1998.

R. Seroul, D. O’Shea, “Programming for Mathematicians,” Springer, 2000.

H. Bercovici, V. Pata, “The law of large numbers for free identically distributed randomvariables,” The Annals of Probability, pp. 453-465, 1996.

A. Nica, R. Speicher, “On the multiplication of free N-tuples of noncommutative randomvariables,” American Journal of Mathematics, pp. 799-837, 1996.

Ø. Ryan, M. Debbah, “Multiplicative free convolution and information-plus-noise typematrices,” Arxiv preprint math.PR/0702342, 2007.

N. R. Rao, A. Edelman, “The polynomial method for random matrices,” Foundations ofComputational Mathematics, vol. 8, no. 6, pp. 649-702, 2008.

Ø. Ryan, M. Debbah, “Asymptotic Behavior of Random Vandermonde Matrices With Entrieson the Unit Circle,” IEEE Trans. on Information Theory, vol. 55, no. 7, pp. 3115-3147, 2009.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 22 / 46

Page 40: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Outline

1 What is a random matrix? Generalities

2 History of mathematical advances

3 The moment approach and free probability

4 Introduction of the Stieltjes transform

5 Proof of the Marcenko-Pastur law

6 Summary of what we know, what is left to be done, which approach to consider to attack a large dimensional

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 23 / 46

Page 41: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

The Stieltjes transform

Definition

Let F be a probability distribution function. The Stieltjes transform mF of F is the function defined,for z ∈ C+, as

mF (z) =∫

1

λ− zdF (λ)

For a < b real, denoting z = x + iy , we have the inverse formula

F ([a, b]) = limy→0

1

π

∫ b

aℑ[mF (x + iy)]

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 24 / 46

Page 42: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Remark on the Stieltjes transform

If F is the e.s.d. of a Hermitian matrix XN ∈ CN×N , we might denote mX∆=mF , and

mX(z) =∫

1

λ− zdF (λ) =

1

Ntr (XN − zIN)

−1

We already saw that, for compactly supported F ,

mF (z) = −∞∑

k=0

Mk z−k−1

The Stieltjes transform is doubly more powerful than the moment approach!conveys more information than any K -finite sequence M1, . . . , MK .

is not handicapped by the support compactness constraint.

however, Stieltjes transform methods, while stronger, are more painful to work with.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 25 / 46

Page 43: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Remark on the Stieltjes transform

If F is the e.s.d. of a Hermitian matrix XN ∈ CN×N , we might denote mX∆=mF , and

mX(z) =∫

1

λ− zdF (λ) =

1

Ntr (XN − zIN)

−1

We already saw that, for compactly supported F ,

mF (z) = −∞∑

k=0

Mk z−k−1

The Stieltjes transform is doubly more powerful than the moment approach!conveys more information than any K -finite sequence M1, . . . , MK .

is not handicapped by the support compactness constraint.

however, Stieltjes transform methods, while stronger, are more painful to work with.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 25 / 46

Page 44: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Properties of the Stieltjes transform

mF defined in general on C+ but exists everywhere outside the support of F .

if X ∈ CN×n, the spectral distribution of XXH and XHX only differ by a mass of |N − n| zeros.Say N ≥ n,

mXXH (z) =1

N

N∑

i=1

1

λi − z=

1

N

n∑

i=1

1

λi − z+

1

N(N − n)

−1

z

hence

mXXH (z) =n

NmXHX − N − n

N

1

z

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 26 / 46

Page 45: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Asymptotic results using the Stieltjes transform

J. W. Silverstein, Z. D. Bai, “On the empirical distribution of eigenvalues of a class of largedimensional random matrices,” Journal of Multivariate Analysis, vol. 54, no. 2, pp. 175-192, 1995.

Theorem

Let BN = XNTNXHN ∈ CN×N , where XN ∈ CN×n has i.i.d. entries of mean 0 and variance 1/N,

F TN ⇒ F T and n/N → c. Then, F BN converges weakly and almost surely to F with Stieltjestransform

mF (z) =

(

c∫

t

1 + tmF (z)dF T (t)− z

)−1

whose solution is unique in the set z ∈ C+,mF (z) ∈ C+.

The proof of a more general theorem will be given in Part 2 of this course.

in general, no explicit expression for F .

the theorem above characterizes also the Stieltjes transform of BN = T12N XH

NXNT12N with

asymptotic distribution F ,

mF = cmF + (c − 1)1

zThis gives access to the spectrum of the sample covariance matrix model, whenXN = [x1, . . . , xn], with i.i.d. columns TN = E[x1xH

1 ].

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 27 / 46

Page 46: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Getting F ′ from mF

Remember that, for a < b real,

F ′(x) = limy→0

1

πℑ[mF (x + iy)]

where mF is (up to now) only defined on C+.(we will show in Part 3 that it can be somehow extended to C∗)to plot the density F ′,

first approach: span z = x + iy on the line x ∈ R, y = ε parallel but close to the real axis, solvemF (z) for each z, and plot ℑ[mF (z)].refined approach: see Part 3.

Example (Sample covariance matrix)

For N multiple of 3, let F′TN (x) = 1

3 δ(x − 1) + 13 δ(x − 3) + 1

3 δ(x − K ) and let BN = T12N XH

NXNT12N

with F BN → F , then

mF = cmF + (c − 1)1

z

mF (z) =

(

c∫

t

1 + tmF (z)dF T (t)− z

)−1

We take c = 1/10 and alternatively K = 7 and K = 4.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 28 / 46

Page 47: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Getting F ′ from mF

Remember that, for a < b real,

F ′(x) = limy→0

1

πℑ[mF (x + iy)]

where mF is (up to now) only defined on C+.(we will show in Part 3 that it can be somehow extended to C∗)to plot the density F ′,

first approach: span z = x + iy on the line x ∈ R, y = ε parallel but close to the real axis, solvemF (z) for each z, and plot ℑ[mF (z)].refined approach: see Part 3.

Example (Sample covariance matrix)

For N multiple of 3, let F′TN (x) = 1

3 δ(x − 1) + 13 δ(x − 3) + 1

3 δ(x − K ) and let BN = T12N XH

NXNT12N

with F BN → F , then

mF = cmF + (c − 1)1

z

mF (z) =

(

c∫

t

1 + tmF (z)dF T (t)− z

)−1

We take c = 1/10 and alternatively K = 7 and K = 4.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 28 / 46

Page 48: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Getting F ′ from mF

Remember that, for a < b real,

F ′(x) = limy→0

1

πℑ[mF (x + iy)]

where mF is (up to now) only defined on C+.(we will show in Part 3 that it can be somehow extended to C∗)to plot the density F ′,

first approach: span z = x + iy on the line x ∈ R, y = ε parallel but close to the real axis, solvemF (z) for each z, and plot ℑ[mF (z)].refined approach: see Part 3.

Example (Sample covariance matrix)

For N multiple of 3, let F′TN (x) = 1

3 δ(x − 1) + 13 δ(x − 3) + 1

3 δ(x − K ) and let BN = T12N XH

NXNT12N

with F BN → F , then

mF = cmF + (c − 1)1

z

mF (z) =

(

c∫

t

1 + tmF (z)dF T (t)− z

)−1

We take c = 1/10 and alternatively K = 7 and K = 4.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 28 / 46

Page 49: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Getting F ′ from mF

Remember that, for a < b real,

F ′(x) = limy→0

1

πℑ[mF (x + iy)]

where mF is (up to now) only defined on C+.(we will show in Part 3 that it can be somehow extended to C∗)to plot the density F ′,

first approach: span z = x + iy on the line x ∈ R, y = ε parallel but close to the real axis, solvemF (z) for each z, and plot ℑ[mF (z)].refined approach: see Part 3.

Example (Sample covariance matrix)

For N multiple of 3, let F′TN (x) = 1

3 δ(x − 1) + 13 δ(x − 3) + 1

3 δ(x − K ) and let BN = T12N XH

NXNT12N

with F BN → F , then

mF = cmF + (c − 1)1

z

mF (z) =

(

c∫

t

1 + tmF (z)dF T (t)− z

)−1

We take c = 1/10 and alternatively K = 7 and K = 4.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 28 / 46

Page 50: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Spectrum of the sample covariance matrix

1 3 70

0.2

0.4

0.6

Eigenvalues

Den

sity

Empirical eigenvalue distribution

Limit law

1 3 40

0.2

0.4

0.6

EigenvaluesD

ensi

ty

Empirical eigenvalue distribution

Limit law

Figure: Histogram of the eigenvalues of BN = T12N XH

N XN T12N , N = 3000, n = 300, with TN diagonal composed of

three evenly weighted masses in (i) 1, 3 and 7 on top, (ii) 1, 3 and 4 at bottom.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 29 / 46

Page 51: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Other notorious result

V. L. Girko, “Theory of Random Determinants,” Kluwer, Dordrecht, 1990.

Theorem

Let XN ∈ CN×n with xij i.i.d. of zero mean and variance σ2ij /N where the σij ’s are uniformly

bounded. Assume the distribution of σij tends to pσ(x , y) as n,N → ∞, n/N → c. Then, almostsurely, the e.s.d. of BN = XNXH

N converges weakly to F with Stieltjes transform

mF (z) =∫ 1

0u(x , z)dx

and u(x , z) satisfies

u(x , z) =

[

−z +

∫ c

0

pσ(x , y)dy

1 +∫ 1

0 u(x ′, z)pσ(x ′, y)dx ′

]−1

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 30 / 46

Page 52: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Other transforms: the R-transform

All classically used transforms can be expressed as a function of the Stieltjes transform

Some transforms are more handy to treat specific problems.

Definition

Let F be a distribution function mF its Stieltjes transform. Then the R-transform of F is defined as

mF (RF (z) + z−1) = −z

or equivalently

mF (z) =1

RF (−mF (z))− z

The main property of the R-transform is that, for A, B free random matrices,

RA+B = RA + RB

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 31 / 46

Page 53: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Other transforms: the R-transform

All classically used transforms can be expressed as a function of the Stieltjes transform

Some transforms are more handy to treat specific problems.

Definition

Let F be a distribution function mF its Stieltjes transform. Then the R-transform of F is defined as

mF (RF (z) + z−1) = −z

or equivalently

mF (z) =1

RF (−mF (z))− z

The main property of the R-transform is that, for A, B free random matrices,

RA+B = RA + RB

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 31 / 46

Page 54: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Other transforms: the S-transform

Definition

Let F be a distribution function mF its Stieltjes transform. Then the S-transform of F is defined as

mF

(

z + 1

zSF (z)

)

= −zSF (z)

The S-transform is the product equivalent of the R-transform, i.e. for A, B free random matrices,

SAB = SA · SB

Remark: the R- and S-transforms are convenient to use when dealing with unitary matrices.Example of use is worked out in Part 2.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 32 / 46

Page 55: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Other transforms: Shannon and η-transforms

A. M. Tulino, S. Verdu, “Random matrix theory and wireless communications,” Now Publishers Inc.,2004.

Definition

Let F be a probability distribution, mF its Stieltjes transform, then the Shannon-transform VF of Fis defined as

VF (x)∆=

∫ ∞

0log(1 + xλ)dF (λ) =

∫ ∞

x

(

1

t− mF (−t)

)

dt

Note that this last relation is fundamental to wireless communication purposes!

Definition

Let F be a probability distribution, mF its Stieltjes transform, then the η-transform ηF of F isdefined as

ηF (x)∆=

∫ ∞

0

1

1 + xλdF (λ) =

1

xmF

(

− 1

x

)

The η-transform is only a convenient way to use the Stieltjes transform on the negative real-line.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 33 / 46

Page 56: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Introduction of the Stieltjes transform

Other transforms: Shannon and η-transforms

A. M. Tulino, S. Verdu, “Random matrix theory and wireless communications,” Now Publishers Inc.,2004.

Definition

Let F be a probability distribution, mF its Stieltjes transform, then the Shannon-transform VF of Fis defined as

VF (x)∆=

∫ ∞

0log(1 + xλ)dF (λ) =

∫ ∞

x

(

1

t− mF (−t)

)

dt

Note that this last relation is fundamental to wireless communication purposes!

Definition

Let F be a probability distribution, mF its Stieltjes transform, then the η-transform ηF of F isdefined as

ηF (x)∆=

∫ ∞

0

1

1 + xλdF (λ) =

1

xmF

(

− 1

x

)

The η-transform is only a convenient way to use the Stieltjes transform on the negative real-line.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 33 / 46

Page 57: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

Outline

1 What is a random matrix? Generalities

2 History of mathematical advances

3 The moment approach and free probability

4 Introduction of the Stieltjes transform

5 Proof of the Marcenko-Pastur law

6 Summary of what we know, what is left to be done, which approach to consider to attack a large dimensional

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 34 / 46

Page 58: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

The Marcenko-Pastur law

The theorem to be proven is the following

Theorem

Let XN ∈ CN×n have i.i.d. zero mean variance 1/n entries with finite eighth order moments. Asn,N → ∞ with N

n → c ∈ (0,∞), the e.s.d. of XNXHN converges almost surely to a nonrandom

distribution function Fc with density fc given by

fc(x) = (1 − c−1)+δ(x) +1

2πcx

(x − a)+(b − x)+

where a = (1 −√c)2, b = (1 +

√c)2 and δ(x) = I0(x).

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 35 / 46

Page 59: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

The Marcenko-Pastur density

0 0.5 1 1.5 2 2.5 30

0.2

0.4

0.6

0.8

1

1.2

x

Den

sity

f c(x

)

c = 0.1

c = 0.2

c = 0.5

Figure: Marcenko-Pastur law for different limit ratios c = limN→∞ N/n.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 36 / 46

Page 60: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

Diagonal entries of the resolvent

Since we want an expression of mF , we start by identifying the diagonal entries of the resolvent(XNXH

N − zIN)−1 of XNXHN . Denote

XN =

[

yH

Y

]

Now, for z ∈ C+, we have

(

XNXHN − zIN

)−1=

[

yHy − z yHYH

Yy YYH − zIN−1

]−1

Consider the first diagonal element of (RN − zIN)−1. From the matrix inversion lemma,

(

A BC D

)−1

=

(

(A − BD−1C)−1 −A−1B(D − CA−1B)−1

−(A − BD−1C)−1CA−1 (D − CA−1B)−1

)

which here gives[

(

XNXHN − zIN

)−1]

11=

1

−z − zyH(YHY − zIn)−1y

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 37 / 46

Page 61: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

Diagonal entries of the resolvent

Since we want an expression of mF , we start by identifying the diagonal entries of the resolvent(XNXH

N − zIN)−1 of XNXHN . Denote

XN =

[

yH

Y

]

Now, for z ∈ C+, we have

(

XNXHN − zIN

)−1=

[

yHy − z yHYH

Yy YYH − zIN−1

]−1

Consider the first diagonal element of (RN − zIN)−1. From the matrix inversion lemma,

(

A BC D

)−1

=

(

(A − BD−1C)−1 −A−1B(D − CA−1B)−1

−(A − BD−1C)−1CA−1 (D − CA−1B)−1

)

which here gives[

(

XNXHN − zIN

)−1]

11=

1

−z − zyH(YHY − zIn)−1y

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 37 / 46

Page 62: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

Trace Lemma

D. N. C. Tse, O. Zeitouni, “Linear multiuser receivers in random environments,” IEEE Trans. onInformation Theory, vol. 46, no. 1, pp. 171-188, 2000.

To go further, we need the following result,

Theorem

Let AN ∈ CN×N . Let xN ∈ CN , be a random vector of i.i.d. entries with zero mean, variance1/N and finite 8th order moment, independent of AN . Then

√N[

xHNANxN − 1

Ntr AN

]

→ CN (0, 1)

As a corollary, we have

xHNANxN − 1

Ntr AN → 0

almost surely.

For large N, we therefore have approximately[

(

XNXHN − zIN

)−1]

11≃ 1

−z − z 1N tr(YHY − zIn)−1

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 38 / 46

Page 63: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

Rank-1 perturbation lemma

J. W. Silverstein, Z. D. Bai, “On the empirical distribution of eigenvalues of a class of largedimensional random matrices,” Journal of Multivariate Analysis, vol. 54, no. 2, pp. 175-192, 1995.

It is somewhat intuitive that adding a single column to Y won’t affect the trace in the limit.

Theorem

Let z ∈ C+, A and B N × N with B Hermitian, and v ∈ CN . Then∣

1

Ntr(

(B − zIN)−1 − (B + vvH − zIN)

−1)

A∣

≤ 1

N

‖A‖ℑ[z]

with ‖A‖ the spectral norm of A.

Therefore, for large N, we have approximately,[

(

XNXHN − zIN

)−1]

11≃ 1

−z − z 1N tr(YHY − zIn)−1

≃ 1

−z − z 1N tr(XH

NXN − zIn)−1

=1

−z − z nN mF (z)

in which we recognize the Stieltjes transform mF of the l.s.d. of XHNXN .

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 39 / 46

Page 64: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Proof of the Marcenko-Pastur law

End of the proof

We have again the relationn

NmF (z) = mF (z) +

N − n

N

1

z

hence[

(

XNXHN − zIN

)−1]

11≃ 1

nN − 1 − z − zmF (z)

Note that the choice (1, 1) is irrelevant here, so the expression is valid for all pair (i, i). Summingover the N terms and averaging, we finally have

mF (z) =1

Ntr(

XNXHN − zIN

)−1≃ 1

c − 1 − z − zmF (z)

which solve a polynomial of second order. Finally

mF (z) =c − 1

2z− 1

2+

(c − 1 − z)2 − 4z

2z

from the inverse Stieltjes transform formula, we then verify that mF is the Stieltjes transform of theMarcenko-Pastur law.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 40 / 46

Page 65: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Outline

1 What is a random matrix? Generalities

2 History of mathematical advances

3 The moment approach and free probability

4 Introduction of the Stieltjes transform

5 Proof of the Marcenko-Pastur law

6 Summary of what we know, what is left to be done, which approach to consider to attack a large dimensional

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 41 / 46

Page 66: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Models studied with analytic tools

Stieltjes transform: models involving i.i.d. matrices

sample covariance matrix models, XTXH and T12 XHXT

12

doubly correlated models, R12 XTXHR

12 . With X Gaussian, Kronecker model.

doubly correlated models with external matrix, R12 XTXHR

12 + A.

variance profile, XXH, where X has i.i.d. entries with mean 0, variance σ2i,j .

Ricean channels, XXH + A, where X has a variance profile.

sum of doubly correlated i.i.d. matrices,∑K

k=1 R12k Xk Tk XH

k R12k .

information-plus-noise models (X + A)(X + A)H

frequency-selective doubly-correlated channels (∑K

k=1 R12k Xk Tk Xk R

12k )(

∑Kk=1 R

12k Xk Tk Xk R

12k )

sum of frequency-selective doubly-correlated channels∑K

k=1 R12k Hk Tk HH

k R12k , where

Hk =∑L

l=1 R′

kl

12 Xkl T

kl XHkl R

kl

12 .

R- and S-transforms: models involving a column subset W of unitary matrices

doubly correlated Haar matrix R12 WTWHR

12

sum of simply correlated Haar matrices∑K

k=1 Wk Tk WHk

In most cases, T and R can be taken random, but independent of X. More involved randommatrices, such as Vandermonde matrices, were not yet studied.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 42 / 46

Page 67: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Models studied with analytic tools

Stieltjes transform: models involving i.i.d. matrices

sample covariance matrix models, XTXH and T12 XHXT

12

doubly correlated models, R12 XTXHR

12 . With X Gaussian, Kronecker model.

doubly correlated models with external matrix, R12 XTXHR

12 + A.

variance profile, XXH, where X has i.i.d. entries with mean 0, variance σ2i,j .

Ricean channels, XXH + A, where X has a variance profile.

sum of doubly correlated i.i.d. matrices,∑K

k=1 R12k Xk Tk XH

k R12k .

information-plus-noise models (X + A)(X + A)H

frequency-selective doubly-correlated channels (∑K

k=1 R12k Xk Tk Xk R

12k )(

∑Kk=1 R

12k Xk Tk Xk R

12k )

sum of frequency-selective doubly-correlated channels∑K

k=1 R12k Hk Tk HH

k R12k , where

Hk =∑L

l=1 R′

kl

12 Xkl T

kl XHkl R

kl

12 .

R- and S-transforms: models involving a column subset W of unitary matrices

doubly correlated Haar matrix R12 WTWHR

12

sum of simply correlated Haar matrices∑K

k=1 Wk Tk WHk

In most cases, T and R can be taken random, but independent of X. More involved randommatrices, such as Vandermonde matrices, were not yet studied.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 42 / 46

Page 68: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Models studied with analytic tools

Stieltjes transform: models involving i.i.d. matrices

sample covariance matrix models, XTXH and T12 XHXT

12

doubly correlated models, R12 XTXHR

12 . With X Gaussian, Kronecker model.

doubly correlated models with external matrix, R12 XTXHR

12 + A.

variance profile, XXH, where X has i.i.d. entries with mean 0, variance σ2i,j .

Ricean channels, XXH + A, where X has a variance profile.

sum of doubly correlated i.i.d. matrices,∑K

k=1 R12k Xk Tk XH

k R12k .

information-plus-noise models (X + A)(X + A)H

frequency-selective doubly-correlated channels (∑K

k=1 R12k Xk Tk Xk R

12k )(

∑Kk=1 R

12k Xk Tk Xk R

12k )

sum of frequency-selective doubly-correlated channels∑K

k=1 R12k Hk Tk HH

k R12k , where

Hk =∑L

l=1 R′

kl

12 Xkl T

kl XHkl R

kl

12 .

R- and S-transforms: models involving a column subset W of unitary matrices

doubly correlated Haar matrix R12 WTWHR

12

sum of simply correlated Haar matrices∑K

k=1 Wk Tk WHk

In most cases, T and R can be taken random, but independent of X. More involved randommatrices, such as Vandermonde matrices, were not yet studied.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 42 / 46

Page 69: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Models studied with moments/free probability

asymptotic resultsmost of the above models with Gaussian X.products V1VH

1 T1V2VH2 T2... of Vandermonde and deterministic matrices

conjecture: any probability space of matrices invariant to row or column permutations.

marginal studies, not yet fully exploredrectangular free convolution: singular values of rectangular matricesfinite size models. Instead of almost sure convergence of mXN

as N → ∞, we can study finite sizebehaviour of E[mXN

].

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 43 / 46

Page 70: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Models studied with moments/free probability

asymptotic resultsmost of the above models with Gaussian X.products V1VH

1 T1V2VH2 T2... of Vandermonde and deterministic matrices

conjecture: any probability space of matrices invariant to row or column permutations.

marginal studies, not yet fully exploredrectangular free convolution: singular values of rectangular matricesfinite size models. Instead of almost sure convergence of mXN

as N → ∞, we can study finite sizebehaviour of E[mXN

].

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 43 / 46

Page 71: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Open problems, to be explored

Stieltjes transform methods for more structured matrices: e.g. Vandermonde matrices

clean framework for band matrix models

finite dimensional methods for Ricean matrices

other ?

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 44 / 46

Page 72: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Related bibliography

R. B. Dozier, J. W. Silverstein, “On the empirical distribution of eigenvalues of large dimensionalinformation-plus-noise-type matrices,” Journal of Multivariate Analysis, vol. 98, no. 4, pp. 678-694, 2007.

J. W. Silverstein, Z. D. Bai, “On the empirical distribution of eigenvalues of a class of large dimensionalrandom matrices,” Journal of Multivariate Analysis, vol. 54, no. 2, pp. 175-192, 1995.

J. W. Silverstein, S. Choi “Analysis of the limiting spectral distribution of large dimensional randommatrices” Journal of Multivariate Analysis, vol. 54, no. 2, pp. 295-309, 1995.

F. Benaych-Georges, “Rectangular random matrices, related free entropy and free Fisher’s information,”Arxiv preprint math/0512081, 2005.

Ø. Ryan, M. Debbah, “Multiplicative free convolution and information-plus-noise type matrices,” Arxivpreprint math.PR/0702342, 2007.

V. L. Girko, “Theory of Random Determinants,” Kluwer, Dordrecht, 1990.

R. Couillet, M. Debbah, J. W. Silverstein, “A deterministic equivalent for the capacity analysis of correlatedmulti-user MIMO channels,” submitted to IEEE Trans. on Information Theory.

V. L. Girko, “Theory of Random Determinants,” Kluwer, Dordrecht, 1990.

W. Hachem, Ph. Loubaton, J. Najim, “Deterministic Equivalents for Certain Functionals of Large RandomMatrices”, Annals of Applied Probability, vol. 17, no. 3, 2007.

M. J. M. Peacock, I. B. Collings, M. L. Honig, “Eigenvalue distributions of sums and products of largerandom matrices via incremental matrix expansions,” IEEE Trans. on Information Theory, vol. 54, no. 5, pp.2123, 2008.

D. Petz, J. Reffy, “On Asymptotics of large Haar distributed unitary matrices,” Periodica Math. Hungar., vol.49, pp. 103-117, 2004.

Ø. Ryan, A. Masucci, S. Yang, M. Debbah, “Finite dimensional statistical inference,” submitted to IEEETrans. on Information Theory, Dec. 2009.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 45 / 46

Page 73: Random Matrices in Wireless Communications …...Random Matrices in Wireless Communications Course 1: Introduction to random matrix theory and the Stieltjes transform Romain Couillet

Summary of what we know, what is left to be done, which approach to consider toattack a large dimensional wireless communication problem?

Technical Bibliography

W. Rudin, “Real and complex analysis,” New York, 1966.

P. Billingsley, “Probability and measure,” Wiley New York, 2008.

P. Billingsley, “Convergence of probability measures,” Wiley New York, 1968.

R. Couillet (Supelec) Random Matrix Theory Course 29/10/2009 46 / 46