Dirac Delta Function of Matrix Argument: Lin Zhang

Dirac Delta Function of Matrix Argument
Lin Zhang∗
Institute of Mathematics, Hangzhou Dianzi University, Hangzhou 310018, PR China
arXiv:1607.02871v2 [quant-ph] 31 Oct 2018
Abstract
Dirac delta function of matrix argument is employed frequently in the development of di-
verse fields such as Random Matrix Theory, Quantum Information Theory, etc. The purpose
of the article is pedagogical, it begins by recalling detailed knowledge about Heaviside unit
step function and Dirac delta function. Then its extensions of Dirac delta function to vector
spaces and matrix spaces are discussed systematically, respectively. The detailed and elemen-
tary proofs of these results are provided. Though we have not seen these results formulated
in the literature, there certainly are predecessors. Applications are also mentioned.
1 Heaviside unit step function H and Dirac delta function δ
The materials in this section are essential from Hoskins’ Book [4]. There are also no new results
in this section. In order to be in a systematic way, it is reproduced here.
The Heaviside unit step function H is defined as

1, x > 0,
H ( x) := (1.1)
0, x < 0.
That is, this function is equal to 1 over (0, +∞) and equal to 0 over (−∞, 0). This function can
equally well have been defined in terms of a specific expression, for instance

1 x
H ( x) = 1+ . (1.2)
2 |x|
The value H (0) is left undefined here. For all x 6= 0,
dH ( x)
H ′ ( x) = =0
dx
∗ E-mail: godyalin@163.com; linyz@zju.edu.cn
1
corresponding to the obvious fact that the graph of the function y = H ( x) has zero slope for
all x 6= 0. Naturally we describe the slope as "infinite" at origin. We shall denote by δ( x) the
derivative of H ( x):
δ( x) = H ′ ( x) = 0, ∀ x 6= 0,
and
δ(0) = +∞.
We recall the definition of Dirac delta function:
Definition 1.1 (Dirac delta function). Dirac delta function δ( x) is defined by


+∞, if x = 0;
δ( x) = (1.3)
0, if x 6= 0.
Proposition 1.2 (Sampling property of the Dirac delta function). If f is any function which is
continuous on a neighborhood of 0, then
Z +∞
f ( x)δ( x)dx = f (0). (1.4)
−∞
In fact, we have, for a 6= 0, Z +a

f ( x)δ( x)dx = f (0)
−a
and Z a Z +∞
δ( x)dx = H ( a − x)δ( x)dx = H ( a).
−∞ −∞
Definition 1.3. Assume that f is continuous function which vanishes outside some finite interval.
There corresponds a certain number which we write as h H, f i, given by
Z +∞ Z +∞
h H, f i := f ( x) H ( x)dx = f ( x)dx. (1.5)
−∞ 0
Similarly, hδ, f i is given by

Z +∞
hδ, f i := f ( x)δ( x)dx = f (0). (1.6)
−∞
For an ordinary function f and a fixed a ∈ R, the symbol f a denotes the translation of f with
respect to a:
f a ( x ) : = f ( x − a ). (1.7)
Thus
Z +∞ Z +∞
h δa , f i = f ( x)δa ( x)dx = f ( x)δ( x − a)dx = f ( a). (1.8)
−∞ −∞
2
From the above discussion, it is easy to see that
f ( x ) δ ( x − a ) = f ( x ) δa ( x ) = f ( a ) δ ( x ) . (1.9)
This fact will be used later.
Proposition 1.4. If f is any function which has a continuous derivative f ′ , at least in some neighborhood
of 0, then

Z +∞

′
δ , f := f ( x)δ′ ( x)dx = − δ, f ′ = − f ′ (0). (1.10)
−∞
Proof. Since
Z +∞ Z +∞ Z +∞
δ ( x ) − δ ( x − ǫ) 1
f ( x) dx = f ( x)δ( x)dx − f ( x)δ( x − ǫ)dx
−∞ ǫ ǫ −∞ −∞
f ( 0) − f ( ǫ ) f ( ǫ ) − f ( 0)
= =− ,
ǫ ǫ
it follows that

Z +∞ Z +∞
δ ( x ) − δ ( x − ǫ)
δ′ , f = f ( x)δ′ ( x)dx = f ( x) lim dx (1.11)
−∞ −∞ ǫ →0 ǫ
Z +∞
δ ( x ) − δ ( x − ǫ)
= lim f ( x) dx (1.12)
ǫ →0 − ∞ ǫ
f ( ǫ ) − f ( 0)

= lim − = − f ′ (0) = − δ, f ′ . (1.13)
ǫ →0 ǫ
This completes the proof.
Proposition 1.5. The n-th derivative of the Dirac delta function, denoted by δ(n) , is defined by the follow-
ing:
D E
δ(n) , f = (−1)n f (n) (0), (1.14)
where n ∈ N + and f is any function with continuous derivatives at least up to the n-th order in some
neighborhood of 0.
From these, we see that

δa′ , f
= − δa , f ′ = − f ′ ( a ) , (1.15)
D E D E
(n)
δa , f = (−1)n δa , f (n) = (−1)n f (n) ( a). (1.16)
Suppose that g(t) increases monotonely over the closed interval [ a, b]: suppose there is t0 ∈ ( a, b)
such that g(t0 ) = 0. We have known that
dg−1 ( x) 1
= dg( t)
.
dx
dt
3
From this, we see that, via x = g(t), t ∈ [ a, b],
Z b Z g(b)
f (t)δ( g(t))dt = f ( g−1 ( x))δ( x)d( g−1 ( x)) (1.17)
a g( a)
Z g(b)
dg−1 ( x)
= f ( g−1 ( x))δ( x) dx (1.18)
g( a) dx
dg−1 ( x) f ( t0 )
= f ( g−1 (0)) x = g ( t )
= ′ . (1.19)
dx 0 g ( t0 )
Proposition 1.6. If g(t) is monotone, with g( a) = 0 and g′ ( a) 6= 0, then
δa ( t )
δ( g(t)) = . (1.20)
| g′ ( a)|
From this, we see that

1 b
δ(kx + b) = δ x+ , k 6= 0.
|k| k
More generally, the delta distribution may be composed with a smooth function g( x) in such a
way that the familiar change of variables formula holds, that
Z ′ Z

f ( g( x))δ( g( x)) g ( x) dx = f (t)δ(t)dt (1.21)
R g (R )
provided that g is a continuously differentiable function with g′ nowhere zero. That is, there is a
unique way to assign meaning to the distribution δ ◦ g so that this identity holds for all compactly
supported test functions f . Therefore, the domain must be broken up to exclude the g′ ( x) = 0
point. This distribution satisfies δ( g( x)) = 0 if g is nowhere zero, and otherwise if g has a real
root at x0 , then
δ ( x − x0 )
δ( g( x)) = . (1.22)
| g′ ( x0 )|
It is natural to define the composition δ( g( x)) for continuously differentiable functions g by
δ( x − x j )
δ( g( x)) = ∑ (1.23)
g′ ( x j )
j
where the sum extends over all roots of g( x), which are assumed to be simple. Thus, for example
1
δ x 2 − a2 = (δ( x + a) + δ( x − a)) . (1.24)
2 | a|
In the integral form the generalized scaling property may be written as

Z +∞
f (xj )
−∞
f ( x)δ( g( x))dx = ∑ g′ ( x ) . (1.25)
j j
4
Example 1.7 ([1]). If T is an n × n positive definite matrix and r ∈ R + , α > 0, then
Z
π n Γ ( α + 1) r n + α
(r − hu | T | ui)α [du] = , (1.26)
Bn ( T,r ) Γ(n + α + 1) det( T )
where Bn ( T, r) := {u ∈ C n | hu | T | ui < r} and [du] = ∏nj=1 du j for [dz] = d (Re(z)) d (Im(z)).

Indeed, let
1 1
v = r− 2 T 2 u.
Then [dv] = det(r−1 T )[du] or [du] = det(rT −1 )[dv]. Thus

Z Z
rn+α
(r − hu |T | ui)α [du] = (1 − hv|v i)α [dv], (1.27)
Bn ( T,r ) det( T ) Bn (1 n ,1)
where Bn (1 n , 1) = {v ∈ C n |hv|v i < 1}. Now
Bn (1 n , 1) = ∪γ∈[0,1) Sn (γ),
where Sn (γ) = {v ∈ C n | k v k2 = γ}.

Z Z 1 Z
(1 − hv|vi)α [dv] = dγ δ(γ − k v k2 ) (1 − hv|vi) [dv]
α
(1.28)
Bn (1 n ,1) 0
Z 1 Z
2 α
= dγ(1 − γ ) · δ(γ − k v k2 )[dv] (1.29)
0
Z 1
= dγ(1 − γ2 )α vol(Sn (γ)), (1.30)
0
where Z
2π n 2n−1
vol(Sn (γ)) = δ(γ − k v k2 )[dv] = γ .
Γ(n)
Therefore, we obtain that
Z Z
2π n rn+α 1
(r − hu | T | ui)α [du] = (1 − γ2 )α γ2n−1dγ (1.31)
Bn ( T,r ) Γ(n) det( T ) 0
Z 1
π n rn+α
= (1 − γ2 )α γ2n−2d(γ2 ), (1.32)
Γ(n) det( T ) 0
where
Z 1 Z 1
Γ ( α + 1) Γ ( n )
(1 − γ2 )α γ2n−2d(γ2 ) = (1 − x)α xn−1dx = B(α + 1, n) = .
0 0 Γ ( n + α + 1)
Definition 1.8 (Convolution). An operation on functions, called convolution and denoted by the
symbol ∗, is defined by:
Z Z
f ∗ g( x ) = f ( x − t) g(t)dt = f (t) g( x − t)dt. (1.33)
R R
5
It is easily seen that some properties of convolution:
(i) δa ∗ δb = δa+b .
(ii) The delta function as a convolution unit: δ ∗ f = f ∗ δ = f .
(iii) Convolution as the translation: δa ∗ f = f ∗ δa = f a , where f a ( x) := f ( x − a).
(iv) δ(n) ∗ f = f ∗ δ(n) = f (n) .
Definition 1.9 (Fourier transform). Let f be a complex-valued function of the real variable t
which is absolutely integrable over the whole real axis R. That is,
Z Z
| f1 ( x)| < +∞ and | f2 ( x)| < +∞,
R R
√
where f = f1 + −1 f2 . We define the Fourier transform of f to be a new function
Z
fb(ω ) := F ( f )(ω ) = e−iωt f (t)dt. (1.34)
R
Next, we consider the Fourier integral representation of Dirac delta function which is very
powerful in applications. We use the following standard result:
Z +∞ iωx
e
PV dx = iπ (1.35)
−∞ x
where the symbol PV denotes the Cauchy Principal Value of the integral and ω > 0 a constant.
That is,
Z +∞ Z +∞
sin(ωx) cos(ωx)
dx = π and dx = 0. (1.36)
−∞ x −∞ x
Replacing ω by −ω simply changes the sign of the first of these two real integrals and leaves the
other unaltered. That is, if ω > 0
Z +∞ −iωx
e
PV dx = −iπ (1.37)
−∞ x
Hence, if we replace ω by the usual symbol t for the independent real variable we can write
Z +∞ itx
e
PV dx = iπ sign(t), (1.38)
−∞ x
i.e.,

Z +∞ itx 1, t>0
1 e 1
dx = sign(t) = 2 (1.39)
2π −∞ ix 2 − 1 , t<0
2
A formal differentiation of this with respect to t then yields the following result:
6
Proposition 1.10. It holds that
Z +∞
1
δ ( t) = eitx dx. (1.40)
2π −∞
This amounts to say F −1 (1)(t) = δ(t). Replacing t by t − a, we have

Z +∞
1
ei(t− a) x dx = δ(t − a) = δa (t). (1.41)
2π −∞
This amounts to say F −1 (e−iax )(t) = δa (t) or F (δa (t))( x) = e−iax . The integral on the left-
hand side of (1.40) is, of course, divergent, and it is clear that this equation must be understood
symbolically. That is to say, for all sufficiently well-behaved functions f , we should interpret
(1.40) to mean that
Z +∞ Z +∞ Z +∞
1 itω
f ( t) e dω dt = f (t)δ(t)dt = f (0) (1.42)
−∞ 2π −∞ −∞
or, more generally, that

Z +∞ Z +∞ Z +∞
1 i( t − x ) ω
f ( x) e dω dx = f ( x)δ(t − x)dx = f (t). (1.43)
−∞ 2π −∞ −∞
We can rewrite this result in the form

Z +∞ Z +∞
1
f ( t) = f ( x) ei(t− x )ω dω dx (1.44)
−∞ 2π −∞
Z +∞ Z +∞ Z +∞
1 1
= e itω
f ( x)e −ixω
dx dω = eitω fb(ω )dω (1.45)
2π −∞ −∞ 2π −∞
Proposition 1.11 (Fourier Inversion). Let f be a (real or complex valued) function of a single real
variable which is absolutely integrable over the interval (−∞, +∞) and which also satisfies the Dirichlet
conditions over every finite interval. If fb(ω ) denotes the Fourier transform of f , then at each point t we
have
Z +∞
1 1
eitω fb(ω )dω = [ f (t+) + f (t−)] , (1.46)
2π −∞ 2
where f (t±) := lims→t± f (s).
There are several important properties of the Fourier transform which merit explicit mention.
(i) The Fourier transform of the convolution of two functions is equal to the product of their
individual transforms: F ( f ∗ g) = F ( f )F ( g).
1
(ii) F ( f g) = 2π F ( f ) ∗ F ( g).
(iii) The Fourier transformation is linear: F (λ1 f 1 + λ2 f 2 ) = λ1 F ( f1 ) + λ2 F ( f 2 ).
7
(iv) F ( f ( x − a))(ω ) = e−iωa F ( f ( x))(ω ).
(v) F ( f ( x)e− ax )(ω ) = F ( f ( x))( a + iω ).
(vi) F ( f ′ )(ω ) = iω F ( f )(ω ).
ω

(vii) F ( f ( ax))(ω ) = 1a F ( f ( x)) a .
For example, the proofs of (iv) and (vi) are given. Indeed, f ( x − a) = f a ( x) = f ∗ δa ( x), thus
F ( f ( x − a))(ω ) = F ( f ∗ δa ( x))(ω ) = F ( f )F (δa )(ω ), that is, F ( f ( x − a))(ω ) = e−iωa F ( f ),
hence (iv). Since f ′ = f ∗ δ′ , it follows that F ( f ′ ) = F ( f ∗ δ′ ) = F ( f )F (δ′ ). In what follows, we
calculate F (δ′ ). By definition of Fourier transform,
Z
de−iωt
F (δ′ )(ω ) = e−iωt δ′ (t)dt = − = iω. (1.47)
R dt t=0
Thus F ( f ′ )(ω ) = iω F ( f )(ω ), hence (vi). This property can be generalized: F ( f (n) )(ω ) =
(iω )n F ( f )(ω ). Indeed,
F ( f (n) )(ω ) = F ( f ∗ δ(n) )(ω ) = F ( f )(ω )F (δ(n) )(ω ) = (iω )n F ( f )(ω ). (1.48)
We can apply the sampling property of the delta function to the Fourier inversion integral:
Z +∞
1 1 ixα
eixω δ(ω − α)dω = e (1.49)
2π −∞ 2π
and similarly
Z +∞
1 1 −ixα
eixω δ(ω + α)dω = e . (1.50)
2π −∞ 2π
Thus, recalling that the Fourier transform is defined in general for complex-valued functions,
these results suggest that we can give the following definitions for the Fourier transforms of
complex exponentials such as
F (eiαx )(ω ) = 2πδ(ω − α); F (e−iαx )(ω ) = 2πδ(ω + α).
Both equations immediately yield the following definitions for the Fourier transforms of the real
functions cos(αx) and sin(αx):
F (cos(αx))(ω ) = π (δ(ω − α) + δ(ω + α)) , (1.51)

F (sin(αx))(ω ) = −iπ (δ(ω − α) − δ(ω + α)) . (1.52)
In particular, taking α = 0, we find that the generalized Fourier transform of the constant function
f (t) ≡ 1 is simply 2πδ(ω ). This in turn allows us to offer a definition of the Fourier transform
of the unit step function.
8
1
Proposition 1.12. The Fourier transform of the Heaviside step function H ( x) = 2 + 12 sign( x), where
x
sign( x) = |x|
, is given by
b (ω ) = πδ(ω ) + 1 .
H (1.53)
iω
Proof. Now

Z +∞ ixω 1, x>0
1 e 1
dω = sign( x) = 2 (1.54)
2π −∞ iω 2 − 1 , x<0
2
2
Then we know that iω is a suitable choice for the Fourier transform of the function sign( x) in the
sense that
Z +∞
1 2 2
sign( x) = eixω dω = F −1 ( x ). (1.55)
2π −∞ iω iω
2
This amounts to say that F (sign( x))(ω ) = iω . Hence for Heaviside step function
1 1
H ( x) = + sign( x), (1.56)
2 2
the Fourier transform of it is given by

1 1 1 1
F ( H ( x))(ω ) = F + sign( x) = F (1)(ω ) + F (sign( x))(ω ), (1.57)
2 2 2 2
i.e.
1 b (ω ) = πδ(ω ) + 1 .
F ( H ( x))(ω ) = πδ(ω ) + ⇐⇒ H (1.58)
iω iω
We can get some important properties of Dirac delta function which are listed below:
(i) The delta function is an even distribution: δ( x) = δ(− x).
(ii) The delta function satisfies the following scaling property for a non-zero scalar: δ( ax) =
1
| a | δ( x) for a ∈ R \{0}.
(iii) The distributional product of δ( x) and x is equal to zero: xδ( x) = 0.
(iv) If x f ( x) = xg( x), where f and g are distributions, then f ( x) = g( x) + cδ( x) for some
constant c.
9
Previous two facts can be checked as follows: Note that
Z ∞ ∞ Z
1 1
δ(− x) = e−itx dt = −
e−itx d(−t)
2π−∞ 2π −∞
Z −∞ Z ∞
1 isx 1
= − e ds = eisx ds = δ( x).
2π +∞ 2π −∞
This is (i). For the proof of (ii), since a 6= 0, we observe that δ( ax) = δ(− ax) by (i), hence
δ( ax) = δ(| a | x), it follows that
Z +∞ Z +∞
1 1
δ( ax) = δ(| a | x) = eit·| a | x dt = ei| a |t· x dt.
2π −∞ 2π −∞
Let s = | a | t. Then ds = | a | dt, thus

Z +∞
1 1 1
δ( ax) = eis· x ds = δ ( x ). (1.59)
| a | 2π −∞ |a|
That is, in the sense of distribution,
1
δ( ax) = δ ( x ), a ∈ R \{0}. (1.60)
|a|
2 Dirac delta function of vector argument
Definition 2.1 (Dirac delta function of real-vector arguments). The real-vector delta function can
be defined in n-dimensional Euclidean space R n as the measure such that
Z
f ( x)δ( x)[dx] = f (0) (2.1)
Rn
for every compactly supported continuous function f . As a measure, the n-dimensional delta
function is the product measure of the 1-dimensional delta functions in each variable separately.
Thus, formally, with
n
δ ( x) = ∏ δ ( x j ), (2.2)
j=1
where x = [ x1 , . . . , xn ]T ∈ R n .
The delta function in an n-dimensional space satisfies the following scaling property instead:
δ( ax) = | a | −n δ( x), a ∈ R \{0}. (2.3)
Indeed, ax = [ ax1 , . . . , axn ]T for x = [ x1 , . . . , xn ]T , thus

n n n
δ( ax) = ∏ δ(axj ) = ∏ | a | −1 δ ( x j ) = | a | − n ∏ δ ( x j ) = | a | − n δ ( x).
j=1 j=1 j=1
10
This indicates that δ is a homogeneous distribtion of degree (−n). As in the one-variable case, it
is possible to define the compositon of δ with a bi-Lipschitz function g : R n → R n uniquely so
that the identity
Z Z
f ( g( x ))δ( g( x)) det g′ ( x) [dx] = f (u)δ(u)[du] (2.4)
Rn g (R n )
for all compactly supported functions f .

Using the coarea formula from geometric measure theory, one can also define the composition
of the delta function with a submersion from one Euclidean space to another one of different
dimension; the result is a type of current. In the special case of a continuously differentiable
function g : R n → R such that the gradient of g is nowhere zero, the following identity holds1
Z Z
f ( x)
f ( x)δ( g( x))[dx] = dσ( x) (2.5)
Rn g −1 ( 0) |∇ g( x)|
where the integral on the right is over g−1 (0), the (n − 1)-dimensional surface defined by g( x) = 0
with respect to the Minkowski content measure. That is known as a simple layer integral.

Z
1
δ ( x) = eih t,xi [dt ] ( x ∈ R n ). (2.6)
(2π )n Rn
Proof. Let x = [ x1 , . . . , xn ]T ∈ R n . Then by Fourier transform of Dirac delta function:

n n Z
1
δ( x) = ∏ δ( x j ) = ∏ 2π eit j x j dt j (2.7)
j=1 j=1 R
Z n
1 n
= ei ∑ j=1 t j x j ∏ dt j (2.8)
(2π )n Rn j=1
Z
1
= eiht,xi [dt ] ( x ∈ R n ), (2.9)
(2π )n Rn
where [dt ] := ∏ nj=1 dt j .
Proposition 2.3. For a full-ranked real matrix A ∈ R n×n , it holds that
1
δ( Ax) = δ ( x), x ∈ Rn . (2.10)
| det( A)|
In particular, under any reflection or rotation R, the delta function is invariant:
δ( Rx) = δ( x). (2.11)

1 See https://en.wikipedia.org/wiki/Dirac_delta_function
11
The first proof. By using Fourier transform of Dirac delta function, it follows that
Z Z
1 1
eih A i [dt ] ( x ∈ R n ).
T t,x
ih t,Axi
δ( Ax) = e [dt ] = (2.12)
(2π )n Rn (2π )n Rn
Let s = AT t. Then [ds] = | det( AT )| [dt ] = | det( A)| [dt ]. From this, we see that
Z
1
δ( Ax) = det−1 ( A) eihs,xi [ds] = det−1 ( A) δ( x) ( x ∈ R n ). (2.13)
(2π )n Rn
Thus δ( Rx) = δ( x) since det( R) = ±1 for reflection or any rotation R. We are done.
The second proof. By SVD, we have two orthogonal matrices L, R and diagonal matrix
Λ = diag(λ1 , . . . , λn )
with positive diagonal entries such that A = LΛRT . Then, via y := RT x (hence δ(y) = δ( x)),
n n
δ( Ax) = δ( LΛRT x) = δ(Λy) = ∏ δ(λ j yj ) = ∏ λ−j 1 δ(yj ),
j=1 j=1
that is,
n
1 1 1
λj ∏
δ( Ax) = δ(y j ) = n δ(y) = δ ( x).
∏nj=1 j=1 ∏ j=1 λj ∏nj=1 λ j
Now det( A) = det( LΛRT x) = det( L) det(Λ) det( RT ), it follows that

n
| det( A)| = | det( L)| | det(Λ)| | det( RT )| = ∏ λ j .
j=1

−1
Therefore we get the desired identity: δ( Ax) = det ( A) δ( x). This completes the proof.
Clearly, letting A = a1 n in the above gives Eq. (2.3).
3 Dirac delta function of matrix argument
Definition 3.1 (Dirac delta function of real-matrix argument). (i) For an m × n real matrix X =
[ xij ] ∈ R m×n , the matrix delta function δ( X ) is defined as
m n
δ( X ) := ∏ ∏ δ( xij ). (3.1)
i=1 j=1
In particular, the vector delta function is just a special case where n = 1 in the matrix case.
(ii) For an m × m symmetric real matrix X = [ xij ], the matrix delta function δ( X ) is defined as
δ( X ) := ∏ δ( xij ). (3.2)
i6 j
12
From the above definition, we see that the matrix delta function of a complex matrix is equal to
the product of one-dimensional delta functions over the independent real and imaginary parts of
this complex matrix. In view of this observation, we see that δ( X ) = δ(vec( X )), where vec( X ) is
the vectorization of the matrix X. It is easily checked for a rectangular matrix. For the symmetric
" #
x11 x12
case, for example, take 2 × 2 symmetric real matrix X = with x12 = x21 , then
x21 x22
vec( X ) = [ x11 , x21 , x12 , x22 ]T = [ x11 , x12 , x12 , x22 ]T , thus vec( X ) = x11 [1, 0, 0, 0]T + x12 [0, 1, 1, 0]T +
x22 [0, 0, 0, 1]T , i.e., there are three independent variables { x11 , x12 , x22 } in the vector vec( X ) just
like in the matrix X, thus
δ( X ) = δ( x11 )δ( x12 )δ( x22 ) = δ(vec( X )).
Proposition 3.2. For an m × m symmetric matrix X, we have

Z
−m − m (m2+1)
δ( X ) = 2 π ei Tr (TX ) [dT ], (3.3)
where T = [tij ] is also an m × m real symmetric matrix, and [dT ] := ∏ i6 j dtij .
Proof. Since
m m
Tr ( TX ) = ∑ tjj xjj + ∑ tij xij = ∑ tjj xjj + 2 ∑ tij xij
j=1 i6 = j j=1 i< j
implying that
Z m Z Z
i Tr ( TX )
e [dT ] = ∏ exp it jj x jj dt jj ∏ exp itij 2xij dtij
j=1 16i< j6m
m
= ∏ 2πδ x jj × ∏ 2πδ 2xij
j=1 16i< j6m
m m +1
= ∏ 2πδ x jj × ∏ πδ( xij ) = 2m π ( 2 ) δ ( X ).
j=1 16i< j6m
Therefore we get the desired identity.
Proposition 3.3. For A ∈ R m×m , B ∈ R n×n and X ∈ R m×n , we have

δ( AXB) = det−n ( A) det−m ( B) δ( X ). (3.4)
Proof. We have already known that δ( AXB) = δ(vec( AXB)). Since vec( AXB) = ( A ⊗ BT ) vec( X )
[11], it follows that
δ( AXB) = δ(vec( AXB)) = δ (( A ⊗ BT ) vec( X )) (3.5)

−1 T
= det ( A ⊗ B ) δ(vec( X )) (3.6)

= det−n ( A) det−m ( B) δ( X ). (3.7)
13
Proposition 3.4. For A ∈ R n×n and X = X T ∈ R n×n , we have
δ( AX AT ) = | det( A)|−(n+1) δ( X ). (3.8)
Proof. By using Eq. (3.3), it follows that

Z
n ( n +1 )
ei Tr ( TAXA ) [dT ]
T
δ( AX AT ) = 2−n π − 2
Z
n ( n +1 )
ei Tr ( A ) [dT ].
T TAX
= 2− n π − 2
Let S = AT TA. Then we have [dS] = | det( A)|n+1 [dT ] (see Proposition 2.8 in [11]). From this, we
see that
Z
−( n+1) − n − n (n2+1) −( n+1)
T
δ( AX A ) = | det( A)| 2 π ei Tr (SX ) [dS] = | det( A)| δ ( X ). (3.9)
Note that Dirac delta function for complex number is defined by δ(z) := δ(Re(z))δ(Im(z)),
√
where z = Re(z) + −1Im(z) for Re(z), Im(z) ∈ R. The complex number z can be realized as a
2-dimensional real vector " #
Re(z)
z 7→ b
z := .
Im(z)
Thus
" #!
Re(z)
δ ( z) = δ ( b
z) = δ . (3.10)
Im(z)
Then for c ∈ C, cz is represented as

" # " #" #
Re(c)Re(z) − Im(c)Im(z) Re(c) −Im(c) Re(z)
cz 7→ = ,
Im(c)Re(z) + Re(c)Im(z) Im(c) Re(c) Im(z)
we have
" #" #!
Re(c) −Im(c) Re(z)
δ(cz) = δ
Im(c) Re(c) Im(z)
" #! " #!
Re ( c ) − Im ( c ) Re(z)

= det−1 δ = | c | −2 δ ( z ).
Im(c) Re(c) Im(z)
Therefore we have
δ(cz) = | c | −2 δ(z). (3.11)
Furthermore, if z ∈ C n , then
δ(cz) = | c |−2n δ(z). (3.12)
14
Proposition 3.5. For a full-ranked complex matrix A ∈ C n×n , it holds that
1
δ( Az) = ∗
δ(z) = | det( A)|−2 δ(z), z ∈ Cn. (3.13)
| det( AA )|
In particular, for any unitary matrix U ∈ U(n), we have
δ(Uz) = δ(z). (3.14)
Proof. Since Az can be represented as

" #" #
Re( A) −Im( A) Re(z)
bz =
Ab
Im( A) Re( A) Im(z)
it follows that
" #" #!
δ( Az) = δ
Im( A) Re( A) Im(z)
" #! " #!
−1
= det δ
Im( A) Re( A) Im(z)

= det−1 ( AA∗ ) δ(z),
" #!
Re( A) −Im( A)

where det = | det( AA∗ )| can be found in [8, 11]. Therefore we have
Im( A) Re( A)
−2
δ( Az) = | det( A)| δ ( z ), z ∈ Cn . (3.15)
If A = U is a unitary matrix, then | det(U )| = 1. The desired result is obtained.

" # " #
The second proof. Let Ab= and bz= . Then by Proposition 2.3,
Im( A) Re( A) Im(z)

δ( Az) = δ( Ab b) δ(b
bz) = det−1 ( A
z) = det−1 ( AA∗ ) δ(z). (3.16)
Definition 3.6 (Dirac delta function of complex-matrix argument). (i) For an m × n complex
matrix Z = [zij ] ∈ C m×n , the matrix delta function δ( Z ) is defined as
m n
δ( Z ) := ∏∏δ Re(zij ) δ Im(zij ) . (3.17)
i=1 j=1
In particular, the vector delta function is just a special case where n = 1 in the matrix case.
(ii) For an m × m Hermitian complex matrix X = [ xij ] ∈ C m×m , the matrix delta function δ( X ) is
defined as

δ( X ) := ∏ δ( xjj ) ∏ δ Re( xij ) δ Im( xij ) . (3.18)
j i< j
15
The Fourier integral representation of Dirac delta function can be extended to the matrix case.
The following proposition is very important in this paper.
Proposition 3.7. For an m × m Hermitian complex matrix X ∈ C m×m , we have

Z
1
δ ( X ) = m m2 ei Tr ( TX ) [dT ], (3.19)
2 π
where T = [tij ] is also an m × m Hermitian complex matrix, and [dT ] := ∏ j dt jj ∏i< j dRe(tij )dIm(tij ).
Proof. Indeed, we know that

m m
Tr ( TX ) = ∑ tjj xjj + ∑ t̄ij xij = ∑ Re(tjj )Re( xjj ) + ∑ t̄ij xij + tij x̄ij
j=1 i6 = j j=1 16i< j6m
m
= ∑ tjj xjj + ∑ 2 Re(tij )Re( xij ) + Im(tij )Im( xij ) ,
j=1 16i< j6m
implying that
Z m Z
ei Tr ( TX ) [dT ] = ∏ exp it jj x jj dt jj
j=1
Z
× ∏ exp iRe(tij ) 2Re( xij ) dRe(tij )
16i< j6m
Z
× ∏ exp iIm(tij ) 2Im( xij ) dIm(tij )
16i< j6m
m
= ∏ 2πδ x jj × ∏ 2πδ 2Re( xij ) 2πδ 2Im( xij )
j=1 16i< j6m
m
= ∏ 2πδ x jj × ∏ πδ Re( xij ) πδ Im( xij )
j=1 16i< j6m
m
2 (2)

= (2π )m π ∏ δ( xjj ) ∏ δ Re( xij ) δ Im( xij )
j i< j
m2
= 2m π δ ( X ) .
Therefore we get the desired identity.
Remark 3.8. Indeed, since

Tr T off X off = ∑ 2(Re(tij )Re( xij ) + Im(tij )Im( xij ))
i< j
and
[dT off ] = ∏ dRe(tij )dIm(tij ),

i< j
16
it follows that
Z
[dT off ] exp i Tr T off X off
Z Z
=∏ dRe(tij ) exp iRe(tij )(2Re( xij )) dIm(tij ) exp iIm(tij )(2Im( xij ))
i< j
= ∏ 2πδ(2Re( xij )) · 2πδ(2Im( xij )) = ∏ πδ(Re( xij ))πδ(Im( xij ))

i< j i< j
2( m2 ) m ( m − 1)
=π ∏ δ(Re( xij ))δ(Im( xij )) = π δ( X off ).
i< j
From the above discussion, we see that (3.19) can be separated into two identities below:
Z
1
[dT diag ]ei Tr ( T X ) ,
diag diag
diag
δ( X ) = (3.20)
(2π )m
Z
1
[dT off ]ei Tr ( T X ) .
off off
δ( X off ) = m ( m − 1 )
(3.21)
π
Note that the identity in Proposition 3.7 is used in deriving the joint distribution of diagonal
part of Wishart matrix ensemble [12], and it is also used in obtaining derivative principle for
unitarily invariant random matrix ensemble in [9]. More generally, the fact can be found in [2]
that the derivative principle for invariant measure is used to investigate the joint distribution of
eigenvalues of local states from the same multipartite pure states.
Proposition 3.9. For A ∈ C m×m and B ∈ C n×n , let Z ∈ C m×n . Then we have
δ( AZB) = det−n ( AA∗ ) det−m ( BB∗ )δ( Z ). (3.22)
Proof. Now AZB can be represented as, via d b

b Y,
XY = X
" #" #" #
Re( A) −Im( A) Re( Z ) −Im( Z ) Re( B) −Im( B)
[=A
AZB bZ
bBb= (3.23)
Im( A) Re( A) Im( Z ) Re( Z ) Im( B) Re( B)
Then from Eq. (3.4), we see that

[ ) = δ( A
δ( AZB) = δ( AZB bZ b) = det−n ( A
bB b) δ( Z
b) det−m ( B b) (3.24)
= det−n ( AA∗ ) det−m ( BB∗ )δ( Z ). (3.25)
The result is proven.
Proposition 3.10. For A ∈ C m×m , and m × m Hermitian complex matrix X ∈ C m×m , we have
−m
δ( AX A∗ ) = | det( AA∗ )| δ ( X ). (3.26)
17
Proof. By using the Fourier transform of the matrix delta function (see Eq. (3.19))
Z Z
1 ∗ 1 ∗ TAX )
δ( AX A∗ ) = 2 ei Tr ( TAXA ) [dT ] = 2 ei Tr ( A [dT ], (3.27)
2m π m 2m π m
where T = [tij ] is also an m × m Hermitian complex matrix, and [dT ] := ∏ j dt jj ∏i< j dRe(tij )dIm(tij ).
Let H = A∗ TA. Then we have (see Proposition 3.4 in [11]):
[dH ] = | det( AA∗ )|m [dT ]. (3.28)
Thus
Z
1
δ( AX A∗ ) = | det( AA∗ )|−m 2 ei Tr ( HX ) [dH ] = | det( AA∗ )| −m δ( X ), (3.29)
2 πm
m
We are done.
4 Applications
4.1 Joint distribution of eigenvalues of Wishart matrix ensemble
The first application is to calculate the joint distribution of Wishart matrix ensemble [6]. Note
that the so-called Wishart matrix ensemble is the set of all complex matrices of the form W = ZZ †
with Z an m × n(m 6 n) complex Gaussian matrix, i.e., a matrix with all entries being standard
complex Gaussian random variables,
1
†
ϕ( Z ) = exp − Tr ZZ ,
π mn
the distribution density of W is given by
Z
P (W ) = δ W − ZZ † ϕ( Z )[dZ ] (4.1)
Z
1 † †
= δ W − ZZ exp − Tr ZZ [dZ ]. (4.2)
π mn
That is,
Z
1 − Tr (W ) †
P (W ) =
e δ W − ZZ [dZ ]. (4.3)
π mn
√ √ √
Let Z = WY. Then ZZ † = WYY † W and [dZ ] = det(W )n [dY ]. By Proposition 3.10, we get
√ √ 1
δ W − ZZ † = δ W 1 − YY † W = δ 1 − YY †
.
det(W )m
Therefore
Z
1
P (W ) = detn−m (W )e− Tr (W ) δ 1 m − YY † [dY ] ∝ detn−m (W )e− Tr (W ) , (4.4)
π mn
R
where δ 1 − YY † [dY ] := C is a constant, independent of W.
18
Z 1
π 2 m(2n−m+1)
δ 1 m − YY † [dY ] = m . (4.5)
∏ k=1 ( n − k ) !
In particular, (i) for m = 1,
Z
πn πn
δ (1 − hy, yi) [dy] = = . (4.6)
( n − 1) ! Γ(n)
(ii) for m = n, we have
Z 1
π 2 n ( n + 1)
δ 1 n − YY †
[dY ] = n = 2−n · vol(U(n)). (4.7)
∏ k=1 Γ ( k )
Proof. Since
Z Z
C
1= P(W )[dW ] = detn−m (W )e− Tr (W ) [dW ], (4.8)
π mn W >0
where via W = UwU † for w = diag(w1 , . . . , wm ) where w1 > · · · > wm
[dW ] = ∆(w)2 [dw][U † dU ].
It follows that
Z
C
1 = mn vol(U(m)/T m ) ∆(w)2 detn−m (w)e− Tr (w) [dw]. (4.9)
π w1 >···> w m
By using Selberg integral formula, we see that

Z m
∆(w)2 detn−m (w)e− Tr (w) [dw] = ∏ k!(n − k)!. (4.10)
k=1
Thus
Z m
1
∆(w)2 detn−m (w)e− Tr (w) [dw] = ∏ k!(n − k)!. (4.11)
w1 >···> w m m! k=1
We conclude that
Z vol(T m )m!π mn
δ 1 − YY † [dY ] = . (4.12)
vol(U(m)) ∏ mk=1 k!( n − k) !
Now
m ( m +1 )
2m π 2
vol(U(m)) = m and vol(T m ) = (2π )m .
∏ k=1 Γ ( k )
We obtain that
Z 1
† π 2 m(2n−m+1)
δ 1 − YY [dY ] = m . (4.13)
∏ k=1 ( n − k ) !
19
Corollary 4.2. It holds that
Z 1
π 2 m(2n−m+1)
δ W − ZZ † [dZ ] = m det(W )n−m . (4.14)
∏ k=1 ( n − k ) !
Proof. It is easily seen that
Z Z
δ W − ZZ † [dZ ] = det(W )n−m δ 1 m − YY † [dY ] (4.15)
1
π 2 m(2n−m+1)
= det(W )n−m . (4.16)
∏mk=1 ( n − k ) !
Here we used the result in Proposition 4.1.

Remark 4.3. Denote U(m, n) := Z ∈ C m×n : ZZ † = 1 m (m 6 n). Note that
Z 1
2m π 2 m(2n−m+1)
vol(U(m, n)) = δ 1 m − ZZ † [ Z † dZ ] = .
∏mk=1 ( n − k ) !
This indicates that

Z Z
δ 1 m − ZZ † [ Z † dZ ] = 2m δ 1 m − ZZ † [dZ ].
In addition, the delta integral can be reformulated in terms of another form:

Z Z
†
δ W − ZZ [dZ ] = [dZ ].
ZZ† =W
Finally, we have seen that

det(W )n−m e− Tr (W )
P (W ) = m .
π ( 2 ) ∏m
k=1 ( n − k ) !
4.2 Joint distribution of eigenvalues of induced random quantum state ensemble
Any mixed state ρ (i.e., nonnegative complex matrix of trace-one) acting on Hm , may be purified
by finding a pure state | X i in the composite Hilbert space Hm ⊗ Hm , such that ρ is given by the
partial tracing over the auxiliary subsystem,
|X i −→ ρ = Tr2 (|X ih X |) . (4.17)
In a loose sense, the purification corresponds to treating any density matrix of size m as a vector
of size m2 .
Consider a bipartite m ⊗ n composite quantum system. Pure states of this system | X i may
be represented by a rectangular complex matrix X. The partial tracing with respect to the n-
dimensional subspace gives a reduced density matrix of size m: ρ = Trn (| X ih X |) . The natural
measure in the space of mn-dimensional pure states induces the measure Pm,n (ρ) in the space of
the reduced density matrices of size m.
20
Without loss of generality, we assume that m 6 n, then ρ is generically positive definite (here
something is generic means it holds with probability one). In any case, we are only interested
in the distribution of the positive eigenvalues. Let us call the corresponding positive reduced
density matrix again ρ = XX † , where X is a m × n matrix. First we calculate the distribution of
matrix elements
Z
P ( ρ) ∝ [dX ]δ ρ − XX † δ 1 − Tr XX † (4.18)
where the first delta function is a delta function of a Hermitian matrix and in the second delta

function Tr XX † may be substituted by Tr (ρ). Since ρ is positive definite we can make a
transformation
√ e
X= ρX, (4.19)
it follows that [dX ] = (det ρ)n [dX

e ] [11]. The matrix delta function may now be written as
√
δ eX
ρ 1−X e † √ρ = (det ρ)−m δ 1 − X
eXe† . (4.20)
As the result the distribution of matrix elements is given by
P(ρ) ∝ θ (ρ)δ (1 − Tr (ρ)) (det ρ) n−m (4.21)
where the theta function assures that ρ is positive definite. It is then easy to show by the methods
of random matrix theory that the joint density of eigenvalues Λ = {λ1 , . . . , λm } of ρ is given by
!
m m
Pm,n (λ1 , . . . , λm ) ∝ δ 1 − ∑ λ j ∏ λnj −m θ (λ j ) ∏ ( λ i − λ j )2 . (4.22)
j=1 j=1 16i< j6m
This result is firstly obtained by Życzkowski [13] in 2001.
4.3 Joint distribution of eigenvalues of two Wishart matrices
The second application is to calculate the distribution of the sum of a finite number of complex
Wishart matrices taken from the same Wishart ensemble [6]. The distribution of the sum of two
Wishart matrices is considered in [7]. Let us consider two independent complex matrices A and
B of dimensions m × n A and m × n B taken, respectively, from the distributions
1
−1 †
P A ( A) = n exp − Tr Σ A AA , (4.23)
π mn A det A (Σ A )
1
−1 †
P B ( B) = exp − Tr Σ BB . (4.24)
π mn B detn B (Σ B ) B
21
Here Σ A , Σ B are the covariance matrices. Since the domains of A and B remain invariant under
unitary rotation, without loss of generality, we may take Σ A = diag(σA1 , . . . , σAm ) and Σ B =
diag(σB1 , . . . , σBm ). We assume that m 6 n A , n B . And we have
Z Z
[dA]P A ( A) = 1 and [dB]P B ( B) = 1.
The matrix AA† and BB† are then n-variate complex-Wishart-distributed, i.e., AA† ∼ WmC (n A , Σ A )
and BB† ∼ WmC (n B , Σ B ).
Ones are interested in the statistics of the ensemble of m × m Hermitian matrices
W = AA† + BB† .
The distribution of W can be obtained as

Z Z
P W (W ) = [dA] [dB]δ W − AA† − BB† P A ( A)P B ( B). (4.25)
√ √
In what follows, our method will be different from Kumar’s in [7]. Let A = e and B =
WA e
W B.
e] and [dB] = det(W )n B [d B
Then [dA] = det(W )n A [d A e]. We also have

eA
δ W − AA† − BB† = det(W )−m δ 1 − A e† − B
eBe† .
Then
Z Z
detn A +n B −m (W ) e] [d B
e]
P W (W ) = [d A
π m(n A +n B ) detn A (Σ A ) detn B (Σ B )
√ √ √ √
e† e− Tr ( WΣ A W Ae Ae† )−Tr ( WΣB W BeBe† ) .
−1 −1
×δ 1 − A eAe† − B eB
That is,
PW (W ) ∝ detn A +n B −m (W )
Z Z √ √ √ √
e† e− Tr ( WΣ A W Ae Ae† + WΣB W BeBe† ) . (4.26)
−1 −1
× e]
[d A [d B eA
e]δ 1 − A e† − B
eB
√ √ √ √
Now consider the reduction of the sum Tr WΣ− 1 eA
WA e† + WΣ− 1 e e†
W B B . Since 1 =
A B
eA
A e† + B
eBe† , it follows that
√ √ √ √
Tr WΣ− 1
W eA
A e† + Tr WΣ −1
W BeBe† (4.27)
A B
√ √
= Tr WΣ− 1
+ Tr W ( Σ −1
− Σ −1
) W eB
B e† (4.28)
A B A
√ √
−1
= Tr WΣ B + Tr −1 −1
W (Σ A − Σ B ) W A eA e† . (4.29)
Hence
detn A +n B −m (W )
P W (W ) ∝ −1
Q (W ) , (4.30)
eTr (WΣ A )
22
where
Z Z √ √
e† eTr ( W (Σ A −ΣB ) W BeBe† )
−1 −1
Q (W ) : = e]
[d A [d B eA
e]δ 1 − A e† − B
eB
Z √ √
W (Σ− −1 e e†
e]eTr ( A − Σ B ) W (1 − A A ) )
1
= [d A
eA
0< A e† <1 m
Z √ √
Tr (W ( Σ− −1
W (Σ− −1 e e†
A −ΣB )) e]eTr ( B −Σ A ) W A A )
1 1
= e [d A
eA
0< A e† < 1 m
That is,
Z
detn A +n B −m (W ) e]eTr (
√
W (Σ− −1
√
e e†
B −Σ A ) W A A )
1
P W (W ) ∝ −1
[d A . (4.31)
eTr (WΣB ) eA
0< A e† < 1 m
Similarly, we have
Z
detn A +n B −m (W ) e]eTr (
√
W (Σ− −1
√
e e†
A −ΣB ) W BB )
1
P W (W ) ∝ −1
[d B . (4.32)
eTr (WΣ A ) eB
0< B e† < 1 m
In fact, we have
Z √ √
1 e]eTr ( W (Σ− −1 e e†
B −Σ A ) W A A )
1
−1
[d A
eTr (WΣB ) 0<ZAe Ae† <1 m
√ √
1 e]eTr ( W (Σ A −ΣB ) W BeBe† ) .
−1 −1
= −1
[d B
eTr (WΣ A ) 0< BeBe† <1 m
√ √
Let Λ = W (Σ− 1 −1 e
A − Σ B ) W. By Gram-Schmidt orthogonalization procedure to write A = TU1 ,
where U1 is m × n A matrix such that U1 U1† = 1 m and T is a m × m lower triangular matrix with
diagonal entries real and positive. Then
m
e] = ∏ t 2( n A − j)+1
[d A jj [dT ][dU1 U † ],
j=1
eA
where U is the enlarged n A × n A unitary matrix of U1 . Let X = A e† = TT † . We then have [11]:
m
2( m − j)+1
[dX ] = 2m ∏ t jj [dT ].
j=1
Combining together the above results gives that [11]
e] = 2−m detn A −m ( X )[dX ][dU1 U † ].

[d A (4.33)
This means that

Z Z 1m Z
e]eTr (Λ Ae Ae† ) = 2−m
[d A [dX ] detn A −m ( X )eTr (ΛX ) [dU1 U † ] (4.34)
eA
0< A e† < 1 m 0
Z 1m
∝ [dX ] detn A −m ( X )eTr (ΛX ) . (4.35)
0
Similarly,
Z Z 1m
e]e− Tr (Λ BeBe† ) ∝
[d B [dX ] detn B −m ( X )e− Tr (ΛX ) . (4.36)
eB
0< B e† < 1 m 0
23
Theorem 4.4. It holds that
Z 1m
P W (W ) ∝ −1
[dX ] detn A −m ( X )eTr (ΛX ) (4.37)
eTr (WΣB ) 0
and
Z 1m
P W (W ) ∝ −1
[dX ] detn B −m ( X )e− Tr (ΛX ) . (4.38)
eTr (WΣ A ) 0
Moreover
Z 1m Z 1m
1 1
[dX ] detn A −m ( X )eTr (ΛX ) = [dX ] detn B −m ( X )e− Tr (ΛX ) . (4.39)
Tr (WΣ− 1
B ) 0 Tr (WΣ− 1
A ) 0
e e
In particular, if Λ = 0 (i.e., Σ A = Σ B := Σ), then
P W (W ) ∝ −1 . (4.40)
eTr (WΣ )
In the following, we calculate, via X = Udiag( x)U † where diag( x) = diag( x1 , . . . , xm ),
Z 1m Z 1 Z 1 m Z
dµHaar (U )eTr (ΛUdiag( x )U ) (4.41)
†
[dX ] det n A −m
( X )e Tr (ΛX )
∝ ··· ∆( x ) 2
∏ xnj A −m dx j
0 0 0 j=1
W.l.o.g, we assume that Λ is in the diagonal form, i.e. Λ = diag(λ1 , . . . , λm ) where each λ j ∈ R.
By Harish-Chandra-Itzykson-Zuber integral formula [3, 5],
Z
!
m det exp( ai b j )
Tr ( AUBU † )
dµHaar (U )e = ∏ Γ ( j) , (4.42)
U( m ) j=1
∆( a) ∆( b)
it follows that
Z
!
m det exp(λi x j )
Tr ( ΛUdiag( x )U † )=
dµHaar (U )e ∏ Γ ( j) ∆( λ) ∆( x )
. (4.43)
j=1
Therefore
Z 1m Z 1 Z 1 m
1
[dX ] det n A −m
( X )e Tr (ΛX )
∝ ··· ∆( x) det exp(λi x j ) ∏ xnj A −m dx j (4.44)
0 ∆( λ) 0 0 j=1
Z 1 Z 1 m
1
∝ ··· | ∆( x)| ∏ eλ j x j xnj A −m dx j , (4.45)
∆( λ) 0 0 j=1
implying that
Z 1m
!Z Z 1
m 1 m
1
[dX ] det n A −m
( X )e Tr ( ΛX )
∝ ∏ ∂λn Aj −m ··· | ∆( x)| ∏ eλ j x j dx j . (4.46)
0 ∆( λ) j=1 0 0 j=1
24
Remark 4.5. Kumar in [7] have presented analytical formula for distribution of the sum of two
Wishart matrices in terms of the confluent hypergeometric function of matrix argument [8], which
is defined by
Z 1p
Γ p (c)
1 F1 ( a; c; − Λ ) = det( X ) a− p det(1 − X )c− a− p e− Tr (ΛX ) [dX ], (4.47)
Γ p ( a) Γ p ( c − a) 0
p ( p −1 )
where Γ p ( a) := π 2 Γ( a)Γ( a − 1) · · · Γ( a − p + 1). Kummer’s formula for the confluent hyper-
geometric function 1 F1 is given by
1 F1 ( a; c; − Λ ) = e− Tr (Λ) · 1 F1 (c − a; c; Λ). (4.48)
Remark 4.6. We can present a simple approach to the similar result. Denote WmC (n, Σ) Wishart
matrix ensemble for which each matrix is of the form W = AA∗ , where A ∈ C m×n (m 6 n). Now
W = W1 + W2 , where W1 , W2 ∈ WmC (n, Σ), can be rewritten as
W = ZZ † , Z = [ A, B] ∈ C m×2n .
Then W ∼ WmC (2n, Σ). Thus
PW (W ) ∝ det2n−m (W )e− Tr (Σ ).
−1 W
The distribution of sum of an arbitrary finite number of Wishart matrices can be derived as:
PW (W ) ∝ detkn−m (W )e− Tr (Σ ),
−1 W
where W = ∑kj=1 Wj for Wj ∼ WmC (n, Σ).
References
[1] A. Lovas, A. Andai, Volume of the space of qubit-qubit channels and state transformations under
random quantum channels, arXiv: 1708.07387
[2] M. Christandl, B. Doran, S. Kousidis, M. Walter, Eigenvalue distributions of reduced density

matrices, Comm. Math. Phys. 332, 1-52 (2014).
[3] Harish-Chandra, Spherical Functions on a Semisimple Lie Group, I, Amer. J. Math.

80, 241-310 (1958).
[4] R.F. Hoskins, Delta Functions: An Introduction to Generalised Functions, 2nd Edition 2009,
Woodhead Publishing Limited, Oxford (2011).
[5] C. Itzykson and J.B. Züber, The planar approximation. II, J. Math. Phys. 21, 411-421 (1980).
25
[6] A.T. James, Distributions of Matrix Variates and Latent Roots Derived from Normal Samples, Ann.
Math. Stat. 35(2), 475-501 (1964).
[7] S. Kumar, Eigenvalue statistics for the sum of two complex Wishart matrices, Europhys. Lett.
107, 60002 (2014).
[8] A.M. Mathai, Jacobians of Matrix Transformations and Functions of Matrix Arguments, World
Scientific (1997).
[9] J. Mejía, C. Zapata, A. Botero, The difference between two random mixed quantum states: exact
and asymptotic spectral analysis, J. Phys. A : Math. Theor. 50, 025301 (2017).
[10] S.R. Moghadasi, Polar decomposition of the k-fold product of Lebesgue measure on R n , Bull. Aust.
Math. Soc. 85, 315-324 (2012).
[11] L. Zhang, Volumes of orthogonal groups and unitary groups, arXiv:1509.00537
[12] L. Zhang, Average coherence and its typicality for random mixed quantum states, J. Phys. A : Math.
Theor. 50, 155303 (2017).
[13] K. Życzkowski, W. Słomczynski, Induced measures in the space of mixed quantum states, J. Phys.
A : Math. Gen. 34, 7111 (2001).
26

Dirac Delta Function of Matrix Argument: Lin Zhang

Uploaded by

Copyright:

Available Formats

You might also like

Dirac Delta Function of Matrix Argument: Lin Zhang

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Dirac Delta Function of Matrix Argument: Lin Zhang

Uploaded by

Copyright:

Available Formats

Dirac Delta Function of Matrix Argument

1 Heaviside unit step function H and Dirac delta function δ

The value H (0) is left undefined here. For all x 6= 0,

We recall the definition of Dirac delta function:

Definition 1.1 (Dirac delta function). Dirac delta function δ( x) is defined by

In fact, we have, for a 6= 0, Z +a

Similarly, hδ, f i is given by

This fact will be used later.

From these, we see that

Proposition 1.6. If g(t) is monotone, with g( a) = 0 and g′ ( a) 6= 0, then

From this, we see that  

It is natural to define the composition δ( g( x)) for continuously differentiable functions g by

In the integral form the generalized scaling property may be written as

where Bn ( T, r) := {u ∈ C n | hu | T | ui < r} and [du] = ∏nj=1 du j for [dz] = d (Re(z)) d (Im(z)).

Then [dv] = det(r−1 T )[du] or [du] = det(rT −1 )[dv]. Thus

where Bn (1 n , 1) = {v ∈ C n |hv|v i < 1}. Now

where Sn (γ) = {v ∈ C n | k v k2 = γ}.

This completes the proof.

(ii) The delta function as a convolution unit: δ ∗ f = f ∗ δ = f .

(iii) Convolution as the translation: δa ∗ f = f ∗ δa = f a , where f a ( x) := f ( x − a).

(iv) δ(n) ∗ f = f ∗ δ(n) = f (n) .

This amounts to say F −1 (1)(t) = δ(t). Replacing t by t − a, we have

or, more generally, that

We can rewrite this result in the form

where f (t±) := lims→t± f (s).

(iii) The Fourier transformation is linear: F (λ1 f 1 + λ2 f 2 ) = λ1 F ( f1 ) + λ2 F ( f 2 ).

(v) F ( f ( x)e− ax )(ω ) = F ( f ( x))( a + iω ).

(vi) F ( f ′ )(ω ) = iω F ( f )(ω ).

F (eiαx )(ω ) = 2πδ(ω − α); F (e−iαx )(ω ) = 2πδ(ω + α).

F (cos(αx))(ω ) = π (δ(ω − α) + δ(ω + α)) , (1.51)

(i) The delta function is an even distribution: δ( x) = δ(− x).

(iii) The distributional product of δ( x) and x is equal to zero: xδ( x) = 0.

Let s = | a | t. Then ds = | a | dt, thus

2 Dirac delta function of vector argument

δ( ax) = | a | −n δ( x), a ∈ R \{0}. (2.3)

Indeed, ax = [ ax1 , . . . , axn ]T for x = [ x1 , . . . , xn ]T , thus

for all compactly supported functions f .

Proposition 2.2. It holds that

Proof. Let x = [ x1 , . . . , xn ]T ∈ R n . Then by Fourier transform of Dirac delta function:

where [dt ] := ∏ nj=1 dt j .

Proposition 2.3. For a full-ranked real matrix A ∈ R n×n , it holds that

δ( Rx) = δ( x). (2.11)

Now det( A) = det( LΛRT x) = det( L) det(Λ) det( RT ), it follows that

Clearly, letting A = a1 n in the above gives Eq. (2.3).

3 Dirac delta function of matrix argument

δ( X ) = δ( x11 )δ( x12 )δ( x22 ) = δ(vec( X )).

Proposition 3.2. For an m × m symmetric matrix X, we have

where T = [tij ] is also an m × m real symmetric matrix, and [dT ] := ∏ i6 j dtij .

Therefore we get the desired identity.

Proposition 3.3. For A ∈ R m×m , B ∈ R n×n and X ∈ R m×n , we have

δ( AXB) = δ(vec( AXB)) = δ (( A ⊗ BT ) vec( X )) (3.5)

This completes the proof.

δ( AX AT ) = | det( A)|−(n+1) δ( X ). (3.8)

Proof. By using Eq. (3.3), it follows that

This completes the proof.

Then for c ∈ C, cz is represented as

δ(cz) = | c | −2 δ(z). (3.11)

From this, we see that