Professional Documents
Culture Documents
Lecture Notes For Math 623 Matrix Analysis: 1 Normal Matrices
Lecture Notes For Math 623 Matrix Analysis: 1 Normal Matrices
Matrix Analysis
Michael E. O’Sullivan
mosulliv@math.sdsu.edu
www-rohan.sdsu.edu/ ˜ mosulliv
1 Normal Matrices
All matrices are in Mn (C). A∗ is the conjugate transpose.
We recall several definitions and a new term “normal.”
Definition 1.1. A ∈ Mn (C) is
• normal when AA∗ = A∗ A.
• Hermitian when A∗ = A.
1
(d) The set of normal matrices is closed under scalar multiplication, but
is not closed under addition and is not closed under multiplication.
(e) The set of normal matrices is closed under unitary conjugation.
(f ) A ∈ Mm C and B ∈ Mn C are normal iff A ⊕ B is.
(g) Suppose A and B are real normal matrices satisfying AB ∗ = B ∗ A.
Show that A + B and AB are both normal.
Lemma 1.3. Suppose that T is upper (or lower) triangular and normal.
Then T is a diagonal matrix.
Then (T ∗ T )1,1 = t1,1 t1,1 , which is real and nonnegative. On the other hand,
n
X
(T T ∗ )1,1 = t1,1 t1,1 + t1,2 t1,2 + · · · + t1,n t1,n = |t1,i |2
i=1
Exercises 1.4.
(a) Show that for any matrix A, Tr(A∗ A) = ni=1 nj=1 |ai,j |2 . We will
P P
Pn
write this more concisely as i,j=1 |ai,j |2 .
Lemma 1.5. If A, B are unitarily equivalent then ni,j=1 |ai,j |2 = ni,j=1 |bi,j |2 .
P P
2
Proof. Let A = U ∗ BU .
n
X
|ai,j |2 = Tr(A∗ A) = Tr(U ∗ B ∗ U U ∗ BU )
i,j=1
n
X
= Tr(U ∗ B ∗ BU ) = Tr(B ∗ B) = |bi,j |2
i,j=1
(2) A is normal.
Pn 2
Pn 2
(3) i,j=1 |ai,j | = i=1 |λi | .
A∗ A = U ∗ Λ∗ U U ∗ ΛU = U ∗ Λ∗ ΛU = U ∗ ΛΛ∗ U = AA∗
Now we show that either (2) or (3) imply that A is unitarily diagonal-
izable. Any matrix is unitarily triangularizable, so let A = U ∗ T U with U
unitary and T upper triangular.
Suppose A is normal. Since T is similar to A it is also normal. By the
Lemma 1.3, it is diagonal. Thus PA is unitarily diagonalizable.
Now suppose i,j=1 |ai,j | = ni=1 |λi |2 . By the Lemma 1.5
Pn 2
n
X n
X
|λi |2 = |ai,j |2
i=1 i,j=1
X n
= |ti,j |2
i,j=1
3
and since the diagonal entries of T are the eigenvalues of A
n
X n X
X n
= |λi |2 + |ti,j |2
i=1 i=1 j=i+1
From the first and last lines and the nonnegativity of |ti,j |2 we can conclude
that ti,j = 0. So T is diagonal.
Proof.
(1) A is unitary.
4
Proof. We have already shown the equivalence of (1) and (3). If A is unitary
it is normal, and is therefore unitarily diagonalizable, say U ∗ AU = Λ. Then
Λ∗ Λ = U ∗ A∗ U U ∗ AU = U ∗ A∗ AU = U ∗ U = I
A∗ A = U ∗ Λ∗ U U ∗ ΛU = U ∗ Λ∗ ΛU = U ∗ U = I
(1) A is Hermitian.
Proof. We have already proven that A Hermitian implies (2) and (3). If
A is normal and has real eigenvalues then it is unitarily diagonalizable,
A = U ΛU ∗ with Λ ∈ Mn (R) . Then A is unitarily similar to a Hermitian
matrix, so it is also Hermitian.
Suppose that v ∗ Av is real for all v ∈ Cn . Then (u + v)A (u + v) − u∗ Au −
v Av is real for any u, v ∈ Cn . This is equal to u∗ Av + v ∗ Au. Set u = ei
∗
and v = ej to obtain ai,j + aj,i is real. This shows Im(ai,j ) = − Im(aj,i ). Set
u = iei and v = ej to obtain −iai,j + iaj,i is real. This shows Re(ai,j ) =
Re(aj,i ). Thus aj,i = ai,j and A is Hermitian.
For z ∈ C, writing z > 0 means that z is real and positive (and similarly
z ≥ 0 means nonnegative).
(2) v ∗ Av > 0 for all nonzero v ∈ Cn , iff the eigenvalues of A are positive.
5
Proof. If v ∗ Av ≥ 0 for all v ∈ Cn , then in particular for an eigenpair λ, u we
have 0 ≤ u∗ Au = λu∗ u. Since u∗ u is positive for nonzero u, we must have
λ ≥ 0. Furthermore if the inequality is strict, as assumed in (2), then λ > 0.
If A is Hermitian, then by the spectral theorem it is unitarily diagonal-
izable, so A = U ∗ ΛU . Suppose the eigenvalues of A are all nonnegative.
Then
n
X
v ∗ Av = v ∗ U ∗ ΛU v = (U v)∗ Λ(U v) = λ i xi x
i=1
(1) A is symmetric.
3 Unitary Similarity
Definition 3.1. We say A is unitarily similar to B when there exists a
unitary matrix U such that A = U ∗ BU . We write A ∼U B.
Exercises 3.2.
(a) Unitary similarity is an equivalence relation.
(b) If A ∼U B then A ∼ B, but matrices can be similar without being
unitarily similar. (Find an example).
(c) If A ∼U B then A∗ ∼U B ∗ and AA∗ ∼U BB ∗ .
6
We want to generalize the last result.
Theorem 3.5 (Sprecht). If Tr(W (A, A∗ ) = Tr(W (B, B ∗ ) for all words
W (x, y), then A ∼U B.
[Peary] If Tr(W (A, A∗ ) = Tr(W (B, B ∗ ) for all words W (x, y) of degree
at most 2n2 , then A ∼U B.
Horn does not prove Sprecht’s (nore Peary’s) result. I don’t know if
there is a canonical for unitary similarity classes, analogous to Jordan form
for similarity classes.
4 Miscellaneous
At the end of Section 2.1 Horn brings up a generalization of unitary matrices:
instead of requiring that U −1 = U ∗ , consider A similar to A∗ . I’m not sure
if there is some reason for interest in this idea, or it is just a curiosity. I
found Theorem 2.1.9 a bit hard to follow, so I broke it into pieces.
7
Proof. Let x be a nonzero vector in Cn .
α
(α + αA)x = 0 ⇐⇒ Ax = − x
α
α
⇐⇒ − is an eigenvalue of A
α
Choosing α as in the statement of the theorem guarantees αI + αA is trivial
nullspace and is therefore invertible.
Corollary 4.2. Let S be invertible. There is a θ ∈ [0, 2π) such that (Sθ +
Sθ )∗ is invertible. Here Sθ = eiθ S.
Proof. Apply the previous Lemma, let α = eiθ be such that −α/α = −e−2iθ
is not be an eigenvalue of S −1 S ∗ . Then eiθ I + e−iθ S −1 S ∗ is invertible.
Multiply by S to get Sθ + Sθ∗ is invertible.
B −1 (B −1 B ∗ )B = B −1 (BB −1 )∗ B = (B −1 )∗ B = (B −1 B ∗ )−1
Therefore A = B −1 B ∗ .
To get H we know A∗ = SA−1 S −1 for some S. Choose θ so that Sθ + Sθ∗
is invertible. Then A∗ = Sθ A−1 Sθ−1 so
S θ = A∗ S θ A
Sθ∗ = A∗ Sθ∗ A
8
5 Positive Definite Matrices
Definition 5.1. A Hermitian matrix A ∈ Mn is positive definite when
v ∗ Av > 0 for all nonzero v ∈ Cn . It is positive semi-definite when v ∗ Av ≥ 0
for all nonzero v ∈ Cn .
Here we follow the convention that z > 0 for z ∈ C means that z is real
and positive.
Exercises 5.2.
(a) Any principal sub matrix of a positive definite matrix is positive defi-
nite (and similarly for positive semi-definite).
(b) The trace and determinant of all principal minors are positive.
(c) For any i 6= j, aii ajj > |aij |2 .
(d) The set of positive definite matrices is closed under addition and scalar
multiplication by positive real numbers. It doesn’t form a vector space,
but is a union of rays from the origin (the 0 matrix) and is called a
cone.
(e) Applying our results on normal matrices, a positive definite matrix A
has positive eigenvalues and is unitarily diagonalizable, while a semi-
definite matrix may have 0 eigenvalues.
(f ) The definition could be rephrased to say A is normal and v ∗ Av > 0,
since these imply that A is Hermitian.
(g) Let A ∈ Mn be positive definite. For any C ∈ Mn,m , C ∗ AC is positive
definite and rk(C ∗ AC) = rk(C).
For a nonsquare matrix D ∈ Mn,m I will say that D is diagonal when
for i 6= j, di,j = 0.
A = U ΣW
The σi are called the singular values of A. Exactly r = rk(A) of them are
nonzero. They are uniquely defined, since they are the nonzero eigenvalues
of AA∗ . It is easy to check that AA∗ is positive semi-definite.