Professional Documents
Culture Documents
1 s2.0 S0024379524000661 Main
1 s2.0 S0024379524000661 Main
a r t i c l e i n f o a b s t r a c t
✩
Research of HJW is supported by National Science Foundation grant DMS 2000037.
* Corresponding author.
E-mail address: hugo@math.drexel.edu (H.J. Woerdeman).
https://doi.org/10.1016/j.laa.2024.02.021
0024-3795/© 2024 Elsevier Inc. All rights reserved.
248 A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259
1. Introduction
It is well-known (see, e.g., [13]) that any symmetric polynomial in n variables can be
expressed as a polynomial with rational coefficients in the first n power sum polynomials.
For instance, the two variable symmetric polynomial r(x1 , x2 ) = x21 x2 + x1 x22 can be
written as r = 12 p31 − 12 p1 p2 .
In order to state our main result, we consider words w in two letters. For instance,
P P Q, P QP QP P are words in the letters P and Q of lengths 3 and 6, respectively.
The length of a word w is denoted by |w|. The empty word ∅ has length 0. When we
write na(w, P ) = l we mean that P appears l times in the word w (na=number of
appearances). We also need the notion of trace polynomials (introduced in [16]; see also
[7,8]), which in the scalar valued case are linear combinations of products of traces of
matrices. An example of such a scalar valued polynomial is
5 2
q(P, Q) = Tr(P QP Q)Tr P 2 Q3 − Tr(P Q5 P Q) ,
3
where the trace of a square matrix M is denoted by Tr M . We will use trace polynomials
with matrices A and A∗ .
rj = hj (p1 , . . . , pn ), j = 1, . . . , k,
n
k e−it A + eit A∗
k
pk (λ1 (t), . . . , λn (t)) = λj (t)k = Tr Re e−it A = Tr ,
j=1
2
which is a trigonometric polynomial (in eit ) where the coefficients are trace polynomials
(with matrices A and A∗ ). Therefore, rj (λ1 (t), . . . , λn (t)), j = 1, . . . , k, are trigonometric
250 A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259
polynomials (in eit ) where the coefficients are trace polynomials (with matrices A and
A∗ ). If we now take the coefficients appearing in rj (λ1 (t), . . . , λn (t)), j = 1, . . . , k, that
correspond to non-zero powers of eit , and call these qj , j = 1, . . . , K, then these give the
desired trace polynomials. Moreover, if rj , j = 1, . . . , k, have rational coefficients, then
so will qj , j = 1, . . . , K.
Remark 2.2. In case each of the first n power sum polynomials can be expressed as a
polynomial in the polynomials r1 , . . . , rk , the trace conditions (3) in Theorem 2.1 will
reduce to the condition [15, Theorem 1.1(ii)].
We let
ek (x1 , . . . , xn ) = xl1 · · · xlk ,
1≤l1 <···<lk ≤n
denote the elementary symmetric polynomials. The well-known Newton identities relating
the elementary symmetric polynomials with the power sum polynomials are
k
kek (x1 , . . . , xn ) = (−1)i−1 ek−i (x1 , . . . , xn )pi (x1 , . . . , xn ), 1 ≤ k ≤ n,
i=1
p31 p1 p2 p3
e3 = − + .
6 2 3
Thus
1 1 −it 3
Tr e A + eit A∗
λ1 (t)λ2 (t)λ3 (t) =
3
2 6
1 −it it ∗
−it
it ∗ 2 1 −it
it ∗ 3
− Tr e A + e A Tr e A + e A + Tr e A + e A .
2 3
Extracting the coefficient of e−3it , we get up to a factor 1
23
1 1 1
Tr(A)3 − Tr(A)Tr(A2 ) + Tr(A3 ). (4)
6 2 3
0. (5)
A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259 251
Extracting the coefficient of e−it , we get up to a factor 1
23
1 1
Tr(A)2 Tr(A∗ ) − Tr(A)Tr (AA∗ ) − Tr(A∗ )Tr A2 + Tr A2 A∗ , (6)
2 2
where we used the rule Tr(CD) = Tr(DC). The coefficient of eikt is just the complex
conjugate of the coefficient of e−ikt . Thus we arrive at the following corollary.
Corollary 2.3. For a 3 × 3 matrix A, the expression det(Re(e−it A)) is constant if and
only if the quantities in (4) and (6) are equal to zero.
n
(−pi )mi
en = (−1)n .
m1 +2m2 +···+nmn =n i=1
mi ! imi
m1 ≥0,...,mn ≥0
In a similar way as above, one may derive for an n × n matrix A necessary and sufficient
conditions for det Re e−it A to be constant in terms of trace conditions.
In this section we consider the problem of characterizing when the c-numerical range
of a matrix is a disk. Note that in the special case when c1 = · · · = ck = 1 and
ck+1 = · · · = cn = 0, it is also called the k-numerical range.
We recall from [9, Result 5.1] the following proposition.
α = c1 λn + · · · + cn λ1 , β = c1 λ1 + · · · + cn λn
c1 λ1 (t) + · · · + cn λn (t)
Proof. Without loss of generalization, let us assume that the disk in question is the
unit disk. Clearly, Wc (A) is the unit disk if and only if for each t ∈ R we have
that e−it Wc (A) = Wc e−it A is the unit disk. The latter happens if and only if
Re(Wc (e−it A)) = [−1, 1] for all t ∈ R. Notice that Wc e−i(t+π) A = −Wc e−it A ,
from which it follows that Wc (e−it A) is the unit disk for all t ∈ R if and only if
max x=1
x∈Re(Wc (e−it A))
for all t ∈ R. Using Proposition 3.1 the latter happens if and only if c1 λ1 (t) + · · · +
cn λn (t) = 1 for all t ∈ R.
Clearly, when the disk with center 0 has radius r, we obtain that c1 λ1 (t) + · · · +
cn λn (t) = r for all t ∈ R. This finishes the proof.
Note that when c = (1, 0, . . . , 0), the circularity of Wc (A) = W (A) = {x∗ Ax : x∗ x =
1}, the classical numerical range of A, is characterized in [14, Theorem 4.5].
We need the following notation that was introduced in the proof of [2, Theorem 2]. Let
m1 , . . . , mk be positive integers with m1 + · · · + mk = n. We let P (m1 , . . . , mk ) denote
the collection of tuples (P1 , . . . , Pk ) such that P1 , . . . , Pk form a partition of {1, 2, . . . , n}
where Pj has mj elements, j = 1, . . . , k.
c = (c1 , . . . , c1 , c2 , . . . , c2 , . . . . . . , ck , . . . , ck ),
k
where c1 > c2 > . . . > ck , cj appears mj times, mj ≥ 1, j=1 mj = n, and let A ∈ C n×n .
Introduce the polynomial
⎛ ⎞
k
pt (x) = ⎝x − cj λs (t)⎠ ,
(P1 ,...,Pk )∈P (m1 ,...,mk ) j=1 s∈Pj
Proof. It is clear from the definition of pt (x) that the coefficients are symmetric polyno-
mials in λ1 (t), . . . , λn (t). As is well known (see, e.g., [13]) every symmetric polynomial can
n
be expressed as a polynomial in the power functions pk (λ1 (t), . . . , λn (t)) = i=1 λi (t)k =
k
Tr Re e−it A , k = 1, . . . , n. But then it follows that the coefficients are trigonometric
polynomials in t of degree at most n. This proves the first part.
A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259 253
k
For the second part, notice that the roots of pt (x) are j=1 cj s∈Pj λs (t), which
are indexed by (P1 , . . . , Pk ) ∈ P (m1 , . . . , mk ). Among all the roots, the largest is when
the tuple (P1 , . . . , Pk ) is the partition where P1 = {1, . . . , m1 }, P2 = {m1 + 1, . . . , m1 +
k−1 k
m2 }, . . . , Pk = { l=1 ml + 1, . . . , l=1 ml }. This largest root being equal to 1 for all t
corresponds exactly to pt (1) = 0 for all t ∈ R, and pt (x) > 0 for all x > 1 and all t ∈ R.
Here we used that pt (x) → ∞ when x → ∞.
4
4
4
(x − λi (t)) = (−1)4−i xi e4−i (λ1 (t), λ2 (t), λ3 (t), λ4 (t)) = (−1)4−i xi e4−i .
i=1 i=0 i=0
Using the calculations from [2, Equation (16)] (or, alternatively, [1, Theorem 4.2]), we
have that
6
pt (x) = (x − λr (t) − λs (t)) =: ki x6−i ,
1≤r<s≤4 i=0
where k0 = 1,
Using the Newton identities, we may express the coefficients of pt (x) in terms of the
power sum polynomials pj = pj (λ1 (t), λ2 (t), λ3 (t), λ4 (t)) as follows
3 1
k1 = − Tr(A) + zTr(A∗ ) ,
2 z
1
2
1 1 z2
k2 = Tr(A) + zTr(A∗ ) − 2
Tr A2 − Tr(AA∗ ) − Tr A∗2 ,
z 4z 2 4
1 1 1 z 2
k3 = zTr(A∗ ) + Tr(A) Tr(AA∗ ) + 2 Tr A2 + Tr A∗ 2
z 2 4z 4
3
z 1
−3 Tr(A∗ ) + Tr(A) .
2 2z
We omit the expressions for k4 , k5 and k6 in terms of the traces as these are very
long, but can be determined in a similar way. Furthermore, from pt (1) = 0 for all t ∈ R
follows
6
0 = 1 + k1 + k2 + k3 + k4 + k5 + k6 =: ai z i ,
i=−6
1
a−6 = Tr(A)6 + 6Tr(A)4 Tr(A2 ) − 20Tr(A)3 Tr(A3 ) − 9Tr(A)2 Tr(A2 )2
4608
+18Tr(A4 )Tr(A)2 + 12Tr(A)Tr(A2 )Tr(A3 ) − 8Tr(A3 )2 = 0,
1
a−5 = 144Tr(A3 )Tr(A)2 − 36Tr(A)5 + 36Tr(A)Tr(A2 )2 − 144Tr(A4 )Tr(A) = 0,
4608
1
a−3 = 36Tr(A∗ )Tr(A2 )2 − 180Tr(A∗ )Tr(A)4 + 432Tr(A)2 Tr(A∗ A2 )
4608
− 1728Tr(A)3 + 1152Tr(A)Tr(A2 ) − 576Tr(A)Tr(A3 A∗ ) − 144Tr(A∗ )Tr(A4 )
+ 144Tr(A)Tr(AA∗ )Tr(A2 ) + 288Tr(A∗ )Tr(A)Tr(A3 ) = 0,
1
a−1 = 144Tr(A)Tr(AA∗ )2 − 5184Tr(A∗ )Tr(A)2 − 6912Tr(A)
4608
+ 432Tr(A)2 Tr(AA∗ 2 ) + 144Tr(A∗ )2 Tr(A3 ) − 360Tr(A∗ )2 Tr(A)3
+ 1152Tr(A∗ )Tr(A2 ) + 2304Tr(A)Tr(AA∗ )
− 567Tr(A∗ )Tr(A3 A∗ ) − 576Tr(A)Tr(A∗ 2 A2 ) − 288Tr(A)Tr(A∗ AA∗ A)
+ 72Tr(A)Tr(A∗ 2 )Tr(A2 ) + 144Tr(A∗ )Tr(AA∗ )Tr(A2 )
+ 864Tr(A∗ )Tr(A)Tr(A2 A∗ )
= 0.
We will not write down full expressions for a−4 , a−2 and a0 , as these are very long.
However, in order to follow our computations for the remainder of this example, it is
A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259 255
important to note that in every term of a−4 there is a factor Tr Ak or Tr A∗k , and
that
1 3 ∗ 1 2 ∗ 2
a−2 = Tr A A − Tr A A + terms with a factor Tr Ak or Tr A∗k ,
4 64
1 1 1 1
a0 = 1 − Tr A2 A∗ Tr AA∗2 − (Tr (AA∗ )) + Tr (AA∗ AA∗ ) − Tr (AA∗ )
2
32 16 8 2
1
+ Tr A2 A∗2 + terms with a factor Tr Ak or Tr A∗k .
4
Let us apply the above techniques to
⎛ ⎞
α β γ δ
⎜0 α β γ⎟
A=⎝
β⎠
.
0 0 α
0 0 0 α
For this particular A we have that a−6 = α6 . Setting this equal to 0, gives α = 0. Now
we find
1 3 1
a−2 = β δ̄ − (2γ̄β 2 + 2γβ δ̄)2 ,
4 64
1 1 1 1 5 5 1
a0 = − |β|4 |γ|2 − β 2 β̄ γ̄ 2 δ − β β̄ 2 γ 2 δ̄ − |β|2 |γ|2 |δ|2 + |β|4 + |β|2 |γ|2 + |β|2 |δ|2 +
8 8 8 8 16 4 8
1 2 1 2 1 2 2 1 4 3 2 1 2
β γ̄ δ + β̄ γ δ̄ + |δ| |γ| + |δ| − |β| − |γ| − |δ| + 1, 2
4 4 4 16 2 2
and a−6 = a−5 = a−4 = a−3 = a−1 = 0. Letting β, γ, δ be real, and solving for
a−2 = a0 = 0, we find for example the solution β = δ, |γ| = 1. Letting β = δ and γ = 1,
we find that
1 2 1 1
pt (x) = x6 + x4 −2β 2 − 1 + 2x2 β 2 β cos(2t) + β 2 + 1 − β 4 (cos(2t) + 1)
4 4 2
1 4 1
= x2 − 1 β cos(2t) + β 4 − 2β 2 x2 + x4 .
2 2
Clearly, pt (1) ≡ 0. Next, if we let x > 1, then using that cos(2t) ≥ −1, we obtain that
pt (x) ≥ x2 − 1 x2 x2 − 2β 2 .
Thus, the requirement that pt (x) > 0 for x > 1 gives that |β| ≤ √1 .
2
In conclusion, for c
as in (7) we have that
⎛ ⎞
0 β 1 β
⎜0 0 β 1⎟
A=⎝
β⎠
,
0 0 0
0 0 0 0
with |β| ≤ √1 ,
2
has the property that Wc (A) is the unit disk.
256 A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259
Making use of [2, Theorem 4], we obtain the following case where we characterize
when Wc (A) equals the unit disk.
Proof. We use [2, Theorem 4] to rephrase the problem to the direct sum of three matrices
having numerical range equal to the unit disk. Analyzing the latter we get the above
α β
result. Note that B := can only have a circular numerical range if and only
γ −α
if it is of rank 1 (i.e., α2 = −βγ), and in that case the radius is f /2 (as B is unitarily
0 f
similar to ; note that the Frobenius norm needs to be the same).
0 0
It was derived in [14, Theorem 4.5] that for a complex matrix B ∈ C n×n , its numerical
range is the closed unit disk if and only if there exist P0 , P1 ∈ C (n−1)×n so that B =
2P0∗ P1 and P0∗ P0 + P1∗ P1 = In . Of course, we can also apply the results in the previous
section to the standard numerical range by choosing
Proposition 4.1. Let A ∈ C 3×3 . If W (A) equals the closed unit disk, then
1 1
1 + Tr(A)Tr(A∗ ) − Tr(AA∗ ) = 0,
4 4
1 1 1 1 1
− Tr(A) − Tr(A) Tr(A ) + Tr(A) Tr(AA∗ ) + Tr(A∗ )Tr(A2 ) − Tr(A2 A∗ ) = 0,
2 ∗
2 16 8 16 8
Tr(A) − Tr(A ) = 0,
2 2
and
1 1 1
− Tr(A)3 + Tr(A)Tr(A2 ) − Tr(A3 ) = 0.
6 2 3
A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259 257
Proof. If we apply Theorem 3.4 with a c = (1, 0, 0), then we have that pt (x) is just the
characteristic polynomial of Re e−it A , which equals
1 2 p31 p1 p2 p3
pt (x) = x3 − e1 x2 + e2 x − e3 = x3 − p1 x2 + p1 − p2 x − − + .
2 6 2 3
1 −it 1 2
p1 = Tr e A + eit A∗ , p2 = Tr e−it A + eit A∗ ,
2 4
1 −it 3
p3 = Tr e A + eit A∗ .
8
1 1
1 + Tr(A)Tr(A∗ ) − Tr(AA∗ ). (9)
4 4
1 1 1 1 1
− Tr(A)− Tr(A)2 Tr(A∗ )+ Tr(A) Tr(AA∗ )+ Tr(A∗ ) Tr(A2 )− Tr(A2 A∗ ). (10)
2 16 8 16 8
1 1 1
− Tr(A)3 + Tr(A)Tr(A2 ) − Tr(A3 ). (12)
6 2 3
It should be noticed that by using the techniques developed in this paper one can also
find necessary conditions in terms of traces for when the first k higher rank numerical
ranges are disks with center 0. Recall that the rank-k numerical range of a square matrix
B is defined by
This notion, which generalizes the classical numerical range when k = 1 and is motivated
by the study of quantum error correction, was introduced in [4]. In [5,17] it was shown
that Λk (B) is convex. Subsequently, in [10] a different proof of convexity was given. In
[14] the situation was analyzed when one of the eigenvalues λk (Re(e−it A)) is a constant
258 A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259
function of t, which corresponds to the rank-k numerical range of A being a circle with
center 0.
Now, if for instance one wants the numerical range and the 2-rank numerical range to
be disks with center 0 then one has that λ1 (t) and λ2 (t) are constant, which is equivalent
to λ1 (t) and λ1 (t) + λ2 (t) being constant. For the latter, one applies Theorem 3.4 to both
The authors declare the following financial interests/personal relationships which may
be considered as potential competing interests: Hugo Woerdeman reports financial sup-
port was provided by National Science Foundation grant DMS 2000037. Madelein van
Straaten reports financial support was provided by National Graduate Academy for
Mathematical and Statistical Sciences (NGA(MaSS)). If there are other authors, they
declare that they have no known competing financial interests or personal relationships
that could have appeared to influence the work reported in this paper.
Data availability
Acknowledgements
We thank Professors Jonah Blasiak and Darij Grinberg for discussions on symmetric
polynomials. We gratefully acknowledge funding from the National Graduate Academy
for Mathematical and Statistical Sciences (NGA(MaSS)).
References
[1] Sara C. Billey, Brendon Rhoades, Vasu Tewari, Boolean product polynomials, Schur positivity, and
Chern plethysm, J. Int. Math. Res. Not. 21 (2021) 16636–16670.
[2] Mao Ting Chien, Hiroshi Nakazato, Reduction of the c-numerical range to the classical numerical
range, Linear Algebra Appl. 434 (3) (2011) 615–624.
[3] Mao Ting Chien, Bit Shun Tam, Circularity of the numerical range, Linear Algebra Appl. 201 (1994)
113–133.
[4] M.D. Choi, D.W. Kribs, K. Życzkowski, Higher-rank numerical ranges and compression problems,
Linear Algebra Appl. 418 (2006) 828–839.
[5] M.D. Choi, M. Giesinger, J.A. Holbrook, D.W. Kribs, Geometry of higher-rank numerical ranges,
Linear Multilinear Algebra 56 (2008) 53–64.
[6] G. Dirr, U. Helmke, M. Kleinsteuber, T. Schulte-Herbrüggen, Relative C-numerical ranges for appli-
cations in quantum control and quantum information, Linear Multilinear Algebra 56 (2008) 27–51.
[7] Igor Klep, Victor Magron, Jurij Volčič, Optimization over trace polynomials, Ann. Henri Poincaré
23 (1) (2022) 67–100.
A. van der Merwe et al. / Linear Algebra and its Applications 689 (2024) 247–259 259
[8] Igor Klep, James Eldred Pascoe, Jurij Volčič, Positive univariate trace polynomials, J. Algebra 579
(2021) 303–317.
[9] Chi-Kwong Li, C-numerical ranges and C-numerical radii, Linear Multilinear Algebra 37 (1–3)
(1994) 51–82.
[10] Chi-Kwong Li, Nung-Sing Sze, Canonical forms, higher rank numerical ranges, totally isotropic
subspaces, and matrix equations, Proc. Am. Math. Soc. 136 (9) (2008) 3013–3023.
[11] Chi-Kwong Li, Nam-Kiu Tsing, Matrices with circular symmetry on their unitary orbits and C-
numerical ranges, Proc. Am. Math. Soc. 111 (1) (1991) 19–28.
[12] Valentin Matache, Mihaela T. Matache, When is the numerical range of a nilpotent matrix circular?,
Appl. Math. Comput. 216 (2010) 269–275.
[13] I.G. Macdonald, Symmetric Functions and Hall Polynomials, second ed., Clarendon Press, Oxford,
1995.
[14] Edward Poon, Ilya M. Spitkovsky, Hugo J. Woerdeman, Factorization of singular matrix polynomials
and matrices with circular higher rank numerical ranges, SIAM J. Matrix Anal. Appl. 43 (3) (2022)
1423–1439.
[15] Edward Poon, Hugo J. Woerdeman, Isospectrality and matrices with concentric circular higher rank
numerical ranges, Linear Algebra Appl. 631 (2021) 174–180.
[16] C. Procesi, The invariant theory of n × n matrices, Adv. Math. 19 (3) (1976) 306–381.
[17] Hugo J. Woerdeman, The higher rank numerical range is convex, Linear Multilinear Algebra 56
(2008) 65–67.