Professional Documents
Culture Documents
F Casas A Murua and M Nadinic (Efficient Computation of The Zassenhaus Formula)
F Casas A Murua and M Nadinic (Efficient Computation of The Zassenhaus Formula)
F Casas A Murua and M Nadinic (Efficient Computation of The Zassenhaus Formula)
0010-4655/$ – see front matter © 2012 Elsevier B.V. All rights reserved.
doi:10.1016/j.cpc.2012.06.006
F. Casas et al. / Computer Physics Communications 183 (2012) 2386–2391 2387
Theorem 1.1 (Zassenhaus Formula). Let L(X , Y ) be the free Lie All of these proposals give results for Cn as a linear combination
algebra generated by X and Y . Then, eX +Y can be uniquely decomposed of nested commutators. In contrast, Scholz and Weyrauch [19]
as proposed a recursive procedure based on upper triangular matrices
∞
that can be easily implemented in a symbolic algebra package.
In this case, however, the expressions for Cn are not explicitly
eCn (X ,Y )
eX + Y = eX eY
written down in terms of homogeneous Lie polynomials. More
n =2
recently [20], the same authors have applied a technique related
X Y C2 (X ,Y ) C3 (X ,Y )
=e e e e . . . eCn (X ,Y ) . . . , (1.2) to one previously used by Witschel [21] to get Cn up to n = 15 in
less than 2 min of CPU time. Here also the Zassenhaus terms are
where Cn (X , Y ) ∈ L(X , Y ) is a homogeneous Lie polynomial in X and written as
Y of degree n.
Cn = gw w, (1.5)
The existence of such a formula is an immediate consequence w,|w|=n
of the BCH theorem. In fact, it is clear that e−X eX +Y = eY +D , where
where gw is a rational coefficient and the sum is taken over all
D involves Lie polynomials of degree > 1. Now e−Y eY +D = eC2 +D̃ ,
words w with length |w| = n in the symbols X and Y , i.e., w =
where D̃ involves Lie polynomials of degree > 2 and the process is a1 a2 · · · an , each ai being X or Y . Of course, one may always apply
repeated again. Induction allows one to get the general result. the Dynkin–Specht–Wever theorem [22], and express Cn as
By comparing with the BCH formula it is possible to obtain the
first terms of the formula (1.2) as 1
Cn = gw [w], (1.6)
1 n w,|w|=n
C2 (X , Y ) = − [X , Y ],
2 that is, the individual terms are the same as in the associative series
1 1 (1.5) except that the word w = a1 a2 . . . an is replaced with the
C3 (X , Y ) = [Y , [X , Y ]] + [X , [X , Y ]]. right nested commutator [w] = [a1 , [a2 , . . . [an−1 , an ] . . .]] and
3 6
the coefficient gw is divided by the word length n. In this way Cn
Although less familiar than the BCH formula, the Zassenhaus
is constructed as a linear combination of nested commutators of
formula constitutes nevertheless a standard tool in several fields,
homogeneous degree n, that is, as a linear combination of elements
since the disentangling of the exponential of the sum of two of the homogeneous subspace L(X , Y )n of degree n of the free
non-commuting operators into an (in general infinite) product of Lie algebra L(X , Y ). As a matter of fact, another representation of
exponential operators arises for instance in statistical mechanics, (1.5) in terms of nested commutators is proposed in [20] which,
many-body theories, quantum optics, path integrals, q-analysis in it is claimed, contains less terms than the Dynkin–Specht–Wever
quantum groups, etc. [12]. Also in particle accelerators physics, representation. In any case, it should be stressed that the set
Dragt and his collaborators have used the Zassenhaus formula to of nested commutators [w] for words w of length n in either
compute the relevant maps both in Taylor and factored product representation is not a basis of the homogeneous subspace
form [13]. In yet another context, very recently Iserles and L(X , Y )n .
Kropielnicka [14] have proposed a new family of high-order The purpose of this paper is twofold. First, to present a new
splitting methods for the numerical integration of the time- recurrence that allows one to express the Zassenhaus terms Cn
dependent Schrödinger equation based on a symmetric version of directly as a linear combination of independent elements of the
the Zassenhaus formula. homogeneous subspace L(X , Y )n . In other words, the procedure,
Several systematic computations of the terms Cn for n > 3 in which can be easily implemented in a symbolic algebra package,
the Zassenhaus formula have been carried out in the literature, gives Cn up to a prescribed degree directly in terms of independent
starting with the work of Wilcox [7], where a recursive procedure commutators involving n operators X and Y . In this way, no
is presented that has been subsequently used to get explicit rewriting process in a basis of L(X , Y ) is necessary, thus saving
expressions up to C6 in terms of nested commutators [12]. On considerable computing time and memory resources. Moreover,
the other hand, Volkin [15] proposed a general technique to we are able to express directly Cn with the minimum number of
express a function of non-commuting operators as an expansion commutators required at each degree n.
in successively higher commutators of the operators involved. In The second aspect we are dealing with concerns the conver-
particular, he was able to get recursive formulae up to C9 . By gence of the Zassenhaus formula when it is formulated in a Banach
following an idea already suggested by Wilcox in [7], Suzuki [16] algebra. As far as we know, there are only two previous results in
obtained the successive terms Cn (X , Y ) in the literature. The first one was obtained by Suzuki [16] starting
with the recurrence (1.4). Specifically, he was able to prove that
eλ(X +Y ) = eλX eλY eλ C2 eλ C3 · · · if |λ| (∥X ∥ + ∥Y ∥) ≤ log 2 − 1/2 the infinite product (1.3) con-
2 3
(1.3)
verges. Subsequently, Bayen [23] generalized the analysis, show-
by differentiating both sides with respect to λ and setting λ = 0 ing that the product (1.3) converges if |λ| (∥X ∥ + ∥Y ∥) ≤ r, where
after each differentiation. In this way r ≈ 0.596705 is a root of a certain transcendental equation. In
1
d2
1 the present work, we obtain sharper bounds for the terms of the
−λY −λX λ(X +Y )
C2 = (e e e ) = [Y , X ] Zassenhaus formula which show that the product (1.2) converges
2 dλ2 λ=0 2 in an enlarged domain.
d3
1 1 A simple but important remark is in order here. In some
e−λY e−λX eλ(X +Y ) )
2C
C3 = (e−λ 2
= [C2 , X + 2Y ] applications, the ‘‘left-oriented’’ Zassenhaus formula
3! dλ3 λ=0 3
and in general, for n ≥ 3, eX +Y = · · · eĈ4 (X ,Y ) eĈ3 (X ,Y ) eĈ2 (X ,Y ) eY eX (1.7)
1
dn −λn−1 Cn−1
is used instead of (1.2). A simple observation shows that the
−λ2 C2 −λY −λX λ(X +Y )
Cn = ( e · · · e e e e ) . (1.4) exponents Ĉi and Ci are related through
n! dλn λ=0
Ĉi (X , Y ) = (−1)i+1 Ci (X , Y ), i≥2
Finally, Baues [17] gave explicit formulae for the Zassenhaus terms
via homotopy theory and more recently Kurlin [18] obtained a and so we may restrict ourselves to analyzing the ‘‘right-oriented’’
closed expression for n≥2 Cn in the metabelian case. formula (1.2).
2388 F. Casas et al. / Computer Physics Communications 183 (2012) 2386–2391
2. Constructing the Zassenhaus terms Let us analyze in more detail this procedure, with the goal of
providing an algorithm well adapted from a computational point
2.1. A new recurrence of view.
For n = 1, and taking into account (2.2), we get
To derive our recursive procedure, it is convenient to introduce F1 (λ) = −Y − e−λY X eλY + e−λY e−λX (X + Y )eλX eλY
a parameter λ as in (1.3), = −Y − e−λadY X + e−λadY e−λadX (X + Y )
eλ(X +Y ) = eλX eλY eλ C2 eλ C3 eλ C4 · · ·
2 3 4
(2.1) = e−λadY (e−λadX − I )Y ,
that is,
so that the original Zassenhaus formula (1.2) is recovered when
λ = 1. Moreover, we consider the compositions ∞
∞
(−λ)i+j j
F1 (λ) = adiY adX Y (2.11)
−λY −λX λ(X +Y ) i!j!
R1 (λ) = e e e (2.2) i=0 j=1
Rn (λ) = e −λn C n
···e −λ2 C 2
e−λY e−λX eλ(X +Y )
∞
k
(−1)k k−j j
F1 (λ) = f1,k λk , with f1,k = adY adX Y . (2.12)
nC j!(k − j)!
= e−λ n
Rn−1 (λ). (2.3) k=1 j =1
It is then clear that In general, from (2.6) a straightforward calculation shows that for
n ≥ 2,
Rn (λ) = eλ Cn+1 eλ Cn+2 · · · .
n+1 n+2
(2.4) ∞
Fn (λ) = fn,k λk ,
Finally, we introduce k=n
/n]−1
d [k
(−1)j j
Fn (λ) ≡ Rn (λ) Rn (λ)−1 , n ≥ 1. (2.5) with fn,k = adCn fn−1,k−nj , k ≥ n. (2.13)
dλ j!
j =0
To determine the series Fn (λ) we proceed as follows. On the one
Here [k/n] denotes the integer part of k/n. Moreover, a closer
hand, a simple calculation starting from (2.3), leads for n ≥ 2 to
examination of (2.7) reveals that
d
Rn−1 (λ) Rn−1 (λ)−1 eλ
nC nC
Fn (λ) = −nCn λn−1 + e−λ λn+1 adCn+1
Fn (λ) = (n + 1)Cn+1 λn + (n + 2)e Cn+2 λn+1 + · · ·
n n
dλ
= (n + 1)Cn+1 λ + (n + 2)Cn+2 λ
n
+ ··· n +1
Fn−1 (λ)eλ
nC nC
= −nCn λn−1 + e−λ n n
n ad
+ (2n + 2)C2n+2 λ2n+1 + λ2n+2 [Cn+1 , Cn+2 ] + · · ·
= −nCn λn−1 + e−λ Cn
Fn−1 (λ) 2n+2
=e −λn adCn
(Fn−1 (λ) − nCn λn−1 ), (2.6) = kCk λk−1 + λ2n+2 Hn (λ), (2.14)
k=n+1
where we have used the well known formula
where Hn (λ) involves commutators of Cj , j ≥ n + 1. Notice that
1
eA Be−A = eadA B = adnA B the terms Cn+1 , . . . , C2n+2 of the Zassenhaus formula can be then
n≥0
n ! directly obtained from Fn (λ). In particular, one directly gets from
(2.11)
with
1 1
n−1
(−1)n n−j
adA B = [A, B],
j
adA B = [A, adA B],
j−1
ad0A B = B. Cn+1 = f1,n = adiY adX Y ,
n+1 n + 1 i=0 i!(n − j)!
On the other hand, differentiating expression (2.4) with respect
to λ and taking into account (2.5) we arrive at for n = 1, 2, 3. (2.15)
Explicitly,
Fn (λ) = (n + 1)Cn+1 λn
1
∞
λj−1 adCj−1 C2 = − [X , Y ],
j −1 λ
n+1 ad
+ jλ e Cn+1
···e Cj . (2.7) 2
j =n +2 1 1
C3 = [Y , [X , Y ]] + [X , [X , Y ]],
In other words, 3 6
1
Fn (λ) = (n + 1)Cn+1 λn + Gn+1 (λ), n ≥ 1, C4 = − ([Y , [Y , [X , Y ]]] + [Y , [X , [X , Y ]]])
8
(1) (n) 1
where Gn+1 (0) = Gn+1 (0) = ··· = Gn+1 (0) = 0. In consequence, − [X , [X , [X , Y ]]].
we have, for n ≥ 1, 24
Taking into account (2.13) and (2.14) we have in general
−λn+1 adCn+1
Fn+1 (λ) = e Gn+1 (λ), (2.8) 1
C n +1 = f[n/2],n n ≥ 5, (2.16)
1 (n) n+1
Cn+1 = Fn (0), (2.9)
(n + 1)! where the expressions of fn,k are given recursively by (2.13).
λn (n) In summary, the algorithm we propose for computing the
Gn+1 (λ) = Fn (λ) − Fn (0). (2.10) Zassenhaus terms is the following:
n!
Expressions (2.8)–(2.10) allow one to compute recursively the Define f1,k by Eq. (2.12)
Zassenhaus terms Cn starting from F1 (λ). The sequence is Cn = (1/n)f1,n−1 , n = 2, 3, 4
(2.17)
Define fn,k n ≥ 2, k ≥ n by Eq. (2.13)
Fn (λ) −→ Cn+1 −→ Gn+1 (λ) −→ Fn+1 (λ) −→ · · · , n ≥ 1. Cn = (1/n)f[(n−1)/2],n−1 n ≥ 5.
F. Casas et al. / Computer Physics Communications 183 (2012) 2386–2391 2389
This constitutes a new recursive way for obtaining directly the (i0 , . . . , in ) ∈ Jn \ In })
term Cn as a homogeneous Lie polynomial in X , Y of arbitrarily large ⊂ span({X , Y , C2 , . . . , Cn+1 })
degree n which can be easily implemented with a symbolic algebra i i i i
package. ⊕ L({adCnn++11 · · · adC22 adY1 adX0 Y :
(i0 , . . . , in+1 ) ∈ Jn+1 }).
2.2. Linear independence i i i
In consequence, the terms {adCmm · · · adC22 adY1 adX0 Y : (i0 , i1 , . . . ,
i
Algorithm (2.17), or equivalently the procedure given by im ) ∈ Jn } are linearly independent in the free Lie algebra L(X , Y )
the identities (2.8)–(2.10), provides expressions for Cn that, by and the same is true for the representation (2.19) of the Zassenhaus
construction, involve only independent commutators. In other terms.
words, they cannot be simplified further by using the Jacobi
identity and the antisymmetry property of the commutator. 2.3. Computational aspects
In order to prove this assertion, it is convenient to get a more
explicit expression of Fn (λ) and Cn+1 from (2.8)–(2.10). To this end, We have implemented the recursive procedure (2.17) in
consider for n ≥ 1 the sets Jn and In of (n + 1)-tuples of non- MathematicaTM as the following algorithm.
negative integers recursively defined as follows:
Clear[Cmt, ad, ff, cc];
J1 = {(i0 , i1 ) ∈ N2 : i0 ≥ 1}, $ RecursionLimit= 1024;
In = {(i0 , i1 , . . . , in ) ∈ Jn : i0 + i1 + 2i2 + · · · + nin = n}, Cmt[a_, a_]:= 0;
Cmt[a___, 0, b___]:= 0;
Jn+1 = (Jn \ In ) × N. Cmt[a___, c_ + d_, b___] := Cmt[a, c, b]
The set In can be directly defined as the set of (n + 1)-tuples of + Cmt[a, d, b];
non-negative integers satisfying that i0 + i1 + 2i2 + · · · + nin = n Cmt[a___, n_ c_Cmt, b___]:= n Cmt[a, c, b];
and Cmt[a___, n_ X, b___]:= n Cmt[a, X, b];
Cmt[a___, n_ Y, b___]:= n Cmt[a, Y, b];
j + 1 ≤ i0 + i1 + 2i2 · · · + jij for j = 0, . . . , n − 1. (2.18)
Cmt /: Format[Cmt[a_, b_]]:= SequenceForm
Thus, in particular, I1 = {(1, 0)}, I2 = {(1, 1, 0), (2, 0, 0)}, etc. ["[", a, ",", b, "]"];
Observe that, by construction, each (i0 , i1 , . . . , in ) ∈ In is such that
im = 0 if m > n/2. ad[a_, 0, b_]:= b;
From (2.8)–(2.10), one can then prove by induction on n that, ad[a_, j_Integer, b_]:= Cmt[a, ad[a, j-1, b]];
for n ≥ 1, ff[1, k_]:= ff[1, k] =
Sum[((-1)^k/(j! (k-j)!)) ad[Y, k-j,
(−1)i0 +···+in λi0 +i1 +2i2 +···+nin
Fn (λ) = ad[X, j, Y]], {j, 1, k}];
(i0 ,i1 ,...,in )∈Jn
i0 !i1 ! · · · in ! cc[2] = (\chem{1/2}) ff[1, 1];
i
× adiCnn · · · adC22 adY1 adX0 Y ,
i i ff[p_, k_]:= ff[p, k] =
Sum[((-1)^j/j!) ad[cc[p], j, ff[p-1, k - p j]],
1 (−1)i0 +···+in {j, 0,
Cn+1 = IntegerPart[k/p] - 1}];
n + 1 (i ,i ,...,i )∈I i0 !i1 ! · · · in !
0 1 n n cc[p_Integer]:= cc[p] =
i i i
× adiCnn · · · adC22 adY1 adX0 Y . (2.19) Expand[(1/p) ff[IntegerPart[(p-1)/2], p-1]];
In fact, this is clearly true for n = 1 (Eqs. (2.11) and (2.15), The object Cmt [x1 , x2 , . . . , xn−1 , xn ] refers to the nested
respectively), whereas successive application of (2.8)–(2.10) leads commutator [x1 , [x2 , . . . [xn−1 , xn ] · · ·]]. It just has attached the
to the general result. linearity property (there is no need to attach to it the antisymmetry
Now, repeated application of the Lazard elimination principle property and the Jacobi identity). The symbol ad represents
i
[24], together with I1 = {(1, 0)}, {C2 } = {− 21 [X , Y ]} = {− 12 adY1 the adjoint operator and its powers adja b, whereas ff[1,k],
i
adX0 Y : (i0 , i1 ) ∈ I1 }, shows that, as a vector space,
ff[p,k] and cc[p] correspond to expressions (2.12), (2.13) and
(2.16), respectively. Proceeding in this way, we have obtained the
j
L(X , Y ) = span({X }) ⊕ L({adX Y : j ≥ 0}) expressions of Cn up to n = 20 with a reasonable computational
time and memory requirements. Thus, for instance, constructing
= span({X }) ⊕ span({Y })
the terms up to degree n = 20 with a personal computer (2.4 GHz
⊕ L({adiY adiX Y : i ≥ 0, j ≥ 1}) Intel Core 2 Duo processor with 2 GB of RAM) takes less than 20 s of
i i
= span({X , Y }) ⊕ L({adY1 adX0 Y : (i0 , i1 ) ∈ J1 }) CPU time and 35 MB of memory. The expression for C20 has 48,528
terms, all of them independent. The resulting expressions up to C8
= span({X , Y })
are identical to those expressed in the classical Hall basis.
i i
⊕ L({C2 } ∪ {adY1 adX0 Y : (i0 , i1 ) ∈ J1 \ I1 }) In Table 1 we collect the CPU time (in seconds) and memory
= span({X , Y , C2 }) (in MB) needed to construct the terms C2 , C3 , . . . , Cn up to a
i i i given value of n both with the recurrence (2.17) (New) and
⊕ L({adC22 adY1 adX0 Y : (i0 , i1 , i2 ) ∈ J2 }). the implementation provided in [20] using a variant of the so-
More generally, application of Lazard elimination together with called comparison method previously introduced in [21] (W-S).
(2.19) gives Notice that with this method, which is the most efficient of
all the procedures analyzed in [20], the Zassenhaus exponents
L(X , Y ) ⊂ span({X , Y , C2 , . . . , Cn }) Cj are expressed as linear combinations of words of length j
i i i
⊕ L({adiCnn · · · adC22 adY1 adX0 Y : (i0 , . . . , in ) ∈ Jn }) and not directly in terms of independent commutators (although
this is always possible by applying the Dynkin–Specht–Wever
⊂ span({X , Y , C2 , . . . , Cn }) theorem or Theorem 2 in [20], and then simplifying the resulting
i i i
⊕ L({Cn+1 } ∪ {adiCnn · · · adC22 adY1 adX0 Y : expressions by taking into account the Jacobi identity and the
2390 F. Casas et al. / Computer Physics Communications 183 (2012) 2386–2391
∥XY ∥ ≤ ∥X ∥ ∥Y ∥ (3.1) 2k
= y((x + y)k − yk )
k!
for any two elements of A. Notice that for the commutator one
/n]−1
has ∥[X , Y ]∥ ≤ 2∥X ∥ ∥Y ∥. Then it makes sense to analyze the [k
2j
convergence of the Zassenhaus formula (1.2). ∥ f n ,k ∥ ≤ d n ,k = δnj dn−1,k−nj (3.6)
j =0
j!
As stated in the introduction, we are aware of only two previous
results establishing sufficient conditions for convergence of the whence
form ∥X ∥ + ∥Y ∥ < r with a given r > 0. Specifically, Suzuki [16]
obtained rs = log 2 − 12 ≈ 0.1931, whereas Bayen [23] proved 1
∥C n ∥ ≤ δ n = d[(n−1)/2],n−1 , n ≥ 3. (3.7)
that the domain of convergence can be enlarged up to a value of rb n
given by the unique positive solution of the equation A sufficient condition for convergence is obtained by imposing
z
e2 w − 1
z2 1+2 dw = 4(2 log 2 − 1). δn+1
w lim < 1. (3.8)
0 n→∞ δn
A numerical computation shows that rb = 0.59670569 . . .. Thus At this point it is worth remarking that, although not reflected by
for ∥X ∥ + ∥Y ∥ < rb one has the notation, both dn,k and δn depend on (x, y) = (∥X ∥, ∥Y ∥),
so condition (3.8) implies in fact a constraint on the convergence
lim eX eY eC2 · · · eCn = eX +Y . (3.2)
n→∞ domain (x, y) ∈ R2 of the Zassenhaus formula. In Fig. 1, we show
In the following, we use recursion (2.8)–(2.10) to show that (3.2) graphically the (numerically computed) domain D of such points
holds indeed for (x, y) ≡ (∥X ∥, ∥Y ∥) ∈ R2 in a domain that is (x, y). This has been obtained by computing for each point the
larger than {(x, y) ∈ R2 : 0 ≤ x + y < rb }. coefficients dn,k and δn up to n = 1000 (in fact, considering a
Clearly, (3.2) holds if smaller value of n the figure does not change significantly). We
have also included for comparison the previous results x + y <
lim ∥Rn (1)∥ = 1, (3.3) 0.1931 and x + y < 0.5967 of Suzuki and Bayen, respectively.
n→∞
Observe that the new convergence domain is considerably larger.
where Rn (λ) is given by (2.4), and thus is the solution of the initial In particular, it contains the region x + y < 1.054, but it is not
value problem restricted to that. For instance, the convergence domain contains
d the sets {(x, 2.9216) : x < 0.00292} and {(2.893, y) : y <
Rn (λ) = Fn (λ)Rn (λ), Rn (0) = I . (3.4) 0.0145}, and also the points (x, 0) and (y, 0) with arbitrarily large
dλ
1 values of x or y.
It is well known that, if 0 ∥Fn (λ)∥dλ < ∞, then there exists In summary, we have presented a new recursive procedure
a unique solution Rn (λ) of (3.4) for 0 ≤ λ ≤ 1, and that that not only allows us to get the expressions of the Zassenhaus
1
∥Rn (1)∥ ≤ exp( 0 ∥Fn (λ)∥dλ). In consequence, convergence (3.3) exponents Cn directly in terms of independent commutators in
will be guaranteed whenever (x, y) = (∥X ∥, ∥Y ∥) ∈ R2 is such an efficient way and can be easily implemented in any symbolic
that algebra package, but also shows, by bounding appropriately
each term in the recursion, that the convergence domain of
1
the Zassenhaus formula is considerably larger than the domain
lim ∥Fn (λ)∥dλ = 0.
n→∞ 0 guaranteed by previously known results.
From (2.19) we have that ∥Cn+1 ∥ ≤ δn+1 , where δ2 = x y and for
n ≥ 2, Acknowledgments
i0 +···+in
1 2 i
δn+1 = δnin · · · δ22 yi1 xi0 y. This work has been partially supported by Ministerio de
n + 1 (i ,i ,...,i )∈I i0 !i1 ! · · · in ! Ciencia e Innovación (Spain) under project MTM2010-18246-C03
0 1 n n
F. Casas et al. / Computer Physics Communications 183 (2012) 2386–2391 2391