1 s2.0 S0045794909003022 Main

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 12

Computers and Structures 88 (2010) 446–457

Contents lists available at ScienceDirect

Computers and Structures


journal homepage: www.elsevier.com/locate/compstruc

Performance of partitioned procedures in fluid–structure interaction


Joris Degroote a, Robby Haelterman b, Sebastiaan Annerel a, Peter Bruggeman c, Jan Vierendeels a,*
a
Department of Flow, Heat and Combustion Mechanics, Ghent University, Sint-Pietersnieuwstraat 41, B-9000 Ghent, Belgium
b
Department of Mathematics (MWMW), Royal Military Academy, Renaissancelaan 30, B-1000 Brussels, Belgium
c
Department of Applied Physics, Ghent University, Jozef Plateaustraat 22, B-9000 Ghent, Belgium

a r t i c l e i n f o a b s t r a c t

Article history: Partitioned simulations of fluid–structure interaction can be solved for the interface’s position with New-
Received 14 August 2009 ton–Raphson iterations but obtaining the exact Jacobian is impossible if the solvers are ‘‘black boxes”. It is
Accepted 22 December 2009 demonstrated that only an approximate Jacobian is needed, as long as it describes the reaction to certain
Available online 1 February 2010
components of the error on the interface’s position. Based on this insight, a quasi-Newton coupling algo-
rithm with an approximation for the inverse of the Jacobian (IQN-ILS) has been developed and compared
Keywords: with a monolithic solver in previous work. Here, IQN-ILS is compared with other partitioned schemes
Fluid–structure interaction
such as IBQN-LS, Aitken relaxation and Interface-GMRES(R).
Partitioned
Quasi-Newton
Ó 2009 Elsevier Ltd. All rights reserved.
Least-squares model
Aitken relaxation
Interface-GMRES(R)

1. Introduction equations or structural equations. On the other hand, a coupling


algorithm is required to take into account the interaction between
Recent research on fluid–structure interaction (FSI) has yielded flow and motion of the structure in partitioned simulations.
both more complex applications and algorithmic improvements. Both Newton–Raphson methods and fixed-point methods can
Interesting applications of FSI can be found in aeronautics [1,2] be used to solve FSI problems and other multi-physics problems
where FSI is used to assess flutter. Other challenging problems with adjacent, non-overlapping domains like soil–structure inter-
are the dynamic analysis of parachutes [3] due to the thin, highly action [14,15], but also the particle finite element method [16],
flexible structure and the interaction between a structure and smooth particle hydrodynamics [17,5], immersed boundary meth-
free-surface flow [4,5]. Also of great interest are biomedical appli- ods [18] and fictitious domain methods [19] have been used to
cations like blood flow in arteries [6–8] and in artificial heart solve FSI problems. The prefix interface is added to the classifica-
valves [9–11]. tion if the method only operates on variables related to the
An FSI problem consists of a fluid domain and a structural do- fluid–structure interface and the prefix block before Newton–Raph-
main with in between the fluid–structure interface. This multi- son denotes that the Jacobian consists of distinct blocks. In the fol-
physics problem with adjacent domains can be simulated in a lowing paragraphs, an overview of some existing methods is given.
monolithic or in a partitioned way. The former signifies that both Block Newton–Raphson methods can be used in both the mono-
the flow equations and structural equations are solved simulta- lithic [12,13,20] and the partitioned [21,22] approach. These meth-
neously [12,13] while the latter means that they are solved sepa- ods solve the nonlinear flow equations and the structural
rately. The monolithic approach requires a code developed for equations for the variables in the entire fluid and solid domain
this particular combination of physical problems, whereas the par- with the Newton–Raphson method. As the flow problem is solved
titioned approach preserves software modularity. Moreover, the on a moving grid, the Jacobian of the flow equations with respect to
partitioned approach allows to solve the flow equations and the the interface’s position, the so-called shape derivative, is difficult
structural equations with different, possibly more efficient tech- to calculate. However, it is possible to calculate the complete Jaco-
niques which have been developed specifically for either flow bian exactly [23]. The linear system within the Newton–Raphson
iteration can also be solved without knowledge of the Jacobian
with a matrix-free Krylov solver, using a finite-difference approxi-
* Corresponding author.
mation of the Jacobian-vector product [20,21].
E-mail addresses: Joris.Degroote@UGent.be (J. Degroote), Jan.Vierendeels@
UGent.be (J. Vierendeels). Whereas block-Newton–Raphson methods solve the flow and
URL: http://www.FSI.UGent.be/ (J. Degroote). structural problem for the state in the entire fluid and solid

0045-7949/$ - see front matter Ó 2009 Elsevier Ltd. All rights reserved.
doi:10.1016/j.compstruc.2009.12.006
J. Degroote et al. / Computers and Structures 88 (2010) 446–457 447

domain, Fernandez and Moubachir [24] rewrote the fluid–struc- Based on this insight, a quasi-Newton coupling algorithm with
ture problem as a nonlinear problem in the state of the structure, an approximation for the inverse of the Jacobian has been devel-
with the flow state as internal variables of the problem. This sys- oped [39]. Because the inverse of the Jacobian is approximated,
tem is subsequently solved with the Newton–Raphson method one avoids that a linear system with as dimension the number of
using the exact Jacobian. degrees-of-freedom in the interface’s position has to be solved in
It is also possible to reformulate an FSI problem as a system every quasi-Newton iteration. The approximation is constructed
with only the degrees-of-freedom in the interface’s position as un- with the least-squares technique developed for nonlinear systems
knowns and with all remaining variables in the fluid and solid do- by Vierendeels et al. [8] and applied to linear systems by Hael-
main as internal variables. This domain decomposition condenses terman et al. [40]. A matrix-free implementation of the least-
the error of the FSI problem into a subspace related to the interface squares technique is described in this work. The term ‘‘matrix-free”
[25]. The FSI problem can hence be written as either a root-finding denotes that the approximate Jacobian is not constructed explicitly
problem or a fixed-point problem, with the interface’s position as but that a procedure to calculate the product of the Jacobian with a
unknowns. vector is given such that less memory and CPU time is required.
Interface Newton–Raphson methods solve this root-finding Moreover, information from previous time steps can be reused in
problem with Newton–Raphson iterations, e.g. with an approxima- the least-squares approximation and the results demonstrate that
tion of the Jacobian from a linear reduced-physics model [26,27]. this extension greatly accelerates the convergence of the quasi-
The linear system within the Newton–Raphson iteration can also Newton iterations. The quasi-Newton algorithm presented in [39]
be solved with a matrix-free Krylov solver, for example the gener- is called IQN-ILS because it calculates the Interface position using
alized minimal residual method (GMRES), using an approximation Quasi-Newton iterations with an approximation for the Inverse
of the Jacobian-vector product based on finite-differences or with a of the Jacobian from a Least-Squares model.
linear combination of the previous residual vectors in Interface- Other techniques that are able to couple black-box solvers are
GMRES(R) [25,28,29]. Aitken relaxation, Interface-GMRES and IBQN-LS, all of which have
The fixed-point problem can be solved with fixed-point itera- been introduced above. By comparing the algorithms of these tech-
tions, also called (block) Gauss–Seidel iterations [22], which means niques with the IQN-ILS algorithm, it is shown that the IQN-ILS
that the flow problem and structural problem are solved succes- method can easily be implemented in an FSI framework that cur-
sively until the change is smaller than the convergence criterion. rently uses (Aitken) relaxation or Interface-GMRES(R) and that
However, the iterations converge slowly if at all, especially when IQN-ILS is more straightforward to implement than IBQN-LS.
the interaction between the fluid and the structure is strong due In the results section, the performance of all these techniques is
to a high fluid/structure density ratio or the incompressibility of compared in two cases, both in terms of how often the flow prob-
the fluid [7]. The convergence of the fixed-point iterations can be lem and structural problem have to be solved within one time step
stabilized and accelerated by Aitken relaxation and steepest des- and in terms of the total simulation time. The first case is the stea-
cent relaxation, which adapt the relaxation factor in every iteration dy and unsteady two-dimensional (2D) simulation of a flexible
based on the previous iterations [30]. If the interaction between beam behind a cylinder in a laminar flow, which is validated with
the fluid and the structure is weak, only one fixed-point iteration a well-known benchmark [41]. The second case is the propagation
is required within each time step [1,31–35]. These so-called stag- of a pressure pulse in a three-dimensional (3D) flexible tube. These
gered or loosely coupled methods do not enforce the equilibrium simulations indicate that the IQN-ILS is faster than Aitken relaxa-
on the fluid–structure interface within one time step but they tion and Interface-GMRES(R) and that the performance of IQN-ILS
are suitable for aeroelastic simulations with a heavy and rather and IBQN-LS is similar.
stiff structure. In previous work [39], the performance of the IQN-ILS algorithm
Vierendeels et al. [8] rewrite the FSI problem as a system of was compared with a monolithic Newton solver in five different
equations with both the interface’s position and the stress distribu- cases considering various structural configurations and incom-
tion on the interface as unknowns, and this system is solved with pressible fluids. For each case when convergence was reached,
block quasi-Newton iterations of the Gauss–Seidel type. The Jaco- the ratio of the time for the IQN-ILS simulation to the time for
bians of the flow solver and structural solver are approximated the monolithic Newton simulation was between 1/2 and 4, but
by means of least-squares models, constructed with the position there was a case for which the partitioned simulation did not con-
of the fluid–structure interface and the stress distribution on the verge. While problems of various characteristics were solved, still,
interface in all previous quasi-Newton iterations within one time only specific problems were considered and in general rather small
step [8,36]. This method will be referred to as IBQN-LS, meaning problems in number of equations. Moreover, the solutions of the
Interface Block Quasi-Newton with an approximation for the Jaco- structural equations and the flow equations were calculated using
bians from Least-Squares models. a direct sparse solver with full Newton–Raphson iterations
In this work, the focus lies on partitioned methods and more although different solver schemes, in particular much more effi-
specifically on methods that couple a black-box flow solver and cient for the fluid equations when the number of elements be-
structural solver. When the solvers are black boxes, it is difficult comes very large [42], are frequently used in the partitioned
or even impossible to obtain the Jacobian matrices which are re- approach. The performance comparisons may consequently look
quired in Newton–Raphson methods. However, a recent stability different when different problems are solved and other solver
analysis [37,38] on the unsteady, incompressible and inviscid flow schemes are used. However, the general observations given in
in a straight elastic tube has demonstrated that only certain com- [39] can be used to assess whether a monolithic or partitioned
ponents of the error on the interface’s position become unstable or solution of a fluid–structure interaction problem might be more
are badly damped during Gauss–Seidel iterations between the flow effective.
solver and the structural solver. As will be explained in this work, The remainder of this paper is organized as follows. After pro-
this means that if quasi-Newton iterations are used, the approxi- viding the necessary definitions in Section 2, Section 3 explains
mate Jacobian only has to describe the reaction to those unstable why an approximation for the Jacobian can be used. Section 4 de-
or badly damped components; other components of the error will scribes the IQN-ILS technique and the matrix-free implementation
be damped anyhow during the coupling iterations. The complete of the least-squares approximation with reuse of information from
Jacobian is thus not required for fast convergence of the coupled previous time steps. In Sections 5–7, IQN-ILS is compared with,
problem; an approximation of the Jacobian can be used instead. respectively, IBQN-LS, Aitken relaxation and Interface-GMRES(R).
448 J. Degroote et al. / Computers and Structures 88 (2010) 446–457

The performance analysis of the coupling schemes is subsequently next coupling iteration. This tilde is dropped once the final value
given in Section 8, followed by the conclusion in Section 9. that will be used in the next iteration has been calculated. Depend-
ing on the context, the equality sign can denote either assignment
2. Definitions or equality.
All coupling algorithms begin the next time step from an
This section gives a definition of the functions that represent extrapolation of the interface’s position
the flow solver and structural solver, as they are used by interface 5 n 1
Newton–Raphson and fixed-point methods. The governing equa- x0 ¼ ð xÞ  2ðn1 xÞ þ ðn2 xÞ; ð4Þ
2 2
tions in the fluid and solid domain, the discretizations and the
solution techniques of the discrete equations are not described based on the previous time steps. Lower order extrapolations are
here because they are not important for the comparison between used for the first two time steps.
the coupling algorithms but they can be found in [39] or any book
on computational fluid and solid mechanics.
Often, a velocity (or position after time integration) is imposed 3. Motivation for the use of quasi-Newton methods
on the fluid side of the interface and a stress distribution is applied
on the structural side, which is a Dirichlet–Neumann decomposi- Degroote et al. [37] have presented a stability analysis of the
tion of the FSI problem [7]. The following abstract definitions Gauss–Seidel iterations between the flow solver and the structural
emphasize that the solvers are treated as black boxes. solver in a partitioned simulation of the one-dimensional (1D) un-
The function steady flow in a straight, flexible tube. In this section, only the con-
clusion of this analysis is offered as a motivation for the use of
y ¼ FðxÞ ð1Þ quasi-Newton methods. To simplify the equations, the mass of
the structure is neglected and the fluid is incompressible and invis-
is referred to as the flow solver and it concisely represents several
cid. The governing equations are linearized such that Fourier error
operations. The discretized position x 2 Ru of the fluid–structure
analysis can be conducted and the behaviour of the different wave
interface is given to the flow code, and the grid of the fluid domain
numbers in the error on the interface’s position can be analyzed.
adjacent to the interface is adapted accordingly. Subsequently, the
Although this simplified model does not include all physical de-
grid velocity is calculated and the flow equations are solved for
tails, it provides insight into the stability of coupling algorithms
the fluid state in the entire fluid domain, which also results in a
for partitioned FSI simulations.
stress distribution y 2 Rw on the interface. In an unsteady simula-
The stability analysis investigates the difference between the
tion, both x and y are at the new time level and the boundary con-
correct position of the interface and its value during the Gauss–Sei-
ditions and other settings of F are also adapted to the new time
del iterations (further denoted as the error) by decomposing it as a
level.
weighted sum of cosine functions with different wave numbers.
The structural solver is represented by the function
The amplification l of the error is written as a function of the wave
x ¼ SðyÞ: ð2Þ number and three parameters, namely the dimensionless stiffness
of the tube j, the dimensionless time step s and the number of de-
This expression indicates that the fluid stress distribution on the grees-of-freedom in the discretization of the interface u. For more
interface is given to the structural code which then calculates the details and the exact definition of these parameters, the reader is
position of the entire structure and thus also the new position of referred to [37].
the fluid–structure interface. With these definitions, the FSI prob- Fig. 1 depicts the amplification of the error as a function of the
lem is given by wave number. From this figure, it can be seen that the error ampli-
x ¼ S  FðxÞ or RðxÞ ¼ S  FðxÞ  x ¼ 0; ð3Þ fication increases if j or s decreases and that especially the low
wave numbers are unstable. However, even for a very flexible tube
in fixed-point or root-finding formulation, respectively, with R and a small time step, only some wave numbers are unstable. The
being the residual operator. physical meaning of these curves is shown in Fig. 2. The position of
If the flow problem and the structural problem are not discret- the wall of a tube, which initially has a constant cross-section and
ized in the same way on the fluid–structure interface, there has to contains fluid at rest, is perturbed with two different wave num-
be a mapping between the solvers. In this paper, it is assumed that bers. At the inlet and outlet, a zero pressure is imposed. Because
this mapping is included in one of the solvers if necessary. An over- the fluid is incompressible, a displacement of the interface with a
view of mapping methods can be found in [43]. Radial basis func- low wave number requires that the fluid is accelerated globally,
tions can be used for both interpolation on the fluid–structure which causes large pressure variations throughout the fluid and
interface [44] and mesh motion in the fluid domain [45]. thus also a violent motion of the structure. On the other hand, a
Although the solvers are used as black boxes, it is important to displacement with a high wave number only generates local fluid
keep in mind that differences between the numerical methods of motion and hence smaller pressure gradients and structural mo-
both codes can cause undesirable effects. When different time inte- tion. The pressure differences in the case of the high wave number
gration schemes are used by both solvers, it is possible that spuri- can barely be seen since the same scale has been used for both
ous oscillations in time are present in the acceleration of the fluid wave numbers. The material properties, geometrical dimensions
and the structure and thus also in the stresses on the interface. This and time step for this simulation are identical to those in Section
has been demonstrated both analytically and numerically in [46]. 8.2. Although the curves in Fig. 1 depend on the geometry and dis-
In the remainder of this paper, all values and functions are at cretization, this physical interpretation indicates that the underly-
the new time level n þ 1, unless indicated otherwise with a left ing idea would remain valid in other situations.
superscript. A right superscript indicates the coupling iteration Only the components of the error which are unstable ðl > 1Þ or
while a subscript denotes the element in a vector. Capital letters which disappear slowly ðl  1Þ in Gauss–Seidel iterations between
denote matrices, bold lower case letters indicate vectors and lower the flow solver and the structural solver, or equivalently Richard-
case letters represent scalars. Approximations are indicated with a son iterations for the FSI problem, have to be removed by means
hat. The output of the solvers F and S is indicated with a tilde be- of a quasi-Newton technique. Thus, for the quasi-Newton
cause this is only an intermediate value that is not passed on to the iterations to converge quickly, the approximate Jacobian only has
J. Degroote et al. / Computers and Structures 88 (2010) 446–457 449

IBQN-LS, Aitken relaxation and Interface-GMRES(R) are reviewed


in the following sections using the same notation.
The FSI problem reformulated as a set of nonlinear equations in
the interface’s position
RðxÞ ¼ 0 ð5Þ
can be solved by means of Newton–Raphson iterations

dR
solve Dxk ¼ r k ; ð6aÞ
dx xk
xkþ1 ¼ xk þ Dxk ð6bÞ

with the residual calculated as

rk ¼ Rðxk Þ ¼ S  Fðxk Þ  xk ¼ x
~kþ1  xk : ð7Þ
The Newton–Raphson iterations in one time step have converged
when kr k k2 6 eo with eo the convergence tolerance. However, the
exact Jacobian of R is unknown as the Jacobians of F and S are
unavailable. Moreover, a linear system (6a) with as dimension the
number of degrees-of-freedom in the position of the fluid–structure
interface has to be solved in every Newton–Raphson iteration.
Although the number of degrees-of-freedom in the interface’s posi-
tion is generally smaller than the number of degrees-of-freedom in
the entire fluid and structure domain, the Jacobian matrix dR=dx is
usually dense. As a result, the solution of the linear system (6a) cor-
responds to a significant computational cost in large simulations,
especially if a direct solver is used.
If the Jacobian dR=dx is approximated and quasi-Newtons iter-
ations are performed, black-box solvers can be used but this ap-
Fig. 1. Amplification of the error on the interface’s position as a function of the
proach does not circumvent that the linear system (6a) has to be
wave number ðhÞ. This result has been obtained by means of Fourier error analysis solved. It is more advantageous to approximate the inverse of the
on the Gauss–Seidel iterations between the flow solver and structural solver in a Jacobian by applying the least-squares technique introduced by
simulation of 1D unsteady, incompressible flow in a flexible tube [37] with (a) Vierendeels et al. [8] on a particular set of vectors as will be ex-
constant product of the dimensionless time step s and the number of degrees-of-
plained below. The quasi-Newton iterations with the approxima-
freedom u in the discretization of the interface ðsu ¼ 0:1Þ and (b) constant
dimensionless stiffness of the tube ðj ¼ 10Þ. tion for the inverse of the Jacobian can be written as

d  !1
dR 
x kþ1 k
¼x þ  ðrk Þ: ð8Þ
dx xk
to describe the reaction to a limited number of components in the
error on the interface’s position, namely the components with low It can be seen from Eq. (8) that the approximation for the in-
wave numbers. For the components which are not included in the verse of the Jacobian does not have to be created explicitly; a pro-
approximate Jacobian, the quasi-Newton iterations correspond to cedure to calculate the product of this matrix with the vector r k is
Richardson iterations, as will be explained below. sufficient. The vector r k is the difference between the desired
residual, i.e. 0, and the current residual r k and it is further denoted
4. IQN-ILS as Dr ¼ 0  r k ¼ r k . In this work, the matrix–vector product is
calculated from information obtained during the previous quasi-
In this section, the IQN-ILS method with a matrix-free imple- Newton iterations. Eq. (7) shows that the flow equations and struc-
mentation of the least-squares approximation is presented. tural equations are solved in quasi-Newton iteration k, resulting in

Fig. 2. Pressure contours in an axisymmetric tube due to two displacements of the tube’s wall with the same amplitude but a different wave number. Initially, the fluid is at
rest and the tube has a constant cross-section. The straight horizontal line is the axis of symmetry, the curved horizontal line is the tube’s wall. A displacement of the tube’s
wall with a low wave number (top) creates much larger pressure variations than a displacement with the same amplitude but a higher wave number (bottom). Only the
difference between the two calculations and not the values as such are important.
450 J. Degroote et al. / Computers and Structures 88 (2010) 446–457

~kþ1 ¼ S  Fðxk Þ and the corresponding residual r k . The vectors x


x ~ but this implementation becomes unstable if the number of col-
and r from all previous coupling iterations are also available, giving umns in the matrix V k is high which is especially encountered when
a set of known residual vectors information from previous time steps is reused. For that reason, the
so-called economy-size QR-decomposition of V k is calculated using
rk ; r k1 ; . . . ; r 1 ; r0 ð9aÞ
Householder transformations [47]
and the corresponding set of vectors ~x
V k ¼ Q k Rk ð15Þ
~kþ1 ; x
x ~k ; . . . ; x
~2 ; x
~1 : ð9bÞ uv
k
with Q 2 R an orthogonal matrix and R 2 Rv v an upper trian-
k

The differences between all vectors from previous iterations (super- gular matrix. The coefficient vector c k is then determined by solving
script i) and the most recent vector (superscript k) are calculated the triangular system
Dr i ¼ r i  r k ; ð10aÞ T
Rk c k ¼ Q k Dr ð16Þ
Dx~iþ1 ¼ x~iþ1  x~kþ1 ð10bÞ i
using back substitution. If a Dr vector is (almost) a linear combina-
for i ¼ 0; . . . ; k  1. The final result will be the same if vectors other tion of other Dr j vectors, one of the diagonal elements of Rk will (al-
than r k and x ~kþ1 are selected as the reference: choosing the first iter- most) be zero. Consequently, the equation corresponding to that
ation as reference would avoid that all differences have to be recal- row of Rk cannot be solved during the back substitution and the cor-
culated in every coupling iteration; however, choosing the last responding element of c k is set to zero. The threshold for the detec-
iteration facilitates the comparison between the IQN-ILS method tion of small diagonal elements is set to 1010  kRk k2 in this work.
and Aitken relaxation in Section 6. The Dx ~ that corresponds to Dr is subsequently calculated as a
Each Dr i corresponds to a Dx ~iþ1 and these vectors are stored as linear combination of the previous Dx ~i , analogous to Eq. (13),
the columns of the matrices giving
nþ1
V k ¼ ½ Drk1 Dr k2 . . . Dr 1 Dr 0  ð11aÞ Dx~ ¼ W k ck : ð17Þ
and From Eq. (7), it follows that
nþ1 k
W ¼ ½ Dx ~k Dx ~k1 . . . Dx ~2 ~1
Dx : ð11bÞ Dr ¼ Dx~  Dx ð18Þ
Due to the similarity between subsequent time steps, the informa-
and substitution of Eq. (17) in Eq. (18) results in
tion from the previous time steps can be reused. The matrices nþ1 V k
and nþ1 W k are then combined with those from q previous time steps Dx ¼ W k ck  Dr: ð19Þ
(if at least q time steps have already been performed), giving
k
Because the coefficients c are a function of Dr, Eq. (19) shows how
V k ¼ ½ nþ1 V k n
V ... nqþ2
V nqþ1
V ð12aÞ Dx can be approximated for a given Dr. Hence, Eq. (19) can be seen
as a procedure to calculate the product of the approximation for the
and
inverse of the Jacobian and a vector Dr ¼ r k
W k ¼ ½ nþ1 W k n
W ... nqþ2
W nqþ1
W : ð12bÞ  !1
d
dR 
niþ1 niþ1 Dx ¼  Dr ¼ W k ck þ rk : ð20Þ
The columns of the matrices V and W are calculated by dx xk
subtracting the vector of the last iteration of time step n  i þ 1
from all previous vectors in that time step, in the same way as in This matrix-free procedure requires less memory (proportional to
Eq. (10). By including the information for q previous time steps, the number of rows and columns in V k , so u  v ) and is also faster
the convergence of the coupling iterations is remarkably acceler- than the explicit creation of approximation for the inverse of the
ated. However, if information from too many time steps is reused, Jacobian as
the convergence can slow down again as information from time
d  !1
step n  q þ 1 might no longer be relevant in time step n þ 1. The dR  1 T
 ¼ W k Rk Q k  I ð21Þ
optimal value of q is problem dependent but the convergence of dx  xk
the coupling iterations does not change significantly near the opti-
mum such that the performance of the method is robust with re- with I the unity matrix in Ruu . This is a significant advantage for
spect to the parameter q. simulations with a high number of degrees-of-freedom on the
The number of columns in V k and W k is indicated with v and is interface.
generally much smaller than the number of rows u. Nevertheless, The relation between Dr and Dx is thus found by means of the
in simulations with a low number of degrees-of-freedom on the Dx~ values. One might try to relate the residual r directly to x in-
interface, it is possible that the number of columns has to be lim- stead of to x~, but this obviously will not work as the new input
ited to u by discarding the rightmost columns. for S  F would be a linear combination of the previous inputs.
The vector Dr ¼ 0  r k is approximated as a linear combination The only new information in the input of S  F would originate
of the known Dr i from numerical errors and the coupling iterations would not
converge.
Dr  V k c k ð13Þ From Eq. (16), it can be seen that if part of Dr is orthogonal to V k
and thus to Q k , the decomposition coefficients ck will be zero for
with ck 2 Rv the coefficients of the decomposition. Because v 6 u, that part of Dr. Eq. (20) shows that this component of the residual
Eq. (13) is an overdetermined set of equations for the elements of is not modified such that Gauss–Seidel iterations between the flow
ck and hence the least-squares solution to this linear system is cal- solver and the structural solver are performed for this component
culated. In [8], the solution of the least-squares problem is calcu- of the residual which corresponds to Richardson iterations for the
lated as complete FSI problem. Only the components of Dr in the span of
T T the columns of V k are reduced by means of quasi-Newton
c k ¼ ðV k V k Þ1 V k Dr ð14Þ iterations.
J. Degroote et al. / Computers and Structures 88 (2010) 446–457 451

As for any other Newton–Raphson or quasi-Newton scheme, the


IQN-ILS iterations will only converge when the initial guess is suf-
ficiently close to the final solution. For example, if the deformation
of the structure within one time step is too large and the coupling
iterations do not converge, then the time step must be adjusted.
The same remark can be made for the IBQN-LS and Interface-
GMRES coupling techniques.
The complete IQN-ILS technique is shown in Algorithm 1 and a
simplified version is shown in Fig. 3(a). Because the matrices V k
and W k have to contain at least one column, a relaxation with fac-
tor x (line 4) is performed in the second coupling iteration of the
first time step if information from the previous time steps is reused
ðq > 0Þ and in the second coupling iteration of every time step
without reuse ðq ¼ 0Þ.

Algorithm 1. IQN-ILS method


1: k ¼ 0; x~1 ¼ S  Fðx0 Þ; r 0 ¼ x
~1  x0
2: while kr k k2 > eo do
3: if k ¼ 0 and (q ¼ 0 or n ¼ 0) then
4: xkþ1 ¼ xk þ xr k
5: else
6: construct V k and W k as shown in Eqs. (10)–(12)
7: calculate QR-decomposition V k ¼ Q k Rk
8: T
solve Rk c k ¼ Q k r k
9: xkþ1 ¼ xk þ W k ck þ r k
10: end if
11: ~kþ1 ¼ S  Fðxk Þ; r k ¼ x
k ¼ k þ 1; x ~kþ1  xk
12: end while

5. IBQN-LS

The IBQN-LS method is explained in detail in Algorithm 2 and a


simplified representation is shown in Fig. 3(b). This coupling tech-
nique solves the FSI problem written as
FðxÞ  y ¼ 0; ð22aÞ
SðyÞ  x ¼ 0 ð22bÞ
with block-Newton–Raphson iterations of the Gauss–Seidel type.
The linear system
2 3
c
d F    
dx
I Dx FðxÞ  y
4 5 ¼ ð23Þ Fig. 3. Simplified representation of the (a) IQN-ILS, (b) IBQN-LS, (c) (Aitken)
I c
d S Dy SðyÞ  x relaxation and (d) Interface-GMRES algorithm for partitioned FSI simulations with
dy
black-box solvers.
is thus first solved for Dx, followed by an update of x and the right-
hand side. Subsequently, the modified system is solved for Dy and y
is updated. As a consequence, the IBQN-LS method modifies the hand side of Eq. (24) and consequently the approximate Jacobians
stress distribution that is calculated by the flow solver before trans- c
d Fk
and dcSk
do not have to be calculated explicitly; a procedure to
dx dy
ferring it to the structural solver, as opposed to the other techniques calculate the product of these matrices with a vector is sufficient.
described in this paper. With the notation for intermediate values The procedure to calculate the product of the approximate Jaco-
defined in Section 2, the input and output of the flow solver are thus bian dcFk
dx
or cdS k
dy
with a vector is similar to the procedure described
denoted as xk and y ~kþ1 and the input and output of the structural in Section 4. The matrix–vector product with dc Fk
dx
is calculated from
solver are ykþ1 and x ~kþ1 . the previous inputs
Starting from the position xk that was given as input to the flow
x0 ; . . . ; xk ð25aÞ
solver in the previous coupling iteration, the position
xkþ1 ¼ xk þ Dxk is calculated by solving the system and the corresponding outputs
!
d dF
dS d d
dS y~1 ¼ Fðx0 Þ; . . . ; y~kþ1 ¼ Fðxk Þ ð25bÞ
k k
I Dxk ¼ x~kþ1  xk þ k ~ kþ1
ðy  ykþ1 Þ ð24Þ
dy dx dy of the flow solver which are converted into differences with respect
k to the last input and output
for Dx . As opposed to the original approach of Vierendeels et al. [8],
this linear system is solved with an iterative solver like the general- Dxi ¼ xi  xk ; ð26aÞ
ized conjugate residual method [48] or the generalized minimal
Dy~iþ1 ¼ y~iþ1  y~kþ1 ð26bÞ
residual method [49] in a matrix-free way. The matrix on the left-
452 J. Degroote et al. / Computers and Structures 88 (2010) 446–457

for i ¼ 0; . . . ; k  1. All Dxi and Dy


~iþ1 from the current time step (and xkþ1 ¼ xk þ xk r k ; ð30aÞ
possibly from previous time steps) are stored as columns of the k k k ~kþ1
¼ ð1  x Þx þ x x : ð30bÞ
matrices V kf and W kf with the subscript f referring to the flow solver.
Subsequently, the economy-size QR-decomposition of V kf is calcu-
Algorithm 3. Aitken relaxation [30]
lated. To determine the product of dc dx
Fk
with a vector Dx, the trian-
gular system 1: k ¼ 0; x~1 ¼ S  Fðx0 Þ; r 0 ¼ x
~ 1  x0
2: while kr k k2 > eo do
T
Rkf c kf ¼ Q kf Dx ð27Þ 3: if k ¼ 0 then
4: x0 ¼ signðn xÞ minðjn xj; xmax Þ
is solved for c kf and the matrix–vector product is then calculated as 5: else
d 6: k1 T
xk ¼ xk1 ðr krkÞrðrk1rk2
k k1
Þ
dF k
Dx ¼ W kf ckf : ð28Þ
dx 7: end if
The product of c
dS k
dy
with a vector is calculated analogously, based on 8: xkþ1 ¼ xk þ xk r k
9: ~kþ1 ¼ S  Fðxk Þ; r k ¼ x
k ¼ k þ 1; x ~kþ1  xk
the inputs and outputs of the structural solver.
10: end while
Once xkþ1 has been obtained, k is increased, the corresponding
~kþ1 ¼ Fðxk Þ is calculated and the matrices V f ,
stress distribution y
W f , Q f and Rf are updated. To calculate the stress distribution The next input for S  F is thus a linear combination of the last out-
ykþ1 ¼ yk þ Dyk that has to be applied on the structure, the system put and the previous input. Moreover, the update of the interface’s
! position is in the direction of the residual vector, as opposed to the up-
d dS
dF d d
dF date from the IQN-ILS method on line 9 of Algorithm 1. The first relax-
k k1
I Dyk ¼ y~kþ1  yk þ k ~k
ðx  xk Þ ð29Þ
dx dy dx ation in a time step is executed with the relaxation factor from the end
of the previous time step, but limited to xmax , so x0 ¼ signðn xÞ
is solved, again with the matrix-free iterative solver. Every time the minðjn xj; xmax Þ and 0 x0 ¼ xmax . The value of xk is obtained as
solution of either the flow problem or the structural problem is cal-
ðr k1 ÞT ðrk  r k1 Þ
culated, the approximate Jacobian of the corresponding solver is im- xk ¼ xk1 ; ð31Þ
proved by means of the solver’s latest input and output. ðr k  rk1 ÞT ðrk  rk1 Þ
Analogous to the IQN-ILS technique, the matrices V ks;f and W ks;f which is interpreted in [30] as the secant method for scalars directly
have to contain at least one column to calculate the quasi-Newton applied to vectors and projected on rk  r k1 . By combining Eqs.
update; otherwise a relaxation with factor x is used for the inter- (30b) and (31), it can be seen that the update of the interface’s posi-
face’s position (line 4 in Algorithm 2) and the stress distribution is tion is given by
passed on without modification (line 13).
ðxk  xk1 ÞT ðrk  r k1 Þ
xkþ1 ¼ xk þ ðrk Þ; ð32aÞ
ðr k  rk1 ÞT ðr k  rk1 Þ
Algorithm 2. IBQN-LS method [8] " #
k ðx~kþ1  x ~k ÞT ðrk  rk1 Þ
1: k ¼ 0; y~1 ¼ Fðx0 Þ; y1 ¼ y
~1 ; x
~1 ¼ Sðy1 Þ; r 0 ¼ x
~ 1  x0 ¼x þ  1 ðr k Þ ð32bÞ
2: k ðr k  r k1 ÞT ðr k  rk1 Þ
while kr k2 > eo do
3: if k ¼ 0 and (q ¼ 0 or n ¼ 0) then for k > 0. The previous equation is similar to line 9 of Algorithm 1. If
4: xkþ1 ¼ xk þ xr k the Jacobian were created explicitly in the IQN-ILS algorithm, if the
5: else normal equations were used to solve the least-squares problem in
6: Eq. (13) and if the matrices V k and W k were limited to their last col-
construct V ks and W ks
umn, line 9 of Algorithm 1 would give
7: calculate QR-decomposition V ks ¼ Q ks Rks " #
8: ðx ~k Þðr k  rk1 ÞT
~kþ1  x
solve Eq. (24) for Dxk x kþ1
¼x þk
 I ðrk Þ: ð33Þ
9: xkþ1 ¼ xk þ Dxk ðr k  r k1 ÞT ðr k  rk1 Þ
10: end if Eqs. (32b) and (33) are, however, not identical because the coeffi-
11: k ¼ k þ 1; y~kþ1 ¼ Fðxk Þ cient of r k is a scalar in the first equation and a matrix in the sec-
12: if k ¼ 1 and (q ¼ 0 or n ¼ 0) then ond one. This proves that Aitken relaxation is fundamentally
13: ykþ1 ¼ y
~kþ1 different from the IQN-ILS method. On the other hand, it demon-
14: else strates that implementing the IQN-ILS method is hardly more com-
15: plex than implementing Aitken relaxation. However, Aitken
construct V kf and W kf
relaxation can also be seen as an interface quasi-Newton technique:
16: calculate QR-decomposition V kf ¼ Q kf Rkf if the inverse of the Jacobian in Eq. (8) is approximated by xk I, the
17: solve Eq. (29) for Dyk Aitken relaxation method is retrieved.
18 ykþ1 ¼ yk þ Dyk Section 8 will demonstrate the gain in performance obtained by
19 end if using the information from all previous iterations or even previous
20: ~kþ1 ¼ Sðykþ1 Þ; r k ¼ x
x ~kþ1  xk time steps in the approximate Jacobians instead of only from the
21: end while last two iterations in Aitken relaxation. Steepest descent relaxation
[30] is similar to Aitken relaxation, but it is not studied in this pa-
per as it exhibits stepwise or zig-zag convergence which is typical
for steepest descent methods and well understood [50].
6. Aitken relaxation
7. Interface-GMRES(R)
Aitken relaxation [30], shown in Algorithm 3 and Fig. 3(c),
determines a dynamically varying scalar relaxation factor xk for Interface-GMRES (Algorithm 4 and Fig. 3(d)) uses the Newton–
the fixed-point iterations within a time step. Raphson method to solve the nonlinear equation RðxÞ ¼ 0 for the
J. Degroote et al. / Computers and Structures 88 (2010) 446–457 453

interface’s position. It is a hybrid Newton–Krylov method [51], ap- the relaxation factor x is problem dependent and one is limited by
plied to FSI. The Newton–Raphson update Dx is obtained by solving the finite accuracy of the solvers for a lower bound and divergence
the linear system in Eq. (6a). To be able to calculate the Jacobian- for an upper bound. The orthonormalization of the Dxi from lines 7
vector product, fixed-point iterations are first performed in lines to 9 is recommended in [28] to improve the accuracy of the solu-
4–19 which results in a sequence of interface positions xi and the tion of Eq. (35).
corresponding residuals r i with i ¼ 1; . . . ; j. From these sequences, Algorithm 4 can be conceived as the construction of a model
differences Dxi ¼ xi  x0 and Dr i ¼ r i  r 0 are calculated and stored with the residual as input and the interface’s position as output. In-
as columns of V j and W j . It has been proved by induction that the stead of using the model to update the interface’s position in the
Krylov space corresponding with the linear system in Eq. (6a) is inner loop, underrelaxed fixed-point iterations are performed until
asymptotically similar to spanfDxi gji¼1 and the fixed-point itera- the model meets the user-defined tolerance 1 . To prevent that Eq.
tions thus serve as a preconditioner to the GMRES solution of Eq. (35) is solved too accurately during the first few Newton–Raphson
P
(6a) [25,28]. The residual at x þ ji¼1 ci Dxi is approximated as iterations, 1 is set to a fraction k of the current residual kr k k.
! Instead of discarding the xi and r i from the previous Newton–
X
j X
j
R xþ c i Dx i
 r0 þ c i Dr i ; ð34Þ Raphson iterations, they can be reused in the next Newton–Raph-
i¼1 i¼1 son iteration, which is called Interface-GMRESR. Algorithm 5 shows
the alterations to Algorithm 4 that bring about reuse of the fixed-
which can be considered as a finite-difference approximation.
point iterations from previous Newton–Raphson iterations.
When the minimization problem in the residual space
 
 X j  Algorithm 5. Interface-GMRESR method [28]
 
min r 0 þ c i Dr i  ð35Þ
c   1: ~1 ¼ S  Fðx0 Þ; r 0 ¼ x
~ 1  x0
i¼1 2
k ¼ 0; x
2: j ¼ 0; f ¼ kr k k2
of the GMRES method has converged sufficiently, the corresponding
3: while kr k k2 > eo do
Newton–Raphson update is calculated as
  4: ...
 
 0 X
j
i 21: k ¼ k þ 1; x ~jþ1 ¼ S  Fðx0 Þ; r k ¼ x
~jþ1  x0
c ¼ arg min r þ c i Dr  ; ð36aÞ
c   22: T
i¼1 2 solve Rj c ¼ Q j r k
X
j 23: f ¼ kr k þ V j ck2
Dx ¼ ci Dxi : ð36bÞ 24: end while
i¼1

The coefficient vector c is calculated in the same way as in the IQN-


ILS method,
 as shown  on line 17 of Algorithm 4. The expression
 P  The inner loop is executed at least once before going to the
minc r 0 þ ji¼1 ci Dr i  is called the linearized residual f, as opposed
2 Newton–Raphson update; otherwise no additional Dr and Dx are
to the true residual krk2 .
known with respect to the previous Newton–Raphson update
and the Newton–Raphson update would be ineffective. Every time
Algorithm 4. Interface-GMRES method [28] a Newton–Raphson update is performed on line 21, the reference
1: ~1 ¼ S  Fðx0 Þ; r 0 ¼ x
k ¼ 0; x ~ 1  x0 x0 is modified and the solvers are evaluated. However, this solver
2: while kr k k2 > eo do evaluation does not result in an additional difference Dx and Dr,
3: whereas IBQN-LS and IQN-ILS modify the reference in every itera-
j ¼ 0; f ¼ kr k k2
tion and obtain a difference of the input and output in every iter-
4: while f > 1 do
ation except for the first one.
5: j¼jþ1
Due to the relaxation on line 10, the Dxj in Algorithm 4 all have
6: Dxj ¼ x~ j  x0
the same norm, regardless of whether it is the first or the final iter-
7: for i ¼ 1 to j  1 do
ation in the time step. However, the step size is adapted to the ini-
8: jT
Dxj ¼ Dxj  DkxDxiDkx Dxi
i

2
tial residual. If the problem is nonlinear, the Dx and Dr can be
9: end for inaccurate because this finite-difference step size is suboptimal.
10: 0 Because the norm of the differences in IBQN-LS and IQN-LS is equal
Dxj ¼ x kkr k2
Dxj k
Dxj
2 to the step size, it automatically decreases during the coupling iter-
11: xj ¼ x0 þ Dxj ations as they converge and these methods do not require a preset
12: ~jþ1 ¼ S  Fðxj Þ
x step size.
13: Dr j ¼ ðx
~jþ1  xj Þ  r k Reuse of differences from the previous time steps and not only
14: construct V j ¼ ½Dr 1    Dr j  from the previous Newton–Raphson iterations has been intro-
15: duced by Michler [52] who mention that reuse increases the effi-
construct W j ¼ ½Dx1    Dxj 
ciency, but that it comes at the expense of robustness and
16: calculate QR-decomposition V j ¼ Q j Rj therefore has to be used with some caution. Results for only 25
17: T
solve Rj c ¼ Q j r k time steps have been presented in a simulation with a compress-
18: f ¼ kr k þ V j ck2 ible fluid. To enable reuse in Interface-GMRESR, j should not be re-
19: end while set to 0 at the beginning of the time step. The convergence plots in
20: [52] demonstrate that with reuse from the previous time steps,
x0 ¼ x0 þ W j c
there is a large discrepancy between the linearized residual and
21: ~1 ¼ S  Fðx0 Þ; r k ¼ x
k ¼ k þ 1; x ~ 1  x0
the true residual. The inner loop with fixed-point iterations con-
22: end while
verges after one iteration because the linearized residual already
is very small. Consequently, the algorithm repeatedly performs
For FSI with strong interaction, the relaxation on line 10 of Algo- one fixed-point iteration followed by a Newton–Raphson update
rithm 4 is imperative as the inner loop consists of fixed-point iter- and in this regime an additional Dx and Dr are obtained at the cost
ations which diverge fast without relaxation. The optimal value of of two solver evaluations.
454 J. Degroote et al. / Computers and Structures 88 (2010) 446–457

8.1. 2D flexible beam


2.30e+03
2.13e+03
1.96e+03 The first cases are the steady FSI1 test and the unsteady FSI2
1.79e+03 test from the benchmark in [41]. This test is well-documented in
1.62e+03
the aforementioned reference and thus it will only be described
1.45e+03
1.28e+03 briefly. As can be seen in Fig. 4, the geometry consists of a horizon-
1.11e+03 tal channel of 0.41 m high, containing a rigid cylinder with center
9.40e+02 positioned 0.2 m above the bottom of the channel. A laminar, vis-
7.70e+02
cous flow with density 103 kg/m3 and kinematic viscosity
6.00e+02
4.30e+02 103 m2/s enters the channel from the left-hand side with a para-
2.60e+02 bolic velocity profile. A constant pressure is imposed at the outlet
9.00e+01 of the channel and a no-slip boundary condition is applied on the
-8.00e+01
walls. The mean inlet velocity is 0.2 m/s for FSI1 and 1.0 m/s for
-2.50e+02
-4.20e+02 FSI2. A linearly elastic beam is attached to the right-hand side of
-5.90e+02 the cylinder and this beam has a density of 103 kg/m3 or 104 kg/
-7.60e+02 m3, respectively, in FSI1 and FSI2, a Young’s modulus of
-9.30e+02
-1.10e+03
1.40  106 N/m2 and a Poisson’s ratio of 0.4.
The finite volume flow solver uses second-order discretization
(a) for the pressure and second-order upwind for the momentum. It
solves the Navier–Stokes equations in arbitrary Lagrangian Euleri-
2.30e+03 an (ALE) formulation for the pressure and velocity simultaneously
2.13e+03 with first-order time accuracy. The mesh of the fluid domain is
1.96e+03
adapted to the position of the fluid–structure interface with a
1.79e+03
1.62e+03 spring analogy and remeshing is performed if the skewness of
1.45e+03 the cells is too large. The spring analogy means that the edges be-
1.28e+03 tween the grid nodes are replaced by springs, with a spring con-
1.11e+03
stant inversely proportional to the length of the edge. This
9.40e+02
7.70e+02 results in a pseudo-structural problem with the grid displacement
6.00e+02 as unknowns [53]. The finite element structural solver uses impli-
4.30e+02 cit time integration of plain strain elements with 8 nodes and takes
2.60e+02
9.00e+01
into account the geometric nonlinearities due to the large defor-
-8.00e+01 mation of the structure. The coupling algorithm is executed on
-2.50e+02 one processor, the flow solver and structural solver each on two
-4.20e+02
processors.
-5.90e+02
-7.60e+02
After validating the flow solver and the structural solver with
-9.30e+02 the tests listed in [41], the FSI1 and FSI2 tests have been performed.
-1.10e+03 Table 1 and Fig. 5 give the displacement in the x and y direction of
(b) the point A, which is located at the right end of the beam, and the
drag and lift force on the entire structure in the same format as in
Fig. 4. Pressure contours in the 2D unsteady FSI2 test with the flexible beam after the benchmark paper [41]. The difference between the simulations
(a) 12 s; (b) 16 s. of the FSI1 test decreases as the grid is refined and the results cor-
respond well with the benchmark values, even for the FSI2 test
considering that the flow solver is only first-order accurate in time
on moving meshes. The same results have been found with each
8. Results coupling method. Pressure contours in the vicinity of the structure
are shown in Fig. 4.
In this section, the performance of the coupling algorithms is The number of solver evaluations and the relative duration of
compared for two different cases, namely a 2D flexible beam be- the simulations are given in Table 2. The residual kr i k2 is reduced
hind a cylinder and the propagation of a pressure pulse in a 3D five orders of magnitude with respect to its initial value and the
flexible tube. number between brackets behind the name of an algorithm indi-

Table 1
Displacement and force in the 2D steady FSI1 and unsteady FSI2 test with the flexible beam for different grids. The number of degrees-of-freedom (DOF) is indicated for each grid,
separately for the flow solver and the structural solver, and this number varies slightly in the unsteady test due to remeshing. For the FSI2 test, the values are given as
mean ± amplitude [frequency].

DOF ðF þ SÞ displx [103 m] disply [103 m] forcex [N] forcey [N]


FSI1
74,418 + 7650 0.022643 0.83672 14.234 0.75416
164,394 + 16,874 0.022645 0.83489 14.254 0.75616
290,574 + 29,698 0.022651 0.83478 14.266 0.75753
Benchmark 0.022700 0.82090 14.295 0.76380
FSI2
89,760 + 7650 14.07 ± 12.37 [3.7] 1.18 ± 76.5 [1.9] 217.52 ± 84.65 [3.7] 0.74 ± 267.6 [1.9]
Benchmark 14.58 ± 12.44 [3.8] 1.23 ± 80.6 [2.0] 208.83 ± 73.75 [3.8] 0.88 ± 234.2 [2.0]
J. Degroote et al. / Computers and Structures 88 (2010) 446–457 455

(a) 0 (b) 0.1


0.08
−0.005
0.06
−0.01 0.04

displ [m]
displx [m]
0.02

y
−0.015
0
−0.02 −0.02
−0.04
−0.025
−0.06
−0.03 −0.08
13 13.2 13.4 13.6 13.8 14 13 13.2 13.4 13.6 13.8 14
t [s] t [s]

(c) 300 (d) 250


280 200
260 150
100
240

force [N]
force [N]

50
220

y
x

0
200
−50
180
−100
160 −150
140 −200
120 −250
13 13.2 13.4 13.6 13.8 14 13 13.2 13.4 13.6 13.8 14
t [s] t [s]

Fig. 5. Displacement and force in the 2D unsteady FSI2 test with the flexible beam. Displacement of node A in (a) the x-direction; (b) the y-direction. Force on the beam and
cylinder in (c) the x-direction; (d) the y-direction.

Table 2 duration of the simulation is similar for IQN-ILS(3) and IBQN-


Number of solver evaluations and relative duration for the 2D steady FSI1 test and LS(3), which are significantly faster than Aitken relaxation and
unsteady FSI2 test with the flexible beam. The data are independent of the grid Interface-GMRES(R).
refinement in the steady simulation and the number of solver evaluations per time
step in the unsteady simulation has been averaged over the last period of the
oscillation. 8.2. 3D flexible tube

Algorithm Evaluations Duration


The second test case is a 3D simulation of a straight flexible
FSI1 tube—representing an artery—with radius 0.005 m and length
IBQN-LS 5 1.18
0.05 m, as described by Fernandez and Moubachir [24], Formaggia
IQN-ILS 5 1.00
Aitken relaxation 8 1.42 et al. [54] and Gerbeau and Vidrascu [26]. The same flow solver and
Interface-GMRES 7 1.35 structural solver as in Section 8.1 have been used, but the structure
Interface-GMRESR 6 1.00 has been modeled with one layer of shell elements. The tube’s wall
FSI2 is a linear elastic material with density 1200 kg/m3, Young’s mod-
IBQN-LS 7.2 1.54 ulus 3  105 N/m2, Poisson’s ratio 0.3 and thickness 0.001 m. The
IBQN-LS(3) 4.8 1.00
structure is clamped in all directions at the inlet and outlet. The
IQN-ILS 9.4 1.84
IQN-ILS(3) 6.1 1.07
fluid is incompressible and has a density of 1000 kg/m3 and a dy-
Aitken relaxation 9.9 1.81 namic viscosity of 0.003 Pa s.
Interface-GMRES 10.5 1.94 Both the fluid and the structure are initially at rest. During the
Interface-GMRESR 14.4 2.71 first 3  103 s, an overpressure of 1333.2 N/m2 is applied at the
Interface-GMRESR(3) 12.4 2.71
inlet. The wave propagates through the tube during 102 s, simu-
lated with time steps of 104 s. The fluid model contains 37,128 de-
grees-of-freedom and the structure model 29,760, with u ¼ 4896
degrees-of-freedom in the interface’s position and w ¼ 4992 de-
cates from how many time steps information is reused. For FSI1, grees-of-freedom in the stress distribution on the fluid–structure
the number of solver evaluations is independent of the grid refine- interface. Pressure contours on the fluid–structure interface are
ment. Interface-GMRESR has been used with k ¼ 0:01 because this shown in Fig. 6 and they correspond well with the above-men-
resulted in the lowest number of solver evaluations. In the unstea- tioned references. The same results have been found with each
dy FSI2 test, Interface-GMRESR requires more coupling iterations coupling method.
than Interface-GMRES, especially when the deformation within a Table 3 gives the number of solver evaluations in a time
time step is large. Reuse of information from three time steps re- step, averaged over the entire simulation, and the relative dura-
duces the number of coupling iterations with approximately 30% tion of the simulations. In every time step, krk2 is reduced three
for both IQN-ILS and IBQN-LS. For this unsteady simulation, the orders of magnitude with respect to its initial value in that time
456 J. Degroote et al. / Computers and Structures 88 (2010) 446–457

Table 3
1.40e+03 Number of solver evaluations per time step and relative duration for the simulation of
1.31e+03 the 3D flexible tube. The number of solver evaluations per time step has been
1.22e+03
averaged over the entire simulation.
1.13e+03
1.04e+03 Algorithm Evaluations Duration
9.50e+02
8.60e+02 IBQN-LS(2) 8.2 1.29
7.70e+02 IBQN-LS(10) 6.3 1.00
6.80e+02 IQN-ILS(2) 8.4 1.30
5.90e+02 IQN-ILS(10) 6.6 1.03
5.00e+02 Aitken relaxation 26.7 4.69
4.10e+02 Interface-GMRES 16.1 2.71
3.20e+02 Interface-GMRESR(2) 11.7 1.79
2.30e+02 Interface-GMRESR(10) 9.5 1.45
1.40e+02
5.00e+01
-4.00e+01 Y
-1.30e+02
Z X
-2.20e+02
step. The results demonstrate that the performance of the cou-
-3.10e+02
-4.00e+02 pling methods is different with respect to number of solver
evaluations and CPU time. The reason for this difference is that
if the coupling algorithm predicts an irregular position of the
(a) interface or pressure distribution on the interface, it will take
the flow solver and structural solver longer to converge. For this
case, the duration of the simulation is again similar for IBQN-
1.40e+03
1.31e+03 LS(10) and IQN-ILS(10), which are faster than the other
1.22e+03 methods.
1.13e+03
1.04e+03
9.50e+02 9. Conclusion
8.60e+02
7.70e+02
6.80e+02
It has been demonstrated that the complete Jacobian is not re-
5.90e+02 quired in an interface Newton–Raphson technique for partitioned
5.00e+02 simulation of fluid–structure interaction. Based on this insight,
4.10e+02
3.20e+02
an interface quasi-Newton technique with an approximation for
2.30e+02 the product of the inverse of the Jacobian with a vector (IQN-ILS)
1.40e+02 and reuse of information from previous time steps has been devel-
5.00e+01
oped [39]. A comparison of the IQN-ILS algorithm with IBQN-LS,
-4.00e+01 Y
-1.30e+02 Aitken relaxation and Interface-GMRES(R) indicates that the IQN-
Z X
-2.20e+02 ILS method can easily be implemented in an FSI framework that
-3.10e+02 currently uses (Aitken) relaxation or Interface-GMRES(R) and that
-4.00e+02
IQN-ILS is more straightforward to implement than IBQN-LS. The
results show that the IQN-ILS method performs similarly to the
(b) IBQN-LS method and better than Aitken relaxation and Interface-
GMRES(R).

1.40e+03
Acknowledgments
1.31e+03
1.22e+03
1.13e+03 The authors gratefully acknowledge the funding of Joris Degro-
1.04e+03
ote and Peter Bruggeman by a Ph.D. fellowship of the Research
9.50e+02
8.60e+02 Foundation-Flanders (FWO). Joris Degroote acknowledges a grant
7.70e+02 for a long stay abroad at the Massachusetts Institute of Technology
6.80e+02 (MIT) of the Research Foundation-Flanders. Jan Vierendeels
5.90e+02
5.00e+02
acknowledges the funding by Research Project G027508 of the Re-
4.10e+02 search Foundation-Flanders and the Ghent University Association.
3.20e+02
2.30e+02
1.40e+02 References
5.00e+01
-4.00e+01 Y [1] Farhat C, van der Zee K, Geuzaine P. Provably second-order time-accurate
-1.30e+02
Z X
loosely-coupled solution algorithms for transient nonlinear computational
-2.20e+02 aeroelasticity. Comput Method Appl Mech Eng 2006;195:1973–2001.
-3.10e+02 [2] Cavagna L, Quaranta G, Mantegazza P. Application of Navier–Stokes
-4.00e+02 simulations for aeroelastic stability assessment in transonic regime. Comput
Struct 2007;85(11–14):818–32.
[3] Stein K, Tezduyar T, Kumar V, Sathe S, Benney R, Charles R. Numerical
simulation of soft landing for clusters of cargo parachutes. In: Niettaanmäki P,
(c) Rossi T, Majava K, Pieronneau O, editors. European congress on computational
methods in applied sciences and engineering, ECCOMAS 2004, Jyväskylä, 1–14,
Fig. 6. Pressure contours on the fluid–structure interface in a 3D simulation of the 2004.
flexible tube after (a) 103 s; (b) 5  103 s; (c) 9  103 s. [4] Lee C, Noguchi H, Koshizuka S. Fluid–shell structure interaction analysis by
J. Degroote et al. / Computers and Structures 88 (2010) 446–457 457

coupled particle and finite element method. Comput Struct 2007;85(11– [30] Küttler U, Wall W. Fixed-point fluid–structure interaction solvers with
14):688–97. dynamic relaxation. Comput Mech 2008;43(1):61–72. doi:10.1007/s00466-
[5] Potapov S, Maurel B, Combescure A, Fabis J. Modeling accidental-type fluid– 008-0255-5.
structure interaction problems with the SPH method. Comput Struct [31] Lesoinne M, Farhat C. A higher-order subiteration free staggered algorithm for
2009;87(11–12):721–34. non-linear transient aeroelastic problems. AIAA J 1998;36(9):1754–6.
[6] Riemslagh K, Vierendeels J, Dick E. Coupling of a Navier–Stokes solver and an [32] Piperno S, Farhat C, Larrouturou B. Partitioned procedures for the transient
elastic boundary solver for unsteady problems. in: Papailiou K, Tsahalis D, solution of coupled aeroelastic problems – Part I: Model problem, theory and
Périaux J, Hirsch C, Pandolfi M, editors. Fourth European computational fluid two-dimensional application. Comput Method Appl Mech Eng
dynamics conference, ECCOMAS, Athens, Greece; 1998. p. 1040–5. 1995;124:79–112.
[7] Causin P, Gerbeau J-F, Nobile F. Added-mass effect in the design of partitioned [33] Piperno S, Farhat C. Partitioned procedures for the transient solution of
algorithms for fluid–structure problems. Comput Method Appl Mech Eng coupled aeroelastic problems – Part II: Energy transfer analysis and three-
2005;194(42–44):4506–27. dimensional applications. Comput Method Appl Mech Eng 2001;190:3147–70.
[8] Vierendeels J, Lanoye L, Degroote J, Verdonck P. Implicit coupling of partitioned [34] Zuijlen Av, Bosscher S, Bijl H. Two level algorithms for partitioned fluid–
fluid–structure interaction problems with reduced order models. Comput structure interaction computations. Comput Method Appl Mech Eng
Struct 2007;85(11–14):970–6. 2007;196:1458–70.
[9] dos Santos N, Gerbeau J-F, Bourgat J-F. Partitioned FSI strategy for simulations [35] van Zuijlen A, de Boer A, Bijl H. Higher-order time integration through smooth
of a thin elastic valve. In: Wesseling P, Oñate E, Périaux J, editors. European mesh deformation for 3D fluid–structure interaction simulations. J Comput
conference on computational fluid dynamics, ECCOMAS CFD 2006, ECCOMAS, Phys 2007;224:414–30.
Delft, The Netherlands; 2006. p. 1–10. [36] Vierendeels J. Implicit coupling of partitioned fluid–structure interaction
[10] van Loon R, Anderson P, van de Vosse F. A fluid–structure interaction method solvers using reduced-order models. In: Bungartz H-J, Schäfer M, editors.
with solid–rigid contact for heart valve dynamics. J Comput Phys Fluid–structure interaction – modelling, simulation, optimisation. Lecture
2006;217:806–23. notes in computational science and engineering, vol. 53. Berlin: Springer;
[11] Dumont K, Vierendeels J, Kaminsky R, Van Nooten G, Verdonck P, Bluestein D. 2006. p. 1–18. ISBN: 3-540-34595-7.
Comparison of the hemodynamic and thrombogenic performance of two [37] Degroote J, Bruggeman P, Haelterman R, Vierendeels J. Stability of a coupling
bileaflet mechanical heart valves using a CFD/FSI model. ASME J Biomech Eng technique for partitioned solvers in FSI applications. Comput Struct
2007;129(4):558–65. 2008;86(23–24):2224–34.
[12] Heil M. An efficient solver for the fully coupled solution of large-displacement [38] Degroote J, Annerel S, Vierendeels J. Stability analysis of Gauss–Seidel
fluid–structure interaction problems. Comput Method Appl Mech Eng iterations in a partitioned simulation of fluid–structure interaction. Comput
2004;193:1–23. Struct 2010;88:263–71.
[13] Bathe KJ, Zhang H. Finite element developments for general fluid flows with [39] Degroote J, Bathe KJ, Vierendeels J. Performance of a new partitioned
structural interactions. Int J Numer Method Eng 2004;60:213–32. procedure versus a monolithic procedure in fluid–structure interaction.
[14] Jahromi H, Izzuddin B, Zdravkovic L. Partitioned analysis of nonlinear soil– Comput Struct 2009;87(11–12):793–801.
structure interaction using iterative coupling. Interact Multiscale Mech [40] Haelterman R, Degroote J, Van Heule D, Vierendeels J. The quasi-Newton least
2007;1(1):33–51. squares method: a new and fast secant method analyzed for linear systems.
[15] Elleithy W, Tanaka M. Interface relaxation algorithms for BEM–BEM coupling SIAM J Numer Anal 2009;47(3):2347–68.
and FEM–BEM coupling. Comput Method Appl Mech Eng 2003;192:2977–92. [41] Turek S, Hron J. Proposal for numerical benchmarking of fluid–structure
[16] Oñate E, Idelsohn S, Del Pin F, Aubry R. The particle finite element method. An interaction between an elastic object and laminar incompressible flow. In:
overview. Int J Comput Method 2004;1:267–307. Bungartz H-J, Schäfer M, editors. Fluid–structure interaction – modelling,
[17] Monaghan J. Simulating free surface flows with SPH. J Comput Phys simulation, optimisation. Lecture notes in computational science and
1994;110(2):399–406. engineering, vol. 53. Berlin: Springer; 2006. p. 371–85. ISBN: 3-540-34595-7.
[18] Peskin C, McQueen D. A three-dimensional computational method for blood [42] Bathe KJ. Encyclopedia of computer science and engineering. In: Wah B, editor.
flow in the heart. I. Immersed elastic fibers in a viscous incompressible fluid. J The finite element method, vol. 1. John Wiley and Sons; 2008. dx.doi.org/
Comput Phys 1989;81:372–405. 10.1002/9780470050118.ecse159.
[19] van Loon R, Anderson P, van de Vosse F, Sherwin S. Comparison of various [43] Farhat C, Lesoinne M, Le Tallec P. Load and motion transfer algorithms for
fluid–structure interaction methods for deformable bodies. Comput Struct fluid/structure interaction problems with non-matching discrete interfaces:
2007;85(11–14):833–43. momentum and energy conservation, optimal discretization and application to
[20] Hron J, Turek S. A monolithic FEM/Multigrid Solver for ALE formulation of aeroelasticity. Comput Method Appl Mech Eng 1998;157:95–114.
fluid–structure interaction with application in biomechanics. In: Bungartz H-J, [44] de Boer A, van Zuijlen A, Bijl H. Review of coupling methods for non-matching
Schäfer M, editors. Fluid–structure interaction – modelling, simulation, meshes. Comput Method Appl Mech Eng 2007;196:1515–25.
optimisation. Lecture notes in computational science and engineering, vol. [45] de Boer A, van der Schoot M, Bijl H. Mesh deformation based on radial basis
53. Berlin: Springer; 2006. p. 146–70. ISBN: 3-540-34595-7. function interpolation. Comput Struct 2007;85(11–14):784–95.
[21] Matthies H, Steindorf J. Partitioned strong coupling algorithms for fluid– [46] Vierendeels J, Dumont K, Dick E, Verdonck P. Analysis and stabilization of
structure interaction. Comput Struct 2003;81:805–12. fluid–structure interaction algorithm for rigid-body motion. AIAA J
[22] Matthies H, Niekamp R, Steindorf J. Algorithms for strong coupling procedures. 2005;43(12):2549–57.
Comput Method Appl Mech Eng 2006;195:2028–49. [47] Golub GH, Loan CFV. Matrix computations. 3rd ed. Baltimore, MD, USA: Johns
[23] Fernandez M, Moubachir M. An exact block-Newton algorithm for solving Hopkins University Press; 1996.
fluid–structure interaction problems. Compt Rend Acad Sci Ser I: Math [48] Eisenstat S, Elman H, Schultz M. Variational iterative methods for
2003;336(8):681–6. nonsymmetric systems of linear equations. SIAM J Numer Anal
[24] Fernandez M, Moubachir M. A Newton method using exact Jacobians for 1983;20(2):345–57.
solving fluid–structure coupling. Comput Struct 2005;83:127–42. [49] Saad Y, Schultz M. GMRES: a generalized minimal residual algorithm for
[25] Michler C, van Brummelen E, de Borst R. Error-amplification analysis of solving nonsymmetric linear systems. SIAM J Sci Stat Comput
subiteration-preconditioned GMRES for fluid–structure interaction. Comput 1986;7(3):856–69.
Method Appl Mech Eng 2006;195:2124–48. [50] Nocedal J, Wright S. Numerical optimization. Springer series in operations
[26] Gerbeau J-F, Vidrascu M. A quasi-Newton algorithm based on a reduced model research. Springer; 1999.
for fluid–structure interaction problems in blood flows. ESAIM: Math Model [51] Brown P, Saad Y. Hybrid Krylov methods for nonlinear systems of equations.
Numer Anal 2003;37(4):631–48. SIAM J Sci Stat Comput 1990;11(3):450–81.
[27] Gerbeau J, Vidrascu M, Frey P. Fluid–structure interaction in blood flows on [52] Michler S. Efficient numerical methods for fluid–structure interaction. PhD
geometries based on medical imaging. Comput Struct 2005;83(2–3):155–65. thesis, TUDelft; 2005.
[28] van Brummelen E, Michler C, de Borst R. Interface-GMRES(R) acceleration of [53] Batina J. Unsteady Euler airfoil solutions using unstructured dynamic meshes.
subiteration for fluid–structure-interaction problems. Report DACS-05-001; AIAA J 1990;28(8):1381–8.
2005. <http://www.em.lr.tudelft.nl/downloads/DACS-05-001.pdf>. [54] Formaggia L, Gerbeau J-F, Nobile F, Quarteroni A. On the coupling of 3D and 1D
[29] Michler C, van Brummelen E, de Borst R. An interface Newton–Krylov solver Navier–Stokes equations for flow problems in compliant vessels. Comput
for fluid–structure interaction. Int J Numer Method Fluid 2005;47(10– Method Appl Mech Eng 2001;191:561–82.
11):1189–95.

You might also like