Download as pdf or txt
Download as pdf or txt
You are on page 1of 17

SERIES A

DYNAMIC
METEOROLOGY
AND OCEANOGRAPHY
PUBLISHED BY THE INTERNATIONAL METEOROLOGICAL INSTITUTE IN STOCKHOLM

Assimilating host model information into a limited area


model
By P E R DA H L G R E N * A N D NI L S G U S T A F S S O N , Swedish Meteorological and Hydrological
Institute, Folkborgsvägen 1, 601 76, Norrköping, Sweden
(Manuscript received 31 May 2011; in final form 7 December 2011)

ABSTRACT
We propose to add an extra source of information to the data-assimilation of the regional HIgh Resolution
Limited Area Model (HIRLAM) model, constraining larger scales to the host model providing the lateral
boundary conditions. An extra term, Jk, measuring the distance to the large-scale vorticity of the host model, is
added to the cost-function of the variational data-assimilation. Vorticity is chosen because it is a good
representative of the large-scale flow and because vorticity is a basic control variable of the HIRLAM
variational data-assimilation. Furthermore, by choosing only vorticity, the remaining model variables,
divergence, temperature, surface pressure and specific humidity will be allowed to adapt to the modified
vorticity field in accordance with the internal balance constraints of the regional model. The error
characteristics of the Jk term are described by the horizontal spectral densities and the vertical eigenmodes
(eigenvectors and eigenvalues) of the host model vorticity forecast error fields, expressed in the regional model
geometry. The vorticity field, provided by the European Centre for Medium-range Weather Forecasts
(ECMWF) operational model, was assimilated into the HIRLAM model during an experiment period of 33 d in
winter with positive impact on forecast verification statistics for upper air variables and mean sea level pressure.

Keywords: data-assimilation, large-scale constraint, error covariances, limited area, host model

1. Introduction The aim of this paper is to examine a way to include


selected information from the host model in the data-
In order to run a regional Numerical Weather Prediction
assimilation procedure.
(NWP) model, an initial condition and a host model One reason for doing this is to make the initial model
providing lateral boundaries are required. In this work state more consistent with the host model that will be
we have used the HIRLAM model in which the initial
applied at the lateral boundaries during time-integration.
condition is determined by combining observations and Fa Another reason is that the global models often have more
short forecast with variational data assimilation, and the advanced assimilation techniques that use more satellite
host model is usually the operational ECMWF model.
data, for example, and therefore give a better description of
The HIRLAM data-assimilation system is based on the
the large-scale flow. Using observations inside a regional
variational technique described in Gustafsson et al. (2001)
domain only, larger scales cannot be described in a proper
and Lindskog et al. (2001). In this work it is run in
way, and will be aliased on smaller scales, so by blending
four-dimensional variational analysis (4D-Var) mode,
the large-scale information into the regional model we can
described in Huang et al. (2002) and Gustafsson (2006),
also hope to improve the regional model forecasts.
and the background error covariances include a statistical
In Guidard and Fischer (2008), information from the
balance, Berre (2000).
global Action de Recherche Petite Echelle et Grand Echelle
The variational procedure finds the optimal model state
(ARPEGE) model analysis was blended into the regional
constrained by the models internal balances and observa-
Aire Limitée Adaption Dynamique développement Inter-
tions, i.e. the host model is not taken into consideration.
National (ALADIN) model by adding the ARPEGE
analysis as a constraint in the data-assimilation. All upper
air variables and surface pressure from ARPEGE were
The review process was handled by Editor-in-Chief Harald Lejenäs
*Corresponding author. applied with a diagonal error matrix, and the spectral
email: Per.Dahlgren@smhi.se information vector was truncated such that only the large

Tellus A 2012. # 2012 P. Dahlgren and N. Gustafsson This is an Open Access article distributed under the terms of the Creative Commons Attribution- 1
Noncommercial 3.0 Unported License (http://creativecommons.org/licenses/by-nc/3.0/), permitting all non-commercial use, distribution, and reproduction in any
medium, provided the original work is properly cited. Citation: Tellus A 2012, 64, 15836, DOI: 10.3402/tellusa.v64i0.15836

(page number not for citation purpose)


2 P. DAHLGREN AND N. GUSTAFSSON

scales were affected. In this work we take a different and the general formulation of the background error
approach by selecting only the vorticity field from the host constraint, Jb, is
model, ECMWF, and we let that represent the large-scale
1 T 1
flow which we assume is better described due to its extensive Jb ¼ d^
x B d^
x (2)
2
use of satellite observations and due its use of a global
domain. Furthermore, we use a short forecast from where B contains the error covariances of x^b . All variables
ECMWF, instead of an analysis, to reduce the possibility in x^b are related to each other and therefore the B matrix
of error correlations between the host model field and the becomes quite complex. To ensure a fast convergence of the
observations used in the regional model data-assimilation. minimisation, a pre-conditioning through a variable trans-
Also, it is not even possible for us in practice to use the form is introduced. The change of variable is designed as a
ECMWF analysis in an operational context since it is not series of operators that successively removes the correla-
available in real time. Finally, the error covariances of the tions between the components in d^ x until we end up with a
ECMWF vorticity field are not assumed to be represented by new variable, called v ^, whose covariance matrix can be
a diagonal matrix. assumed to be the identity matrix.
The paper outline is briefly as follows: The series of transforms can be written as
Details of the background error constraint, Jb, in HIR-
^ ¼ c1=2
v
1=2
V Tb kb r1b FGd^
x (3)
LAM and how vorticity is treated is described in Section 2. b

This information is needed for Section 3, where the formula- G calculate vorticity and divergence.
tion of the host model constraint is explained in detail. F balance operators.
Thereafter, in Section 4, we explain how the errors of the sb background error standard deviation. Horizontal
host model constraint are modelled and compared with the average, i.e. one value per vertical level.
background error covariances. Section 5 points out some lb horizontal spectral density of the background error
details on the implementation. The impact of the new correlation. Diagonal matrix.
constraint on an analysis is also investigated with a simple Vb eigenvectors of the vertical background error correla-
three-dimensional variational analysis (3D-Var) single tion matrix. Orthonormal matrix, i.e. V 1 T
b ¼ Vb.
case test. gb eigenvalues of the vertical background error correla-
The impact on forecasts, using a multi incremental 4D- tion matrix. Diagonal matrix.
Var setting, is then shown in Section 6. In Section 7 we The balance operators F follows Berre (2000) and the
summarise and make concluding remarks. remaining operators are described in Gustafsson et al.
(2001).
After the G operator is applied, the state vector contains
2. Balance constraints and control vector (d^f, d^ ^ d^
g, d T, q, dln^ ps ). Then the F operator removes the
The following notations will be used throughout the rest of cross-correlations between the variables and the state
this article: vector is written (d^f, d^ gu , dT ^ u , d^ qu , dln^
ps ), where subscript
u means that the variables are unbalanced. dT ^ and dln^
ps
“ Variables in spectral space are denoted: x^ are often combined into one variable, and the balance
“ Subscript ls denotes data from the host model operators described as going from the unbalanced state
(ECMWF), example: xls to the model state. Following the notations in Berre
“ Subscript b denotes data from the regional model (2000), and also omitting all d and the ln for pressure to
background (HIRLAM), example: xb make the expression more compact, the balance operators
“ An increment is the deviation from the regional acts as the following
model background:dx ¼ x  x b 0 1 0 10 1
^f E 0 0 0 ^f
B g C B CB ^ C
The model state increment vector in the HIRLAM B ^ C ¼ B MH E 0 0 CB g u C (4)
assimilation is @ ðT^; p^ Þ A @ NH P E 0 A@ ðT^; p^ Þ A
s s u
q^ QH R S E q^u
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
0 1 F 1
d^u
B d^ C where E is the identity matrix, M, N, P, Q, R and S
B v C
d^ B
x ¼ B dT C
^
C (1) are vertical balance operators derived by linear regression
@ d^ q A and H is a similar horizontal balance operator that relates
dlnp^s a linearised geopotential with vorticity, including also a
horizontally varying Coriolis parameter. It should also be
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 3

noted that since F 1 is triangular, the inverse is also Bls is not a diagonal matrix which makes this formulation
triangular, so both F 1 and F have the same shape. This difficult to use. Therefore a pre-conditioning is introduced
means that when we go from model state to the control using a similar transform as for the background error
vector, or from the control vector to the model state as covariances B, but using the error characteristics of Bls:
in eq. (4), vorticity is not affected by the balance operators.
If we only consider vorticity, the transform from model L ls ¼ c1=2
ls V Tls k1=2
ls r1
ls (10)
state to the control vector can be written as Projecting both d^k and d^f through Lls will give a state
Lb ¼ c1=2 V Tb k1=2 r1 vector whose error covariance matrix is the identity matrix.
b b b
(5)
^ ¼ L b d^f
v 1
Jk ðd^fÞ ¼ ðL ls d^k þ L ls d^fÞT ðL ls d^k þ L ls d^fÞ (11)
2
but in the minimisation only the inverse and its adjoint
This pre-conditioning hopefully also helps assuring a good
are used
minimisation towards the host model vorticity field.
1=2 1=2
L 1
b ¼ rb kb V b cb
As a last step, eq. (6) is used to express Jk as a function of
(6) the control vector.
d^f ¼ L 1 v
b ^
1
vÞ ¼ ðL ls d^k þ L ls L 1
Jk ð^ b v^ÞT ðL ls d^k þ L ls L 1
b v^Þ (12)
2
3. Formulation of the host model constraint
The derivative of Jk ð^
vÞ with respect to v
^ gives a gradient
We believe that the host model is better at describing the that can be directly added to the total gradient, that now
large-scale flow of which vorticity is a good descriptor. becomes
From the previous section it is also clear that vorticity has
@J
very appealing characteristics because it is also a data- ¼ v^ þ L T b H
T
Hðxb Þ  y þ HL1 b v ^
v
@^ |{z} |fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
assimilation control variable. Therefore we add an extra rJb rJo
(13) (13)
term to the cost-function that describes the difference ^
þ L T
b L T
ls ðL d
ls k þ L 1
ls L b v ^Þ
between the regional- and host-model vorticity fields: |fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
rJk
J ¼ Jb þ Jo þ Jk
1
Jk ð^fÞ ¼ ð^f  ^fls ÞT B 1 ^ ^
ls ðf  fls Þ (7)
2 4. Error covariances
^f is represented by a short forecast from the host To implement the host model constraint Jk we need the
ls
model and Bls contains the error covariances of ^fls . For error covariances of the ECMWF model as seen in the
simplicity we will ignore the cross-correlations between HIRLAM model geometry. To achieve this, ECMWF
the errors of ^fls and x ^b as well as the cross-correlations forecasts were interpolated to the HIRLAM geometry
between the errors of ^fls and the observation errors. The and then processed with the software used to determine
cross-correlations between the errors of ^fls and b are the HIRLAM background error statistics. Both HIRLAM
certainly different from zero and will be examined in and interpolated ECMWF fields were processed simulta-
the next section. Using a host model forecast instead of neously which allowed for cross-covariances between the
a host model analysis eliminates the need to consider errors of two models to be calculated.
the cross-correlations between the errors in ^fls and the The statistics presented will focus on displaying contents
observation errors, since the observations used to deter- of importance for this work. For example, the contents
mine the analysis from which ^fls originates are at least of the L 1
b and Lls operators that will be used in the
6 h old. Also, most regional models do not have the assimilation, eqs. (6) and (10).
global host model analysis available in real time.
Now we set ^f ¼ ^fb þ d^f in eq. (7):
4.1. Regional model geometry
1
Jk ðd^fÞ ¼ ð^fb þ d^f  ^fls ÞT B 1 ^ ^ ^
ls ðfb þ df  fls Þ (8) All calculations are carried out on the HIRLAM reference
2
area, called Regular Cycle with the Reference (RCR),
where ^fb is the regional model background. Fig. 1. The horizontal grid is expressed in a rotated lat/lon
We now also denote d^k ¼ ^fb  ^fls which gives geometry defined by shifting the South pole. Model top is
1 at 10 hPa and there are 60 vertical levels. Model levels are
Jk ðd^fÞ ¼ ðd^k þ d^fÞT B 1 ^ ^
ls ðdk þ dfÞ (9) numbered from the top and down, i.e. model level 1 is at
2
4 P. DAHLGREN AND N. GUSTAFSSON

Fig. 1. Geographical coverage of the HIRLAM reference, called RCR, domain which is used in these experiments.

the top and level 60 is closest to ground. The data- 4.2. Host model geometry
assimilation calculations are carried out in spectral space
The ECMWF data used in the forecast experiments in
and therefore an area extension, following Haugen and
this work originate from a spectral T799 model with
Machenhauer (1993), is used to ensure periodic variations
91 vertical levels and the model top at 0.01 hPa. T799
in both horizontal dimensions. The horizontal resolution
and number of gridpoints are presented in Fig. 2. corresponds to a gridpoint resolution of about 25 km in
On each vertical level, an arbitrary variable, a(x,y), is a reduced Gaussian grid.
represented by a bi-Fourier series: It should be noted that the ECMWF forecasts used for
the derivation of the host model error statistics in this study
1 XX were retrieved from the ECMWF archives in a horizontal
a^kl e2piðNXLþNYLÞ
kx ly
aðx; yÞ ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi (14) resolution of 0.5 latitudelongitude. These forecast fields
NXLNYL k l
were subject to a bi-linear horizontal interpolation followed
by a vertical interpolation to the HIRLAM geometry. Due
So the variable a is in spectral space defined by the complex
to the simple interpolation procedure, one should not trust
coefficients a^kl . An elliptical truncation
statistical information concerning horizontal waves shorter
!2 !2 than approximately 150 km.
k l
þ 1 (15)
Kmax Lmax
4.3. Statistics
is used to obtain an isotropic and homogeneous resolution
Short-range forecast errors were simulated by applying an
over the whole area. Each wave-number pair (k, l) is linked
ad hoc assumption that differences between forecasts of
to a total wave-number k* by
different length valid at the same time can be taken as a
s
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
2  2ffi proxy for such forecast errors. Thus, we applied what is
k l referred to as the ‘NMC method’. Pairs of 36 h and
k ¼ Ns þ (16)
NXL NYL 12 h forecasts, valid at the same time, were obtained
from the ECMWF and HIRLAM RCR archives for 87
where Ns is a scaling factor that is equal to the length of situations, all at 00UTC, during the period September
the extended domain divided by a map factor. November 2008. Forecast differences were relaxed towards
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 5

The cross-correlation between ECMWF and HIRLAM


vorticity forecast differences is strongest for the longest
scales, right column in Fig. 3. Since longer scales are more
predictable, and since these larger scales are to a large
extent controlled by the host model through the lateral
boundary relaxation, it is likely that both models produce
somewhat similar large-scale structures which make the
shape of the statistics reasonable. It seems like it is safe to
ignore the cross-correlations for shorter scales while it may
be more questionable for longer scales.
The horizontally averaged standard deviations, Fig. 4,
also show that the HIRLAM values exceed the ECMWF
ones on all levels. They both have rather similar shapes
Fig. 2. Integration area of the spectral HIRLAM tangent linear from the top and down to level 30 with a maximum around
and adjoint model applied in 4D-Var, together with the horizontal model level 15 (:350 hPa) which corresponds to the jet-
resolution and the number of gridpoints for the setup used in the stream level. Below, HIRLAM has another maximum
experiments presented in this work.
around level 43 (:850 hPa) which is not present in the
ECMWF statistics. These horizontally averaged standard
deviations are applied in the assimilation weights L 1 b
zero in the lateral boundary relaxation zone of the fore- and Lls but are adjusted with an empirical scaling factor.
cast model (Fig. 2), in order to avoid aliasing effects The scaling factor for sls is used to adjust the weight of
through the extrapolations to the area extension zone that the host model term, see Section 5.2.
are applied in order to get bi-periodic variations over the Figure 5 shows the horizontal spectral densities, kz ðk Þ,
extended domain. All calculations of statistics were carried as calculated by the NMC statistics software. kz ðk Þ and
out in spectral space ignoring covariances between spectral Ps(k*) are related via a simple expression:
component coefficients of different horizontal wave-num-
bers, thus implicitly assuming horizontal homogeneity with !
NXLNYL
respect to the covariances in gridpoint space. Pz ðk Þ ¼ kz ðk Þ 2pk
Ns2
The horizontal variance power spectra for vorticity,
Pz(k*), is an output entity from the NMC statistics software
that shows the contribution to the total variance for kz ðk Þ is the input to the HIRLAM data-assimilation,
different wavelengths. Pz(k*) is plotted on three different but before applied in assimilation they are scaled such that
vertical levels in the left column of Fig. 3, while the cross their sum, divided with the number of gridpoints, is equal
correlation between the HIRLAM and ECMWF vorticity to 1. It should be noted that the result of that scaling is
fields is plotted in the right column of the same figure. resolution dependent. So what we see in Fig. 5 is not
1=2
We see that HIRLAM has larger values throughout the exactly what is used in the Lls and L 1
b operators, i.e. kls
1=2
whole spectrum. Since the NMC statistics are based on and kb . In Fig. 5 we see that the ECMWF spectrum is
forecast differences, one reason could be that HIRLAM is sharper than HIRLAM on all levels. At the same time,
the ECMWF values are larger for large scales (small
more active and has a larger spread due to its higher
wave-numbers) and vice versa for short scales. For
resolution. Another reason may be difference between
ECMWF in relation to HIRLAM, this means that large
the data-assimilation methods applied in the forecasting
scales contribute more than short scales to the horizontal
systems producing the forecast fields that enter into the
correlation which gives broader horizontal structure
statistical calculations. ECMWF applied 4D-Var, while the
functions for ECMWF (not shown). This is natural due
HIRLAM fields during autumn 2008 were produced with
to the difference in resolution.
3D-Var. It is known that introduction of 4D-Var reduces Next we take a look at the eigenmodes of the vertical
the ‘jumpiness’ in the time-series of forecast evolutions, and error covariance matrix, Figs. 6 and 7. Each eigenmode
the difference between 12 h and 36 h forecasts valid at is represented by an eigenvector and a corresponding
the same time is essentially a measure of jumpiness. Also eigenvalue.
note that the horizontal resolution used in the retrieval of The eigenvector and eigenvalue for mode n and
the ECMWF forecast fields from the archives makes any wave-number k* can be denoted V zn;k . z 1, Number of
comparison of covariance spectra meaningless for horizon- vertical LEVels (NLEV) and gn,k. The sum of eigenvalues
tal wave-numbers greater than 64 (wave length 160 km). over all modes, for a specific wave-number k*, is constant
6 P. DAHLGREN AND N. GUSTAFSSON

Vorticity variance power spectra at level 05 Cross correlation at level 05


10 0.45
LAM 0.4 corr
LS
0.35
spectrum * 1012

Correlation
1 0.3
0.25
0.2
0.1 0.15
0.1
0.05
0.01 0
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber, k* wavenumber, k*

Vorticity variance power spectra at level 30 Cross correlation at level 30


100 0.5
LAM 0.45 corr
LS
0.4
spectrum * 1012

10

Correlation
0.35
0.3
1
0.25
0.2
0.1 0.15
0.1
0.01 0.05
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber, k* wavenumber, k*

Vorticity variance power spectra at level 50 Cross correlation at level 50


100 0.5
LAM 0.45 corr
LS 0.4
spectrum * 1012

10
0.35
Correlation

0.3
1 0.25
0.2
0.15
0.1
0.1
0.05
0.01 0
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber, k* wavenumber, k*

Fig. 3. Left column: Horizontal variance power spectra, Pz ðk Þ, for vorticity forecast differences at model levels 5 (:90 hPa), 30
(:630 hPa) and 50 (:920 hPa), full lineECMWF and dotted line HIRLAM. Right column: cross-correlation of vorticity forecast
differences between HIRLAM and ECMWF.

and equal to the number of modes, which is the same as the Figure 6 shows the eigenvalues for the first six leading
number of vertical levels, thus: modes. At the bottom, the combined effect of these modes
is shown. We see that these modes include 90% of the
X
Nmod
cn;k ¼ NLEV variance for the longest scales and then gradually decrease
n¼1 as the scales get shorter. This is important to remember
and the ratio for the ECMWF data because we choose to use only the
cn;k six first, or leading, modes of Vis and cls in the operator
P
Nmod Lls, eq. (10). Lls then acts as a vertical filter that removes
cn;k part of the small scale variance from the host model,
n¼1
including also vertically small scale noise introduced by
tells us how much of the total variance that is explained by the vertical interpolation procedure. In the same figure
mode n. How the variance is distributed vertically for we also see how the higher order modes contain less
z
eigenmode n is shown by the eigenvector Vn;k . large-scale information. Fig. 7 shows the structures of some
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 7

Vorticity standard deviation mesh of 48 km or 96 km. This means that the oversampled
LAM part of the ECMWF vorticity field is never used.
LS
Second, The Jk term itself is truncated to an even lower
10 resolution as described in Section 5.2.

20 5.2. Tuning and truncation of the Jk term


In Section 4.3, a scaling of the standard deviation sls in the
Model level

operator Lls [eq. (10)] was mentioned. The purpose of this


30
scaling is to adjust the weight for the Jk-constraint in
relation to the other terms in the cost-function. In the
40
HIRLAM variational analysis code, a similar scaling is
applied to the background error standard deviation to
adjust the weight of the Jb term in relation the observation
50 term Jo. The tuning factor, here called a, is applied as such:

pffiffiffi
rscaled
ls ¼ arls
60
1 2 3 4 5 6 7 8
std [s-1]*105 and the actual tuning procedure in this study is rather
ad hoc. A few different values of scaling-factors, a1, 5,
Fig. 4. Vertical profiles of horizontally averaged standard
deviation for vorticity forecast differences. These are both scaled in 10, 100, were applied on a 3D-Var test case with
the minimisation to adjust the weight in the data-assimilation. Full J JbJoJk and the behaviour of the cost-function,
lineECMWF and dotted lineHIRLAM. and the individual terms separately, during minimisation,
were studied. a100 gave Jk no weight at all, while a1
eigenvectors. For each mode, the wave-number that is made Jk too dominating. With a5 and a10 the Jk
associated with the largest eigenvalue is chosen. The shapes and Jo terms both minimised smoothly. a5 however
are quite similar with a slight shift of phase for modes 58. gave a more distinct minimisation of the Jk term without
It should however be mentioned that the shape of the losing the fit towards observations. Therefore a5 was
eigenvectors for a specific mode varies a little between the used throughout all experiments in this study.
wave-numbers. The Jk term is also truncated at k* 45, wave length
around 230 km, to make sure only large scales are
penalised. In terms of low-resolution increments, described
5. Implementation in Section 5.1, k* 45 corresponds to a resolution about
seven times lower than the original resolution.
5.1. Low resolution increments
A short ECMWF forecast, with the geometry described in
5.3. 3D-Var single case test
the first paragraph of Section 4.2, is interpolated to the
HIRLAM geometry described in Section 4.1. The inter- As a first step, we perform a 3D-Var analysis with only
polated ECMWF wind-field is thereafter transformed to the background and Jk as constraints, i.e. J JbJk.
spectral space. It is important to mention this because it The date of the single case test was 22 October 2008 at
means that a 25 km grid is interpolated to a 16 km grid, and 06UTC, and both xb and fls are 06 h forecasts from the
the ECMWF field is thus oversampled and contains false previous cycle.
high-resolution information. The analysis converges in four iterations, not shown,
In practice, however, this is never a problem for us due to and the resolution was 48 km, i.e. three times lower than
two reasons: the original resolution but the Jk term was truncated at
First, the actual minimisation is a carried out on a k* 45 as mentioned in the previous section. Both the
grid mesh with lower resolution than the original grid first guess xb and analysis xa are then compared with
(16 km), this incremental formulation is also described in the host model field xls. This is done by computing Root
Gustafsson et al. (2001). One of the reasons for this Mean Square (RMS) values in gridpoint space for (xb xls)
is to reduce the computational cost. In this work, all and (xa xls) for the upper air variables z, u, y, T, q. The
analyzes are carried out in a resolution either three or differences are computed for every vertical level and
six times lower than the original resolution, i.e. a grid are presented as profiles in Fig. 8.
8 P. DAHLGREN AND N. GUSTAFSSON

Spectral density at level 05 Spectral density at level 10


100 100
LAM LAM
LS LS
10
Spec dens * 105

Spec dens * 105


10

1
0.1

0.01 0.1
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber wavenumber

Spectral density at level 20 Spectral density at level 30


100 100
LAM LAM
LS LS
10 10

Spec dens * 105


Spec dens * 105

1 1

0.1 0.1

0.01 0.01
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber wavenumber

Spectral density at level 50 Spectral density at level 60


100 10
LAM LAM
LS LS
10
Spec dens * 105
Spec dens * 105

1 1

0.1

0.01 0.1
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber wavenumber

Fig. 5. Horizontal spectral densities for vorticity forecast differences, g. Full lineECMWF, dotted line HIRLAM. Before these are
applied in the assimilation they undergo a scaling so that their sum, divided with the number of gridpoints, is equal to 1.

Because Jk contains vorticity only, with no cross- It is also interesting to investigate how the host model
correlations to any other variables, the analysed vorticity constraint acts on different scales. For simplicity, we only
field must be closer to ECMWF than the first guess, look at the vorticity field and compare
 the complex
 spectral

otherwise, there would be something wrong with the coefficients of the differences ^fb  ^fls and ^fb  ^fls .
implementation. As explained in Section 4.1, these differences are repre-
We now see in the upper left plot in Fig. 8 that the sented as bi-Fourier series as shown in eq. (14), and each
analysed vorticity field is clearly closer to ECMWF wave-number pair (k, l) is linked to the total wave-number
compared with the first guess. So the Jk implementation k* [eq. (16)].
seems to do what it is supposed to. Differences in the other To be able to visualise these difference fields they
variables are produced by HIRLAM’s internal balances via have been expressed as 1-D arrays over k*. Each element is
the B matrix and do therefore not have to be closer to the average of the absolute values of all spectral coefficients
ECMWF. with wave-number pairs (k, l) that are linked to the same k*.
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 9

Eigenvalues for mode 1 Eigenvalues for mode 2


0.6 0.2
0.55 LAM LAM
LS 0.18 LS

% of total variance

% of total variance
0.5
0.45 0.16
0.4
0.35 0.14
0.3
0.25 0.12
0.2 0.1
0.15
0.1 0.08
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber wavenumber

Eigenvalues for mode 3 Eigenvalues for mode 4


0.12 0.09
LAM LAM
0.11 LS 0.08 LS
% of total variance

% of total variance
0.1 0.07
0.09 0.06
0.08 0.05
0.07 0.04
0.06 0.03
0.05 0.02
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber wavenumber

Eigenvalues for mode 5 Eigenvalues for mode 6


0.07 0.055
0.065 LAM 0.05 LAM
LS LS
% of total variance

% of total variance

0.06
0.045
0.055
0.05 0.04
0.045 0.035
0.04 0.03
0.035
0.025
0.03
0.025 0.02
0.02 0.015
20 40 60 80 100 120 140 20 40 60 80 100 120 140
wavenumber wavenumber

Sum over modes 1-6


0.95
0.9 LAM
LS
% of total variance

0.85
0.8
0.75
0.7
0.65
0.6
0.55
0.5
0.45
20 40 60 80 100 120 140
wavenumber

Fig. 6. Eigenvalues for the vorticity forecast difference vertical covariance matrix, the first six leading modes. Mode number 1 is the
leading mode and so on. The sum over the six modes are presented in the bottom figure.

In Fig. 9, the top


 figureshows the averaged1-D spectrum
 is drawn towards ECMWF on all scales where Jk is active.
of the difference ^fb  ^fls as a full line and ^fb  ^fls as a The size of the increments is also shown in the middle of
dotted line. It can now be seen that the analysed vorticity Fig. 9. This plot shows the average absolute value of the
10 P. DAHLGREN AND N. GUSTAFSSON

mode 1 mode 2 mode 3 mode 4


k*=2 k*=10 k*=25 k*=40

LS LS LS LS
10 LAM 10 LAM 10 LAM 10 LAM

20 20 20 20
Model level

Model level

Model level

Model level
30 30 30 30

40 40 40 40

50 50 50 50

60 60 60 60
−0.06 0 0.06 0.12 0.18 −0.2 0 0.2 0.4 −0.4 −0.2 0 0.2 −0.4−0.2 0 0.2 0.4

mode 5 mode 6 mode 7 mode 8


k*=40 k*=40 k*=40 k*=40

LS LS LS LS
10 LAM 10 LAM 10 LAM 10 LAM

20 20 20 20
Model level

Model level

Model level

Model level
30 30 30 30

40 40 40 40

50 50 50 50

60 60 60 60
−0.4−0.2 0 0.2 0.4 −0.4−0.2 0 0.2 0.4 −0.4−0.2 0 0.2 0.4 −0.4−0.2 0 0.2 0.4

Fig. 7. Eigenvectors for the first eight leading modes for the vorticity forecast difference vertical covariance matrix. The wave-number
k for each mode is the one that has the largest eigen-value, see Fig. 6.

vorticity increments d^f at the last iteration which corre- Two outer loops in HIRLAM means that the inner
sponds to the difference between the full and dotted line in loop begins in a coarse resolution and performs a number
the top plot in Fig. 9. of iterations. Then, as an intermediate step, the control
The largest increments are introduced at k* 28, vector is saved and a temporary analysis file written from
peaking at 5 and thereafter slowly decreases until k* 45 which a new, non-linear trajectory is calculated with the
where it becomes zero. forecast model to make a re-linearization of the observa-
However, the relative size of the increments is rather tion operators and for the tangent-linear model. The
uniformly distributed over the spectrum as shown in the second inner loop then continues in a higher resolution to
bottom plot of Fig. 9. finish the analysis.
Now we have verified that the implementation of the In HIRLAM the 4D-Var time-window is 93 h around
Jk term draws the analysis closer to the host model the nominal analysis time and the first outer loop uses
(ECMWF) vorticity field, and that the largest increments
the 03 h forecast from the previous cycle as first
are introduced on the longest scales.
guess. Since the control vector is updated and the cost-
function gradient is calculated at the beginning of the
6. Forecast impact time-window it is appropriate to also use a 03 h fore-
cast for the host model constraint.
6.1. Analysis setup The first loop is carried out with 96 km resolution
A 4D-Var setup with two outer loops was chosen for (six times lower than original) for a maximum of 60
the forecast impact experiment in order to be close to iterations, and the second loop is carried out in 48 km
what is used operationally at SMHI. resolution for a maximum of 30 iterations.
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 11

Vorticity Wind

RMS(xb-xls)
RMS(xa-xls)
10 10

Model level 20 20

Model level
u RMS(xb-xls)
30 30 u RMS(xa-xls)
v RMS(xb-xls)
v RMS(xa-xls)
40 40

50 50

60 60
0 0.5 1 1.5 2 2.5 3 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6
[s-1]*109 [m/s]

Temperature Specific humidity

RMS(xb-xls) RMS(xb-xls)
RMS(xa-xls) RMS(xa-xls)
10 10

20 20

Model level
Model level

30 30

40 40

50 50

60 60
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 1 2 3 4 5 6 7 8
[K] [kg/kg]*107

Fig. 8. Analysis impact on upper air fields for a test case using 3D-Var and only the background and host model constraints,
i.e. J ¼ Jb þ Jk and thus no observations. Full line: RMS difference between xb and xls. Dotted line: RMS difference between xa and xls. xb
is the background and first guess HIRLAM field and xls is the ECMWF 06 h forecast used as constraint in Jk. Notice that the inclusion of
Jk draws the analysed vorticity field closer to ECMWF.

6.2. Experiment period and observation usage arbitrarily, 24 January 2009 at 12UTC, and studied. The
cost-function contains three terms Jb, Jo and Jk and
The model was run up to 48 h every 6th hour from
each part is plotted separately, together with the total
24 January 00UTC to 25 February 2009 at 18UTC,
value Jtot in Fig. 10.
i.e. 33 d. For simplicity, only conventional observations
It is important to note that the cost-function values
were used in the data-assimilation. Conventional observa-
of the Jb and Jk terms are resolution dependent. Jb is
tion types include SYNOP, SHIP, BOUY, TEMP, PILOT
the scalar product of the control vector, Jb ¼ v ^T v
^, and
and AIRCRAFT.
similarly the Jk value is the scalar product described in
Two sets of assimilation experiments were carried out
eq. (12). These vectors are thus longer in the 2nd outer
throughout the 33-d period:
loop. At the same time, the horizontal spectral density,
“ ref  reference run. Only observations used in the lb and lis, undergoes a resolution-dependent scaling
analysis. described in Section 4.3, as well as the coefficients of the
“ jk_active  same as reference but with the host spectral fields. These three factors combined, cause the
model constraint activated in data-assimilation. cost-function values of Jb and Jk to make a discontinuous
jump between the 1st and 2nd outer loops. Because of
this discontinuity, the decrease of the cost-function
value should be compared with the initial value of each
6.3. 4D-Var minimisation case
loop.
As in the 3D-Var singe case test, it is interesting to see the In Fig. 10 we see that the total cost-function value, as
cost-function behaviour for a 4D-Var case with observa- well as the observation term Jo, decreases and converges in
tions. One date in the impact experiment was chosen both loops, while Jk decreases and converges, already in the
12 P. DAHLGREN AND N. GUSTAFSSON

First guess (xb) and analysis (xa) departure from ECMWF forecast (xls)
x=vorticity
3.5

Spectral coeff. * 104


(xb-xls)
3 (xa-xls)
2.5
2
1.5
1
0.5
0
0 20 40 60 80 100 120
k*

Absolute values of analysis increments, dx, for vorticity


0.7
abs(dx)
Spectral coeff. * 104

0.6
0.5
0.4
0.3
0.2
0.1
0
0 20 40 60 80 100 120
k*

Ratio between dx and (xb-xls)


0.3
dx/(xb-xls)
0.25
Ratio dx/(xb-xls)

0.2
0.15
0.1
0.05
0
0 20 40 60 80 100 120
k*

Fig. 9. Spectrum of analysis impact on vorticity for a test case using 3D-Var and only the background and host model constraints, i.e.
J ¼ Jb þ Jk and thus no observations. Top figure, full line: absolute value of spectral coefficients for the difference ð^fb  ^fls Þ. Top figure,
dotted line: absolute value of spectral coefficients for the difference . Middle figure: absolute value of the analysis increments for vorticity,
d^f which corresponds to the difference between the full and dotted line in the top plot. Bottom figure: size of analysis increments relative to
the first guess and host model vorticity fields, i.e. d^f=ð^fb  ^fls Þ.

1st loop, and is then almost constant in the 2nd. It is For surface parameters like 2 m temperature, humidity
possible that the reason for this is that Jk only acts on and 10 m wind the impact was neutral. Mean sea level
large scales which converges faster. The main conclusion pressure showed a clear positive impact, Fig. 11. Upper air
from Fig. 10 is that the analysis draws closer to both parameters, computed against radiosondes, also had a
observations and Jk simultaneously. clearly reduced RMS error for the jk_active experiment
Root Mean Square differences for this particular case, for temperature, wind, geopotential and specific humidity.
as described in Section 5.3 and Fig. 8, also show that Fig. 12 shows vertical profiles of mean and RMS errors of
the analysed vorticity is clearly closer to ECMWF (not temperature calculated at midnight, 00UTC and noon,
shown). 12UTC.

6.4. Observation verification results 6.5. One case for illustration


Forecasts were verified by computing mean and RMS Before making the concluding remarks, we also compare
errors against all available SYNOP and radiosonde sta- one case in the forecast experiment period and see what the
tions in the area. In this case it adds up to around 2000 differences looks like. The date was chosen from a time-
SYNOP and 175 radiosonde stations. series of RMS errors of 24 h forecasts of surface
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 13

Total cost-function Jo term


75000 60000
1:st loop 1:st loop
2:nd loop 55000 2:nd loop
70000
50000
65000 45000
J

J
60000 40000

35000
55000
30000

50000 25000
0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100
iter iter

Jb term Jk term
12000 16000
1:st loop 1:st loop
2:nd loop 15000 2:nd loop
10000
14000
13000
8000
12000
6000 11000
J

J 10000
4000
9000
8000
2000
7000
0 6000
0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100
iter iter

Fig. 10. Cost-function behaviour for one case in the impact experiment. Here, a 4D-Var setting with two outer loops is used.
As mentioned in Section 6.3, the resolution dependency of Jb and Jk is clearly shown here.

Selection: ALL using 2068 stations


Period: 24 Jan 2009 - 25 Feb 2009
Surface pressure Hours: {00,06,12,18}
3 300000

280000
2.5

260000
2
240000

1.5
Pressure [hPa]

RMSE ref 220000


No cases

RMSE jk_active
1 200000
BIAS ref
BIAS jk_active 180000
0.5
CASES 160000
0
140000

-0.5
120000

-1 100000
0 5 10 15 20 25 30 35 40 45 50
Forecast length [h]

Fig. 11. Verification of forecasts compared with observations of mean sea level pressure (MSLP). Upper lines are the Root Mean Square
(RMS) error and the lower lines show the mean error, or bias. Full line: reference experiment (i.e. the host model constraint is not
activated). Dotted line: same as reference except that Jk is now actively used. The amount of observations used in the statistics is also shown
using the right hand y-axis.
14 P. DAHLGREN AND N. GUSTAFSSON

183 stations Selection: ALL


Temperature Period: 24 Jan 2009 - 25 Feb 2009
Statistics at 00 UTC At {00,06,12,18} + 06 12 18 24 36 48

No cases
0 5000 10000 15000 20000 25000 30000

100

200

300
RMSE ref
RMSE jk_active
Pressure [hPa]

400
BIAS ref
BIAS jk_active
500
CASES
600

700

800

900

1000
-0.5 0 0.5 1 1.5 2 2.5
Temperature [deg C]

174 stations Selection: ALL


Temperature Period: 24 Jan 2009 - 25 Feb 2009
Statistics at 12 UTC At {00,06,12,18} + 06 12 18 24 36 48

No cases
0 5000 10000 15000 20000 25000 30000

100

200

300
RMSE ref
RMSE jk_active
Pressure [hPa]

400
BIAS ref
BIAS jk_active
500
CASES
600

700

800

900

1000
-1 -0.5 0 0.5 1 1.5 2 2.5
Temperature [deg C]

Fig. 12. Verification of temperature forecasts compared with observations from radiosondes. To the right we have the Root Mean
Square (RMS) error and to the left the mean error, or bias, averaged of all forecast lengths, valid at 00UTC (upper figure) and 12UTC
(lower figure) are shown. Full line: reference experiment (i.e. the host model constraint is not activated.) Dotted line: same as reference
except that Jk is now actively used. The amount of observations used in the statistics is also shown using the upper x-axis.

pressure, Fig. 13. Around 30 January 2009, the RMS errors experiment. Therefore the 12UTC run from 29 January
of 24 h forecasts differed with approximately 1.0 hPa was chosen for the study since the 24 h fore-
between the experiments, with advantage to the jk_active cast from that time is valid on 30 January. We are thus
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 15

Selection: ALL 2068 stations


Surface pressure
At {00,06,12,18} + 24 Window: 6h
3 3000
RMSE ref
RMSE jk_active
2.5 BIAS ref 2800
BIAS jk_active
CASES
2 2600

1.5 2400
Pressure [hPa]

1 2200

No cases
0.5 2000

0 1800

-0.5 1600

-1 1400

-1.5 1200

-2 1000
24 Jan 31 Jan 07 Feb 14 Feb 21 Feb 28 Feb
Date

Fig. 13. Time-series of surface pressure 24 h forecast errors compared with observations. The two lines at the top show the RMS
errors, and the ones at the bottom the mean error. Full line: reference experiment (i.e. the host model constraint is not activated. Dotted
line: same as reference except that Jk is now actively used.

5 d into the experiment period and the two experi- while the low outside of Newfoundland moves to the East.
ments have evolved separately through a number of data- After 24 h, Fig. 15, the low originally to the West of Ireland
assimilation cycles and forecast integrations. One cannot is located near Iceland. The differences have grown in both
expect large-scale differences only just because Jk acts spatial scale and magnitude and are still associated with the
on large scales. The non-linear forecast integration can low-pressure system near Iceland. If we compare the MSLP
generate differences on all scales. fields we see that it is the shape and the depth of the low
Figure 14 shows the analyzes of the mean sea level that have changed. The low in the reference run is now
pressure (MSLP) fields, from the experiment (jk_active) deeper and more intense than the low in the (jk_active)
and reference (ref) runs, and the difference between them. experiment. This low has thus developed differently in
The overall large-scale synoptic features are quite similar the experiments. In the reference run, the low moved
with an intense low at the coast of Newfoundland, and a northwards and intensified while the low in (jk_active)
high pressure system to the south-east, over the Atlantic. A experiment moved to the North and weakened a bit.
large trough stretches from Greenland to the south-east
with a low-pressure system just to the West of Ireland.
There is also another low-pressure minima near Iceland. 7. Summary and concluding remarks
Then there is a massive high pressure over northern Russia In this paper, we have described and tested a method to
with a ridge that stretches to the south-west into Scandi- assimilate the host model information (ECMWF) into a
navia. The difference map reveals that the most noticeable regional NWP model (HIRLAM). The vorticity field from
differences are associated with the low-pressure system to the host model was chosen to act as a constraint in the
the West of Ireland. A closer look at the MSLP fields regional model variational analysis. For simplicity, the host
shows that it is the shape of the low-pressure system that is model vorticity constraint was assumed to be univariate,
different, and that the low is bit deeper in the experiment i.e. no cross-correlations to any regional model variables or
(jk_active). observations. This, along with the characteristics of vorti-
When this is integrated forward in time we see that it is a city in the statistical balance, led to an error covariance
blocking high pressure over Russia/Scandinavia, and the matrix that could be made diagonal by using the horizontal
low to the West of Ireland moves straight to the North spectral densities and vertical eigenmodes.
16 P. DAHLGREN AND N. GUSTAFSSON

ref ref
29 Jan 2009 12z + 00h 29 Jan 2009 12z + 24h
1005
1020 10 1035 1025
10 30
15 10 10
10 20
1010 1010
10

5
101
10 10

10
05

15
10
35
103

05
1010 5 5
10 0
99 10
0

30
1010

10
100 995 1000
5 995
10 990
1025
9999050 985
0 980
98

99 97
5
5

1015 5

99 995
5 990
0 985
99

103
0
98

1000
5

1030

10
102
1020

05
5 1020
10
20
10 1005
10
10
05
5

0
10
102

101
10

10
10
5
10 101
20 10
20
10
20

MSLP MSLP
jk_active jk_active
29 Jan 2009 12z + 00h 29 Jan 2009 12z + 24h
1005 1010
1030 1010 1025
1020 1035
5 30
101 10
1010
99

10

1020

5
5

101
30
10

1005
10

10
05

35

10
10

10
35
00

10
0
10

1010 104
100 995
5
10 1025 1010 995
9999050 1000 995
99

0
98

102
5

101
5

5
990
995 995
990
0 985
99
990
1010

98
1030 5 10
05
1020 102
5
102
0
102 5
0 0 100
1 15
1025

1015 10 10
1005 10
10
5
101
102 1020 10
0 20

MSLP MSLP
MSLP Difference (jk_active-ref) MSLP Difference (jk_active-ref)
29 Jan 2009 12z + 00h 29 Jan 2009 12z + 24h
4 2
4

2
-2
-2

-4
-6

-6
2 -2
46
18111204
2
-2

18 6

2 -6-4-2
-8

-10
-12
-2

2
4 42
-2
-4
-6 42
2

MSLP difference hPa MSLP difference hPa

Fig. 14. Analyzes of mean sea level pressure (MSLP) on 29 Fig. 15. Mean sea level pressure fields for 24 h forecasts from
January 2009 12UTC. ref (top) refers to the run in which Jk is not 29 January 2009 12UTC, thus valid at 30 January 2009 12UTC.
used, jk_active (middle) is the run that uses Jk. The difference map
(bottom) is the difference between jk_active and ref. sea level pressure forecast fields compared with synop
observations.
A forecast experiment of 33 d has given very promising The study clearly indicates that the host model informa-
results with reduced RMS errors of upper air forecast fields tion is important to include when determining the initial
compared with radiosonde observations as well as mean condition.
ASSIMILATING HOST MODEL INFORMATION INTO A LIMITED AREA MODEL 17

8. Acknowledgements Gustafsson, N., Berre, L., Hörnquist, S., Huang, X.-Y., Lindskog,
M. and co-authors. 2001. Three-dimensional variational assim-
Constructive comments from Xiang-Yu (Hans) Huang, ilation for a limited area model. Part I: general formulation and
NCAR and one anonymous reviewer helped us to improve the background error constraint. Tellus 53A, 425446.
this manuscript and these contributions are acknowledged. Haugen, J.-E. and Machenhauer, B. 1993. A spectral limited-area
model formulation with time-dependent boundary conditions
applied to the shallow water equations. Mon. Weather Rev. 121,
References
26182630.
Berre, L. 2000. Estimation of synoptic and mesoscale forecast Huang, X.-Y., Yang, X., Gustafsson, N., Mogensen, K.-S. and
error covariances in a limited-area model. Mon. Weather Rev. Lindskog, M. 2002. Four-dimensional variational data assimila-
128, 644667. tion for a limited area model. HIRLAM Tech. Rep. 57, 10pp.
Guidard, V. and Fischer, C. 2008. Introducing the coupling Lindskog, M., Gustafsson, N., Navascués, B., Mogensen, K. S.,
information in a limited-area variational assimilation. Q. J. R. Huang, X.-Y. and co-authors. 2001. Three-dimensional varia-
Meteorol. Soc. 134, 723736. tional assimilation for a limited area model. Part II: observation
Gustafsson, N. 2006. Status and performance of HIRLAM handling and assimilation experiments. Tellus 53A, 447468.
4D-VAR. HIRLAM Newslett. 51, 10pp.

You might also like