Professional Documents
Culture Documents
Mapping Urban Air Pollution Using GIS - A Regression Based Approach PDF
Mapping Urban Air Pollution Using GIS - A Regression Based Approach PDF
26]
On: 06 July 2014, At: 12:49
Publisher: Taylor & Francis
Informa Ltd Registered in England and Wales Registered Number:
1072954 Registered office: Mortimer House, 37-41 Mortimer Street,
London W1T 3JH, UK
International Journal of
Geographical Information
Science
Publication details, including instructions for
authors and subscription information:
http://www.tandfonline.com/loi/tgis20
To cite this article: DAVID J. BRIGGS , SUSAN COLLINS , PAUL ELLIOTT , PAUL
FISCHER , SIMON KINGHAM , ERIK LEBRET , KAREL PRYL , HANS VAN REEUWIJK ,
KIRSTY SMALLBONE & ANDRE VAN DER VEEN (1997) Mapping urban air pollution
using GIS: a regression-based approach, International Journal of Geographical
Information Science, 11:7, 699-718, DOI: 10.1080/136588197242158
To link to this article: http://dx.doi.org/10.1080/136588197242158
Francis shall not be liable for any losses, actions, claims, proceedings,
demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in
relation to or arising out of the use of the Content.
This article may be used for research, teaching, and private study
purposes. Any substantial or systematic reproduction, redistribution,
reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access
and use can be found at http://www.tandfonline.com/page/terms-andconditions
Research Article
M apping urban air pollution using GIS: a regression-based approach
1
1. Introduction
Despite the major improvements in air quality seen in many European cities
over the last 30 40 years, the problem of urban air pollution remains. Levels of
traditional pollutants, such as smoke and sulphur dioxide (SO2 ) have declined, as a
result of industrial restructuring, technological changes and pollution control, but
the rapid growth in road tra c has given rise to new pollutants and new concerns.
Between 1970 and 1990, for example, passenger car transport in Europe increased
by ca. 34 per cent per annum, and car ownership in East European countries is
1365 8816/ 97 $1200
700
D. Briggs et al.
currently rising by 7 12 per cent per year (Stanners and Bordeau 1995 ). As a result
of these increases, emissions of many pollutants are growing: in the U.K., emissions
of nitrogen dioxide ( NO2 ) of which 45 50 per cent is derived from the transport
sector rose by 120 per cent between 1970 and 1990, before declining slightly; emissions
of volatile organic compounds ( VOCs), of which ca. 40 per cent derives from
transport, rose by 73 per cent ( Department of the Environment 1995). For the future,
these trends seem set to continue. In the absence of major shifts in policy, a further
doubling of both passenger and freight transport in Europe is anticipated by the
year 2010 (Stanners and Bordeau 1995 ). Notwithstanding the e ects of improvements
in engine design and fuel technology, this is likely to lead to at least the maintenance
of current emission levels in most West European countries. In East Europe, where
the rates of increase are higher, emissions may be expected to grow.
Against this background, there has been heightening concern about the health
e ects of tra c-related pollution. Several factors contribute to this concern. One is
the simple arithmetic of exposure. In Europe, for example, about 70 per cent of the
population is classi ed as urbanized ( UNEP 1993 ), while Flachsbart ( 1992) estimates
that between 95 and 18 million people spend a considerable part of their working
day at or near roadsides. Equally, urban areas are estimated to account for the
major proportion of emissions. The potential for human exposure to tra c-related
pollution is therefore large. Secondly, there is growing evidence from epidemiological
studies of a relationship between air pollution and respiratory illness and mortality
(e.g., Schwartz 1993, 1994) and of increased levels of respiratory symptoms in people
living close to major roads, or in areas of high tra c density (e.g., Edwards et al .
1994, Ishizaki et al . 1987, Weiland et al . 1994, Wjst et al . 1993). In a study of 1000
adults in Oslo, NILU ( 1991 ) also found a positive association between self-reported
symptoms of cough and chronic bronchitis and modelled levels of air pollution at
the place of residence. At the same time, there has been an apparent increase in
levels of respiratory illness, particularly asthma, in vulnerable groups such as children
and the old (Anderson et al . 1994, Burney 1988, Burney et al . 1990, Haahtela
et al . 1990).
In the light of these concerns, there is clearly a need for improved information
on levels of tra c-related air pollution and their potential links to human health.
This information is required for a wide range of purposes: to help investigate the
relationships involved, as inputs to health risk assessment, to assist in establishing
and monitoring air quality standards, and to help evaluate and compare transport
policies and plans. For all these purposes, information is needed not only on the
temporal trends in air pollution (as, for example, provided by data from xed-site
monitoring stations), but also on geographical variations. Maps are needed, for
example, to identify pollution `hot-spots, to de ne at-risk groups, to show changes
in spatial patterns of pollution resulting from policy or other interventions, and to
provide improved estimates of exposure for epidemiological studies.
Mapping urban air pollution nevertheless faces many problems. The complex
geography of emission sources and the equal complexity of dispersion processes in
an urban environment mean that levels of air pollution typically vary over extremely
short distances, often no more than a few tens of metres (e.g., Hewitt 1991). On the
other hand, data on both emission sources and pollution levels are often sparse. As
a result, maps of urban air pollution tend to be highly generalized, and estimates of
exposure to air pollutants subject to serious misclassi cation.
The development of GIS techniques, however, o ers considerable potential to
701
improve upon this situation. Digital data on urban road networks, for example, are
now becoming increasingly available, providing a valuable data source for pollution
modelling. The spatial analysis and overlay techniques available in GIS also provide powerful tools for pollution mapping. This paper describes and evaluates a
regression-based approach to air pollution mapping, developed as part of the
EU-funded SAVIAH ( Small Area Variations in Air quality and Health) project. This
study was a multi-centre project, involving collaborators in London and Hudders eld
( U.K.), Bilthoven ( Netherlands), Prague (Czech Republic) and Warsaw ( Poland ).
The aim of the study was to develop and validate methods for analysing relationships
between air pollution and health at the small area scale. A description of the overall
SAVIAH study is given by Elliott et al . ( 1995 ).
Traditionally, two general approaches to air pollution mapping can be identi ed:
spatial interpolation and dispersion modelling ( Briggs 1992). The former uses statistical or other methods to model the pollution surface, based upon measurements at
monitoring sites. With the development of GIS and geostatistical techniques in recent
years, a wide range of spatial interpolation methods have now become available.
Burrough ( 1986 ) divides these into global methods (e.g., trend surface analysis),
which t a single surface on the basis of the entire data set, and local methods (e.g.,
moving window methods, kriging, spline interpolation) in which a series of local
estimates are made, based on the nearest data points. Recently, particular attention
has tended to focus on kriging in its various forms (e.g., Oliver and Webster 1990,
Myers 1994 ). Nevertheless, despite a number of studies comparing this with other
techniques (e.g., Abbass et al . 1990, Dubrule 1984, Laslett et al . 1987, von Kuilenburg
et al . 1982, Weber and Englund 1992, Knotters et al . 1995 ), there is no clear consensus
to suggest that any one approach is universally optimal. Instead, performance of the
various methods tends to vary depending upon the character of the underlying
spatial variation being modelled, and the speci c characteristics of the data concerned
(e.g., sampling density, sampling distribution).
A number of these interpolation methods have found applications in pollution
mapping, albeit mainly at a relatively broad, regional scale. Linear interpolation, for
example, has been widely used to derive contour maps of pollution surfaces on the
basis of point measurements (e.g., Archibold and Crisp 1983, Muschett 1981). Kriging
in its various forms has been used to map national patterns of NO2 concentrations
(Campbell et al . 1994 ), acid precipitation ( Venkatram 1988, Schaug et al . 1993) and
ozone concentrations ( Lefohn et al . 1988, Liu et al . 1995), and to help design
continental-scale monitoring networks ( Haas 1992). Wartenberg ( 1993) also reports
the use of kriging to estimate and map exposures to groundwater pollution and
microwave radiation. Mapping of tra c-related pollution in urban areas, however,
potentially faces far more severe di culties. First and foremost is the inherent
complexity of the pollution surfaces involved. Within an urban area, emissions may
derive from a large number of intersecting line sources. The distance decay of
pollution levels away from these sources is also rapid, and greatly a ected by local
meteorological and topographical conditions. Marked variations in pollutant levels
can thus occur over distances of less than 100 metres in urban areas (e.g., Hewitt
1991 ). In contrast, the density and distribution of most monitoring networks is
generally poor. The number of stations monitoring pollutants such as NO2 , VOCs
702
D. Briggs et al.
and ne particulates on a routine basis is generally small, and the location of these
stations is often biased towards speci c pollution environments. As a consequence,
the existing monitoring networks provide only a limited picture of spatial patterns
of urban air pollution, potentially biased estimates of trends, and poor indications
of human exposure. Even where purpose-designed surveys can be conducted (e.g.,
using passive samplers), constraints of time and cost severely limit the sampling
density which is possible.
The main alternative to interpolation is the use of dispersion modelling techniques. This involves constructing a dynamic model of the dispersion processes,
taking account of all the main factors which in uence the ultimate pollution concentration eld. It is an approach which has been most widely used in relation to point
sources, but several models have also been built for road tra c pollution: examples
include the CALINE models, developed on behalf of the US EPA ( Benson 1992 )
the CAR model, developed in the Netherlands ( Eerens et al . 1992), the Highways
Agency Design Manual for Roads and Bridges ( DMRB) model, and the ADMS
model (which has been developed from DMRB by Cambridge Environmental
Research Consultants Ltd).
Dispersion modelling has much to commend it as a basis for air pollution
mapping in that it attempts to re ect the processes of dispersion and can relatively
easily be adapted to new pollutants or areas, without the need for additional monitoring. On the other hand, it also has a number of important constraints. Amongst
the most serious are the relatively severe data demands of most dispersion models:
typically, data are required not only on the distribution of the road network, but
also tra c volumes and composition, tra c speed, emission factors for all main
classes of vehicle, street characteristics (e.g., road width, building height or type), and
meteorological conditions (e.g., wind speed, wind direction, atmospheric stability,
mixing height). Rarely are these data available for a su ciently dense network of
locations in an urban area, with the result that considerable data extrapolation often
has to occur. Line dispersion models also provide estimates of pollution concentrations only within the immediate vicinity of the roadway up to a distance of only
35 metres in the CAR model, for example, and ca. 200 metres for the CALINE
models. This means that they do not easily provide information on variations in
background concentrations, at greater distances from major roads. In addition,
unlike spatial interpolation techniques, there is as yet little progress either in integrating dispersion models into GIS, or coupling the two technologies ( though the ADMS
model does have a limited interface with GIS).
Against this background, there is clearly a need to develop more practicable
techniques of air pollution mapping, which can make use of the capability o ered
by GIS, and extract the maximum amount of information from the di erent data
sets which are available within urban areas. Regression-mapping o ers particular
potential in this respect. This involves using least squares regression techniques to
generate predictive models of the pollution surface, based on a combination of
monitored pollution data and exogenous information. The technique is widely used
for exploratory and explanatory investigations, and is also used to help classify
remote sensing imagery. Regression methods have only rarely been used, however,
for mapping purposes. Examples include the development of a regression-based
model of road salt contamination (Mattson and Godfrey 1994), air pollution ( Wagner
1995 ) and soil depth ( Knotters et al . 1995).
703
Additional variables were also added locally, as required. The variables and data
sources used in each centre are summarized in table 2.
Due to variations in data availability, classi cation systems (e.g., of tra c ow
and land use) and local topography, it was not felt appropriate to use exactly the
Mixture of residential,
industrial and open space;
2
area ca. 48 km ; altitude
range 172 355 m OD;
population 163 700 ( 1991 )
Hudders eld
Prague
Description
Amsterdam
Study area
Table 1.
2. Oct 1993
3. Feb 1994
4. May 1994
1. June/July 1993
2 Oct. 1993
3. Feb. 1994
4. May 1994
1. June 1993
2. Nov 1993
3. Feb/March 1994
4. May/June 1994
1. June/July 1993
Survey
No./Date
Badges ( 80 )
Tubes ( 20 )
Tubes ( 80 )
Tubes ( 80 )
Tubes ( 80 )
Badges ( 80 )
Tubes ( 80 )
Tubes ( 20 )
Tubes ( 80 )
Tubes ( 80 )
Tubes ( 80 )
Badges ( 80 )
Tubes ( 20 )
Tubes ( 80 )
Tubes ( 80 )
Tubes ( 80 )
Device (no.)
219
200
148
56
277
95
156
105
396
282
275
124
Min
654
577
829
658
785
515
694
884
721
682
708
735
Max
396
334
367
222
476
256
331
282
518
500
413
364
Mean
Concentration (m g m
102
96
180
139
102
95
127
141
64
72
108
122
SD
704
D. Briggs et al.
705
Figure 1.
Figure 2.
Monitoring
results:
mean
annual NO2
Netherlands.
concentration,
Amsterdam,
The
706
D. Briggs et al.
Table 2.
Centre
Hudders eld
Variable
Road network
Tra c volume
Land cover
Altitude
NO2 concentration
Sample height
Site exposure
Topographical
exposure
Amsterdam
Road network
De nition
Data source
Road network
Tra c volume
Land cover
Department of
Development, Prague
Municipal Authority
Department of
Development, Prague
Municipal Authority
City planning maps
Road type
Distance to road
Land cover
NO2 concentration
Prague
Altitude
NO2 concentration
707
same regression procedure in all study areas. Instead, each centre developed its own
equation, subject only to the constraint that: (a ) it included terms for tra c volume,
land cover and topography; (b ) a similar bu ering approach was used. In each centre,
therefore, data on the relevant input variables were rst computed for a series of
bands around each monitoring site (to a distance of 300 m) using the GRID routines
in ARC/INFO. These were then entered into a multiple regression analysis using as
the dependent variable either the monitored NO2 values for a speci c survey period
(to provide a pollution map for that period alone) or the modelled annual mean
concentration (to provide an annual average pollution map). The equation thus
generated was nally used to compute the predicted pollution level at all unmeasured
sites for a ne grid of points across the study area and the results mapped.
As an example, details of the approach used in Hudders eld are presented in
gure 3. In brief, the method was as follows:
1. GIS development . Coverages were compiled in ARC/INFO as outlined in
table 2.
2. Computatio n of a weighted tra c volume factor ( Tvol 300) for the 300 metre
bu er around each monitoring site . Daytime tra c volumes (vehicle km/hour) were
estimated for each 20 m zone around each sample point (to a maximum distance of
300 metres) using the FOCALSUM command in ARC/INFO. Results were then
entered into a multiple regression analysis (in SPSS) against the modelled mean
annual NO2 concentrations and di erent combinations of band width compared.
2
The best- t combination (as de ned by the r value) was selected, and weights for
each band determined by examination of the slope coe cients. This gave two bands,
weighted as follows: 0 40 m (weight =15) and 40 300 m (weight =1 ). These were
thus combined into a compound tra c volume factor (equation ( 1).
T vol300 = 15 T vol 0
40 +
T vol 40
300
( 1)
3. Computation of a compound land cover factor ( Land 300 ) for the 300 m bu er
around each monitoring site. The area of each land use type ( hectares) within each
20 m band around each sample point (to a maximum distance of 300 m) was calculated using the FOCALSUM command in ARC/INFO. Results were then entered
into a multiple regression analysis (in SPSS) against the residuals from the previous
analysis (step 2). Di erent combinations of land cover and distance were compared
2
in terms of the r value, and the best- t non-negative combination selected. This
gave a single band ( 0 300 m) comprising two land use types: high density housing
( HDH0 300 ) and industry ( Ind0 300 ). Weights were identi ed by examination of the
slope coe cients, and a compound land use factor computed in equation ( 2).
L and 300 = 18HDH 0
( 2)
4. Stepwise multiple regression analysis was rerun using the two compound
factors ( Tvol 300 and L and 300 ), together with altitude (variously transformed), topex,
sitex and sampler height, against the modelled mean nitrogen dioxide concentrations.
Only variables signi cant at the 5 per cent con dence level were retained. A number
of equations were derived from this procedure, all explaining generally similar
proportions of variation in the monitored NO2 levels. From these, regression equa2
tion ( 3) was chosen for further analysis, because of its marginally higher r value
which was 0607 and because all variables were signi cant at the 005 level.
MeanNO2 = 1183+ ( 000398 Tvol 300 ) + ( 0268 L and 300 )
( 00355RSAlt ) +( 6777Sampht)
( 3)
Figure 3.
709
5. This equation was then used to construct a complete air pollution coverage
for the study area by applying the equation on a cell by cell basis to all locations in
the study area.
In Prague, a broadly similar approach was used. In this case, however, air
pollution data were seen to be skewed, so the data were log-transformed prior to
analysis. In this case, also, no attempt was made to produce a compound tra c
volume; instead separate tra c volume factors were computed for each zone. A
single land cover factor was computed in equation ( 4 ), representing the weighted
sum of the areas of each land cover type within the bu er zone around the site and
equation ( 5 ) was thus derived
L and = (area*density class)
( 4)
+ ( 0000554 L and 60 )
( 000152Alt)
( 5)
where: T vol 60= tra c volume (1000 vehicle km hr ) within 60 m of the site
1
T vol 120 = tra c volume ( 1000 vehicles km hr ) within 60 120 m of the site
L and 60 = land cover factor within 60 m of the site
Alt =altitude (m)
2
This equation gave r =072 for the 80 sample points. Plotting of the residuals
showed one outlier, with a large negative residual. When this was removed and the
regression analysis rerun, the equation ( 6 ) was obtained
1
( 000155Alt)
( 6)
The r value was again 072, but the plot of residuals showed a better distribution,
with no outliers. This equation was therefore used for subsequent analysis.
In Amsterdam, the lack of data on tra c volume, and the essentially at nature
of the local terrain, meant that a di erent approach was used. In this case, road
segments were classi ed into broad types, based upon the classi cation used by the
city Highways Department. Three road types were identi ed, as follows:
RD1 access road for residential areas with >25 000 people
RD2 access road for residential areas with >5000 and <25 000 people
RD3 access road for residential areas with >1000 and <5000 people
The length of each road type within a 0 50 m and 50 200 m bu er was then
computed, in GRID, giving six road variables. Distance to the nearest road serving
>25 000 people ( RD1) was also included as a variable. Land cover was de ned as
all built up land within 100 m of the site, based on planning maps. The variables
thus computed were then entered into a stepwise regression model, using the modelled
mean NO2 concentration as the independent variable. The resulting model (with
2
r =062) is given in equation ( 7).
MeanNO2 = 4164+ ( 05832RD1 50 ) + ( 06190RD2 50 )
+ ( 00723RD1 200 )
+ ( 00133RD350 )
(00570RD 2200 )
( 00348RD 3200 )
( 7)
It should be noted that the approach used here di ers from that in the other two
centres, in that no attempt was made to derive compound tra c ow variables,
710
D. Briggs et al.
based on tra c volumes in bu er zones around each site. Instead, a large number
of individual tra c-related variables were used, and allowed to enter the regression
model freely. The result is a regression equation which is to some extent counterintuitive, in that some variables (e.g., RD 2200 , RD 3200 and L and 100 ) have negative
signs. While the model thus optimizes predictions for this data set, it is less generic
in structure, and may not be readily transferable to other areas.
4. Validation
Examples of the air pollution maps for Hudders eld and Amsterdam are shown
in gures 4 and 5. As these indicate, the maps show considerable local detail, and
clearly pick out high zones of pollution along the main road network.
Validation of the air pollution maps was carried out by comparing the NO2
concentrations predicted from each of the air pollution maps with the measured
concentrations at the 8 10 `reference sites, using simple least squares regression.
( Note that these sites were not used in the generation of the original regression
equation). Results of the validation are summarized in table 3 and gure 6. Because
of di erences in the number of sample sites available for validation, and the di erent
structures of the regression models, care is needed in comparing the results. Overall,
however, it is apparent that the regression method gave extremely good predictions
2
of the pollution levels at the reference sites (as shown by the r values and mean
deviations).
In Hudders eld, for example, the regression equation tted well to the measured
2
NO2 concentrations for the 80 sites on which it was based, with an r value of 061.
Examination of the residuals showed some heterogeneity, with a tendency for overprediction at low concentrations and under-prediction at high concentrations.
Figure 4.
711
Figure 5.
Table 3.
Performance of the regression maps: r2 and standard error of estimate for reference
sites.
Centre
Number of sites
Hudders eld
Prague
Amsterdam
8
10
10
082
087
079
Standard error
of estimate
(mg m 3 )
369
467
445
Residuals showed no spatial correlation, however, and Morans I coe cients were
consistently low (ca. 008). Although attempts were therefore made to use kriging on
the residuals, no improvement in predictions was obtained.
2
Re ecting this, the r values for the `reference sites was uniformly high in all
centres ( 079 087), while the standard error of the estimate ranged from 35 47 m g
3
m (table 3). As gure 6 also shows, the slope of the relationship between predicted
and measured concentrations is close to unity, albeit with some tendency to underpredict actual concentrations in Amsterdam and to over-predict in Prague. In
Hudders eld, the relationship between measured and predicted concentrations also
tends to be non-linear, resulting in some over-prediction at low concentrations.
In Hudders eld, predictions for individual surveys were also tested by rerunning
the regression procedure outlined above, using data for surveys 2 to 4 separately
(survey 1 was not analysed, because of the small number of di usion tubes for which
2
data were available). Regression models and r values for the individual survey
periods are shown in table 4. The validity of each of these models was then further
712
Figure 6.
D. Briggs et al.
Predicted mean annual concentration from the regression map versus monitored
concentration (1993 1994 ), all centres.
tested by comparing the predicted and measured values at the 8 reference and 40
2
`variable sites. Table 4 includes the resulting r values. For all three surveys, broadly
2
similar regression equations are obtained. Notably, for the last period ( 4 ), the r for
the regression equation is relatively high (>06) and model coe cients are close to
those derived for the mean annual concentration. In the second and third surveys,
2
a reduced number of variables enter the equation, and the r value is lower
( 036 039). All three equations, however, provide good estimates of concentrations
2
at the 8 reference sites, with r values between 069 and 075. Survey 4 also gives a
2
reasonably good prediction of concentrations at the 40 `variable sites (r =052).
2
Surveys 2 and 3, however, are less strongly predictive of the variable sites, with r
values of 026 and 035 respectively. In part, this may re ect the distribution of these
variable sites in these surveys, in that they were speci cally selected to examine
variation in background concentrations, and are thus not representative of pollution
conditions across the whole study area. Overall, however, it appears that results of
regression mapping are more variable when applied to individual survey periods.
This should not cause surprise, for the tra c data used in the model refer to long-
Oct 1993
Feb 1994
May 1994
Period
Core sites
(n =80 )
0256
0350
0524
Variable sites
(n= 40 )
Table 4.
0729
0743
0690
Continuous sites
(n =8 )
713
714
D. Briggs et al.
term mean ows, and thus do not take account of short-term variations in tra c
conditions; nor is any allowance made for variations in weather conditions.
As noted earlier, in both Hudders eld and Amsterdam, pollution data were also
available for a subset of 20 sites in the following year (October 1994 September
1995 ). In Hudders eld, these were measured over 21 consecutive 2-week periods; in
Amsterdam, they were measured during four two-week campaigns. Comparing these
measured data with the estimated concentrations from the regression map allowed
the temporal stability of the pollution map to be estimated. Results are summarized
in gures 7 and 8. As can be seen, the correlation between the predicted value from
the map and the mean annual concentration measured in the following year is strong
2
in both centres (r =059 in Hudders eld and 086 in Amsterdam). In Hudders eld,
however, the map tends to underpredict actual concentrations in the following year.
This re ects the relatively hot summer and still conditions experienced in 1994 5,
which contributed to higher than average pollution levels. Nevertheless, it is apparent
that, notwithstanding the short-term variations in air pollution levels which
undoubtedly occur, the geography of pollution is relatively stable from year to year.
The air pollution maps derived from the regression method thus have longer-term
validity. On this evidence, the pollution maps should provide a basis for estimating
historic exposures, at least over recent years. In the case of health outcomes with a
relatively long lag period, this has considerable signi cance for environmental epidemiology. In addition, the maps clearly provide a useful framework for designing
air pollution monitoring systems, and identifying the areas for which individual
monitoring sites can be considered representative.
Figure 7.
Predicted NO2 concentrations from the regression map versus observed mean
annual concentration the following year: Hudders eld, UK.
Figure 8.
715
Predicted NO2 concentrations from the regression map versus observed mean
annual concentration the following year: Amsterdam, The Netherlands.
The results of this study clearly illustrate the complex nature of spatial variation
in urban air pollution, and con rm the marked variation in levels of tra c-related
pollution which may occur over small distances (<100 m). At the same time, however,
they also demonstrate that this variation is predictable, and can be modelled to a
high degree of accuracy, using information on emission sources and topography.
The spatial variations in pollution levels seen in this study have considerable
implications, both for air pollution monitoring and management, and for environmental epidemiology. It means, for example, that pollution data derived from a
single monitoring site cannot normally be considered representative of more than a
small surrounding area. This clearly limits the extent to which existing monitoring
networks based as they are on a small number of xed-site monitoring devices
can either provide spatially representative measures of levels of compliance with air
pollution legislation, or give meaningful warnings of air pollution hazards across an
urban environment. Equally, epidemiological studies based upon only a small number
of monitoring sites are likely to involve considerable misclassi cation of exposure.
In both cases, understanding of the geography of pollution within the area of interest
is vital. Signi cantly, however, this study suggests that the long term pattern of
pollution within an urban area may be reliably predicted from a relatively small
number of surveys; in this case only four two-week surveys provided good predictions
of mean annual concentrations of NO2 . The study also suggests that, whilst average
concentrations of NO2 inevitably vary from year to year, the spatial pattern of
pollution within an urban area remains broadly stable.
The method of GIS-based regression mapping used in this study appears to o er
716
D. Briggs et al.
an e ective method for mapping tra c-related pollutants, such as NO2 . The maps
produced here consistently gave good predictions of pollution levels at unsampled
points. As with any empirical approach, however, regression mapping has its limitations. It potentially su ers, for example, from being case- and area-speci c. This is
especially likely to be true when models are developed simply through a process of
statistical optimization. As the results for Amsterdam show in this study, the regression equation may then be somewhat counter-intuitive, and as such may not be
valid outside the speci c study area. Nevertheless, where the regression models are
developed from clear underpinning principles, as, here, in Prague and Hudders eld,
it may be expected that the models would be more generally valid. Recent studies,
to be reported elsewhere, do in fact demonstrate that the Hudders eld regression
model can be applied successfully to other urban areas in the UK. The empirical
nature of regression mapping is also part of its strength, for, unlike formal dispersion
modelling, it can be readily adapted to local circumstances and data availability. It
thus allows optimal use to be made of the available data. In small area studies,
where monitored data are scarce and where the need for high resolution maps is
paramount, GIS-based regression mapping thus o ers a powerful tool.
Acknowledgments
The SAVIAH study was a multi-centre project, funded under the EU Third
Framework Programme. It was led by Professor Paul Elliott ( Department of
Epidemiology and Public Health, Imperial College School of Medicine at St. Marys,
London UK formerly at the London School of Hygiene and Tropical Medicine) and
co-principal investigators were Professor David Briggs (Nene Centre for Research,
Nene College, Northampton, UK formerly at the University of Hudders eld ), Dr
Erik Lebret ( Environmental Epidemiology Unit, National Institute of Public Health
and Environmental Protection, Bilthoven, Netherlands), Dr Pawel Gorynski
( National Institute of Hygiene, Warsaw, Poland ) and Professor Bohimir Kriz
( Department of Public Health, Charles University Prague). Other members of the
project team were: Marco Martuzzi and Chris Grundy ( London School of Hygiene
and Tropical Medicine, London, UK); Susan Collins, Emma Livesely and Kirsty
Smallbone ( University of Hudders eld, UK); Caroline Ameling, Gerda Doornbos,
Arnold Dekker, Paul Fischer, L. Gras, Hans van Reeuwijk and Andre van der Veen
( National Institute of Public Health and Environmental Protection, Bilthoven, NL);
Henrik Harssema ( Wageningen Agricultural University); Bogdan Wojtyniak and
Irene Szutowicz ( National Institute of Hygiene, Warsaw, Poland ); Martin Bobak
and Hynek Pikhart ( National Institute of Public Health, Prague, CR) and Karel
Pryl (City Development Authority, Prague, CR). All members of this team made
invaluable contributions to all parts of the project, and this paper is a product of
their joint e ort and expertise. Thanks are also due to the local authorities and
health authorities in the four study areas, Amsterdam, Hudders eld, Prague and
Poznan, for their assistance in carrying out this research.
References
A bbass, T ., E l J allouli, C ., A lbouy, Y ., and D iament, M . , 1990, A comparison of surface
tting algorithms for geophysical data. T erra Nova 2, 467 75.
A nderson, H . R ., B utland, B . K ., and S trachan, D . P ., 1994, Trends in prevalence and
severity of childhood asthma. British Medical Journal 308, 1600 4.
A rchibold, O . W . , and C risp, P . T . , 1983, The distribution of airborne metals in the Illawarra
region of New South Wales, Australia. Applied Geography , 3 (4 ), 331 344.
717
B enson, P . E . , 1992, A review of the development and application of the CALINE3 and
CALINE4 models. Atmospheric Environment , 26B, 379 90.
B riggs, D . J ., 1992, Mapping environmental exposure. In: Geographical and Environmental
Epidemiology : Methods for Small-area Studies, edited by P . E lliott, J . C uzick,
D . E nglish, and R . S tern. (Oxford: Oxford University Press) pp. 158 76.
B urney, P ., 1988, Asthma deaths in England and Wales 1931 85: evidence for a true increase
in asthma mortality. Journal of Epidemiology and Community Health, 42, 316 20.
B urney, P ., C hinn, S ., and R ona, R . J ., 1990, Has the prevalence of asthma increased in
children? Evidence from the national study of health and growth 1973 86. British
Medical Journal , 300, 1306 10.
B urrough, P . A ., 1986, Principles of Geographical Information Systems for L and Resources
Assessment . Monographs on soil and resources survey, no. 12 (Oxford: Clarendon
Press).
concentrations in the United Kingdom using di usion tubes July December 1991.
Atmospheric Environment , 28 (3 ), 477 87.
D epartment of the E nvironment , 1995, Digest of Environmental Statistics. No. 17. 1995.
London: HMSO.
D ubrule, O ., 1984, Comparing splines and kriging. Computers & Geosciences, 10, 327 38.
E dwards, J ., W alters, S ., and G riffiths, R . K ., 1994, Hospital admissions for asthma in
preschool children: relationship to major roads in Birmingham, United Kingdom.
Archives of Environmental Health, 49, 223 7.
E erens, H ., S liggers, C . , and van der H out, K ., 1993, The CAR model: the Dutch method
to determine city street air quality. Atmospheric Environment , 27B, 389 99.
E lliott, P ., B riggs, D ., L ebret, E ., G orynski, P ., and K riz, B . , 1995, Small area variations
in air quality and health (the SAVIAH study): design and methods. (Abstract).
Epidemiology , 6, S32.
F lachsbart, P . G ., 1992, Human exposure to motor vehicle air pollution. In Motor Vehicle
Air Pollution: Public Health Impact and Control Measures, edited by D . T . M age , and
O . Z ali (Geneva: WHO Division of Environment ), pp. 85 113.
H aahtela, T ., L indholm, H ., B jorkstoen, F ., K oskenvuo, K ., and L aitinen, I . A ., 1990,
Prevalence of asthma in Finnish young men. British Medical Journal , 301, 266 8.
H aas, T . C . , 1992, Redesigning continental-scale monitoring networks. Atmospheric
Environment , 26A, 3323 33.
H ewitt, C . N ., 1991, Spatial variations in nitrogen dioxide concentration in an urban area.
Atmospheric Environment , 25B , 429 34.
I shizaki, T ., K oizumi, K ., I kemori, R ., I shiyama, Y ., and K ushibiki, E . , 1987, Studies of
prevalence of Japanese cedar pollinosis among the residents in a densely cultivated
area. Annals of Allergy, 58, 265 70.
K notters, M ., B rus, D . J ., and V oshaar, J . H . O ., 1995, A comparison of kriging, co-kriging
and kriging combined with regression for spatial interpolation of horizon depth with
censored observations. Geoderma , 67, 227 46.
L aslett, G . M ., M c B ratney, A . B ., P ahl, P . J ., and H utchinson, M . F . , 1987, Comparison
of several prediction methods for soil pH. Journal of Soil Science, 38, 325 70.
L ebret, E ., B riggs, D ., C ollins, S ., van R eeuwijk, H ., and F ischer, P ., 1995, Small area
variations in exposure to NO2 . (Abstract ). Epidemiology , 6, S31.
L efohn, A . S ., K nudsen, H . P ., and M c E voy, L . R ., 1988, The use of kriging to estimate
monthly ozone exposure parameters for the southeastern United States. Environmental
Pollution , 53, 27 42.
L iu, L . J . S ., R ossini, A ., and K outrakis, P ., 1995, Development of cokriging models to
predict 1- and 12-hour ozone concentrations in Toronto. (Abstract ). Epidemiology ,
6, S69.
M attson, M . D ., and G odfrey, P . J ., 1994, Identi cation of road salt contamination using
multiple regression and GIS. Environmental Management , 18, 767 73.
M uschett, F . D ., 1981, Spatial distribution of urban atmospheric particulate concentrations.
Annals of the Association of American Geographers , 71, 552 65.
M utius, E . , 1993, Road tra c and adverse e ects on respiratory health in children. British
Medical Journal , 307, 596 600.
718
313 32.