Jirs Ann

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 15

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/225583246

A Neural Network based Urban Growth Model of an Indian City

Article  in  Journal of the Indian Society of Remote Sensing · September 2010


DOI: 10.1007/s12524-009-0041-7

CITATIONS READS
51 2,001

1 author:

Sandeep Maithani
Indian Space Research Organization
38 PUBLICATIONS   775 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

research View project

Persistent Scatter Interferometric Synthetic Aperture RADAR ( PsInSAR) View project

All content following this page was uploaded by Sandeep Maithani on 28 February 2014.

The user has requested enhancement of the downloaded file.


Photonirvachak

J. Indian Soc. Remote Sens. (September 2009) 37:363–376

RESEARCH ARTICLE

A Neural Network based Urban Growth Model of an Indian City

S. Maithani

Received: 25 March 2009 / Accepted : 27 March 2009

Keywords Spatial urban growth . Artificial neural network . Multilayer perceptron . Spatial metrics . Kappa
. Urban growth potential

Abstract The aim of the study reported in this feed forward ANN architectures were evaluated in
paper is to demonstrate that the subjectivity in urban this study and finally the most optimum ANN
growth modeling and the calibration time can be architecture was selected for future growth
reduced by using objective techniques like Artificial simulation.
neural network (ANN). As a case study, the ANN- The simulated urban growth maps were evaluated
based model was applied to simulate the urban on a cell by cell matching using Kappa index and
growth of Saharanpur city in India. In the proposed three spatial metrices namely, Mean Patch Fractal
model, remote sensing and GIS were used to generate Dimension, Landscape Shape Index and Percentage
site attributes, while ANN was used to reveal the of like Adjacencies. The most optimal architecture
relationships between urban growth potential and the was then used subsequently for simulating the future
site attributes. Once ANN learnt the relationship, it urban growth. The study results thus, demonstrated
was then used to simulate the urban growth. Different that the ANN-based model can objectively simulate
urban growth, besides successfully coupling GIS,
remote sensing and ANN.
S. Maithani ( )
Human Settlement Analysis Division,
Indian Institute of Remote Sensing, Introduction
4, Kalidas Road, Dehradun – 248 001,
Uttarakhand, India The urban population of India increased from 217.6
million to 285 million in the last decade and presently
constitutes 27.8% of the total population (Census of
email : maithanis@yahoo.com India, 2001), it is expected to increase to 40% of the
maithani@iirs.gov.in total Indian population by the year 2021 (GGIM,
364 J. Indian Soc. Remote Sens. (September 2009) 37:363–376

2005). This increasing population pressure is leading Artificial Neural Network (ANN) was used to reveal
to unregulated growth of urban areas. The negative the relationships between future urban growth
impacts associated with the unregulated growth of potential and the site attributes.
urban areas are namely, permanent loss of agricultural The applicability of ANN-based urban growth
land, reduced ground water infiltration, raised models to Indian cities has also been lacking. Hence,
greenhouse gas emissions, elevated air and noise as a case study, the ANN-based model was applied
pollution levels, damage to various ecological cycles to simulate the urban growth of Saharanpur city in
etc. (Gowda and Sridhara, 2000; Kulshrestha, 2004, Uttar Pradesh state, India.
2007a, 2007b; Tayal and Bharat, 1997). Faced with
these severe negative impacts, there is an urgent need
for urban planners to develop predictive models of Artificial neural network
urban growth. These models not only provide an
understanding of the urban growth process, but also Artificial neural network (ANN) is a non-parametric
provide realizations of the numerous potential growth technique for quantifying and modeling complex
scenarios that an urban area may take in the future. behavior and patterns. Unlike the commonly used
This kind of information can be very helpful, in statistical methods, ANN has the following
regulating urban growth and proper planning can advantages (Li and Yeh, 2001):
also be done for the future urbanizable areas. a. It makes no assumptions regarding the
Over the past few years, a number of models of distribution of the data.
urban growth have been developed namely by, b. Measurement data of different types can be
White and Engelen, 1993, 1994, 1997; Barredo et al., used.
2003, 2004; Wu, 1998a, 1998b, 1999, 2002; Wu and c. They can solve highly nonlinear problems.
Webster, 1998, 2000; Li and Yeh, 2000; Ward et al.,
2000. However, these models are based on Since, urban growth is also a complex phenomena
subjectively defined rules. The rules are based on the in which a number of variables interact non-linearly
expert’s experience of the subject and study area and with each other, the application of ANN to model
therefore, contain some bias towards the assumptions urban growth appears quite logical.
made. Secondly, it is extremely time consuming to In ANN, the basic processing elements are the
find proper values for the model parameters during neurons that are arranged in different layers and work
the calibration process. To overcome these in parallel to transform input data into output entities.
limitations, an ANN-based model of urban growth The neurons in each layer are connected to the
has been proposed in the present study. The aim of neurons in the next successive layer and each
the study is to demonstrate, that the subjectivity in connection carries a weight (Atkinson and Tatnall,
rule definition and model calibration can be reduced 1997). This arrangement of neurons in layers and the
by using Artificial Neural Networks (ANN). The pattern of connection within and in between these
research also aims at demonstrating the use of remote layers is called ANN architecture.
sensing and GIS as spatial data provider and spatial In the present study, a feedforward multilayer
data handler respectively in the proposed model. In perceptron (MLP) ANN architecture has been used
the proposed model, remote sensing data were used and is shown in Fig. 1. The MLP consists of three
to provide empirical data about urban growth and types of layers, i.e., input, hidden and output layers.
other spatial information. GIS was used for handling The black circles in Fig. 1 represent the neurons, in
this spatial data and generating site attributes. input, hidden and output layers, while the lines
J. Indian Soc. Remote Sens. (September 2009) 37:363–376 365

represent the weighted connections between neurons netj = weighted input at neuron j
in these different layers. Since the ANN architecture n = number of neurons in the input layer
used is a feedforward architecture, a link is only The neuron j generates an activation signal, in
allowed from a neuron in a layer to neurons in the response to the netj. The activation signal is generated
next subsequent layer. There are no interconnections via a transfer function. Any differentiable non-linear
between neurons within the same layer or with function can be used as a transfer function, but a
neurons in the preceding layers (Kavzoglu and sigmoid function is generally used due to its useful
Mather, 2003). The ANN is described by a sequence properties like monotonicity and continuity which
of numbers indicating the number of neurons in each help in increasing the learning capacity of the ANN
layer. For example, the ANN shown in Fig. 1 has a (Haykin, 1999, Kumar, 2004; Sivanandam et al.,
4-5-1 architecture, i.e., it contains four neurons in the 2006). The sigmoid function constraints the ANN
input layer, five neurons in the hidden layer and one output between 0 and 1.
neuron in the output layer. The activation signal generated at neuron j
becomes the input to the neurons in the next layer.
Thus, all hidden and output neurons collect the
activation signals of the neurons in the previous layer
and then generate an output activation signal
themselves. This activation signal becomes the input
for neurons in the subsequent layers.
The number of neurons in the input layer depends
on the number of input data sources. The number of
hidden layers and their neurons are often determined
by trial and error or literature-driven thumb rules. The
number of neurons in the output layer depends on
Fig. 1 A feedforward MLP ANN architecture. the number of classes being mapped (Arora et al.,
1998, 2004; Kavzoglu and Mather, 2003; Kanungo
et al., 2006).
The neurons in the input layer only transmit data
to the next layer, while the hidden and output layer ANN training
neurons actively process the data. Each neuron in the
hidden and output layers responds to the weighted In order to use the ANN for prediction purposes, the
inputs it receives from the connections to the neurons ANN must be taught the characteristics of the dataset
in the preceding input layer. If i is the neuron in the being processed. This is called ANN training. Once
input layer and j is a neuron in the next layer, then the ANN learns the characteristics of the dataset, the
the weighted input (netj) that neuron j receives is ANN is considered as trained and the whole dataset
computed as, is passed through this trained ANN to generate the
output. A training dataset is usually employed for
net j = ∑ I i Wij (0 ≤ i ≤n) (1) training the ANN. The training dataset consists of
where, input values and the desired output values
Ii = input signal from neuron i corresponding to these input values. The desired
Wij = weight associated with connection output values can be obtained from field, remote
between neurons i and j sensing data and other secondary sources.
366 J. Indian Soc. Remote Sens. (September 2009) 37:363–376

Although the ANN can be trained using a number Study area


of training algorithms, in the present study, the
backpropagation (BP) learning algorithm proposed The study area consisting of Saharanpur city and its
by Rumelhart et al. (1986) has been used for training surrounding areas covers an area of about 80 km2 and
the ANN due to its simplicity and wide applicability. encompasses the geographical extents, 29°55' N to
In the BP algorithm, the ANN weights are randomly 30°0' N latitude and 77°30' E to 77°35' E. The city
initialized and the training data are fed to the input is located in the fertile tract of Upper Ganga-Yamuna
neurons. The hidden neurons collect the data from daob (tract of land between two confluent rivers) in
input neurons and generate an activation signal as the the western part of Uttar Pradesh State (India). The
input to the next layer. The ANN outputs generated city, besides being a major market of food grains also
at the output neurons, are compared with the desired acts as a service centre for the surrounding hinterland,
outputs. The error (difference) between the ANN as it provides education and medical facilities. The
outputs and the desired outputs is back propagated city is well connected by several roads and the
through the ANN and is minimized by updating the railways which provide transport to other important
interconnection weights between the layers. This cities and in its regional setting has a very significant
process of backpropogating the errors is repeated place as a transport node. Lately, due to the
iteratively with weights being recomputed in each emergence of several industries (e.g., paper,
iteration, until the error falls below a predefined strawboard, tobacco), the city has lost its agrarian
threshold value. Thus, during training, the ANN fabric and the growing numbers of its labour force
weights are continuously adjusted till they are are presently employed in the secondary and tertiary
optimized (Haykin, 1999; Kumar, 2004). sectors. This development of industry and services
In order, to evaluate the performance of the trained is attracting migrants not only from the city’s
ANN on an independent data set, a testing dataset is hinterland but also from far away areas. As a result,
also fed to the trained ANN. A minimal difference the city is expanding rapidly, onto the nearby
between the overall accuracies obtained on the surrounding fertile agricultural lands (Subudhi, 1998;
training and testing datasets indicates that the Fazal, 2000). Hence, there is an urgent need to model
trained ANN has not only learned the data the growth pattern of the city, so that proper planning
characteristics, but also performed well on the unseen measures can be taken for future urbanizable areas.
data (testing data) (Kanungo et al., 2006). The best
ANN architecture is selected based on the minimum
difference between the overall accuracies obtained Data sources used for creating spatial database
on the training and testing datasets. of study area
Since in the present study, a sigmoid activation
function was used, the ANN output values varied Built-up/ non built-up maps
from 0 to 1. The ANN values in the present case The remote sensing data used to produce the built-
represented the urban growth potential (UGP) at each up/non built-up area maps of the years 1993 and 2001
location in the study area. A value of 1 denotes (Fig. 2a and 2b) consisted of aerial photographs
maximum potential for future urban growth while a (1:10,000 scale) and remote sensing data acquired
value of 0 denotes the minimum urbanization by Indian Remote Sensing (IRS) satellite 1C, PAN
potential for future urban growth. The intermediate sensor (1:12,500 scale) respectively. Both data
values between 0 and 1, indicate different potentials products were panchromatic, so the built-up area has
for future urban growth. a blocky appearance with a light tone. The vegetated
J. Indian Soc. Remote Sens. (September 2009) 37:363–376 367

(a) (b)

Fig. 2 Built-up/ non built-up areas in the years (a) 1993 and (b) 2001 as interpreted from remote sensing data.

areas have a light to dark tone with rough texture. commercial and institution facilities in Saharanpur
Bare soil has a very light tone, while the water bodies city are located in the city core. This area constitutes
have a dark tone. The overall accuracy of the 1993 the old part of the city and has a very high built-up
and 2001 built-up/ non built-up maps prepared by area and population density.
visual interpretation of the respective data products,
was 93% and 91% respectively.
Factors driving urban growth
Generation of road network map
In the present study urban growth was defined in
A road network map depicting major and minor roads
terms of increase in built-up area, over a period of
was generated by a visual image interpretation of the
time. An urban growth map, depicting the increase
1993 aerial photographs. The guide map and master
in built-up area during the 1993–2001 period was
plan of Saharanpur city were used as secondary data
derived from the built-up/ non built-up maps of the
sources.
years 1993 and 2001, using the overlay function of
Delineation of city core ARC/INFO. A value of 1 was assigned to the cells,
which changed from non built-up to built-up during
The city core was demarcated after consultation 1993–2001 and a value of 0 was assigned to cells,
with the local planning authorities. Most of the which remained non-built during the 1993–2001
368 J. Indian Soc. Remote Sens. (September 2009) 37:363–376

period. Analysis of the urban growth map for 1993– 20 meters. The cell size was kept as 20 meters as a
2001 revealed that 630 hectares of land has changed smaller cell size increased the map size and model
from non built-up to built-up during the period. Due processing time, while a coarser cell size led to
to the presence of a defense establishment, i.e., generalization and loss of detail. The raster maps
Remount Depot, in the east direction, the growth has generated corresponding to the three causative factors
been restricted in this direction. are discussed below :
As accessibility to roads and higher level facilities i) Accessibility to roads :
(mainly located in the city core) are important factors The accessibility to roads network was measured
driving urban growth, the urban growth pattern was in terms of Euclidian distance from the nearest road.
analyzed with respect to distance from roads and the Raster maps depicting the Euclidian distance of each
city core. After consultation with the local planning cell from the nearest major and minor roads were
authorities and based on field knowledge, buffer generated using the Euclidian distance function of
zones of different distances were created around the ARC/INFO GRID.
road network and the city core. These buffer zones ii) Accessibility to the city core :
were overlaid with the urban growth map of the The accessibility to the city core was measured
1993–2001 period, in order to find out the spatial in terms of Euclidean distance from the city core,
distribution of urban growth in these different buffer where most of the higher level commercial facilities
zones The results showed that 79% of the total urban are located. A raster map depicting the Euclidian
growth in 1993–2001 had taken place within a distance of each cell from the city core was generated
distance of 400 meters from the roads. Similarly 90% using the Euclidian distance function of ARC/INFO
of the total urban growth in 1993–2001 has taken GRID.
place within a distance of 4000 meters from the city iii) Accessibility to infrastructural facilities :
core. The accessibility to infrastructural facilities has
However, besides these two factors, the been measured in terms of:
accessibility to basic infrastructure facilities (i.e.,
water supply, sewerage, electricity, banks, shopping a) Distance from existing built-up cell :
centre, medical centre, etc.) also influences the urban The cost of connecting to urban services (e.g.,
growth process (Paul and Bharat, 1997; Gupta and water supply, sewerage etc.) decreases with the
Bawa, 2004; Gowda, 1998). Thus, urban growth in distance from existing built-up areas, as these basic
the study area was defined as a function of the urban services are already present in existing built-
following three factors (hereafter causative factors), up areas. A raster map showing the Euclidean
distance of each cell from the nearest built-up cell
i) Accessibility to roads (connectivity is a major was generated using the Euclidian distance function
factor affecting the urban growth process) of ARC/INFO GRID.
ii) Accessibility to the city core, (most of the b) Amount of built–up cells in the neighbourhood :
higher level facilities are located in the city A larger proportion of built-up area in the
core). neighbourhood implies availability of localized
iii) Accessibility to infrastructural facilities. facilities, i.e., shopping centre, medical centre,
Raster maps corresponding to the three factors were banking, post office etc., necessary to support the
generated using the data layers generated in section population. A raster map showing the amount of
4. Standard GIS operations like Euclidian distance, built-up in a 500-meter circular neighbourhood of
focal sum etc., were used to create these raster data each cell was generated using the focal sum
layers. The cell size of these raster maps was kept as function of ARC/INFO GRID. The size of 500-meter
J. Indian Soc. Remote Sens. (September 2009) 37:363–376 369

neighbourhood was decided after consultation with 3. Euclidean distance of a cell from the nearest
the local planning authorities. built-up
These five raster maps formed the spatial dataset 4. Euclidian distance of each cell from the city
corresponding to the three causative factors. The five core
maps were normalized from 0 to 1 by dividing each 5. Amount of built-up in neighbourhood (500
value in the map by the maximum value contained metre)
in the map. The maps were normalized so as to make ii) Generation of maps depicting actual urban
them compatible with the sigmoid activation function growth in study area :
used in the ANN. An urban growth map depicting cells that transit
from non built-up to built-up area during 1993–2001
was generated using remote sensing data and GIS
Implementation of the ANN model tools as discussed in section 5.
iii) Generation of training and testing dataset :
The proposed ANN model is based on the dichotomy Using ERDAS IMAGINE software, 1800
of built-up and non built-up areas. The model sampling points were selected from the urban growth
simulated transition from non built-up to built-up map of 1993–2001 period. The number of training
areas with no reverse process taking place. The ANN sample points was based on the heuristic 60 Ni (Ni
processing was implemented using the Neural +1) given by Kavzoglu and Mather (2003), where
Network Tool Box of MATLAB software. The Ni is the number of input neurons. Since there are
following steps were followed in the implementation
five neurons in the input layer corresponding to the
of the ANN model :
five raster maps, so 1800 training sample points were
i) Generation of maps corresponding to factors
selected. Out of these 1800 sampling points, 900
driving urban growth in study area
points belonged to the urbanized category (cells
ii) Generation of maps depicting actual urban
which transit from non built-up in 1993 to built-up
growth in study area
in 2001 and were assigned a value of 1) while 900
iii) Generation of training and testing dataset
points belonged to the non-urbanized category (cells
from the data created in step i) and ii)
which remained non built-up in 1993 and 2001 and
iv) Design of ANNs with different network
were assigned a value of 0). This was done so as to
architectures
ensure equal representation of both the classes. These
v) Training and evaluation of ANNs
sample point coordinates were then imported into
vi) Selection of optimal ANN architecture
Arc/Info for extraction of cell attributes that were
vii) Masking of exclusionary areas
viii) Simulation of urban growth using the associated with these sampling points in the five
optimal ANN architecture raster maps (the five raster maps are discussed in
section 5). The sample function of Arc/Info was used
i) Generation of maps corresponding to factors for retrieval of the cell attributes. This dataset
driving urban growth in study area : consisting of 1800 sample points along with the
As discussed in section 5, urban growth in the associated site attributes was used as the training
study areas was expressed as a function of the data.
following five raster maps, Table 1 shows an example of the training dataset.
1. Euclidian distance of each cell from the In the table the values under f1,f2,f3,f4 and f5
nearest major road
columns show the attributes value at a cell
2. Euclidian distance of each cell from the
corresponding to the five factors maps, while the
nearest minor road
370 J. Indian Soc. Remote Sens. (September 2009) 37:363–376

target value of 1 or 0 indicates whether the cell transit driven thumb rules given by Kanellopoulos and
from non built-up to built-up or remained non built- Wilkinson (1997), Hush (1989), Hecht-Nielsen
up during the 1993–2001 period respectively. (1987), Wang (1994), Ripley (1993) and Paola
(1994).
Table 1 Example of training dataset consisting of cell attributes and the target value

f1 f2 f3 f4 f5 Target output

Sample 1 0.132 0.407 0.010 0.345 0.323 0


Sample 2 0.015 0.343 0.010 0.181 0.151 0
Sample 3 0.015 0.390 0.007 0.089 0.189 1
Sample 4 0.009 0.409 0.007 0.236 0.336 1
Sample 5 0.009 0.402 0.007 0.222 1

In order to determine the generalization capability v) Training and evaluation of ANN :


of the ANN, a testing dataset of 1800 sample points There are a number of variants of the back
was also created in the same manner as the training propogation algorithm available like, variable
data set. However, the training and testing datasets learning rate, conjugate gradient, resilient
were mutually exclusive (Foody and Arora, 1997). backpropogation, Quasi-newton and Levenberg-
iv) Design of ANNs with different network Marquardt. In the present study, the Levenberg-
architectures : Marquardt algorithm which had the fastest
In the present model a feed forward MLP ANN convergence was used for training the ANN. The
architecture having one input layer, one or two hidden details of this algorithm can be found in Hagan and
layers and one output layer was used. The number Menhaj (1994) and Demuth and Beale (2003).
of hidden layers was kept as one or two as these are Different ANN architectures were trained using the
sufficient for most of the classification like problems training data discussed in step (iii). The testing
(Kanellopoulos and Wilkinson, 1997; Arora et al., dataset was also passed through each of the ANN
2004). architectures, in order to determine the generalization
The input layer had five neurons corresponding capability of the trained ANNs.
to the five raster maps (discussed in section 5). The vi) Selection of optimal ANN architecture :
output layer had one neuron since in the output only The ANN architectures, for which the difference
one class, i.e., the UGP at each cell in the study area between the training and testing data accuracies was
was being mapped. The value of the UGP varied from minimum, were selected. Twelve ANN architectures
0–1. A value of 1 indicated maximum potential for (Table 2) were selected based on the difference
transiting from non built-up to built-up and a value between the training and testing data accuracies and
of 0 denoted minimum UGP. The intermediate values their respective network weights were stored.
between 0 and 1, indicated different potential for vii) Masking of exclusionary areas :
transiting from non built-up to built-up. The restricted areas, water bodies, public grounds
The number of neurons in the hidden layers were and gardens were treated as exclusionary zones as
determined based on trial and error or literature they had no growth potential. Hence, these areas were
J. Indian Soc. Remote Sens. (September 2009) 37:363–376 371

Table 2 ANN architectures which gave the minimum Evaluation of model results
difference between the training and testing data accuracies

S No ANN Overall training Overall testing The twelve simulated urban growth maps for 1993–
architecture accuracy accuracy 2001 were evaluated by comparing them with the
actual urban growth map of 1993–2001, using Kappa
1 5-13-1 79.22% 75.28% coefficient and spatial metrics. The Kappa coefficient
2 5-15-1 67.94% 64.78% (Congalton, 1991) evaluated the simulated urban
3 5-17-1 76.94% 73.44% growth maps on the basis of cell by cell matching of
4 5-5-9-1 83.67% 81.11%
each map with the actual growth map. However, the
Kappa coefficient is based on independent
5 5-5-11-1 73.72% 70.39%
comparison between pairs of cells. Thus, small
6 5-6-5-1 79.33% 77.94% displacements between the actual and simulated
7 5-6-8-1 74.00% 69.28% urban growth maps are considered as errors and the
8 5-6-10-1 82.78% 79.67% same error is reported even if the displacement is of
9 5-7-6-1 71.17% 69.56% ‘n’ cells or one cell (Barredo et al., 2003). The Kappa
coefficient, therefore, was unable to take into account
10 5-7-7-1 90.56% 87.83%
the patterns or distribution of urban growth. As the
11 5-7-8-1 74.28% 69.67%
usefulness of any model is not in its “correctness” to
12 5-7-9-1 75.56% 72.67% predict the exact location of future growth, but on
its generalization capability and the ability to simulate
the future growth pattern. Hence, the simulated urban
masked out and the model made no prediction in growth maps were also evaluated on the basis of three
these areas. A mask corresponding to these areas spatial metrics namely, Mean Patch Fractal
was generated based on Survey of India Dimension, Landscape Shape Index and Percentage
topographical map and guide map of the study area. of like Adjacencies. These three metrices evaluated
Since the model only simulated transition from non the similarity between the actual and simulated
built-up to built-up with no reverse process taking growth patterns.
place, so the built-up cells in 1993 were also masked
out from the analysis. Kappa coefficient
viii) Simulation of urban growth using the optimal Kappa coefficient was computed from the confusion
ANN architecture : matrices generated by cross-tabulating the cell-by-
The entire dataset consisting of the five raster cell attributes between each of the twelve simulated
maps (14093 cells in each of the five raster maps) urban growth maps and the actual growth map for
was then passed through each of the selected twelve 1993–2001. The values of the Kappa coefficient
ANN architectures. The output map obtained from obtained for each of the twelve simulated urban
each ANN architecture showed the urban growth growth maps are given in Table 2. The value of
potential (UGP) of each cell in the study area. In each Kappa varies from 0–1, a Kappa value of 0.6 can be
of these twelve UGP maps, cells which had a UGP interpreted as an indication that the simulation is 60
value of 1 were converted to built-up cells while cells per cent better than one resulting from chance
having a UGP value of less than 1 did not transit to (Lillesand and Kiefer, 2000). From Table 3, it can
built-up and remained non built-up. be observed that in general the value of Kappa
372 J. Indian Soc. Remote Sens. (September 2009) 37:363–376

coefficient lies between 0.6 and 0.7, which can be size. One of the basic types of shape index based on
termed as very good as far as the prediction capability perimeter-area relationships is the fractal dimension
of the model on cell-to-cell basis is considered. The (FD), which can be computed as 2 times the
simulated urban growth map generated using the 5- logarithm of patch perimeter (m) divided by the
7-9-1 ANN architecture had the maximum value of logarithm of patch area (m 2). FD > 1 indicates
kappa coefficient, i.e., 0.72. departure from Euclidean geometry (i.e., an increase
in shape complexity). FD near to 1 reflects shapes
Table 3 Kappa coefficient values for each of the twelve with very simple perimeters such as squares. FD
simulated urban growth maps approaches 2, for shapes with highly convoluted,
plane-filling perimeters. The fractal dimension was
ANN architecture Kappa
calculated for all the built-up patches and then it was
5-13-1 0.658 averaged.
5-15-1 0.65 B. Landscape Shape Index (LSI): It is defined as
the total perimeter of built-up class divided by the
5-17-1 0.64
perimeter of the built-up class, if it was aggregated
5-5-9-1 0.62
into one square patch. The value of LSI = 1 when
5-5-11-1 0.66 the landscape consists of a single square. The LSI
5-6-5-1 0.57 increases as the class becomes more disaggregated.
5-6-8-1 0.65 C. Percentage of Like Adjacencies (PLADJ): It
5-6-10-1 0.61 is defined as the number of like cell adjacencies
5-7-6-1 0.64
involving the urban class, divided by the total number
of cell adjacencies involving the urban class,
5-7-7-1 0.62
multiplied by 100. PLADJ measures the degree of
5-7-8-1 0.66 aggregation of the built-up patch type. A landscape
5-7-9-1 0.72 containing greater aggregation of built-up patches
(e.g., larger patches with compact shapes) will
Spatial metrics contain a higher proportion of like adjacencies than
Three spatial metrics were calculated for each of the a landscape containing disaggregated patch types
twelve simulated urban growth maps and the actual (e.g., smaller patches). As such, this index provides
growth map for 1993–2001, using the spatial pattern an effective measure of class-specific contagion.
analysis software FRAGSTATS 3.3 (freely The value of PLADJ varies from 0 to 100. PLADJ
downloadable from the University of Massachusetts equals 0 when the corresponding class is maximally
website, http://www.umass.com). The three spatial disaggregated (i.e., every cell is a different patch) and
metrics calculated were : there are no like adjacencies. A PLADJ value of 100
indicates that the landscape consists of single class
a. Mean Patch Fractal Dimension and all adjacencies are between the same class.
b. Landscape Shape Index The above mentioned three spatial metrics were
c. Percentage of Like Adjacencies calculated for each of the twelve simulated maps and
A. Mean Patch Fractal Dimension (FRAC_MN): actual urban growth map. The percentage difference
A patch is defined as a group of pixels that are between the values of the three metrics for each
contiguous to each other and have the same attribute simulated map and the actual growth map was
value. Thus, different pockets of built-up area in a calculated using Equation 4 and the results were
map constitute urban patches of different shape and plotted as shown in Fig. 3.
J. Indian Soc. Remote Sens. (September 2009) 37:363–376 373

Fig. 3 Percentage difference between values of three spatial metrics calculated for the actual and simulated urban
growth maps for the year 2001.

Percent difference = [ (SM s - SM r) / SM r ]*100 (4) of cell-to-cell basis was not found to be realistic
SM s = Spatial metrics calculated from because of the complexities and uncertainties of the
simulated urban growth map real world. It was felt that the model should be able
SM r = Spatial metrics calculated from actual to simulate the overall future shape or growth pattern
growth map of the city, instead of a pixel-to-pixel match between
the actual and simulated map. So the 5-7-7-1 network
architecture was selected for future prediction,
Results and discussion instead of the 5-7-9-1 network which produced the
highest kappa coefficient.
From Fig. 3 it can be observed that the 5-7-7-1 ANN A visual comparison of the simulated built-up area
architecture best simulated the urban growth for 2001 (Fig. 4a) with the actual growth for year
pattern compared to other network architectures, as 2001 revealed, that the model was able to simulate
the per cent difference between the values of the three the actual urban growth pattern which had taken place
spatial metrics is least for the 5-7-7-1 ANN in a contiguous and concentric manner around the
architecture compared to other ANN architectures. existing built-up area. However, some growth had
While in terms of kappa coefficient (cell to cell taken place in an isolated and patchy form in the city
matching) the 5-7-9-1 ANN architecture gave the fringe areas, mainly in the south and north east
highest value of 0.72. Since urban growth is a directions. The model is not able to predict these
stochastic process, so the strength of a predictive types of growth quite accurately. Thus, the model was
urban growth model is in its ability to simulate the able to simulate the growth pattern in areas having a
future urban growth patterns, if not the exact location compact, dense and contiguous growth (city core)
of future growth. Hence, an exact simulation in terms more accurately as compared to the areas which had
374 J. Indian Soc. Remote Sens. (September 2009) 37:363–376

(a) (b)
Fig. 4 Simulated built-up areas using 5-7-7-1 ANN architecture for year (a) 2001 and (b) 2009.

a dispersed and isolated growth pattern (city fringe As observed from the existing actual growth
areas). trends in the study area, (i.e., in year 2009), the city
The built-up area for year 2009 was also simulated is growing mainly in the north and south directions
using the trained 5-7-7-1 ANN architecture (Fig. 4b). with a number of residential areas and other facilities
It can be observed from Fig. 4(b), that the future coming up in these areas. Besides, growth is also
urban growth will take place in a compact form and taking place around the areas which have developed
contiguous to the existing built-up areas. The areas in a patchy manner around the city. These field
in the north, north-west, south and south-east might observations thus, also validate the predictions made
experience more growth, as these areas are well by the model for 2009.
served by the road network and most of the roads
connecting the city to other parts of the state are Conclusions
located also in these areas. Growth is also expected
to occur around patches of built-up areas located in A number of models have been built by various
the vicinity of the main city. According to the model, authors, to explain urban spatial growth in terms of
these fringe areas might grow in future and finally various variables. The definition of rules for these
merge with the city. The area in the east direction is models is very crucial as the model performance
mainly occupied by defense establishments therefore, depends on them. Most of the models follow a
not much of future growth is expected in this subjective procedure for definition of these rules,
direction. which is a very tedious process. In the present study,
J. Indian Soc. Remote Sens. (September 2009) 37:363–376 375

the definition of rules is done directly by the ANN. Barredo JI, Demicheli L, Lavalle C, Kasanko M and
The user has to give the input variables or causative McCormick N (2004) Modeling future urban scenarios
factors (which are derived from remote sensing data), in developing countries: an application case study in
Lagos, Nigeria. Environment and Planning
once the ANN has learnt the growth pattern as a
B: Planning and Design 32: 65–84
function of these causative factors, the trained ANN Census of India, 2001 (New Delhi: Government of India).
can then be used for future prediction. As a result, Congalton RG (1991) A review of assessing the accuracy
the model calibration time is reduced and not much of classifications of remotely sensed data. Remote
computing power is required. Sensing of Environment 37: 35–46
In the present work, instead of using the traditional Demuth H and Beale M (2003) Neural networks toolbox:
back propagation training algorithm (BP), a faster User guide, version 4. The Mathworks, Inc., USA
version of BP the Levenberg- Marquardt was used, Fazal S (2000) Urban expansion and loss of agricultural
land – a GIS based study of Saharanpur city, India.
which made it possible to evaluate a number of ANN
Environment & Urbanization 12(2): 133–149
architecture in less time. The simulated urban growth Foody GM and Arora MK (1997) An evaluation of some
map had a kappa index greater than 0.6, which factors affecting the accuracy of classification by an
demonstrated that the ANN based model was able to artificial network. International Journal of Remote
predict the location of the built-up cells quite Sensing, 18(4): 799–810
accurately. For evaluating the simulated urban growth Gowda K (1998) Forces of urban form, In International
pattern three spatial metrics were used. The values Symposium on Changing Environmental Scenario in
of these three metrics calculated for the simulated and South Asia During the Past Five Decades, 20–22
December 1998, Calcutta
the actual urban growth map matched very closely.
Gowda K and Sridhara MV (2000) Sustainable city:
Thus, it can be inferred that the model was able to experiences and perspectives. In International
simulate both the location and pattern of urban Conference on urbanisation and housing 2000,
growth. Finally, it can be concluded that the model Bangalore
has successfully coupled GIS, remote sensing and GGIM, Good governance India magazine, available online
ANN, while reducing the subjectivity in urban growth at: www.infochangeindia.org (accessed on 4 April
modelling at the same time. 2005)
Gupta JK and Bawa P (2004) Selecting sites for urban
development projects. Spatio-Economic Development
Record 11(3): 29–33
References Hagan MT and Menhaj M (1994) Training feedforward
networks with the Marquardt algorithm. IEEE Trans.
Arora MK, Narsimham G and Mohanty B (1998) Neural Neural Networks 5(6): 989–993
network approach for the classification of remotely Haykin S (1999) Neural Networks: A Comprehensive
sensed data. Asian Pacific Remote Sensing and GIS Foundation. New Jersey: Prentice Hall
Journal 10(2): 11–18 Hecht-Nielsen R (1987) Kolmogorov’s mapping neural
Arora MK, Das Gupta AS and Gupta RP (2004) An network existence theorem. In Proceedings of the first
artificial neural netwok approach for landslide hazard IEEE International conference on neural networks. 21–
zonation in the Bhagirathi (Ganga) valley, Himalayas. 24 June 1987, San Diego, CA
International Journal of Remote Sensing 25: 559–572 Hush DR (1989) Classification with neural networks: a
Atkinson PM and Tatnall ARL (1997) Neural networks in performance analysis. In Proceedings of the IEEE
remote sensing. International Journal of Remote International conference on systems engineering, Ohio,
Sensing 18(4): 699–709 USA, pp 277–280
Barredo JI, Kasanko M, McCormick N and Lavalle C Kanellopoulos I and Wilkinson GG (1997) Strategies and
(2003) Modeling dynamic spatial process: simulation best practice for neural network image classification.
of urban future scenarios through cellular automata. International Journal of Remote Sensing 18(4): 711–
Landscape and Urban Planning 64: 145–160 725
376 J. Indian Soc. Remote Sens. (September 2009) 37:363–376

Kanungo DP, Arora MK, Sarkar S and Gupta RP (2006) Sivanandam SN, Sumathi S and Deepa SN (2006)
A comparative study of conventional ANN black box, Introduction to neural networks using Matlab 6.0 New
fuzzy and combined neural and fuzzy weighting Delhi: Tata McGraw-Hill
procedures for landslide susceptibility zonation in Subudhi AP (1998) Monitoring and mapping: Urban
Darjeeling Himalayas. Engineering Geology 85: 347– growth and changing land use pattern in and around
366 Saharanpur city. Ph.D thesis, Ch. Charan Singh
Kavzoglu T and Mather PM (2003) The use of back University, Meerut, India
propagation artificial neural networks in land cover Tayal T and Bharat A (1997) Managing urban environment:
classification. International Journal of Remote Sensing a case of Bhopal. In International Conference on
24: 4097–4938 Planning for 21st century (concepts & systems), 2–4
Kulshrestha SK (2004) Some Suggestions on the Mixed October 1997, Roorkee, pp 77–86
Land Use Approach for Madhya Pradesh. In 53 rd Wang F (1994) The use of artificial neural networks in
National Town & Country Planners Congress, 27–29 geographical information system for agricultural land-
December 2004, Indore, pp 177–181 suitability assessment. Environment and Planning A,
Kulshrestha SK (2007a) Emerging challenges in spatial 26: 265–284
development and need for reforms in urban and Ward DP, Murray AT and Phinn SR (2000) A stochastically
regional planning legislations. In 55th National Town constrained cellular model of urban growth.
& Country Planners Congress, 9–11 January 2007, Computers, Environment and Urban Studies
Kerala, pp 5–11 24:539–558
Kulshrestha SK (2007b) Peri-Urban areas: The concept, White R and Engelen G (1993) Cellular automata and
concerns, and planning and development policies. In fractal urban form: a cellular modeling approach to the
56th National Town & Country Planners Congress, 20– evolution of urban land use patterns. Environment and
22 December 2007, Kolkata, pp 31–37 Planning A 25: 1175–1199
Kumar S (2004) Neural Networks: A Classroom Approach. White R and Engelen G (1994) Cellular dynamics and
(New Delhi: Tata McGraw Hall) GIS: modeling spatial complexity. Geographical
Lillesand TM and Kiefer RW (2000) Remote sensing and Systems 1: 237–253
image interpretation New York: John Wiley & Sons White R and Engelen G (1997) Cellular automata as the
Paola JD (1994) Neural network classification of basis of integrated dynamic regional modeling.
multispectral imagery. MSc thesis, The University of Environment and Planning B 24: 235–246
Arizona, Tucson, USA Wu F (1998a) An empirical model of intrametropolitan
Paul R and Bharat A (1997) Solutions for metropolitan land use changes in a Chinese city. Environment and
development. In International Conference on Planning Planning B 25: 245–263
for 21st century (concepts & systems), 2–4 October Wu F (1998b) Simland: A prototype to simulate land
1997, Roorkee, pp 257–265 conversion through the integrated GIS and CA with
Pijanowski BC, Shellito BA, Pithadia S and Alexandridis, AHP-derived transition rules. International Journal of
K (2002b) Forecasting and assessing the impact of Geographical Information Science 12: 63–82
urban sprawl in coastal watersheds along eastern lake Wu F (1999) GIS-based simulation as an exploratory
Michigan. Lakes & Reservoirs: Research and analysis for space-time processes. The Journal of
Management 7: 189–199 Geographical Systems 1(3): 199–218
Pijanowski BC, Pithadia S, Shellito BA and Alexandridis, Wu F (2002) Calibration of stochastic cellular automata:
K (2005) Calibrating a neural network-based urban the application to rural-urban land conversions.
change model for two metropolitan areas of the Upper International Journal of Geographical Information
Midwest of the United States. International Journal Science 16(8): 795–818
of Geographical Information Science 19(2): 197–215 Wu F and Webster C (1998) Simulation of land
Ripley BD (1993) Statistical aspects of neural networks. development through the integration of cellular
In Networks and Chaos –Statistical and Probabilistic automata and multicriteria evaluation. Environment
Aspects, Barndorff-Nielsen OE, Jensen JL and Kendall and Planning B 25: 103–126
WS (Eds), pp 40–123 London: Chapman & Hall Wu F and Webster C J (2000) Simulation of artificial
Rumelhart DE, Hinton GE and Williams (1986) Learning cities in a GIS environment: urban growth under
representatons by Back-propogating errors. Nature 23: alternative regulation regime. International Journal of
533–536 Geographical Information Science 14(7): 625–648

View publication stats

You might also like