Atkinson 1997

This article was downloaded by: [University of Sydney]
On: 16 April 2013, At: 11:19

Publisher: Taylor & Francis
Informa Ltd Registered in England and Wales Registered Number: 1072954
Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH,
UK
International Journal of Remote

Sensing
Publication details, including instructions for authors
and subscription information:
http://www.tandfonline.com/loi/tres20
Introduction Neural networks in

remote sensing
P. M. Atkinson & A. R. L. Tatnall
Version of record first published: 25 Nov 2010.
To cite this article: P. M. Atkinson & A. R. L. Tatnall (1997): Introduction Neural

networks in remote sensing, International Journal of Remote Sensing, 18:4, 699-709
To link to this article: http://dx.doi.org/10.1080/014311697218700
PLEASE SCROLL DOWN FOR ARTICLE
Full terms and conditions of use: http://www.tandfonline.com/page/terms-

and-conditions
This article may be used for research, teaching, and private study purposes.
Any substantial or systematic reproduction, redistribution, reselling, loan, sub-
licensing, systematic supply, or distribution in any form to anyone is expressly
forbidden.
The publisher does not give any warranty express or implied or make any
representation that the contents will be complete or accurate or up to
date. The accuracy of any instructions, formulae, and drug doses should be
independently verified with primary sources. The publisher shall not be liable
for any loss, actions, claims, proceedings, demand, or costs or damages
whatsoever or howsoever caused arising directly or indirectly in connection
with or arising out of the use of this material.
int. j. remote sensing, 1997 , vol. 18 , no. 4 , 699 ± 709
Introduction
Neural networks in remote sensing
P. M. ATKINSON² and A. R. L. TATNALL³

² Department of Geography and ³ Department of Aeronautics and Astronautics,
University of Southampton, High® eld, Southampton SO17 1BJ, England, U.K.
( Received 15 January 1996; in ® nal form 20 June 1996 )

Downloaded by [University of Sydney] at 11:19 16 April 2013
Abstract. Over the past decade there have been considerable increases in both
the quantity of remotely sensed data available and the use of neural networks.
These increases have largely taken place in parallel, and it is only recently that
several researchers have begun to apply neural networks to remotely sensed data.
This paper introduces this special issue which is concerned speci® cally with the
use of neural networks in remote sensing. The feed-forward back-propagation
multi-layer perceptron (MLP) is the type of neural network most commonly
encountered in remote sensing and is used in many of the papers in this special
issue. The basic structure of the MLP algorithm is described in some detail while
some other types of neural network are mentioned. The most common applica-
tions of neural networks in remote sensing are considered, particularly those
concerned with the classi® cation of land and clouds, and recent developments
in these areas are described. Finally, the application of neural networks to
multi-source data and fuzzy classi® cation are considered.
1. Backgroun d
The current generation of Earth observation sensors are producing data with
great potential for use in scienti® c and technological investigations in very large and
ever increasing quantities. Whilst such data provide a considerable resource with
which to address many fundamental environmental issues, they also present new
challenges of data processing and data interpretation. These challenges must be
tackled if the full potential of the data are to be realised. Not only is this necessary
for e cient use of the present data, it also provides an important constraint on the
need for, and an in¯ uence on the design of, instruments proposed for future sensor
platforms. It is in the context of these requirements that arti® cial neural networks
(ANNs) are currently being applied in a wide variety of remote sensing applications.
Good introductions to neural networks are provided in texts such as Kohonen
( 1988 ), Beale and Jackson ( 1990), Simpson ( 1990 ), Bishop ( 1995) and Aleksander
and Morton ( 1991 ).
The use of arti® cial neural networks for remote sensing data interpretation has
been motivated by the realisation that the human brain is very e cient at processing
vast quantities of data from a variety of di erent sources. Neurons in the human
brain receive inputs from other neurons and produce an output (if the sum of the
inputs is above a certain threshold ) which is then passed to other neurons. For some
time it has been recognized that a mathematical approach based on the actions of
0143 ± 1161/97 $12.0 0 Ñ 1997 Taylo r & Francis Ltd
700 P. M. Atkinson and A. R. L . Tatnall
biological neurons may be implemented to process and interpret many di erent
types of digital data. While it is not possible or desirable to reproduce the complexity
of the human brain on a computer, arti® cial neural networks that are based on an
architecture of simple processing elements like neurons are proving successful for a
wide range of applications, including processing and interpreting remotely sensed
data.
In the above sense, neural networks are an arti® cial intelligence (AI) technique
and, therefore, come from the same family as expert systems and knowledge-based
approaches to learning ( Key et al . 1989 ). However, whereas expert systems are based
on symbolic representation and, therefore, incorporate qualitative data into the
estimation through prior programming of the learning algorithm, neural networks
employ a connectionist approach in which computer code is required only to run
the network ( Hepner et al . 1990).
Neural networks, in the simplest sense, may be seen as data transformers ( Pao
1989 ), where the objective is to associate the elements in one set of data with the
elements in a second set. When applied to classi® cation, for example, they are
concerned with the transformation of data from feature space to class space. Neural
networks, therefore, belong to the same class of techniques as automated pattern
recognition ( Ritter et al . 1988), regression, and spectral (and textural ) classi® cation.
Given the importance of these techniques, it is not surprising that neural networks
are ® nding increasing application in remote sensing.
The rapid uptake of neural approaches in remote sensing is due mainly to their
widely demonstrated ability to:
(i ) perform more accurately than other techniques such as statistical classi® ers,
particularly when the feature space is complex and the source data has
di erent statistical distributions ( Benediktsson et al . 1990, 1993, Schalko
1992 );
(ii ) perform more rapidly than other techniques such as statistical classi® ers
( Bankert 1994, CoÃ teÂ and Tatnall 1995);
(iii ) incorporate a priori knowledge and realistic physical constraints into the
analysis ( Brown and Harris 1994, Foody 1995a, b),
(iv) incorporate di erent types of data (including those from di erent sensors)
into the analysis, thus facilitating synergistic studies ( Benediktsson et al .
1993, Benediktsson and Sveinsson 1997)
Given this list of bene® ts, it is clear that one of the main opportunities o ered
by neural networks is to allow the e cient handling of the large quantities of
remotely sensed data which are currently being produced.
In the remainder of this paper, one of the main di erences between statistical
and neural approaches is discussed, a common type of neural network employed in
remote sensing is introduced, and several applications of neural networks in remote
sensing are considered. The objective is not to review neural networks in remote
sensing, but rather to provide a context for the papers in this special issue. A useful
review may be found in Paola and Schowengerdt ( 1995), while some general guidance
is provided in Kanellopoulos and Wilkinson ( 1997).
2. Statistical approaches and neural networks

Traditional parametric statistical approaches to supervised classi® cation include
the Euclidean, maximum likelihood (ML), and Mahalanobis distance classi® ers (see,
Neural networks in remote sensing 701
for example, Thomas et al . 1987). Such approaches depend on the assumption of a

multivariate Gaussian (normal ) distribution for the data to be classi® ed. Each class
in feature space is assumed to have an n -dimensional (where n is the number of
input wavebands) multivariate Gaussian distribution.
A problem with the statistical approach is that the data in feature space may not
follow the assumed model. Further, it is possible that a single class may be represented
at multiple places in feature space. This is particularly the case where classes (for
example, deciduous and coniferous forest) are amalgamated at a higher level in the
classi® cation system (for example, into forest). Consequently, statistical approaches
may be seen as restrictive because of the underlying assumptions of the model. A
further problem with statistical approaches is that they require non-singular
(invertible) class-speci® c covariance matrices ( Benediktsson et al . 1993 ).
Neural networks applied for supervised classi® cation are similar to the K -nearest
neighbour algorithms, although neural networks are more e cient, and require less
data for training ( Lee et al . 1990 ). One of the main advantages of neural networks
for classi® cation is that they are distribution-free, that is, no underlying model is
assumed for the multivariate distribution of the class-speci® c data in feature space.
It is, therefore, possible for a single class to be represented in feature space as a
series of clusters (rather than a single cluster). A fundamental di erence between
statistical and neural approaches to classi® cation is, therefore, that statistical
approaches depend on an assumed model , whereas neural approaches depend on
data . It is for this reason that neural networks are suitable for integrating data from
di erent sources. More recent work has concentrated on the incorporation of
additional knowledge into the neural network ( for example, Foody 1995b).
3. Characteristics of neural networks

There are many di erent types of neural networks (see, for example, Dayho
1990 ). Rather than describe the features of each type of neural network, this section
provides a brief introduction to the major characteristics of one of the most com-
monly used neural networks in remote sensing, the multi-layer perceptron (MLP).
3.1. T he development of the multi-layer perceptron

The concept of the perceptron, ® rst introduced in 1952 by Rosenblatt ( 1958),
was the starting point for the development of neural networks. A perceptron element
consists of a single node which receives weighted inputs and thresholds the results
according to a rule. The perceptron is able to classify linearly separable data, but is
unable to handle non-linear data. An adaptation of the perceptron is the one-layer
delta rule introduced by Widrow and Ho ( 1960 ), in which supervised learning is
achieved via a least mean squares ( LMS) algorithm (Anderson and Rosenfeld 1988).
For two decades, development in neural networks was slow as a result of the
inability to ® nd e cient methods to solve non-linearly separable problems, and the
main focus of attention was on symbolic AI ( Key et al . 1989). In the 1980s there
was a resurgence of interest in neural networks as a result of an increase in computing
power and the development of several new algorithms and network topologies. One
of these was the back-propagation MLP algorithm described by Rumelhart et al .
( 1986 ). The MLP algorithm is still widely used and many of the papers in this
special issue are based on this type of neural network.
Figure 1. The MLP algorithm.
3.2. Basic architecture

A neural network consists of a number of interconnected nodes (equivalent to
biological neurons). Each node is a simple processing element that responds to the
weighted inputs it receives from other nodes. The arrangement of the nodes is referred
to as the network architecture (® gure 1).
The MLP can separate data that are non-linear because it is `multi-layer’, and it
generally consists of three (or more) types of layers . In this paper, it has been assumed
that the number of layers in a network refers to the number of layers of nodes and
not to the number of layers of weights. The latter de® nition can also be found in
the literature ( Bishop 1995 ). The ® rst type of layer is the input layer, where the
nodes are the elements of a feature vector. This vector might consist of the wavebands
of a data set, the texture of the image ( Kaminsky et al . 1997) or other more complex
parameters (Chen et al . 1997 ). In the paper by Logar et al . ( 1997), a large number
of possible feature elements are considered and a way of selecting the most
appropriate is described.
The second type of layer is the internal or `hidden’ layer since it does not contain
output units. There are no rules, but theory shows that one hidden layer can represent
any Boolean function. An increase in the number of hidden layers enables the
network to learn more complex problems, but the capacity to generalize is reduced
and there is an associated increase in training time ( Foody 1995b). Lippmann ( 1987)
suggests that if a second hidden layer is used, the maximum number of nodes in the
second hidden layer should be three times the number in the ® rst hidden layer.
Foody and Arora ( 1997 ) assess several factors a ecting the accuracy of a neural
network, one of which is the number of nodes in the hidden layer.
The third type of layer is the output layer and this presents the output data (for
example, for image classi® cation, the number of nodes in the output layer is equal
to the c classes in the classi® cation). Some researchers have suggested that if
the number of nodes in the output layer is actually greater than c, then greater
classi® cation accuracies may result ( Benediktsson et al . 1993 ).
Each node in the network is interconnected to the nodes in both the preceding
and following layers by connections. These connections have associated with them
weights (or synaptic e cacies following the biological analogy).
3.3. T he feed-forward concept

In the MLP algorithm an input pattern is presented to the network via the input
layer and the input signals are passed to the nodes in the next layer in a feed-forward
manner. As the signal passes from node to node, it is modi® ed by the weights
associated with the connection. The receiving node sums the weighted signals from
all nodes to which it is connected in the preceding layer. Formally, the input that a
single node receives is weighted according to:
netj = v ji o i ( 1)
where v ji represents the weights between node i and node j , and o i is the output
from node i . The output from a given node j is then computed from:
o j = f (net j ) ( 2)
The function f is usually a non-linear sigmoid function that is applied to the weighted
sum of inputs before the signal passes to the next layer. When the signal reaches the
output layer it forms the network output. In traditional hard classi® cation (where
pixels are assigned to a single class only), the output of one node (that of the chosen
class) is set to one, while all other nodes in the output layer are equal to zero.
3.4. T raining
The aim of network training is to build a model of the data generating process
so that the network can generalize and predict outputs from inputs that it has not
seen before. For the MLP a training pattern is presented to the network and the
signals are fed-forwards as described above. Then, the network output is compared
with the desired output (a set of training data, for example, of known classes) and
the error computed. This error is then back-propagated through the network and,
generally, the weights of the connections (which are usually set randomly at the
start) are altered according to what is known as the generalized delta rule ( Rumelhart
et al . 1986):
Dv ji ( n +1) = g ( d j o i ) + aDv ji ( n ) ( 3)
where g is the learning rate parameter, d j is an index of the rate of change of the
error, and a is the momentum parameter. This process of feeding forward signals
and back-propagating the error is repeated iteratively until the error of the network
as a whole is minimized or reaches an acceptable magnitude. It is through the
successive modi® cation of the (adaptive) weights that the neural network is able
to learn.
3.5. Generalizing
Several factors a ect the capabilities of the neural network to generalize, that is,
the ability of the neural network to interpolate and extrapolate to data that it has
not seen before. These include:
(i ) Number of nodes and architecture
If a large number of simple processing elements are used the mathematical
structure can be made very ¯ exible and the neural network can be used for a wide
range of applications. This may not be necessary for all applications. For example,
very simple topologies using a small number of data points have been investigated
( Yahn and Simpson 1995 ). In general terms, the larger the number of nodes in the
hidden layer(s), the better the neural network is able to represent the training data,
but at the expense of the ability to generalize.
(ii ) Size of training set
The data set used must be representative of the entire distribution of values likely
to be associated with a particular class. If the extent of the distribution of the data
in feature space is not covered adequately the network may fail to classify new data
accurately. A consequence of this for the MLP algorithm is that large quantities of
data are often required for training, and researchers are often concerned with ® nding
the minimum size of data set necessary (for example, Hepner et al . 1990).
The requirement for large training data sets also means that training times may
be long. To speed up the training process, several modi® cations to the MLP algorithm
have been introduced including the momentum term (see above), the delta-bar-delta
rule, and optimization procedures ( Benediktsson et al . 1993 ). Paola and
Schowengerdt ( 1995), in a comparison of the back-propagation neural network and
maximum-likelihood classi® ers, identi® ed the choice of training times necessary for
mean square error minimization as problematic.
(iii ) Training time
The time taken for training also a ects the generalizing capabilities of the network.
The longer that a network is trained on a speci® c data set, the more accurately it
will be able to classify those data, but at the expense of the ability to classify
previously unseen data. In particular, it is possible to overtrain a network so that it
is able to memorize the training data but is not able to generalize when it is applied
to di erent data.
3.6. Veri® cation

The trained neural network may be presented with a veri® cation or testing data
set to evaluate the accuracy of the network, and ® nally presented with the actual
data to be processed. The veri® cation data set is important because it allows one to
assess independently the accuracy of the trained neural network. The choice of data
for veri® cation should, like the training data, be representative of the entire distribu-
tion of values associated with a given class. In practice, the available data is often
divided into a training and testing data set and the test data is used to evaluate the
performance of the model.
3.7. Other neural networks

Another common network architecture is the recurrent or feed-back neural
network. In recurrent networks, the interconnection weights remain ® xed and a
modi® ed version of the output of the node is fed back into the input. These types of
networks are often used for optimization, and include the Hop® eld neural network
( Hop® eld and Tank 1985 ). In this special issue, CoÃ teÂ and Tatnall ( 1997 ) use a
Hop® eld network to track and correlate features and use a cost function to identify
the best matches.
The way in which learning is achieved plays a part in de® ning the neural network.
The MLP algorithm is a supervised method of training, since information is required
in the training process about the correct class for each pixel. Unsupervised training
relies on the data forming clusters that can subsequently be labelled, and no prior
information is provided about the desired classi® cation. The Kohonen Self-
Organizing Topological Map ( Kohonen 1984) is an example of this kind of unsuper-
vised neural network, and a version of this is used by Ito and Omatu ( 1997) in this
special issue.
In all the above networks a major challenge is to reduce the time required to
train the network and yet maintain accuracy and generalization, that is, the ability
to handle new data that have not been seen by the neural network before.
4. Applications in remote sensing

One of the most common applications of neural networks in remote sensing is
classi® cation. In particular, this has included land cover classi® cation (for example,
Bischof et al . 1992, Heerman and Khazenie 1992, Kanellopoulos et al . 1992) and
cloud identi® cation and classi® cation ( for example, Lee et al . 1990, Welch et al . 1992,
Bankert 1994 ).
4.1. L and
Howald ( 1989 ), McClelland et al . ( 1989), Hepner et al . ( 1990), and Downey et al .
( 1992 ) all applied neural networks to classify land cover from Landsat Thematic
Mapper ( TM ) imagery and all found to varying degrees that the neural approach
was more accurate than traditional statistical classi® cation. Kanellopoulos et al .
( 1992 ) conducted an experiment to estimate twenty land cover classes from SPOT
High Resolution Visible ( HRV ) imagery, and again found that the neural approach
was more accurate. Decatur ( 1989) applied neural networks to classify terrain from
synthetic aperture radar (SAR) imagery, and Ersoy and Hong ( 1990 ) applied a
hierarchical network to classify airborne multispectral scanner system (MSS)
imagery. Civco ( 1993) found that in certain circumstances neural networks were
actually less accurate than conventional statistical approaches for classifying land
cover.
4.2. Clouds
A similar approach has been taken to evaluate neural networks for identifying
and classifying clouds. For example, Lee et al . ( 1990 ) used a feed-forward back-
propagation neural network to classify di erent categories of clouds including cirrus,
stratocumulus, and cumulus from Landsat MSS data. The results showed a signi® cant
improvement over classical methods and gave an overall accuracy of 93 per cent.
Several di erent types of neural network were compared by Welch et al . ( 1992 ) for
classifying cloud data. They found the feed-forward back-propagation neural network
to be accurate, but slow to train compared to other types of network, particularly
the probabilistic neural network. This network makes use of Bayesian classi® cation
to assign classes to the largest value of the posterior class probability density
functions. The probablistic type of network was successfully used by Bankert ( 1994)
to classify clouds in maritime regions. In these studies, one goal is to automate the
handling of satellite sensor imagery and there is growing recognition that use must
be made of cloud shape, size, texture, and context ( Pankiewicz 1995, Lewis et al .
1997 ). Another goal is that of cloud screening to assist other work ( Yahn and
Simpson 1995, Logar et al . 1997 ).
4.3. Multi-source data

A signi® cant advantage of neural networks is that they allow the combination
of data from di erent sources into the same classi® cation or estimation. Several
researchers have demonstrated the ability of neural networks for classifying multi-
source spatial data, most notably Benediktsson et al . ( 1990), who used Landsat MSS
imagery and three topographic data sets (elevation, slope, and aspect) to classify ten
categories of land cover, and Benediktsson et al . ( 1993 ), who classi® ed high-
dimensional data. Wilkinson ( 1993) applied a neural network to generalize data for
input to geographical information systems, and Peddle et al . ( 1994) applied the
neural approach to multi-source remotely sensed data to classify land cover in Alpine
regions. Key et al . ( 1989 ) merged Advanced Very High Resolution Radiometer

(AVHRR) and Scanning Multichannel Microwave Radiometer (SMMR) imagery to
classify summer Arctic data, including four surface classes and eight cloud classes.
4.4. Fuzzy classi® cation

Generally, there is a trend towards the investigation of fuzzy sets that include
various degrees of set membership ( Fisher and Pathirana 1990, Gopal and Woodcock
1994 ). Fuzzy sets are appropriate where the boundaries between phenomena are not
distinct (or hard). In neurofuzzy techniques ( Brown and Harris 1994 ), the power of
neural networks are combined with fuzzy logic to enable fuzzy rules to be incorpor-
ated into the classi® cation and enable the intrinsic uncertainty in classi® cation to be
represented and minimized. A common problem with classi® cation in remote sensing
is that many observed pixels represent a mixture of classes. Methods of fuzzy
classi® cation for dealing with sub-pixel mixing, including the use of neural networks,
are described and compared in Atkinson et al . ( 1997 ) and Foody et al . ( 1997).
4.5. Integration of neural and statistical approache s

In contrast to studies which compare the accuracies of neural networks and
statistical classi® ers in competition (most of the studies above), Wilkinson et al .
( 1994 ) propose the integration of both neural networks and traditional statistical
classi® ers into one approach which performs more accurately than either approach
in isolation. Speci® cally, Wilkinson et al . (1995 ) use the neural and statistical
approaches to identify (through their di erent mechanisms) di erent sub-divisions
of the output class space. They then assume that the area of overlap is classi® ed the
most accurately, and use a second neural network to classify the areas of con¯ ict.
In this special issue Murai and Omatu ( 1997) integrate knowledge-based and neural
network approaches to achieve increased classi® cation accuracies.
4.6. Other application s

The focus in this section on the use of neural networks for classi® cation re¯ ects
the current use of neural networks in remote sensing and of the papers in this
journal. There are, however, a large number of other possible uses, including the
estimation of continuous variables. In Schweiger and Key ( 1997), for example, a
neural network is used to estimate the relationship between surface radiative ¯ uxes
and brightness temperatures measured from satellite sensors. Further, Jin and Liu
( 1997 ) estimate biomass from microwave imagery, while Wang and Dong ( 1997)
estimate forest stand parameters from SAR imagery. There have also been a number
of papers concerned with the use of a Hop® eld network for stereo correspondence
( Nasrabadi and Choo 1992, Lee et al . 1994) and the paper by CoÃ teÂ and Tatnall
( 1997 ) develops this type of application to track features from remotely sensed
imagery. Finally, Fiset and Cavayas ( 1997 ) use a neural network to match remotely
sensed images to topographic maps. There are many more possible applications.
5. Conclud ing remarks

Neural networks represent one of the most exciting advances in the processing
of remotely sensed data since the introduction of the maximum likelihood classi® ca-
tion algorithm. Neural approaches have been shown generally to be more accurate
than other techniques ( for example, statistical approaches to classi® cation). Further,
neural networks (once trained ) are faster than other approaches, and they may allow
the incorporation of a priori knowledge and data from di erent sources into the
estimation. Neural networks are ® nding use in a wide range of applications in remote
sensing, and new applications are being proposed frequently. In this special issue,
some of the applications identi® ed above are investigated further, and several new
applications are reported.
References
A leksander, I . and M orton, H ., 1991, An Introduction to Neural Computing (London:
Chapman and Hall ).
A rora, M . K . and F oody, G . M . , 1997, An evaluation of variables a ecting the accuracy of
probabilistic, fuzzy and neural network classi® cations with log-linear modelling.
International Journal of Remote Sensing , 18, 785± 798 (this issue).
A nderson, J . A . and R osenfeld, E . , 1988, Neurocomputing (Cambridge, MA: MIT Press).
A tkinson, P ., C utler, M . and L ewis, H ., 1997, Mapping sub-pixel variation in land cover
in the U.K. from AVHRR imagery. International Journal of Remote Sensing , 18,
917± 935 (this issue).

B ankert, R . L . , 1994, Cloud classi® cation of AVHRR imagery in maritime regions using a
probabilistic neural network. Journal of Applied Meteorology, 33, 322± 333.
B eale, R . and J ackson, T . , 1990, Neural Computing: An introduction ( Bristol: Institute of
Physics Publishing) .
B enediktsson, J . A . and S veinsson, J . R ., 1997, Multisource data classi® cation and feature
extraction with neural networks. International Journal of Remote Sensing , 18, 727± 740
(this issue).
B enediktsson, J . A ., S wain, P . H . and E rsoy, O . K ., 1990, Neural network approaches versus
statistical methods in classi® cation of multisource remote sensing data. I.E.E.E.
T ransactions on Geoscience and Remote Sensing , 28, 540± 552.
B enediktsson, J . A ., S wain, P . H . and E sroy, O . K ., 1993, Conjugate-gradient neural networks
in classi® cation of multisource and very-high-dimensional remote sensing data.
International Journal of Remote Sensing , 14, 2883± 2903.
B ischof, H ., S chneider, W . and P inz, A . J ., 1992, Multi-spectral classi® cation of Landsat
images using neural network. I.E.E.E. T ransactions on Geoscience and Remote Sensing ,
30, 482± 490.
B ishop, C . M . , 1995, Neural Networks for Pattern Recognition (Oxford: Oxford University
Press).
B rown, M . and H arris, C . J ., 1994, Neurof uzzy Adaptive Modelling and Control (Hemel
Hempstead: Prentice Hall ).
C hen, K . S ., Y en, S . K . and T say, D . W . , 1997, Neural classi® cation of SPOT imagery
through integration of intensity and fractal dimension. International Journal of Remote
Sensing, 18, 763± 783 (this issue).
C o^ te’ , S . and T atnall, A . R . L . , 1995, A neural network-based method for tracking features
from satellite sensor images. International Journal of Remote Sensing , 16, 3695± 3701.
C o^ te’, S . and T atnall, A . R . L . , 1997, The Hop® eld neural network as a tool for feature
tracking and recognition from satellite sensor images. International Journal of Remote
C ivco, D . L . , 1993, Arti® cial neural networks for land cover classi® cation and mapping.
International Journal of Geographical Information Systems, 7, 173± 186.
D ayhoff, J . E . , 1990, Neural Network Architectures (New York: Van Nostrand Reinhold ).
D ecatur, S . E . , 1989, Application of neural networks to terrain classi® cation. Proceedings of
International Joint Conference on Neural Networks, Vol. 1 (Piscataway, NJ: I.E.E.E.),
283± 288.
D owney, I . D ., P ower, C . H ., K anellopoulos, I . and W ilkinson, G . G ., 1992, A performance
comparison of Landsat Thematic Mapper land cover classi® cation based on neural
network techniques and traditional maximum likelihood algorithms and minimum
distance algorithms. Proceedings of the Annual Conference of the Remote Sensing Society
( Nottingham: Remote Sensing Society), pp. 518± 528.
E rsoy, O . K . and H ong, D ., 1990, Parallel self-organizing, hierarchical neural networks.
I.E.E.E. T ransactions on Neural Networks, 1, 167± 178.
F iset, R . and C avayas, F . , 1997, Automatic comparison of a topographic map with remotely
sensed images in a map updating perspective: the road network case. International
Journal of Remote Sensing , 18, 991± 1006 (this issue).
F isher, P . F . and P athirana, S ., 1990, The evaluation of fuzzy membership of land cover
classes in the suburban zone. Remote Sensing of Environment , 34, 121± 132.
F oody, G . M . , 1995a, Using prior knowledge in arti® cial neural network classi® cation with
a minimal training set. International Journal of Remote Sensing , 16, 301± 312.
F oody, G . M . , 1995b, Land cover classi® cation using an arti® cial neural network with ancillary
information. International Journal of Geographical Information Systems, 9, 527± 542.
F oody, G . M . and A rora, M . K ., 1997, An evaluation of some factors a ecting the accuracy
of classi® cation by an arti® cial neural network. International Journal of Remote Sensing ,
18, 799± 810 (this issue).
F oody, G . M ., L ucas, R . M ., C urran, P . J . and H onzak, M . , 1997, Non-linear mixture
modelling without end-members using an arti® cial neural network. International
G opal, S . and W oodcock, C . , 1994, Theory and methods for accuracy assessment of thematic
maps using fuzzy sets. Photogrammetric Engineering and Remote Sensing , 60, 181± 188.
H eerman, P . D . and K hazenie, N ., 1992, Classi® cation of multi-spectral remote sensing data
using a back-propagation neural network. I.E.E.E. T ransactions on Geoscience and
Remote Sensing , 30, 81± 88.
H epner, G . F ., L ogan, T ., R itter, N . and B ryant, N ., 1990, Arti® cial neural network
classi® cation using a minimal training set: comparison to conventional supervised
classi® cation. Photogrammetric Engineering and Remote Sensing , 56, 469± 473.
H opfield, J . J . and T ank, D . W . , 1985, Neural computation of decisions in optimization
problems. Biological Cybernetics, 52, 141± 152.
H owald, K . J ., 1989, Neural network image classi® cation. Proceedings of the ASPRS-ACS M
Fall Convention ( Falls Church, VA: American Society for Photogrammetry and Remote
Sensing), pp. 207± 215.
I to, Y . and O matu, S ., 1997, Categeory classi® cation using a self-organizing neural network.
J in, Y .-Q . and L iu, C . , 1997, Biomass retrieval from high-dimensional active/passive remote
sensing data by using an arti® cial neural network. International Journal of Remote
K aminsky, E . J ., B arad, H . and B rown, W . , 1997, Textural neural network and version space
classi® ers for remote sensing. International Journal of Remote Sensing , 18, 741± 762
(this issue).
K anellopoulos, I . and W ilkinson, G . G ., 1997, Strategies and best practice for neural
network image classi® cation. International Journal of Remote Sensing , 18, 711± 725
(this issue).
K anellopoulos, I ., V arfis, A ., W ilkinson, G . G . and M e’gier, J ., 1992, Land-cover discrim-
ination in SPOT HRV imagery using an arti® cial neural network: a 20-class experiment.
International Journal of Remote Sensing , 13, 917± 924.
K ey, J ., M aslanik, J . A . and S chweiger, A . J ., 1989, Classi® cation of merged AVHRR and
SMMR Arctic data with neural networks. Photogrammetric Engineering and Remote
Sensing, 55, 1331± 1338.
K ohonen, T . , 1984, Self Organization and Associative Memory (Berlin: Springer-Verlag).
K ohonen, T . , 1988, An introduction to neural computing. Neural Networks, 1, 3± 16.
L ee, J ., W eger, R . C ., S engupta, S . K . and W elch, R . M . , 1990, A neural network approach
to cloud classi® cation. I.E.E.E. T ransactions on Geoscience and Remote Sensing , 28,
846± 855.
L ee, J . J ., S him, J . C . and H a, Y . H ., 1994, Stereo correspondence using the Hop® eld neural
network of a new energy function. Pattern Recognition , 27, 1513± 1522.
L ewis, H . G ., C o^ te’ , S . and T atnall, A . R . L . , 1997, Determination of spatial and temporal
characteristics as an aid to neural network cloud classi® cation. International Journal
of Remote Sensing , 18, 899± 915 (this issue).
L ippmann, R . P ., 1987, An introduction to computing with neural nets. I.E.E.E. ASSP
Magazine , 2, 4± 22.
L ogar, A ., C orwin, E ., A lexander, J ., L loyd, D ., B erendes, T . and W elch, R ., 1997, A
hybrid histogram/neural network classi® er for creating global cloud masks.

M c C lelland, G . E ., D eW itt, R . N ., H emmer, T . H ., M atheson, L . N . and M oe, G . O .,
1989, Multispectral image-processing with a three-layer back-propagation network.
Proceedings of the International Joint Conference on Neural Networks, 1 (New York:
I.E.E.E.), pp. 151± 153.
M urai, H . and O matu, S ., 1997, Remote sensing image analysis using a neural network and
knowledge-based processing. International Journal of Remote Sensing , 18, 811± 828
(this issue).
N asrabadi, N . H . and C hoo, C . Y ., 1992, Hop® eld network for stereo vision correspondence.
I.E.E.E. T ransactions on Neural Networks, 3, 5± 13.
P ankiewicz, G ., 1995, Pattern recognition techniques for the identi® cation of cloud and cloud
systems. Journal of Applied Meteorology, 2, 257± 271.
P ankiewicz, G ., 1997, Neural network classi® cation of convective air masses for a ¯ ood
forecasting system. International Journal of Remote Sensing , 18, 887± 898 (this issue).
P ao, Y .-H . , 1989, Adaptive Pattern Recognition and Neural Networks ( Reading, MA:
Addison-Wesley).
P aola, J . D . and S chowengerdt, R . A ., 1995, A detailed comparison of back-propagation

neural network and maximum-likelihoo d classi® ers for urban and land use classi® ca-
tion. IEEE T ransactions on Geoscience and Remote Sensing , 33, 981± 996.
P eddle, D . R ., F oody, G . M ., Z hang, A ., F ranklin, S . E . and L edrew, E . F . , 1994,
Multisource image classi® cation II: an empirical comparison of evidential reasoning,
linear discriminant analysis, and maximum likelihood algorithms for alpine land cover
classi® cation. Canadian Journal of Remote Sensing , 20, 397± 408.
R itter, N . D ., L ogan, T . L . and B ryant, N . A ., 1988, Integration of neural network
technologies with geographic information systems. Proceedings of the GIS Symposium:
Integrating T echnology and Geoscience Applications, Denver, Colorado , pp. 102± 103.
R osenblatt, F . , 1958, The perceptron: a probabilistic model for information storage and
organization in the brain. Psychological Review, 65, 386± 408.
R umelhart, D . E ., H inton, G . E . and W illiams, R . J ., 1986, Learning internal representations
by error propagation. In Parallel Distributed Processing: Explorations in the
Microstructures of Cognition , vol. 1 edited by D. E. Rumelhart and J. L. McClelland
(Cambridge, MA: MIT Press), pp. 318± 362.
S chalkoff, R ., 1992, Pattern Recognition: Statistical, Structural, and Neural Approaches ( New
York: Wiley).
S chweiger, A . and K ey, J ., 1997, Estimating surface radiation ¯ uxes in the Arctic from TOVS
HIRS and MSU brightness temperatures. International Journal of Remote Sensing , 18,
955± 970 (this issue).
S impson, P . K ., 1990, Arti® cial Neural Systems (Oxford: Pergamon Press).
T homas, I . L ., B enning, V . M . and C hing, N . P ., 1987, Classi® cation of Remotely Sensed
Images ( Bristol: Adam Hilger).
W ang, Y . and D ong, D ., 1997, Retrieving forest stand parameters from SAR backscatter
data using a neural network trained by a canopy backscatter model. International
W elch, R . M ., S engupta, S . K ., G oroch, A . K ., R abindra, P ., R angaraj, N . and N avar,
M . S ., 1992, Polar cloud and surface classi® cation using AVHRR imagery: an inter-
comparison of methods. Journal of Applied Meteorology, 31, 405± 420.
W idrow, B . and H off, M . E . , 1960, Adaptive switching circuits. IRE WESCON Convention
Record, 4, 96± 104.
W ilkinson, G . G ., 1993, The generalization of satellite-derived thematic maps for GIS input.
Geo-Informations-Systeme , 6, 24± 29.
W ilkinson, G . G ., F ierens, F . and K anellopoulos, I ., 1995, Integration of neural and
statistical approaches in spatial data classi® cation. Geographical Systems, 2, 1± 20.
Y ahn, R . S . and S impson, J . J ., 1995, Applications of neural networks to cloud segmentation.
I.E.E.E. T ransactions on Geoscience and Remote Sensing , 33, 590± 603.

Atkinson 1997

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Atkinson 1997

Uploaded by

Copyright:

Available Formats

This article was downloaded by: [University of Sydney]

On: 16 April 2013, At: 11:19

International Journal of Remote

Introduction Neural networks in

To cite this article: P. M. Atkinson & A. R. L. Tatnall (1997): Introduction Neural

To link to this article: http://dx.doi.org/10.1080/014311697218700

PLEASE SCROLL DOWN FOR ARTICLE

Full terms and conditions of use: http://www.tandfonline.com/page/terms-

Neural networks in remote sensing

P. M. ATKINSON² and A. R. L. TATNALL³

( Received 15 January 1996; in ® nal form 20 June 1996 )

2. Statistical approaches and neural networks

for example, Thomas et al . 1987). Such approaches depend on the assumption of a

3. Characteristics of neural networks

3.1. T he development of the multi-layer perceptron

Figure 1. The MLP algorithm.

3.2. Basic architecture

3.3. T he feed-forward concept

3.6. Veri® cation

3.7. Other neural networks

4. Applications in remote sensing

4.3. Multi-source data

regions. Key et al . ( 1989 ) merged Advanced Very High Resolution Radiometer

4.4. Fuzzy classi® cation

are described and compared in Atkinson et al . ( 1997 ) and Foody et al . ( 1997).

4.5. Integration of neural and statistical approache s

4.6. Other application s

5. Conclud ing remarks

917± 935 (this issue).

hybrid histogram/neural network classi® er for creating global cloud masks.

P aola, J . D . and S chowengerdt, R . A ., 1995, A detailed comparison of back-propagation

You might also like