Professional Documents
Culture Documents
6 Data Model PDF
6 Data Model PDF
Sanjeevan Shrestha
Introduction
Data Models
• Viewing an image only as ‘data’ disconnects the remote sensing
analyst from the underlying physical processes that creates the ‘data’.
• Specific data models can provide the link between the physics of
remote sensing and the design of image processing algorithms
Designation of Pixel in image?
• Image as an array of numbers, with indexes i and j, where the array
values are the pixel DNs.
• A pixel value at row i and column j would be denoted as 𝐷𝑁𝑖𝑗
• Rows and columns are conveniently numbered from (1,1) at the
upper left to the (N,M) at the lower right of the image array
Image Statistics
Remote Sensing pixel and basic statistics
Many different ways to check the pixel values and statistics:
• Looking at the frequency of occurrence of individual brightness values
(or Digital Number) in the image displayed in a histogram
• Viewing on a computer monitor the individual pixel brightness value
or DN at specific locations or within a geographic area
• Computing univariate descriptive statistics to determine if there are
unusual anomalies in the image data
• Computing multivariate statistics to determine the amount of
between band correlation to identify redundance
RS data distribution
• Large sample drawn randomly from natural populations usually
produce a symmetrical frequency distribution; most values are
clustered around some central values, and the frequency of
occurrence declines away from this central point- bell shaped, and is
also called a normal distribution.
• Many statistical tests used in the analysis of remotely sensed data
assume that the brightness values (DN) recorded in a scene are
normally distributed.
• Unfortunately, remotely sensed data may not be normally distributed
and analyst must be careful to identify such conditions. In the
instances, non-parametric statistical theory may be preferred.
Univariate Image Statistics
• This is generally apply to the single band images.
• Univariate image statistics may be further classified into:
• Histogram
• Cumulative Histogram
DN Histogram
• Describes the statistical distribution of image pixels in terms of the
number of pixels at each DN.
• Measures brightness distribution
• It is calculated simply by counting the number of pixels in each DN
‘bin’ and divided by total number of pixels in the image, N.
ℎ𝑖𝑠𝑡𝐷𝑁 = 𝑐𝑜𝑢𝑛𝑡(𝐷𝑁)/𝑁
• This is analogous to the continuous Probability Density Function (PDF)
of statistics
ℎ𝑖𝑠𝑡𝐷𝑁 ≈ 𝑃𝐷𝐹(𝐷𝑁)
DN Histogram
• The histograms of larger images of land areas are typically unimodal
i.e. they have a single peak
• Usually skewed, with a tail towards the higher DNs.
• Histogram contains no direct information about the spatial
distribution of pixels
• However, spatial information can be inferred from the spatial
distribution from such pixels i.e. strongly bimodal histogram usually
indicates two dominant materials in the scene.
• What we cannot say is that how two materials are spatially
connected.
DN Histogram
• The image histogram is the useful tool for the contrast enhancement.
• A common contrast enhancement techniques stretches the range of
DNs and clips or thresholds it at one or both ends resulting in a
certain percentage of saturated pixels.
• The appropriate DN thresholds can be obtained from the histogram
percentages of the total number of pixels in the image.
Cumulative Histogram
• Some image processing algorithms, notably histogram equalization,
histogram matching etc. require a function, the cumulative histogram
𝐷𝑁
𝑐ℎ𝑖𝑠𝑡𝐷𝑁 = ℎ𝑖𝑠𝑡𝑚𝑖𝑛
𝐷𝑁= 𝐷𝑁𝑚𝑖𝑛
• The cumulative histogram is the fraction of pixels in the image with a
DN less than or equal to the specified DN.
• This is monotonic function of DN, since it can only increase as each
histogram value is accumulated.
• This is also called the Cumulative Distribution Function (CDF)
Statistical Parameters
• The mode is the value that occurs most frequently in a distribution
and is usually the highest point on the curve (histogram). It is
common, however, to encounter more than one mode in a remote
sensing dataset.
• The median is the value midway in the frequency distribution. One
half of the area below the distribution curve is to the right of the
median, and one half is to the left.
Statistical Parameters
• The mean is the arithmetic average and is defined as the sum of all
brightness value observations divided by the number of observations.
• This can be defined as the weight of each DN by the corresponding
histogram value (the fraction of the image that has that DN) and sum
of the weighted DNs.
𝑁 𝐷𝑁=𝐷𝑁𝑚𝑎𝑥
1
𝜇= 𝐷𝑁𝑝 = 𝐷𝑁 × ℎ𝑖𝑠𝑡𝐷𝑁
𝑁
𝑝=𝑗 𝐷𝑁= 𝐷𝑁𝑚𝑖𝑛
Statistical Parameters
• The image standard deviation can also be used as a measure of image
contrast since it is a measure of the histogram width i.e. the spread in
DNs.
Statistical Parameters
• Skewness
• Measure of asymmetry
• Is zero for any symmetric histogram
• A histogram with a long tail toward larger DNs has a positive/negative
skewness and this is typical of remote sensing images.
• If a distribution has a long right tail of larger values, it is positively skewed and
if it has a long left tail of small values, it is negatively skewed.
𝑁 3 𝐷𝑁𝑚𝑎𝑥 3
1 𝐷𝑁𝑝 − 𝜇 𝐷𝑁 − 𝜇
𝑠𝑘𝑒𝑤𝑛𝑒𝑠𝑠 = = × ℎ𝑖𝑠𝑡𝐷𝑁
𝑁 𝜎 𝜎
𝑝=1 𝐷𝑁= 𝐷𝑁𝑚𝑖𝑛
Statistical Parameters
• Kurtosis
• Measure the sharpness of peak relative to the normal distribution
• Is zero for the normal distribution
• If a histogram has a positive kurtosis, then the peak is sharper than that of a
gaussian
• A negative kurtosis means the peak is less sharp than that of gaussian
𝑁 4
1 𝐷𝑁 𝑝 − 𝜇
𝑘𝑢𝑟𝑡𝑜𝑠𝑖𝑠 = −3
𝑁 𝜎
𝑝=1
𝐷𝑁𝑚𝑎𝑣 4
𝐷𝑁 − 𝜇
= × ℎ𝑖𝑠𝑡𝐷𝑁 − 3
𝜎
𝐷𝑁= 𝐷𝑁𝑚𝑖𝑛
Statistical Parameters
• Both skewness and kurtosis are normalized by the standard deviation
and are unitless, unlike the mean and standard deviation.
• Skewness and kurtosis are quite sensitive to outliers, pixels with DNs
far removed from the majority distribution, because of their high
order.
Multivariate Image Statistics
• Remote sensing research is often concerned with the measurement
of how much radiant flux is reflected or emitted from an object in
more than one band.
• It is useful to compute multivariate statistical measures such as
covariance and correlation among the several bands to determine
how the measurements covary.
• Later it will be shown that variance-covariance and correlation
matrices are used in remote sensing principal components analysis
(PCA), feature selection, classification and accuracy assessment.
Scatterplot
• One way to visualize two or three
dimensional data is the
scatterplot.
• This is binary plot which shows
the dot if a particular
multispectral vector has a
histogram count of at least one.
• However, The number of pixels
with a particular vector is not
shown.
• This 3d nature of scatterplot can
help to reveal different features in
the data and for image
interpretation.
Scatterplot
𝑐𝑠𝑡𝑑 = 𝜎𝐷𝑁
Statistical Measure for Image quality
• Modulation
Another easily measured image property is modulation, M, is defined
as
𝐷𝑁𝑚𝑎𝑥 − 𝐷𝑁𝑚𝑖𝑛
𝑀=
𝐷𝑁𝑚𝑎𝑥 + 𝐷𝑁𝑚𝑖𝑛
2
𝜎𝑠𝑖𝑔𝑛𝑎𝑙
𝑆𝑁𝑅𝑣𝑎𝑟 =
𝜎𝑛𝑜𝑖𝑠𝑒
Signal to Noise Ratio
• The SNR expressed in decibels (dB) is given by
𝑆𝑁𝑅𝑑𝐵 = 10log(𝑆𝑁𝑅)