Professional Documents
Culture Documents
Lecture 3 Introduction To Spatial Data Analysis
Lecture 3 Introduction To Spatial Data Analysis
Analysis
https://landsat.gsfc.nasa.gov/nasa-studies-details-of-a-greening-arctic/
1
1
http://www.statcan.gc.ca/pub/16-002-x/2007001/map/5008062-eng.htm
http://thamesriver.on.ca/watershed-health/watershed-report- http://thamesriver.on.ca/education-community/watershed-friends-of-
cards/watershed-map/ projects/stoneycreek/stoney-creek-watershed/
local examples
2
Distance
Calculated Use
Trickier for Numerous
from two Pythagoras
the sphere variants
coordinates formula
3
Other distance metrics
✓ Number of nearest
entities
4
Neighborhood
Means
1. The region of space
around some objects
2. A set of objects
considered to be
neighbors of that object
Hypothesis testing
5
Learning Objectives
6
Sample vs. population
Sample Population
Size n N
_
Mean X µ
Standard
s σ
deviation
Variance s2 σ2
Draw Sample
POPULATION Calculate
Sample
Statistics
𝑋ത = 78
Infer sd = 7
Population
Parameters
7
10
9 8 8
10
11
9 9
10 10
12
11
10
µ = 10 X =9
8
Scientific method in geography
17
18
9
As n increases, sample
means become
more numerous,
approaching a
continuous normal
distribution.
https://stats.libretexts.org/Textbook_Maps/Map%3A_Introductory_Statistics_(Shafer_and_Zhang)/06%3A_Sampl
ing_Distributions/6.2%3A_The_Sampling_Distribution_of_the_Sample_Mean
Example: Central
limit theorem
20
10
The Central Limit Theorem
x= s
n
21
22
11
Sample Size, Standard Deviation and
Sampling Error
Small Standard Deviation of Population Large
Large
Sampling Error
Small
Small Large
23
Sample Size
This can be used to construct a confidence interval for a population mean, by taking a
sample, calculating its mean and SD, and combining the information to state that with
(say) 95% confidence, the mean lies in the interval from A to B.
One other IMPORTANT thing: for the central limit theorem to hold, we generally
assume that a sample size n = 30 is required.
Hypothesis testing is related to this idea, but starts to add in issues of establishing
theories by statistical testing.
12
Distribution of Sample Means
(Nx = 10)
25
Sampling Distribution
of Sample Means
1. Distribution is
centered on the
population mean
2. Unlikely to find
• very low sample
means
• very high sample
means
3. Most likely to
find intermediate
sample means
• These results
derive directly from
the concepts of
probability and
randomness being
applied to a normal
distribution
13
Z-scores are simply transformations of X-scores
to a zero- plus-minus system, where zero = the
mean of the X-scores.
27
• Based on z-score
• Upper and lower bounds of confidence interval
• Confidence level (CI)
✓ p that confidence interval includes true population mean
✓ Examples: 0.90, 0.95, 0.99, 90%, 95%, 99%
• Significance level (α)
✓ p that confidence interval fails to include true population
mean
✓ Examples: 0.10, 0.05, 0.01, 10%, 5%, 1%
28
14
When determining probability, The Normal Distribution table
we are concerned with both only provides values for one
tails of the normal distribution. tail.
15
What is a hypothesis?
• Any statement about reality.
16
The alternative hypothesis, HA
17
Possible errors
• Two errors are possible:
35
H0 True H0 false
Statistical power: (1 - β)
Accept H0 1-α β
= the probability of rejecting a
false null hypothesis. It is the
chance of finding an effect
that truly exists in the Reject H0 α 1-β
population
http://www.wadsworth.com/psychology_d/templates/student_resources/workshops/stat_workshp/statpower/statpower_23.html
36
18
Statistical power H0 True H0 false
Accept H0 1-α β
Reject H0 α 1-β
http://www.wadsworth.com/psychology_d/templates/student_resources/workshops/stat_workshp/statpower/statpower_23.html
37
Distribution
of Sample
Means
Area of
Type II
Error
2-Tail
Critical
Regions
38
19