Professional Documents
Culture Documents
Screenshot 2023-12-04 at 11.27.14
Screenshot 2023-12-04 at 11.27.14
14.2°
Ice Cream Sales
$215
16.4° $325
11.9° $185
The local ice cream shop keeps
track of how much ice cream 15.2° $332
they sell versus the noon 18.5° $406
temperature on that day. Here 22.1° $522
are their figures for the last 12
days: 19.4° $412
25.1° $614
23.4° $544
18.1° $421
22.6° $445
17.2° $408
Interpolation
• is where we find a
value inside our set of
data points.
Extrapolation
• is where we find a
value outside our set of
data points.
Correlation
• Correlation is a statistical measure that expresses the extent to
which two variables are linearly related
• meaning they change together at a constant rate
• Correlation is Positive when the values increase together, and
• Correlation is Negative when one value decreases as the other
increases
Correlation
Pearson Correlation
• 2 continuous variables
• Linear relationship
• Association between height and weight
• Measures the degree of linear
association between two interval scaled
variables.
• Analysis of the relationship between
two quantitative outcomes like height
and weight
Examples
1. The consumption of ice-cream increases during the summer months. There is a strong
correlation between the sales of ice-cream units. In this particular example, we see there
is a causal relationship also as the extreme summers do push the sale of ice-creams up.
2. Ice-creams sales also have a strong correlation with shark attacks. Now as we can see
very clearly here, the shark attacks are most definitely not caused due to ice-creams. So,
there is no causation here.
The table below
demonstrates
how to interpret
the size (strength)
of a correlation
coefficient
SR NO AGE (X) WEIGHT (Y)
1 40 78
• 6 people having a different age and
different weights given below for 2 21 70
5 38 80
6 47 66
Spearman's rank correlation
• It is a nonparametric measure of rank correlation (statistical dependence between
the rankings of two variables). It assesses how well the relationship between two
variables can be described using a monotonic function.
• Pooja participating in a beauty pageant
• Overall, there were 7 participants in the
beauty pageant JUDGE 1 JUDGE 2 RANK 1 RANK 2
(SCORES) (SCORES) (BASED ON (BASED ON
• Two judges were judging the SCORES OF
JUDGE 1)
SCORES OF
JUDGE 2
participants and each one of them were
ranked by respective judges based on 85 87 7 6
the scores
95 84 2 7
• Scores and ranks given to beauty
contestants by respective judges 88 88 5 5
85 87 7 6 1 1
• Table to calculate spearman’s
rank correlation coefficient 95 84 2 7 5 25
88 88 5 5 0 0
86 95 6 1 5 25
6 ∗ 52
𝜌=1− = .07143 92 89 4 4 0 0
7 7 −1
97 91 1 2 1 1
93 90 3 3 0 0
52
Linear Regression
Define
• Modeling and establishing the relationship between one
dependent variable and one independent variable is known as
Simple Linear Regression.
Find linear regression
equation for the
following two sets of
data:
x 2 4 6 8
y 3 7 5 10
Construct the
following table
Calculation
Calculate predicted y
• Substitute in the equation
• State bank of India recently
established a new policy of
linking savings account interest
rate to Repo rate, and the auditor
of the state bank of India wants
to conduct an independent
analysis on the decisions taken by
the bank regarding interest rate
changes whether those have
been changes whenever there
have been changes in the Repo
rate. Following is the summary of
the Repo rate and Bank’s savings
account interest rate that
prevailed in those months are
given below.
• The auditor of state bank has
approached you to conduct an
analysis and provide a
presentation on the same in the
next meeting. Use regression
formula and determine whether
Bank’s rate changed as and when
the Repo rate was changed?
Excel Question