Professional Documents
Culture Documents
A CA5103 Module 2 Notes
A CA5103 Module 2 Notes
e.g., introducing a new product (forecasting e.g., daily sales of bottled water might depend
demand is difficult due to the lack of any on the average temperature, the average
historical sales data) humidity, and so on.
- Moving Averages
- Consumer market survey – a method that
- Exponential Smoothing
solicits input from customers or potential
- Trend Projections
customers about their future purchasing
- Decomposition
plans; help not only in preparing a forecast
but also in improving product design and 3. Causal Methods
planning for new products.
- Regression Analysis
- Multiple Regression
MANSCI – MODULE 2
Professor: Prof. Mary Jane A. Castilla
Transcribed by: Tyrone Villena
Regression Analysis (Render et al., 2016) - Scatter plots showing various correlation
coefficient values:
- A forecasting technique with generally two
purposes:
1. To understand the relationship between two
variables;
2. To predict the value of one based on the other.
- The sample correlation coefficient (like Pearson’s - The hypothesized relationship may be linear,
r and Spearman rho) measures the degree of quadratic or some other form.
linearity in the relationship between two random - Simple Linear Model is commonly referred to as
variables X and Y, with values in the interval simple regression equation.
[-1.1].
MANSCI – MODULE 2
Professor: Prof. Mary Jane A. Castilla
Transcribed by: Tyrone Villena
Σ𝑌
𝑌̅ = = Average (mean) of Y values
𝑛
𝑏0 = 𝑌̅ − 𝑏1 𝑋̅
- One of the main uses of regression is to make - The standard deviation of the variation of
predictions. Once we have fitted regression observations around the regression line is
equation that show the estimated relationship estimated by:
between X and Y, we can plug in any value of X 2
𝑆𝑆𝐸 ∑𝑛 (𝑌𝑖 − 𝑌̂𝑖 )
(within the range of our sample X values) to obtain 𝑆𝑌𝑋 =√ = √ 𝑖=1
𝑛−2 𝑛−2
the prediction for Y.
- Linearity – the relationship between X and Y is - 𝑆𝑌𝑋 is a measure of the variation of observed Y
linear values from the regression line.
- Independence of errors – the error values - The magnitude 𝑆𝑌𝑋 should always be judged
(difference between observed and estimated relative to the size of Y values in the sample data.
values) are statistically independent.
- Normality of Error – the error values are normally
distributed for any given value of X. Inferences about the slope using the t-test
- Equal variance or homoskedasticity – the
- The t-test for a population slope is used to
probability distribution of the errors has constant
determine if there is a linear relationship between
variance.
X and Y.
- Null and alternative hypotheses
𝐻0 : 𝛽1 = 0 (no linear relationship)
Assessing Fit: Coefficient of determination, 𝑅 2 𝐻1 : 𝛽1 ≠ 0 (linear
relationship does Where:
- The coefficient of determination is the portion of
exist) 𝑏1 = regression slope coefficient
the total variation in the dependent variable that is 𝛽1 = hypothesized slope
𝑏1 − 𝛽1
explained by the variation in the independent 𝑡= 𝑆𝑏1 = standard error of the slope
𝑆𝑏1
variable.
𝑑. 𝑓. = 𝑛 − 2
- It is also called r-squared and is obtained by:
𝑆𝑆𝑅 𝑟𝑒𝑔𝑟𝑒𝑠𝑠𝑖𝑜𝑛 𝑠𝑢𝑚 𝑜𝑓 𝑠𝑞𝑢𝑎𝑟𝑒
𝑟2 = =
𝑆𝑆𝑇 𝑡𝑜𝑡𝑎𝑙 𝑠𝑢𝑚 𝑜𝑓 𝑠𝑞𝑢𝑎𝑟𝑒𝑠
0 ≥ 𝑟2 ≤ 1
2
𝑆𝑆𝑅 = Σ(𝑌̂ − 𝑌̅)
𝑆𝑆𝑇 = Σ(𝑌 − 𝑌̅)2
2
𝑆𝑆𝐸 = Σ(Y − 𝑌̂)