Professional Documents
Culture Documents
Cost and Estimation
Cost and Estimation
CONTENTS
— Cost Estimating Relationship.
— Cost Estimation Method
— Regression Analysis
— Evaluation of the Regression Model
— Multiple Regression
— Learning Curve Theory
Cost Estimating
Cost estimating is a technique used to estimate the behaviour of a particular cost (or price) using an
established relationship with an independent variable(s). In this case, an independent variable is a cost
driver and entails:
— Define the dependent variable or perhaps the cost to be predicted. The choice depends on the
purpose of the cost function. It may also be referred to as a response variable.
— Select the cost drivers (independent, explanatory or predictor) variable. A cost driver is any
factor whose change causes a change in the total cost of an activity. Cost drivers could be direct
labour hours, Machine hours, Number of units, Number of production runs, Number of orders,
etc.
— Collect data concerning the relationship between the dependent and independent variables.
Collecting data is usually the most difficult and time-consuming element of CER development.
All data must be checked and double-checked to ensure that all observations are relevant,
comparable, and relatively free of unusual costs.
— Graph the data (scatter diagram) to indicate the general relationship between the dependent
variable and the cost driver and give a visual indication as to whether a linear cost function can
approximate the cost behaviour. It will also highlight extreme observations (outliers).
— Select the relationship that best predicts the dependent variable.
— Test the reliability of the cost function: logical relationship tests, the goodness of fit test and
model Specification tests
Engineering method: These methods are based on the use of engineering analysis of the
technological relationship between inputs and outputs e.g. method studies and time and motion studies.
The procedure in such a study is to make an analysis based on direct observation of the underlying
physical quantities required for an activity and then to convert the result into a cost estimate. This method
is useful for estimating costs of repetitive processes where the input and output relationship is clearly
defined e.g. the cost associated with direct materials, direct labour and machine time.
High low method (Two-point method): Under this method, records of costs in the previous period are
reviewed and the costs of 2 periods are selected. These are the period with the highest level of outputs
and the period with the lowest output. A line passing through these two points is then established and
used in estimating costs.
Illustration 2.1
The production manager of XYZ Company is concerned about the apparent fluctuation inefficiency
and wants to determine how labour costs (in Sh.) are related to volume. The following data presents
the results of the 12 most recent weeks.
Simple Regression
The regression analysis determines mathematically the regression line of best fit. It is based on the
principle that the sums of squares of the vertical deviation from the line established are the least
possible I.e. (Y Yˆ ) 2
is minimized. Where Y is the observed value of the dependent variable
Ŷis the predicted value of Y. The equation can be solved by the use of normal equations and these are:
1. Y = an + b (x)
My = a (x) + b (x2)
From these normal equations:
b = n my – x y
Nx 2– (x) two
a= Y - bx
N n
Looking at illustration 2.1, then we first compute the sum of X, Y, XU, X2 and Y2
Illustration 2.2
Assume that the company (in illustration 2.1) intends to spend Sh.400 on labour cost next period.
Compute the number of units that the company may produce.
We require a regression of X on Y. i.e. X = g(Y) to answer the above question. The general format of
the equation is: X = a1 + b1 Y
b = n xy – x y
nY 2– (Y)2
a = X - bY
n n
a1 = 430 - 0.0926(3549)
12 12
a1 = 8.3286
Therefore the predicting equation is X̂ = 8.33 + 0.093Y. Thus if the Company intends to spend
Sh.400 on labour, the number of units to be produced will be: X̂ = 8.33 + 0.093(400)= 45.56 units
Approximately 46 units
Logical relationship tests: These tests also referred to as economic plausibility tests, are used to
determine whether there is an expected logical relationship between the independent and the dependent
variable. To carry out this test, it is important to understand the input-output relationship in the company.
For the illustration, there is an expected logical relationship between the number of units and the labour
cost mainly because the higher the number of units, the higher the number of labour hours and therefore
the higher the labour cost.
Goodness of fit tests tests can be divided into two:Tests for the whole model and Testing the slope
r2 = explained variation
Total variation
= (Ŷ - ý)2
(Y- ý)2
r2 = (n xy - x x)2 ________
[n x2 -( x)2] [n y2 – (y)2]
= 0.565
About 56.5% of the variations in labour cost can be explained by variations in units produced while
about 43.5% of the variation in labour cost is explained by other independent variables and the error
term.
Note
The higher the r2, the better the function is. As a rule of thumb, r 2 must be at least equal to 0.8.
(Y Yˆ ) 2
Se = n2
For computation purposes,
Se =
Y 2
a Y b XY
n k 1
= 48.95
The sample size, n, is reduced by 2 because 2 variables ‘a’ & ‘b’ in the regression equation had to be
estimated from the sample observations.
The calculation of the standard error is necessary because the least square line was calculated from
sample data. Other samples would probably result in different estimates. Obtaining the least square
calculation over all the possible observations that might occur would result in the calculation of the true
least square line. The question is “How close does the sample estimate of least square line come to the
true least square line.
The standard error is similar to the standard deviation in normal probability analysis. It is a measure of
variability around the regression line. The std error of estimates enables us to establish a range of values
of the dependent variable within which we may have some degree of confidence that the true value lies.
We can use the following equation to establish this range:
Ŷ – tcSe ≤Y ≤ Ŷ + tcSe
From illustration 2.1, where Ŷ34 = 284.48, the 95% confidence interval can be calculated as follows:
≤ Y ≤ 393.6
We are 95% confident that if X is estimated to be 34 units next period, the true labour cost will lie
between 175.4 and 393.6. Note tc from the student T tables, with 10 degrees of freedom and 5%
significance level, is equal to 2.2281.
The F –test
The significance of the regression results can be tested by using the F- statistics. The F-statistics is a
ratio that compares the explained sum of squares and the unexplained sum of squares.
F= r2 /K_____
(1 – r2)/n-k-1
F statistics can then be used to test the hypothesis that the relationship between the dependent
variables and all the independent variables is not significant.
F= r2 /K_____
(1 – r2)/n-k-1
4.965
Computation of F statistics
F= 0.565/1
(1-0.565)/(12 – 1 –1)
= 12.989
Conclusion
Since the computed F > Fc then we reject Ho. Therefore the relationship between the labour cost and
the number of units is significant.
r = nxy - x y_________
[nx2 – (x)2 ] [ny2 - (y)2]
If the degree of association between two variables is very close then it would be almost possible to
plot the observation on a straight line and r will be almost equal to one.
-1≤ r ≤ 1
If r= -1, then the two variables are said to be perfectly negatively correlated
If r = +1, then the two variables are perfectly positively correlated
If r = 0, then there is no correlation between the two variables.
The reliability of the estimate of the regression coefficient ‘b’ (i.e. the variable cost), is important
since the analyst usually focuses on the rate of variability rather than the absolute level of prediction.
This can be established by the use of the standard error of the slope.
Se 48.95
Sb
x) / n
2
2 430
16234 -
12
1.70
We can then use Sb to construct confidence intervals using t distributions such that the true variable
cost, B, will be:
b – tcSb ≤ B ≤ b + tcSb
For illustration 1.2, the 95% Confidence Interval for the true variable cost will be:
Z or t Statistics
If n ≥30 we use Z, if, n < 30 we use t statistics. These statistics can be used to test the hypothesis:
Ho: B = O that is, there is no relationship between X and Y
HA: B≠ O There is a significant relationship between X and Y
T = 6.1 = 3.59
1.7
Conclusion
Reject Ho since computed T > tc. This means that there is a significant relationship between labour
costs and the number of units.
SPECIFICATION TESTS
These tests are used to test the validity of the regression assumptions. The necessary assumptions in
linear regression are:
Be normally distributed.
Have an expected value (mean) of Zero (0).
Have a constant variance. This is referred to as homoscedasticity. If not constant we have
heteroscedasticity.
Be independent i.e. they are not serially correlated or there is no autocorrelation.
The Specification tests can be done for illustration 2.1 as follows:
Ŷ = 77.08 + 6.10 x
To test whether the observation is normally distributed we can construct a histogram of the
observation.
Independence of observations
An important assumption for the simple linear regression model is the independence of errors. In
many time series models, this assumption is violated because of the correlation of errors in successive
observations. This is referred to as autocorrelation.
Autocorrelation occurs if a positive error is followed by another positive error and a negative error is
followed by another negative error. If autocorrelation occurs then time should be considered as an
important independent variable and therefore time varies analysis should be used.
We can use Durbin Watson ‘D’ statistics to determine whether observations are independent.
D = (ei – ei-1)2
ei2
The Durbin Watson statistics provide a measure of association between successive values of the error
term. The computed statistics are compared against two tabulated values du and dl that depends on the
desired confidence level of the test and the degrees of freedom of the data.
If the computed Durbin Watson “D” statistics is greater than D u, then we can conclude that there’s no
positive correlation between error terms.
If dl ≤ D≤ du then the test is inconclusive and therefore we can neither accept nor reject the null
hypothesis.
Note
A rule of thumb, with uncorrelated errors then D approaches a value of 2. If errors are highly positively
correlated, the D would be less than 1.5 and can be very near to zero (0).
For negatively correlated errors, the value of D will be above 2.5 with an upper limit of 4.
D = (ei – ei-1)2
ei2
= 49461.6 =2.09
Since the calculated value of D is greater than du, then we can accept the null hypothesis, that there is
no positive serial correlation.
The omission of an important variable such as the seasonal effect. (misspecification error).
A shift in the production process may be caused by a change in equipment that has not been shown in
the model.
Multiple Regression
The least-square regression equation discussed above was based on the assumption that total cost was
determined by only one activity-based variable. However, other variables are likely to affect labour
costs such as labour hours, material costs, machine hours, etc. These may affect labour costs.
The equation for the simple regression can be expanded to include more than one independent variable
as shown below:
Ŷ = a + b1X1 + b2 X2 + b3 X3 +………+bn Xn
Ŷ = a + b1X1 + b2 X2
y = na + b1 X1 + b2x2
X1Y = ax1 + b1X12 + b2 X1 X2
X2 Y = ax2 + b1 X 1X2 +b2 X2
Multi-collinearity
Multiple regression analysis is based on the assumption that the independent variables are not correlated
with each other, When the independent variables are highly correlated with each other then it is very
difficult to isolate the effect of each one of these on the dependent variables. This occurs when there is
a simultaneous movement of two or more independent variables in the same direction and almost at the
same time. This condition is called multi-collinearity.
We can use the correlation matrix to determine whether 2 independent variables are highly correlated.
If a correlation value of more than 0.8 exists between two independent variables, then the problem of
multi-colinearity is bound to occur. Alternatively, if the correlation coefficient between the two variables
is greater than the multiple correlation coefficient, then a multi-colinearity problem will occur. To
remove the problem of multicollinearity, we drop one of the correlated variables. You can drop any of
the variables.
Cumulative quantity
A learning curve
Average cost
Cumulative output
The effect of experience on cost is summarised by a learning ratio (improvement ratio or learning rate)
defined by the following;
Interpretation:
Every time cumulative output doubles, the average cost declines to 80 percent of the previous amount.
Since the average cost of the first 1000 units was sh.10, the average cost of the first 2000 units will be
expected to be 20% or sh.8 per unit.
Y = abx
Incremental cost
If producing a second 1000 units is to reduce cumulative average cost from sh.10 to sh.8, the cost of
the second 1000 units will have to be only sh.6000, or sh.6 each. Hence;
Total Cost No. of Units Average Cost (sh.)
(Sh.)
First 1000 units 10,000 1,000 10
Second 1000 units 6,000 1,000 6
16,000 2,000 8
Defining the learning curve in terms of this incremental relationship would be more useful but is more
difficult to work with. As a result, learning curve improvement ratios are usually stated as percentage
reductions in cumulative average labour cost.
Required:
Estimate the total manufacturing cost of 1, 2, 3, 4 units of the product
Solution:
Number of labour hours
Units
X Average (y) Total* Marginal cost Computations
1 1250 1250 1250 Y= 1250 x 1-0.322 = 1250
(i) In Inventory valuation- failing to recognise learning effects can have some unexpected
consequences. (See example below).
Example 3:
Production of a new product starts in January and continues through the year. Direct material cost is
sh.100 per unit throughout the year. Because of the learning effect, the labour hours per unit drop
from 1 hour (at sh.160 per hour) in January to 0.25 hours in December. Manufacturing overhead is all
fixed at sh.80,000. If 1000 units will be produced in January the overhead application rate is sh.80 per
hour. This rate is (mistakenly) applied throughout the year.
January December
Sh. Is (Sh.) Should be (Sh.)
Direct materials 100 100 100
Direct labour 160 40(0.25x160) 40
Overheads applied 80 20(0.25x80) 80
(ii) Decision making – A product newly launched may at a glance appear to be unprofitable,
however, because of the learning effect, the variable costs would drop by the end of the period
making the product profitable.
(iii) Performance evaluation. A bank has developed labour time and cost standards for some of its
clerical activities. These activities are subject to the learning curve effect. The management has
also found that the time on these activities exceeded the standard. On investigation, it was found
that there was high personnel turnover meaning the activities were done by inexperienced people.
Changes were made in personnel policy and the personnel turnover was reduced. The time spent
on clerical activities no longer exceeded standards.
A customer has asked your company to prepare a bid on supplying 800 units of a new product.
Production will be in batches of 100 units. You estimate that costs for the first batch of 100 units will
average to sh100 a unit. You also expect that a 90 percent learning curve will apply to the cumulative
labour costs on this contract.
Solution:
Average cost decreases by 10 percent every time the cumulative total production doubles. Therefore:
Average cost of first 200 units = 0.9 x Average cost of first 100 units
Average cost of first 400 units = 0.9 x Average cost of first 200 units
Average cost of first 800 units = 0.9 x Average cost of first 400 units
Combining these, we have; Average cost of the first 800 units = 0.9 x 0.9 x 0.9 x sh.100 = sh72.90
Total cost = sh (72.90 x 800) = sh. 58,320
Because this increase will not increase cumulative production to twice some figure we already have, we
need to Use the formula. Average cost = sh10,000 x 9x
x = -0.0458 = - 0.15216
0.301
Hence;Log av. cost = log 10,000 – 0.15216 log 9: Average cost = sh. 71.5833/unit
Question One
CB plc produces a wide range of electronic components including its best-selling item, the Laser Switch.
The company is preparing the budgets for Year 5 and knows that the key element in the Master Budget
is the contribution expected from the Laser Switch. The records for this component for the past four
years are summarized below:
Year 1 Year 2 Year 3 Year 4
Sale (unit) 150,000 180,000 200,000 230,000
£ £ £ £
Sale revenue 292,820 346,060 363,000 448,800
Variable costs 131,080 161,706 178,604 201,160
Contribution 161,740 184,354 184,396 247,640
It has been estimated that sales in Year 5 will be 260,000 units. Required:
a) As a starting point for forecasting Year 5 contribution, to project the trend, using linear
regression; To calculate the 95% confidence interval of the individual forecast for Year 5 if the
standard error of the forecast is £14,500 and the appropriate t value is 4,303, and to interpret the
value calculated; To comment on the advantages of using linear regression for forecasting and
limitations of the technique.
b) The theory of the experience curve is that an organization may increase its profitability through
obtaining greater familiarity with supplying its products or services to customers. This reflects
the view that profitability is solely a function of market share. Required: Discuss the extent to
which the application of experience curve theory can help an organization to prolong the life
cycle of its products or services.