Professional Documents
Culture Documents
First Submission Artefical Neural Network and Cost
First Submission Artefical Neural Network and Cost
Elias G/Hiwot
Advisor
A Thesis submitted to the school of Construction Technology Management in
Partial fulfillment for the requirements for the award of Masters of
Construction Technology Management
April, 2024GC.
Addis Ababa, Ethiopia.
Declaration
This is to certify that the thesis proposal prepared by Elias G/Hiwot entitled Building
Student
Advisor
__________________ __________________
Name Signature Date
Contents
CHAPTER ONE.................................................................................................................6
INTRODUCTION..............................................................................................................6
1.5 Scope.........................................................................................................................8
1.6 Significance..............................................................................................................8
CHAPTER TWO..............................................................................................................10
LITERATURE REVIEW..................................................................................................10
2.3.1 Perceptron........................................................................................................19
2.3.5 Backpropagation..............................................................................................22
CHAPTER 3.....................................................................................................................27
METHODOLOGY...........................................................................................................27
3. 2 Research Design....................................................................................................28
3.8.2 Reliability.........................................................................................................31
Schedule............................................................................................................................32
REFERENCES.................................................................................................................33
ABSTRACT
The construction sector has seen notable progress in economic development, local output,
employment, and meeting societal needs, however it still faces challenges hindering its
full potential in the economy. Developing an effective cost estimation model and system
from the design to construction stages is also essential for successful project outcomes.
Current cost estimation techniques (e.g., traditional and probabilistic methods) cannot
satisfy the requirement of the construction industry due to the need for a more accurate
result, more and more scholars gradually focus on the usage of machine learning
techniques to develop innovative cost estimation models. In order to overcome this
problem, a modern preliminary estimate (modern technique) to shall to be studied to
prove its efficiency over the traditional methods. Therefore, the author aims to investigate
on how can estimate building construction cost using Artificial neural network and come
up with digital platform. The researcher will identify top ranked factors determining
construction cost estimation, identify data set required and demonstrate construction cost
estimation using Artificial neural network. The collected data will analyze using
quantitative data analysis technique and presented using tables, and graphs and discuss
the results. After that author conducted analysis using a software SPSS (Statistical
Package for the Social Sciences) to identify top ranked factors that determines
construction cost estimation, the artificial neural network modeling will proceed based on
findings using NeuroSolution 6 platform.
INTRODUCTION
1.1 Background of the Study
The construction and housing industry are key players in Ethiopia's economy,
providing housing, amenities, and infrastructure for both the public and commercial
sectors (Assefa, 2023). According to Assefa, While the sector has seen notable progress
in economic development, local output, employment, and meeting societal needs, it still
faces challenges hindering its full potential in the economy.
Sae-Hyun and colleagues (2019) described cost estimation as the evaluation and
projection of the overall expenses associated with completing specific tasks within a set
timeframe, utilizing all pertinent project data and resources. Hafez et al (2023) stated that
accurate cost estimation during the design phase is crucial for project success, especially
as projects become larger and more complex, leading to increased uncertainties and
costs. Developing an effective cost estimation model and system from the design to
construction stages is also essential for successful project outcomes. According to Hafez
et al construction cost factors are large in number and diversified nature. This makes
difficult for users who directly perform the cost estimation. Therefore, construction Cost
models require to have the accuracy of prediction, efficiency, and sustainability that can
accommodate cost variance factors. Since inaccurate estimation of project uncertainties
is the underlying cause of project cost overruns in construction.
There are construction cost estimation methods and these methods can be conduct
at different stage of the project. Pre-tender estimates are susceptible to inaccuracies
(biases) because they are often prepared within a limited timeframe, and with limited
information about project scope. The trained ANN model can be used as a decision-
making tool when forecasting building cost at the pretender stage (Ajibade & Dharma,
2011). The model can be queried with the characteristics of a new project in order to
quickly predict the error in the estimate of the new project. According to Ajibade &
Dharma the predicted error represents the additional contingency reserve that must be set
aside for the project in order to cater for possible cost overruns. The model can also be
extended to forecast the likely cost of a project
In Ethiopia even though the government and the people are deeply involved and
have started reconstructing the country. The Ethiopian construction industry, like that in
most developing countries, faces challenges that impede its development”.in addition to
that (Laedre & Hansen, 2017) also stated “The architectural, engineering, and
construction industry is facing challenges related to increased productivity and improved
quality. Most of these challenges need to be dealt with early in the design phase of the
projects. This requires an effective design process, increases the need for effective
building design management and documentation. One of key factor for building
construction is the accuracy of construction cost estimation. Therefore, the author aims to
study building construction cost estimation using Artificial neural network.
Faten and colleagues (2022) identified that the causes of cost estimation
inaccuracy. These are insufficient time, poor tender documentation, and insufficient
tender document analysis. According to Waled & Hamid (2006) Engineers estimate their
projects in the preliminary stage using traditional methods, i.e. unit price of square meter
and analogous methods, which often leads to inaccurate estimates. Waled & Hamid
stated that the traditional 10% rule of thumb for estimating contingency is subjective -
based on experience and expert judgment, and are often inadequate. Current cost
estimation techniques (e.g., traditional and probabilistic methods) cannot satisfy the
requirement of the construction industry due to the need for a more accurate result, more
and more scholars gradually focus on the usage of machine learning techniques to
develop innovative cost estimation models. In order to overcome this problem, a modern
preliminary estimate (modern technique) to shall to be studied to prove its efficiency
over the traditional methods.
2. To identify data set required to estimate the construction cost estimation using
Artificial neural network.
3.To demonstrate the process to estimate construction cost using Artificial neural
network.
2. What are the data set required to estimate the construction cost estimation using
Artificial neural network?
1.5 Scope
It focuses on buildings sector of construction projects in Addis Ababa Ethiopia
specifically on pretender stage of construction projects. Due to time constraint, this
research is limited to pretender/ early stage of building construction project, consultants
and contractors registered for architectural and engineering service in Addis Ababa.
The study population is limited only to principal architects and engineers registered for
architectural and engineering service in Addis Ababa.
1.6 Significance
This research holds significant important to understand the factors that influence
the construction cost estimation, accuracy and transparency of cost estimation and bring
modern solutions. Through cost and time overruns government building projects often
suffer from cost overruns and delays, leading to budgetary constraints and negative
impacts on the overall economy. This study will enable policymakers and project
managers to take necessary measures to adopt accurate construction cost estimation
technique in future government projects. The research findings have the potential to
positively impact the construction industry, public service delivery, and the overall
development of Ethiopian’s construction. The findings have practical implications for
stakeholders in the construction engineering and management field, providing insights
for decision-making.
Chapter II is about the review of related literature. The major and central themes
addressed in this chapter are based on specific objectives.
In chapter III this chapter author presented the research methodology: research method,
technique (sampling, data collection, and analysis techniques), and approach as well as
data collection tools are presented.
CHAPTER TWO
LITERATURE REVIEW
Yasamin et al (2022) stated that the purpose of engineering construction is to
build structures in the most efficient, safe, sustainable, and low actual cost. Cost as
defined by (Stewart, 1991), as the total amount of all the resources required to perform
the activity. However, the price is the total amount paid for that activity. Mathematically,
price equals the cost plus the desired profit. The cost estimation provides the basis for
project management, business planning, budget preparation and cost and schedule
control (Marjuki, 2006). The sum of all costs, direct and indirect, inherent in converting a
design plan for material and equipment into a project ready for start-up, but not
necessarily in production operation; the sum of field labor, supervision, administration,
tools, field office expense, materials, equipment, taxes, and subcontracts (AACE
International, 2007). This cost needs to be counted in accurate forms so that the owner of
the project can make sure that there is value in return for money spent on the projected
cost which can be divided based on the type of project construction. It is essential to
calculate the costs on a basis of per unit, utilizing comparable projects as a guide, with
assumptions for different sorts of locations and unusual things.
Kim, et al (2013) stated that cost estimation in the preliminary stage is not accurate due
to incomplete drawings. Consequently, ten variables were adopted to be examined for
school projects in the UK in order to estimate the costs in the preliminary stage. Those
variables are year, budget, school levels, land acquisition, class number, building area,
gross floor area, story, basement floor, and floor height. Three different techniques have
been applied and compared in terms of the accuracy of three estimating techniques
(regression analysis (RA), neural network (NN), and support vector machine techniques
(SVM)) by performing estimations of construction costs. Using historical data, the NN
model shows more accurate estimation results than the RA and SVM models. At the
same time, Cheng, Tsai, and Hsieh (2009) used the neural network model to estimate the
house projects relying on ten quantitative and qualitative variables for their model, which
are floors underground, total floor area, floors aboveground, site area, number of
households, households in adjacent buildings (quantitative factors), soil condition,
seismic zone, interior decoration, and electromechanical infrastructure (qualitative
factors).
Area of shear wall depend on three factors mainly: availability of formwork and labor,
risk of formwork opening, and time frame for execution.
2.2.10 Location of Project (Country, City)
The location of the project can be classified in the top five in terms of impact on the cost
estimation. It is linked to most of the other important factors and its impact will consist
of the below points to be able to provide a decent cost estimate. Either the project
location in an urban, suburb or in the city, isolated area or near material sources,
previously recognized or new to the company, the political situation and customs and
regulations if foreign country. These factors will contribute to a noticeable rise in cost
because of unknown risk and in procurement of insurances to mitigate risks. It can
include overhead and indirect costs such as staff housing if isolated area, challenges in
transportation, might require building of temporary roads and service networks, raw
materials (gravel, cement, sand, and others) might need higher time to deliver or need to
look other suppliers with higher prices, area labor might be scarcer, etc.
A good Management is an important factor for project execution but not as important to
cost estimation process. A well-established management will have better contingency
plans, better project execution methods, less risks and abortive works.
Availability of skilled labor is a critical factor in cost estimation. Locations with scarce
labor may delay project execution and might even reach to stop the works. Recruitment
in such zones will be more expensive and might require the organization to outsource
labors from abroad, which in turn costs housing, meals, transportation, and higher
salaries because of expatriation.
2.2.19 Trend and Value of Market Indices (Stock Market Index, Construction
Index, Inflation)
The trend of market and inflation can have a big impact on cost estimation. For example,
the economic boom of projects causes a big impact on prices of steel, aluminum,
bitumen, and many other materials. Moreover, it led to a market shortage in skilled labor
and engineers, increasing prices which affects the cost estimation especially for fixed
cost projects.
2.2.20 External Social Conditions (I.E. Diseases, Wars and Conflicts).
A factor that is sometimes counted on force majeure and its impact will highly depend on
the contract type is external social conditions such as conflicts and wars and the impact
on the price of raw materials.
ANN’s are biologically inspired; that is viewing at the organization of the brain
considering network configurations and algorithms. The human nerve system, built of
cells called neurons is of staggering complexity. It contains approximately ten thousand
million (1011) basic neurons. Each of these neurons is connected to about ten thousand
(10) others. The connection of each neuron with other neurons forms a densely network
called a neural network. These massive interconnections provide an exceptionally large
computing power and memory. The neuron accepts many inputs, which are all added up
in some fashion. If enough active inputs are received at once, then the neuron will be
activated at once, then the neuron will be activated and “fire”; if not, then the neuron will
remain in its inactive, quit state.
Fig.1 Model representation of a biological neuron versus artificial neuron network model
2.3.1 Perceptron
K.-L. Du (2016) stated that perceptronas a McCulloch-Pitts neuron or linear threshold gate,
is the earliest and simplest neural network model.
In the above figure, for one single observation, x0, x1, x2, x3...x (n) represents various
inputs (independent variables) to the network. Each of these inputs is multiplied by a
connection weight or synapse. The weights are represented as w0, w1, w2, w3….
w(n). Weight shows the strength of a particular node. b is a bias value. A bias value
allows you to shift the activation function up or down.
In the simplest case, these products are summed, fed to a transfer function (activation
function) to generate a result, and this result is sent as output.
Multilayer Perceptron’s are essentially feed-forward neural networks with three types of
layers: input, output, and hidden. The input layer receives the input signal for
processing. The output layer performs tasks such as classification and prediction.
Multilayer Perceptron’s' accurate computational engine consists of an arbitrary number
of hidden layers between input and output layers. Similarly, the data flow from the
input layer to the output layer in a Multilayer Perceptron. The neurons in the Multilayer
Perceptron’s are trained using the backpropagation learning algorithm. Multilayer
Perceptron’s are designed to approximate any continuous function and can solve
problems that are not linearly separable.
2.3.2 Activation function
Ravish (n.d) defined Activation function decides whether a neuron should be activated or
not by calculating the weighted sum and further adding bias to it. The motive is to
introduce non-linearity into the output of a neuron. If activation function did not apply
then the output signal would be simply linear function (one-degree polynomial). Now, a
linear function is easy to solve but they are limited in their complexity, have less power.
Without activation function, model cannot learn and model complicated data such as
images, videos, audio, speech, etc.
Types of Activation Functions:
Threshold Activation Function also called Binary step function
A Binary step function is a threshold-based activation function. If the input value is
above or below a certain threshold, the neuron is activated and sends exactly the same
signal to the next layer.
The Sigmoid function is differentiable, means we can find the slope of the curve at any 2
points. The drawback of the sigmoid activation function is that it can cause the neural
network to get stuck at training time if strong negative input is provided.
2.3.3 Backpropagation
Back propagation is one of the best-known ANNs. The typical back-propagation ANN
always has at least one hidden layer. There is no theoretical limit on the number of
hidden layers, but typically there are no more than two. During the learning process, the
error information is propagated back from the output layer through the network to the
first hidden layer.
Fig 8: Backpropagation
2.3.4 Training Data set using Artificial neural network
NeuroSolution 6 for Excel use to build the ANN model is easier and more
flexible in use for both training and testing. After the data were prepared, the sequential
steps were to create the initial network by choosing the multilayer perceptron (MLP),
which consisted of inputs (independent variables), one hidden layer, and output
(dependent variable). To perform the analysis, the data should be set into three sets,
namely, training set – 70 %; cross-validation set – 15 %, and test set – 15 % (Dowler,
2008).
The training set and cross-validation set were used to train the model through
learning to modify the network weights in order to minimize the network error through
monitoring with cross validation dataset. The “Back Propagation” algorithm, which was
used to train the network, belongs to the realm of supervised learning. Consequently, it
was adopted in this study to train the multilayer network, which is concerned with feed
forward the network structure (Ashwood, 2013). The error of training can be expressed
by the mean squared error (Willumsen, Oehmen, Stingl, & Geraldi). To conduct the
training phase, the normalization of the training data was recognized to improve the
training performance of the network by Neuro Solution software. Moreover, separately,
the test dataset was used to measure the generalization of the network as well as the
network’s performance. Furthermore, the testing set was used to confirm that the
network had learned (Shehatto, 2013). To determine the accuracy of the estimate in the
testing phase, many tests should perform: - Mean Absolute Error (MAE); - Mean
Absolute Percentage Error; - Mean Squared Error; - Root Mean Squared Error (RMSE);
- Correlation Coefficient (R). After the best model was determined, the sensitivity
analysis was done in order to evaluate the effect of each input on the output (Günaydin &
Doğan, 2004; Principe et al., 2010).
Complexity
Project type
Area of floors
Number of stores
Type of foundation
Number of elevators
Slab type
Type of external finishing
interior decoration
Preliminary cost estimate Type of HVAC system
Type of tiles
Type of electricity works
Type of mechanical works
Basement floor
Floor height
Site area
Project location
There were some constraints in using the historical data in the ANN (Shehatto,
2013), which were the inputs of variables were limited to the collected data, Sufficient
number of projects should be available for each variable and any new variable that does
not belong to the adopted model would not be handled. Therefore, the limitations of the
ANN model could be summarized according to the inputs of variables; when the input
was registered in only one case (136 projects), the variable should have been excluded
from the analysis, but it was still present in the ANN model implicitly. On the other hand,
any inputs that did not appear in the collected data were excluded from the range of
variable inputs and the variable still was present in the ANN analysis.
METHODOLOGY
In this chapter, the author briefly describes the Research Area, Research Design,
Research Approach, Sampling Technique, analysis techniques Data collection
techniques, Data type, Source of Data, Validation and Reliability and Ethical
Consideration according to research objectives.
3. 2 Research Design
The author adopted a descriptive research design to identify factors determines the
construction cost estimation, required data set and demonstrate how the artificial neural
network assist the estimation. This research design also portrayed the perception of the
population about the problem and it also enabled the researcher to establish the link
between study variables and the problem.
The sample size for the study calculated according to the formula recommended by
Yamane ‘s (1967) with 95% confidence and 5% acceptable sampling error. The formula
is presented below: To take a population sample, we must use a formula to figure out
what sample size we need to take. Using Slovene ‘s formula to figure out what sample
size, we need to take, which is written as:
n= N / (1+ N * (e) 2)
Based on the above formula, the study has n respondents and gather data through
questionnaire. Based on the calculated sample size, the study assigns the number of
respondents for each branch proportionally.
n = 240/(1+240x0.0025) = 150
The author will use Relative Importance Indices (RII). For the research undertaken, it is
to be observed that feedbacks were received on a (1– 5) Likert scale. Relative
importance index method was used for determining the relative importance of
sustainable criteria. Relative Importance Index (RII) is a non-parametric technique
widely used by construction and facilities management researchers for analyzing
structured questionnaire responses for data involving ordinal measurement of attitudes.
For this part of the questionnaire, the five-point Likert scale of 1 to 5 1 = not at all
important, 2 = low important, 3 = neutral, 4 = very important 5 = extremely important
this adopted and the Relative Importance Indices (RII) for each of the sustainable
criteria. Formula for finding RII for each of sustainable criteria is as follows,
RII = ΣW / (A*N)
The RII value ranges from 0 to 1 with 0 not inclusive. It shows that higher the value of
RII, more important was the sustainable criteria and vice versa. The comparison of RII
with the corresponding importance level is measured from the transformation matrix as
proposed by Chen et al. (2010).
Neuro Solutions and other Neuro Dimension products can be used to apply neural
network technology to real-world applications. Neural network technology performs
"intelligent" tasks similar to those performed by the human brain. It acquires knowledge
through learning and then stores that knowledge within inter-neuron connection strengths
known as synaptic weights. In this active work provide how to use neuro solutions
software to solve all techniques of Neural Networks by using neuro solutions software.
The interaction between neuro solutions software and excel file and also, how to get
training, validation and testing. The goal is to establish the techniques understandable
and easily accessible and to allow continued focus on the chemistry and physics of the
problem. Computational elegance and efficiency are gladly sacrificed for ease.
3.8 Validity and reliability
3.8.1 Validity
Validity is referring to the extent to which an empirical adequately reflects the real
meaning of the concept under consideration. And also, it refers to the degree to which a
statistical instrument measures what it is intended to measure. It emphasizes the accuracy
of a measurement instrument (Saunders, et al; 2009).
3.8.2 Reliability
Reliability it refers to the extent to which your data collection techniques or analysis
procedure will yield consistent findings. It focuses on whether the research method and
design are accurate (Saunders, et al; 2009). The reliability test is an important instrument
to measure the degree of consistency of an attribute which is supposed to measure. It
measures the internal consistency of the item in a scale. The normal range of Cronbach’s
coefficient alpha value ranges between 0-1 and the higher value reflects a higher degree
of internal consistency. Cronbach's Alpha is preferable to the split-half procedure
because it supports correlation for all possible ways of dividing the measure into two
halves (Polit and Hungler, 1978). The reliability coefficient of the scale will establish by
Cronbach's Alpha using the SPSS package after data collection, which reflected the Alfa
coefficient to be in the range from 0.584 to 0.929.
Arab, M. Z. M. (2011). A function-based cost model for early cost advice on new-build
schools projects. Ph.D. thesis, Heriot-Watt University.
Arafa, M., & Alqedra, M. (2011). Early-Stage Cost Estimation of Buildings Construction
Projects Using Artificial Neural Networks, Journal of Artificial Intelligence, 4(1),
63–75. https://doi.org/10.3923/jai.2011.63.75
Brook, M. (2011). Estimating and tendering for construction work (forth ed.).
Amsterdam:
Cheng, M.-Y., Tsai, H.-C., & Hsieh, W.-S. (2009). Web-based conceptual cost estimates
for construction projects using Evolutionary Fuzzy Neural Inference Model,
Automation in Construction, 18(2), 164–172.
https://doi.org/10.1016/j.autcon.2008.07.001
Dowler, J. D. (2008). Using neural networks with limited data to estimate manufacturing
cost, M.Sc. Thesis, Ohio University, Industrial and Systems Engineering, USA.
Retrieved from http://www.ohiolink.edu/etd/view.cgi?ohiou1211293606
Emsley, M. W., Lowe, D. J., Duff, A. R., Harding, A., & Hickson, A. (2002). Data
modelling and the application of a neural network approach to the prediction of
total construction costs, Construction Management & Economics, 20(6), 465–
472. https://doi.org/10.1080/01446190210151050
Emsley al. (2002), ‘Residential Building Projects: Building Cost Indicators and Drivers’.
Elhag and Boussabaine (1998) ‘Data modelling and the application of a neural network
approach to the prediction of total construction costs’.
Günaydin, H. M., & Doğan, S. Z. (2004). A neural network approach for early cost
estimation of structural systems of buildings, International Journal of Project
Management, 22(7), 595–602. https://doi.org/10.1016/j.ijproman.2004.04.002
Gunduz et al. (2011) Parametric cost estimation system for light rail transit and me’tro
trackworks’
Hammarberg. K., Kirkman. M., & Lacey, S. (2015). Qualitative research methods: when
to use them and how to judge them. Human Reproduction, Vol.31, No.3 pp.
498–501,2016
Hunter, K. (2014). Estimating preconstruction services costs for highway projects, M.Sc.
thesis, Iowa State University.
Islam, M., Zhou, L., & Li, F. (2009). Application of artificial intelligence (artificial
neural network) to assess credit risk: a predictive model for credit card scoring.
M.Sc. Thesis, Blekinge Institute of Technology.
Ibrahim Mahamid, (2011) ‘Risk Matrix for Factors Affecting Time Delay in Road
Construction Projects’.
Kim, G.-H., Shin, J.-M., Kim, S., & Shin, Y. (2013). Comparison of School Building
Construction Costs Estimation Methods Using Regression Analysis, Neural
Network, and Support Vector
Machine, JBCPR Journal of Building Construction and Planning Research, 01(01), 1–7.
https://doi.org/10.4236/jbcpr.2013.11001
Kirkham, R. (2014). Ferry And Brandon's Cost Planning of Buildings, (9th ed.). Wiley:
WileyBlackwell.
http://www.dkriesel.com/en/science/neural_networks
Kshirsagar, P., & Rathod, N. (2012). Artificial neural network, International Journal of
Computer Applications, MPGI National Multi Conference, (MPGINMC-2012).
Kudirat Ayinla1, Abdullahi Saka3 Rafiu Seidu2, and Upeksha Madanayake4 (2023). The
Impact Of Artificial Intelligence On Construction Costing Practice. School of the
Built Environment and Architecture, London South Bank University, Borough
Road, London, SE1 0AA, UK
Petroutsatou et al. (2006). Early Cost Estimating of Road Tunnel Construction Using
Neural Networks’.
Petroutsatou al. (2012), ‘Early Cost Estimating of Road Tunnel Construction Using
Neural Networks.Original’.
Principe, J., Lefebvre, C., Lynn, G., Fancourt, C., & Wooten, D. (2010). Neuro Solution
Help: Neuro Dimension, Inc.
Shehatto, O. M. (2013). Cost estimation for building construction projects in Gaza Strip
using Artificial Neural Network (ANN). M.Sc. Thesis, The Islamic University,
Gaza.
Stephanie Glen. "Sample Size in Statistics (How to Find it): Excel, Cochran’s
Formula, General Tips" From StatisticsHowTo.com: Elementary Statistics
for the rest of us! https://www.statisticshowto.com/probability-and-
statistics/find-sample-size/
Yasamin Ghadbhan Abeda, Taha Mohammed Hasana, Raquim Nihad Zehawib (2022).
Machine learning algorithms for constructions cost prediction: A systematic
review. Int. J. Nonlinear Anal. Appl. 13 (2022) 2, 2205–2218 ISSN: 2008-6822
(electronic) http://dx.doi.org/10.22075/ijnaa.2022.27673.3684
Willumsen, P., Oehmen, J., Stingl, V., & Geraldi, J. (2019). Value creation through
project risk management, International Journal of Project Management, 37(5),
731–749. https://doi.org/10.1016/j.ijproman.2019.01.007
Yu, A. T. W., Shen, Q., Kelly, J., & Hunter, K. (2005). Application of value management
in project briefing, Facilities, 23(7/8), 330–342.
https://doi.org/10.1108/02632770510600281