Detection of Forest Fires Using Machine Learning Technique: A Perspective

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

2015 Third International Conference on Image Infonnation Processing

Detection of Forest Fires using Machine Learning


Technique: A Perspective
Aditi Kansal1, Yashwant Singh2, Nagesh Kumar3, Vandana Mohindru4
Department of Computer Science & Engineering
Jaypee University of Infonnation Technology
Waknaghat, Solan- 173234, (H.P ), India
2
1 aditi.kansaI4@gmail.com, yashu want@yahoo.com
_

Abstract- Wireless Sensor Networks (WSN) has gained or more simple events come under composite event. These
attention as it has been useful in warning about disasters. events are detected by taking readings from multiple sensors
Predicting natural disasters like hailstorm, fire, rainfall etc. by present on the sensor node. Simple events require only
WSN are infrequent and stochastic. This is an important topic of participation of a single sensor (say, light sensor) and
research. Detection of these disasters should be fast and accurate composite events require two or more sensors for their
as they may cause damage and destruction at a large scale. In this detection.
paper, comparison of various machine learning techniques such
as SVM, regression, decision trees, neural networks etc. has been The areas in which WSN has been applied have grown a
done for prediction of forest fifes. The proposed approach in this lot in recent years. There are various real-life applications
paper presents how regression works best for detection of forest where they are used such as environment monitoring, habitat
fires with high accuracy by dividing the dataset. Fast detection of monitoring, health and medical monitoring, target tracking,
forest fires is done in this paper by taking less time as compared surveillance monitoring [4]. Due to global wanning, the
to other machine learning techniques. climatic conditions have changed. As a result, there are more
chances of occurrence of fire. Fire is a disastrous activity
Keywords: - Wireless Sensor Networks, Regression, SVM, which may cause danger to living beings. Occurrence of fire
Sensor, Decision Tree, Classification. may be due to climatic changes or due to human negligence.
I. INTRODUCTION
WSN helps a lot in detection of these activities with high
accuracy. Alann should be generated if fire has been detected
Wireless Sensor Networks (WSN) is an area in which by the sensors [5]. Therefore, research on detection of fires
sensors are deployed to monitor specific environmental with high accuracy and reliability becomes most important.
conditions (e.g. fire, climate) and send those data to base
station for processing. WSN comprises of data processors Traditional algorithms for pattern matching have been
(sensors) with limited battery life [1]. WSN has its own design adapted by embedded devices, which apply these algorithms
and resources which include limited amount of energy, short in distributed sampling of sensor nodes and optimize the
communication range, low bandwidth and limited detection accuracy [6]. In this, data is gathered from many
computation. The ideal sensor node is smart, whose power multifunctional sensors after which they are matched with
consumption is less, capable of fast data acquisition, reliable, previous data. Mapping of patterns of an event is significant
inexpensive and does not need much maintenance. for detection of events in WSN.

Event detection procedure is known as an important Detection accuracy can be improved by using machine
component of most of the applications, which facilitate proper learning techniques. Machine learning is an effective approach
and efficient sensing of the physical world using wireless to study the correlation that exists between the output and the
sensor network. Event is classified as a repeated occurrence or input features considered. There are four steps for machine
pattern of infrequent or abnonnal occurrence [2]. Some harsh learning procedure. 1) Feature selection and output labeling,
environment may cause faults in pattern matching and pattern 2) Sample collection 3) Offline training and 4) Online
monitoring domain. Event detection is one of the applications classification [7].
of data observation in wireless sensor networks. Massive Feature selection is the method in which most significant
sensor data is semantically processed and only the infonnation parameter is chosen, which will help to describe the problem
that is relevant is sent to the user. Event processing runs on easily. The parameters are chosen in such a way that
wireless sensor networks with streaming data, keeping in maximum variability in the model is explained by them.
consideration the hardware constraints and the nature of data. Outputs are classified according to the knowledge of experts
The primary objective of event detection is to create in output labeling method. Second is the sample collection in
perceptions by processing and mining physical world data which data is gathered for training and a back end system is
captured by the using sensors and adapting background kept as a database for collection of data. It is also called
knowledge to relate it to possible events. An event can be a centralized learning. However, in many real-life scenarios this
simple (or atomic) or a composite event [3]. The events such is not applicable. So decentralized or distributed learning is
as temperature> 90 or light> 50 come under simple events. preferred. Third step is offline training in which classification
Events like explosion detection which is a combination of two is done between two or more solutions and they do not have

• IEEE
�computer
978-1-5090-0148-4/15/$31.00© 2015 IEEE 241 society
2015 Third International Conference on Image Infonnation Processing

resource constraints. Lastly, the online classification is deployed in an environment. The fire- fighters carry devices
different from offline because offline is done by keeping a such as smart phones which help to check the outcomes of the
backend server but in online only learning overhead is distributed algorithms implemented in sensor networks. The
considered. In few applications of WSN, learning overhead is mobile sensors were able to display the fire map in fonn of
there in training phase. So, it is more efficient and distributed Graphical user interface. In this the system was working
which reduces communication overhead [8]. correctly with the connectivity with the central node. Further,
the topology of the nodes was irregular and unknown. One of
Machine learning techniques are: Support Vector
the tasks was to detect the geographical position of every
Machines (SVM), Artificial Neural Networks (ANN),
node. This was obtained by localization process. In this
Decision Trees (DT), Regression (REG), and Clustering [9].
technique, the localization process was range free and
In this paper, various machine learning techniques for fire
connectivity infonnation was used by sensor nodes to estimate
detection have been discussed. The proposed approach
their position. Assumption was that the parameters sensed
explains how regression can be used to classify the data
must be within a range and threshold values were predefined.
similar to SVM .Comparative analysis is being presented at
So, area bumt at a particular temperature ab, such that ab> td,
the end. The remainder of this paper is described as follows. In
where !ct is the position after burning. To reduce the energy
second section we will discuss about various machine learning
consumption, WSN nodes do not maintain any hierarchy and
techniques for detection of fires. In third section proposed
they do not have any predefined infonnation. Within these
algorithm will be discussed. In fourth section experimental
limitations, EIDOS was implemented. Each node maintains
setup and comparative analysis has been discussed.
and builds a local approximate value starting from the
II. RELATED WORK detection till occurrence of the fue [14].

A. Event Detection/or/orestfires using Support Vector C. Data Mining Approach to Predict Forest Fires using
Machines Meteorological Data.
There are various technologies and methods by which we This describes a novel data mining forest fire methodology
can detect events in WSN. For detections of events like fire in which real time and meteorological data was used. The real
threshold should be defined and specific parameters lik� time data gathered from northeast region of Portugal was used
temperature, relative humidity should be predefined. If the which help to predict bumt area of forest fires [15]. There are
threshold value and the sensor readings are not equal; alarm is various models with advantages and capabilities which have
generated. The approach mentioned works in two different been used in regression task. Most easy and classical approach
phases that is the base phase and Meta phase [10]. Ensemble is the Multiple Regression (MR) model. Only linear mapping
learning is implemented at all the cluster heads which are used is learnt by it. Thus, to solve this disadvantage, nonlinear
to collect and store the infonnation that they acquire from the functions like neural networks and support vector machines
respective cluster. The decision making and learning should be used. Tree structures like decision trees and random
techniques are applied in the Meta phase .The base station forest can also be used but these are difficult to implement
receives all the infonnation from the cluster head here the with large data. This approach considers four parameters like
processing of infonnation is perfonned. The processing rain, wind, temperature and relative humidity. It predicts the
involves checking data with the statistical record of previously burnt area of fires in which majority of occurrences of fire
collected data. SVM which is a supervised machine learning were there. This help to make decisions regarding fire
technique is applied at the base station with polynomial kernel occurrences. A data set of regression consist of hE \!' ...,K}
h
function. The sensors which are deployed can sense carbon examples. Each maps an input vector (al ......ak) to a
A
dioxide, temperature, humidity and carbon monoxide. The predefined target bh. The error is given by: eh � - � , where =

data in tabular fonn or clustered data is generated by bh denotes the predicted value for the h input pattern. The
clustream. The SVM is applied after this to detect fire [II]. perfonnance of the overall task is calculated by a metric which
is global and called as Mean Absolute Deviation (MAD) and
B. Circle-based Approximation to Forest Fires with Root Mean Squared Error (RMSE).
Distributed Wireless Sensor Networks using clustering
II:
.
A disaster management system deals with situations in 1
which various sensors are deployed in a distributed network in
MAD = k* I Ibi- hi'l
i=l
which data is collected from different sources. The main
objective is to make decisions about the occurrence of events.
In previous work the focus was on forest fires and EIDOS k

(Equipment Destined/or Orientation and Safety) [12], system.


Its main goal was to decrease the risk of occurrence of fires
RMSE = I hi - b;-f'jk
t=l
and increasing the detection accuracy. A large sensor network
was used in affected area of forest fires which contributed a lot Lower the values of these better will be results. But RMSE
in gathering the infonnation by the fire-fighters to increase the is more prone to high errors. We can also compare the
safety [l3]. The aim of data gathered by the sensor nodes was regression models by REC curve i.e. Regression Error
to attain the location and position of fire fronts which were Characteristic curve which gives the relation of error tolerance
active at that time. The working of EIDOS system can be and predicted percentage of points of burnt area [15].
explained as follows. Firstly the multifunctional sensors are

242
2015 Third International Conference on Image Infonnation Processing

D. Event Detection in Wireless Sensor Networks for Disaster 4) Select the attributes that yield the highest
Management Significance/show highest variability.
A decision tree is a tree in which inputs are either discrete 5) Divide dataset based on months(eg fire could be likely on
or continuous values. In two steps the decision tree is unlikely in a specific month)
constructed. First is the training phase and other is the testing
6) Make regression equation based on attributes selected in
phase [16]. In real time data both the values are different. In
training phase the data is sent or input. If the depth of tree is step3 for each month.
found to be very less the time complexity is considered less. 7) Predict the level of fire and raise alann based on the level
Also, the space occupied by the memory for inputs and of fire.
outputs is also reduced.
All the readings of the sensor data are aggragated and Output: A highly accurate and robust method of detecting
gathered. These readings are then sent to the base station fire.
which works like a voter by using technique called reputation
based voting. Reputation of all the nodes is resposible for
concluding that which node has detected the event or which is The algorithm proposed was run on forest fire dataset to
more closer to the actual readings already stored. The node obtain a [mal model that would have the minimum mean
whose repuation value is greater as compared to others is square error and yield the maximum accuracy explaining the
choosen. The procedure for choosing node with largest value maximum variability in the model which would increase the
is as follows. Assumption is made that all the data collected by predictive power.
sensors and all the events occurred are correct. Then the l)Attribute selection based on the proposed approach
readings are passed from one node to another. The nodes can All the attributes that were used for prediction in this
pass its reading only to its nearby nodes within a specific model are considered to be significant and predictor attributes
range. The communication between nodes within a given area except the one which defines the level of the fire which is a
taken place and that communication leads to highest value class attribute. This is done because when stepwise regression
selection. Further, the changes between the values are checked and forward regression were used to select the best predictors
i.e. the diiference between the value of the particualr node and for the model, it showed that all the attributes were significant.
other nearby nodes. Then those are compared with the The comparison has been done using all these attributes for all
predefined threshold values. Yes vote is given by the voter is the models.
the reading choosen is less than the predefined threshold value
and viceversa. Then the readings are stored in table called 2)Division and better prediction
NDVT table. Then the highest reputation node is choosen by The algorithm divides the datasets based on the months (in
two reputation techniques. In first technique, evaluation of some other application it could also be some other categorical
every reading of the sensors is done. Then the comparison predictor).This is done because the observations in a particular
between the sensor node value and nearby nodes values is month would be similar in a particular way such as it would be
done [18]. Calculation of mean of all values is done and to the more probable to have large fires in may than in the month of
result the reputation value of each sensor node is multiplied. august. Thus, there will be certain levels common in a
This is taken in fonn of weight. In second technique, Two particular month. This helps in easy and fast prediction as one
predefined threshold values are taken as th l and th2 and can estimate the value even without running algorithm for that
assignment of these values is done manually. Then the particular value.
comparison of the reputation value and these threshold values
3)Regression technique for forestfire
has done. Decision can be considered perfect if Vn >= th1.
Regression is a technique used to study the relationship
The decision can be considered Ok if thl > Vo > th2 and poor
between a dependent attribute such as level of fire which
if Vn < th2 [17].
depends on attributes such as humidity, rain etc. We derive an
III. PROPOSED APPROACH equation that tells us the value of the level of fire.

A. Algorithm The Regression equation is


A dataset based on collection of parameters acquired from
sensor having n instances for which the accuracy has to be
maximized.
MAXIMUM_SENSOR_ACCURACY (Level) where � is coefficient and X is predictor.
I) Initially all instances and attributes in the model.
2) Give appropriated level to class attribute according to We can get a more precise and accurate value when we
'Level' given increase the number of levels .these levels help to assign
whether there is danger or not. This approach can be extended
to other applications also in the field of wireless sensor
networks.
3) Where Maxleveli is maximum value in that level i.

243
2015 Third International Conference on Image Infonnation Processing

Table 1: Comparison of machine learning techniques maximum variability in the model as it has the highest R
squared value.
Machine R-square RMSE Analysis
In Fig 2, we can see that the proposed approach has least
Learning
Time RMSE (Root Mean Squared Error) which shows that the
Technique
proposed approach is more accurate than other machine
DECISION TREE 16.40% 0.204558 00: 00:19 learning approaches. Also, from the figure 1 it can be seen
LINEAR 2.30% 0.221132 00: 00:17 that the analysis time of the proposed approach is less as
REGRESSION compared to other techniques.
GRNN 0.92% 0.222693 00: 06: 45
Thus, the proposed approach is more accurate because the R­
SYM .003% .2237171 29: 44: 49 squared value of the proposed approach is very large as
PROPOSED 69.21% 0.079785 00: 00:12 compared to other machine learning techniques.
APPROACH R-Squared value is in increasing order as SVM < GRNN <
Linear Regression < Decision tree < Proposed Approach.
4)Predictor variation in the model
Sometimes, measurement of a particular parameter becomes
highly complex. In the proposed model, we can reduce the
number of the attributes and yield an r-squared that explain the
model approximately same. Thus, we can build a regression
equation by the use of those attributes and predict the level of
the fire.

The Regression equation is


Level 0.945 + 0.000131 DMC - 0.000027 DC - 0.00133
=

DC - 0.00133 lSI + 0.00391 temp + 0.00522 wind -


0.0109 rain - 0.00013 FFMC

IV. RESULTS AND DISCUSSIONS

A. Data Collection
The dataset used in the paper has been obtained from the VCI
repository which has datasets and data generators of various fields
which can be used for perfonning machine learning techniques.
The following is the dataset that have been used in this paper:­
Forest Fire Data Set: This multivariate dataset has real type of Fig 1: R-squared values comparison
attributes has 517 samples and 13 features, aims to predict burnt
area [15]. There are 11 predictors in this dataset used for
prediction of forest fires.
In this we have calculated R-squared as RMSE
0.25
R
2 _5--::5_
E---:­
0.2
=

n-k-l
0.15

Table 1 shows the comparison between the simple linear 0.1


regression, decision tree, general regression neural network
0.05
and support vector machine on the basis of root mean square
error. It can be observed from the figure that the proposed o -RMSE
algorithm builds a model that always has least Root Mean
Square Error and has ease in prediction of fire as it defines
some particular level that can be a final concept class for a
particular month. Thus, if only the month is given we can tell
the fire levels that are common without even running the
algorithm for that particular observation using the value
obtained by algorithm for previous values.

From Fig 1, we can see that its R-squared is also high. It can
also be observed that the proposed approach shows the Fig 2: Graph of perfonnance analysis

244
2015 Third International Conference on Image Infonnation Processing

The time complexity of the proposed algorithm comes out to [15] P. Cortez and A. d. J. R. Morais, "A data mining approach to predict
forest fires using meteorological data," 2007.
be O(n). Thus, this approach follows linear complexity.
[16] S. Ortmann, M. Maaser, and P. Langendoerfer, "Adaptive pruning of
V. CONCLUSION event decision trees for energy efficient collaboration in event-driven
wsn," in Mobile and Ubiquitous Systems: Networking & Services,
In this paper, an algorithm for detection of fire has been MobiQuitous, 2009. MobiQuitous' 09. 6th Annual International. IEEE,
2009,pp. I-II.
proposed by using regression and dividing the datasets
according to months. The algorithm achieves low root mean [17] M. Bahrepour, N. Meratnia, M. Poel, Z. Taghikhaki, and P. J. Havinga,
"Distributed event detection in Wireless Sensor Networkss for disaster
square error and high R-squared. The beauty of the algorithm management," in Intelligent Networking and Collaborative Systems
lies in the way that it can give the result without doing the (INCOS), 2010 2nd International Conference on. IEEE, 2010, pp. 507-
computation on whole dataset. In future, this approach can be 512.
extended by for other disasters as well. Application of certain [18] G. Wittenburg, N. Dziengel, C. Wartenburger, and J. Schiller, "A
system for distributed event detection in Wireless Sensor Networkss," in
transfonnation might also improve the model efficiency.
Proceedings of the 9th ACMfIEEE International Conference on
Information Processing in Sensor Networks. ACM, 2010, pp. 94--104.
VI. REFERENCES

[1] N. Dziengel, G. Wittenburg, and J. Schiller, "Towards distributed event


detection in Wireless Sensor Networkss," in Adjunct Proc. of
4th
IEEEfACM IntI. Conf. on Distributed Computing in Sensor Systems
(DCOSSa:AZ'08), Santorini Island, Greece, 2008.
[2] S. Jarupadung, "Distributed event detection and semantic event
processing," in The 6th ACM International Conference on Distributed
Event- Based Systems (DEBS 2012)(Doctoral Symposium), 2012.
[3] Y. Li and L. E. Parker, "Detecting and monitoring time-related abnormal
events using a Wireless Sensor Networks and mobile robot," in
Intelligent Robots and Systems, 2008. IROS 2008. IEEEfRSJ
International Conference on. IEEE, 2008,pp. 3292-3298.
[4] P. Radivojac,U. Korad, K. M. Sivalingam, and Z. Obradovic,"Learning
from class-imbalanced data in Wireless Sensor Networkss," in Vehicular
Technology Conference, 2003. VTC 2003-Fall. 2003 IEEE 58th, vol.
5,IEEE, 2003,pp. 3030-3034.
[5] Z.-J. Zhang, J.-S. Fu, H.-P. Chiang, and Y.-M. Huang, "A novel
mechanism for fire detection in subway transportation systems based on
Wireless Sensor Networkss," International Journal of Distributed Sensor
Networks, vol. 2013, 2013.

[6] Dragos Margineantu, Weng-Keen Wong, Denver Dash, "Machine


learning algorithms for event detection A special issue of Machine
Learning," published by springer, 2010.
[7] M. Di and E. M. Joo, "A survey of machine learning in wireless sensor
netoworks from networking and application perspectives," in
Information, Communications & Signal Processing, 2007 6th
International Conference on. IEEE, 2007,pp. 1-5.
[8] A. Foerster and A. L. Murphy, "Machine learning across the wsn
layers," 2010.
[9] V. Pande,W. Ehnannai, and K. Elleithy, "Classification and detection of
fire on wsn using imb400 multimedia sensor board," in Systems,
Applications and Technology Conference (LISAT), 2013 IEEE Long
Island. IEEE, 2013, pp. 1---6.

[10] Y. Singh,S. Saha,U. Chugh,and C. Gupta, "Distributed event detection


in Wireless Sensor Networkss for forest fires," in Computer Modelling
and Simulation (UKSim), 2013 UKSim 15th International Conference
on. IEEE, 2013, pp. 634---639.

[11 ] L. Yu, N. Wang, and X. Meng, "Real-time forest fire detection with
Wireless Sensor Networkss," in Wireless Communications, Networking
and Mobile Computing, 2005. Proceedings. 2005 International
Conference on, vol. 2. IEEE, 2005,pp. 1 214--1 217.
[12] M. A. Serna, A. Bermudez, R. Casado, and P. Kulakowski, "A convex
hull-based approximation of forest fire shape with distributed Wireless
Sensor Networkss," in Intelligent Sensors, Sensor Networks and
Information Processing (ISSNIP), 2011 Seventh International
Conference on IEEE, 2011 ,pp. 419-424.
[13] Y. Li, Y. Wang,and G. He, "Clustering-based distributed support vector
machine in Wireless Sensor Networkss," Journal of Information &
Computational Science, vol. 9,no. 4,pp. 1083-1096,2012.

[14] M. Angeles Serna, A. Bermudez, and R. Casado, "Circle-based


approximation to forest fires with distributed Wireless Sensor
Networkss," in Wireless Communications and Networking Conference
(WCNC), 2013 IEEE. IEEE, 2013, pp. 4329-4334

245

You might also like