Professional Documents
Culture Documents
Seminar Presentation
Seminar Presentation
LEARNING
BY
OGHU EMUGHEDI
(PG/MAS/19/229)
A SEMINAR REPORT PRESENTED TO THE DEPARTMENT OF COMPUTER SCIENCE, FACULTY
OF SCIENCES
FEDERAL UNIVERSITY, LOKOJA
SUPERVISOR : DR. TAIWO KOLAJO
SEPTEMBER, 2021
1
INTRODUCTION
The organized method of policing the country Nigeria has failed and there is urgent need to
seek for alternative approach in addressing this life threating societal ill. Ameh et al. (2020)
stated that rising incidence of kidnapping have become so prevalence and have been a source
of concerns to the government, members of the public and international community.
Benjamin and Daniel (2014) stated that Bokoharam kidnapped over 200 schoolgirls sitting for
their final exams in the town of Chibok in Borno State.
Nnam and Otu (2015) opined several studies on kidnapping in Nigeria revealed that the
floodgate of kidnapping activities was opened on 12th January, 2006, when MEND (Movement
for the Emancipation of the Niger Delta) captured four (4) expatriate oil workers in Ekeremo
Local Government Area of Bayelsa State and the current pattern or trend of kidnapping in
contemporary Nigeria is a basket case; little wonder therefore that the latest global ranking
placed Nigeria as the fourth most notorious kidnapping nation where people can be easily
kidnapped with impunity.
The activities of these men of underworld have made the life of an average Nigerians
meaningless and unpredictable 2
INTRODUCTION CONT
The use of Big Data and Deep Learning have been fully utilized in USA, UK,
India and other Western World in using to handle crime problems to a
manageable level (Manengadan et al., 2021; Bello-Orgaz, 2016).
Using the concept of data mining we can extract previously unknown, useful
information from an unstructured data (Werner, Yang & McConky, 2017).
Further work on the researched publication Employing Deep Learning and Time
Series Analysis to Tackle the Accuracy and Robustness of the Forecasting
Problem is expected to be performed on scalability analysis and implement the
proposed method for different datasets( Haseeb et al.2021)
3
STATEMENT OF RESEARCH
PROBLEM
Major crimes in Nigeria include rape, kidnapping, murder, burglary, fraud,
terrorism, robbery, cyber-crimes, bribery and corruption, money laundering
and so on have made life of an average Nigeria unbearable and life
threatening.
These crimes occur in every part of Nigeria in hourly, daily, weekly, monthly
and yearly basis (Oguntunde et al., 2018; Ameh et al., 2020).
Automatic crime prediction using deep learning algorithms has played an
increasingly critical role in criminal justice systems and crime prevention
efforts.
Previous work, such as (Luo et al., 2017), shows that deep learning can assist
decision-making by processing a very high volume of data that are difficult for
human researchers to analyze efficiently (Khan et al., 2021). 4
AIM AND OBJECTIVES OF THE
STUDY
The aim of the study is to propose a framework for crime prediction
and prevention in Nigeria using deep learning
The objectives are:
3 Alkesh and Train a model for Data collection. The model The current
Sarvanaguru prediction. Data Preprocessing. predicts the problem faced
(2018) Building the Feature selection. type of is the aspect
model will be Building and Traning crime with of maintaining
done using Model. accuracy of a proper
better Prediction. 0.789. dataset of
algorithm Visualization. crime
9
4 ToppiReddy visualizing techniques Data collection & visualization Future plan
et al. (2018) and machine learning preprocessing. techniques and for applying
algorithms are Data visualization classification other
adopted for predicting Visualization of Crime algorithms that classification
the crime distribution Data Using Google can be used for algorithms on
over an area. Maps. predicting the the crime
Accurate real-time Visualization of Exact crimes
crime predictions help Location of Crime with
to reduce the crime 3D View.
rate but remains Visualization based on
challenging problem type of Crime.
10
METHODOLOGY
This section involves the use of Method, material, tools and techniques and technologies to explain the fig 3.1.0 the framework
on crime prediction and prevention Model to be extracted from crime dataset. The following are various stages in crime
dataset analysis:
Data Collection: Data for the research is expected to be collected from crime data’s from Nigeria police, Prisons
database(Correctional database) and from Twitter API(Application Program Interface) save in comma separated value
file(csv). Featured Attributes For each entry of crime incidents in the datasets, the following 13 featured attributes are
included: 1) IncidentNum , 2) Dates, 3) Category, 4) Descript to mention a few
Data preprocessing: is the process of preparing the raw data involves removing of null values using df = df.dropna() where
df is the data frame. The categorical attributes (Location, Block, Crime Type, Community Area) are converted into numeric
using Label Encoder. The date attribute is splitted into new attributes like month and hour which can be used as feature for
the and making it suitable for deep learning modelling:
Features Extraction: is a type of dimensionality reduction where large numbers of pixels of the image are efficiently
represented in such a way that interesting parts are captured effectively. Dimensionality reduction techniques reduce the high
dimensional data to low-dimensional data. In this study, we have applied the principal component analysis (PCA) method
which provides linear mapping based on an eigenvector search. PCA provides different approaches to reduce the feature space
dimensionality [35, 36].
(ii)Data cleaning is the process of fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or
incomplete data within a dataset.
11
Data collection
Data collection
Feature Data Cleaning
Extractions
ARIMA
SES Crime prediction models
HW
RNN-LSTM
Evaluate forecasting performance
Figure 3.1.0: Proposed Methodology
METHODOLOGY CONTINUE(1)
Data conversion in time series: A key function to help transform time series data into a supervised
learning problem is the Pandas shift() function.
Time series are stationary if they do not have trend or seasonal effects.
Structured data: comprised of clearly defined data types with patterns that make them easily searchable.
Unstructured Data Analysis Information Extraction
unstructured data, which is comprised of data from emails, social media feeds and crimes related calls. Analyze semi- structured and
unstructured data sets for improved crime prediction.
As an outcome, organizations have to analyze semi- structured and unstructured crime datasets to extract structured data insights to make
improved crimes predictions.
Use of big data analytics tools and technology such Hadoop, NoSQL databases are being expected to deploy.
Training data: is the data you use to train an algorithm or deep learning model to predict the outcome
you design your model to predict. 80% of data are usually taken as training dataset.
Testing data:20% Test data provides a final, real-world check of an unseen dataset to confirm that the
deep Learning algorithm was trained effectively.
Various Models that Successfully with proposed Crime prediction model are: 13
METHODOLOGY CONTINUE(2)
ARIMA (AutoRegressive Integrated Moving Average) Model
ARIMA is an acronym that stands for AutoRegressive Integrated Moving Average. It is a
generalization of the simpler AutoRegressive Moving Average and adds the notion of
integration.
This acronym is descriptive, capturing the key aspects of the model itself. Briefly, they are:
AR: Autoregression. A model that uses the dependent relationship between an observation and
some number of lagged observations.
I: Integrated. The use of differencing of raw observations (e.g. subtracting an observation from
an observation at the previous time step) in order to make the time series stationary.
MA: Moving Average. A model that uses the dependency between an observation and a
residual error from a moving average model applied to lagged observations.
14
METHODOLOGY(3)
• Simplest Exponentially Smoothing(SES)
This method is suitable for forecasting data with no clear trend or seasonal pattern. The data do
not display any clear trending behaviour or any seasonality.
. Holt–Winters Exponential Smoothing Method. Holt–Winters(HW) exponential smoothing
method was designed in 1960 by extending the exponential smoothing method. HW is applied
when data are in the stationary form. For the calculation of the prediction measures, all the data
values need to be in series. This method is suitable when data are with the trend and
seasonality.
Long short-term memory (LSTM) networks are modified versions of the RNN that can help to
solve the short- and long-term dependencies which make it easier to remember previous data.
LSTM networks are trained using backpropagation through time which helps to overcome the
vanishing gradient problem.
Traditional neural networks have neurons, while LSTM networks have memory blocks connected
through sequential layers. Each module contains gates that can handle module status and outputs.
The gated formation of the LSTM network manages its memory state 15
CONTRIBUTION
This researched have provided a framework, that would
assist in solving the ever increasing crimes in Nigeria, using
deep learning.
Neil, Nandish & Manan (2021). Crime forecasting: a machine learning and computer vision approach to crime
prediction and prevention, Journal of Visual Computing for Industry, Biomedicine, and Art, article 9, pp-1-5
Haseeb et al. (2021)Employing Deep Learning and Time Series Analysis to Tackle the Accuracy and Robustness
of the Forecasting Problem, Hindawi Security and Communication Network, pp-1-10
Luo et al (2017). Learning to predict charges for criminal cases with legal basis. In
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing,
pp. 2727–2736
19
End
Thank you
20