Professional Documents
Culture Documents
Mesh PR Ofile Est Imation Using Logistic Regressi On and Random Forest
Mesh PR Ofile Est Imation Using Logistic Regressi On and Random Forest
o n u s in g
s tim a t i
P ro f ile E
Me sh s io n a n d
R e g r e s
Lo g is tic
F o r e st
Rand o m i n ( UC SM)
Hn e t Yee
b y Hs u L
n t ed
Prese
19
25/12/20
Outline
Abstract
Data Preparation
Feature Engineering
Experimental Results
References
Abstract
The composed 4 files are “calendar.csv”, “dynamic_train.csv”,
“dynamic_test.csv” and “meta_data.csv”.
Class labels are Residence, Station, Park, EventHall, Office.
Firstly, all of features and records from these files are aggregated.
The dynamic_test.csv also has the same records like train data for
20 mesh.
The metadata.csv give the target variable for each mesh id in train
data.
Exploratory Data Analysis
There has 5 different class label. The following are class labels
in train dataset.
Weekend and Weekday stay for Weekend and Weekday move for
each class label each class label
Exploratory Data Analysis
EventHall
Office
Weekend Weekday
Park
stay:move stay:move
7.5:2.5 7.5:2.5
Residence
Station
The unnecessary features are category, date_time, date, time, hd_flag, dow.
75 % 70
70 %
%
https://python-graph-gallery.com/11-grouped-barplot/
https://scikit-learn.org/stable/modules/generated/
sklearn.model_selection.RandomizedSearchCV.html
Thank You