Professional Documents
Culture Documents
Presentation Slides TeYang Lau
Presentation Slides TeYang Lau
HDB Resale
Price TeYang Lau
Contents
2
HDB Resale Prices
Dataset
Understanding the Data 1 3
HDB Resale Prices Dataset
4
Median HDB Resale Prices Over the Years
Adjusted for Inflation to 2019
Unadjusted for Inflation SGD
◉ Adjust using Consumer Price Index for Housing and Utilities (singstat.gov.sg)
5
Other Data Sources
Feature Engineering 2 6
Process
7
Nearest Amenities Distance & Number in 2km Radius
Nearest Nearest Nearest Nearest Nearest Distance
Num Nearest Nearest Nearest
Flat Primary School
School Hawker
Hawker
Mall
Mall
MRT
MRT From
School Distance Distance Distance Distance Dhoby
North Bridge
Bendemeer
3A UPP BOON KENG Road Market KALLANG MRT
Primary 1.21 6 1.25 Aperia 1.02 0.28 3.41
RD & Food STATION
School
Centre
Zhonghua Chomp
126 SERANGOON KOVAN MRT
Primary 0.69 6 Chomp Food 0.62 Hougang 1 1.35 1.60 7.95
NTH AVE 1 STATION
School Centre
Cantonment
Market Street People's CHINATOWN
536 UPP CROSS ST Primary 1.22 2 0.04 0.20 0.13 1.58
Food Centre Park Centre MRT STATION
School
8
Exploratory Data
Analysis
By Each Feature 3 9
Resale Price by Town
10
North
SGD 351,798
North-East
SGD 410, 850
West East
SGD 371,119 Central SGD 413,631
SGD 485,659
11
Resale Price by Distance From Dhoby Ghaut MRT
12
2 Room Resale Price by Flat Model Maisonette
13
Resale Price by Floor Area
14
Resale Price by Flat Type
15
Model and Results
Linear Regression
Random Forest
4 16
Data Prepreration
Dummy Encoding
Year 44064 -
Flat Model (baseline – Standard)
Floor Area 20 19 Region (baseline – Central)
Nearest Hawker
11 8
Distance
Nearest Park
6 6
Distance
17
Data Prepreration — Outlier Detection
18
Linear Regression
◉ OLS Regression
◉ 23 Features
◉ Statsmodels package
19
Random Forest
Random Forest Feature Importance
◉ Random Forest Regression
◉ 26 Features
◉ SKLearn package
◉ Train Test — 9:1
◉ Out-Of-Bag (R2: 0.957)
◉ Grid Search Cross Validation
(R2: 0.961)
20
SHAP Values
Predicted Low Resale Price
21
Conclusion
22