Professional Documents
Culture Documents
Machine Learning ภาษาไทย
Machine Learning ภาษาไทย
Machine learning ?
Machine
logic
(code)
Spam
? ?
?
machine learning
logic input
output
Machine learning
machine learning
3
1. Supervised learning
2. Unsupervised learning
3. Reinforcement learning
Supervised learning
ML ML
Class Label
ML supervised learning
Classification Regression; classification
class label regression
continuous label
,
Unsupervised learning
pattern label
input
input
reward reward=1 reward=0
reward feed input
https://www.youtube.com/watch?v=xM62SpKAZHU
AI Flappy Bird
(
) AI
concept ML
Dataset
Dataset
(feature)
dataset
digitize
classic dataset ML Iris
dataset (sample) 3
Setosa, Versicolor, Verginica 150 sample
sample feature 4
1. (petal width)
2. (petal height)
3. (sepal width)
4. (sepal height)
Wikipedia
Iris flower
dataset https://en.wikipedia.org/wiki/Iris_flower_data_set
3
,
Features
Attributes
( Setosa )
Class label
Samples Instances features
4 class label
matrix
Machine learning
Pre-processing
Learning
Evaluation
Pro-processing
(structure)
Iris flower dataset
structure
feature feature
unstructure
pixel
attribute
input generate
feature
prediction
Feature extraction feature
feature
Feature selection
Learning
data input
data input
Training
training
Ref: http://www.datasciencecentral.com/profiles/blogs/a-tour-of-machine-learning-algorithms-1
train
prediction
Model
Cross-validation model
(accuracy) error
Overfitting train
feature model predict
sensitive noise
cross-validation
Leave-one-out 2 train
k-fold dataset k 5-fold 4-fold
train 1-fold test 3-fold train 2-fold test
combinational
Evaluation
Accuracy
90%
Confusion matrix
accuracy
classify classes class
predict classify
2 classes confusion matrix confusion table
ML
learning
Entropy
Entropy (
)
Feature selection entropy
Entropy
entropy Shannon
()
Father of
Information Theory communication, information
Entropy (impurity)
(transmission)
channel (purity)
Entropy 0-1
- 100 entropy
50 50
() entropy
: = 90:10 70:30
1 entropy
2 entropy
Entropy
class content
(
) log 2 class
A-Z 26 1/26
- 1/44
entropy
log 2
x class 1 class
entropy
probability class form
: i=1 n
Sum Latex plugin
wordpress render -
entropy ?
classification instances/samples
instance features
features train
model predict accuracy
features
content , features 2
train entropy
log 2 1 0
entropy classification binary-classification
2-classes classification (Yes or
No), (Male or Female),
Decision tree
node Yes No
Information gain
entropy information
gain
Resources:
StackOverflow
prof.
https://stackoverflow.com/questions/1859554/what-is-
entropy-and-information-gain
. ( )
Data Mining
http://dataminingtrend.com/20
14/data-mining-techniques/feature-selection-information-gain/
Entropy (Information Theory)
https://www.khanacademy.org/computing/computer-
science/informationtheory/moderninfotheory/v/information-
entropy
https://en.wikipedia.org/wiki/Entropy_(infor
mation_theory)
Information gain
IG
Entropy
- 10
2
class (child 1 child 2)
: ()
entropy
IG
weight child
IG
0.256
Real-world examples
dataset
?
feature (feature
selection) Information Gain
feature
model
entropy top comment
What is entropy and information gain?
Linear Regression 1
Linear Regression predictive model
continuous 1-100, 0.01-0.99
classification model discrete 0,
1 ; , 2 , ,
,
2 ,
,
y () x
( )
2
(linear)
x=15 y
model
(
)
linear regression
(fit)
( )
Training data ( training set, examples )
training data (Unknown
data/Unseen data) x=15 y ~= 12
?
Error error
( make sense )
optimize error model
h hypothesis
(
)
parameter optimize ()
training data
h hypothesis index
m h 1
error 2
(Sum square error) training data
Cost function
Square Error J
(error
)
optimize
Linear Regression 2
concept
1. hypothesis
2. represent cost function
3. minimize cost function
hypothesis h
concept
h
1, 0.5 0
sum square
cost
function
(blow shape) J
0 training set
( training set )
optimize model
optimize
math
optimize
iterative algorithm
Gradient Descent
Gradient Descent
Linear Regression task
optimize Cost function
model fit training set
(blow shape) 2
2 3
Global minimum
Global
Minimum optimize
Gradient
Descent
(j = 0 j = 1)
}
cost function linear regression
}
Simultaneously
:= assign
chain
Batch Gradient Descent
(alpha)
Learning
rate gradient
gradient
Overshoot
const function
optimize
training set global
minimum (
)
local minimum
()
optimize optimize global minimum
gradient
global minimum
Underfitting VS Overfitting
cross-validation
complex handle
generalize predict data cross-
validation model
model
linear regression
training data
x,y
model (cost function train)
model
training data
handle data
complex
handle Underfitting
underfitting train
cost function 2, 3
noise outlier
training data edge case
Overfitting
error
overftting 1) data train model
training data
2)
custom training data generalize
overfitting model
training data
testing data
Machine learning ?
document
machine
learning concept
download machine learning library
sample code, document
document
programming
language
( )
machine learning
machine learning
document library
-?
feature train ?
?
?
train optimize ?
paper math ?
linear algebra, discrete math, calculus
, matrix,
vector, probability, set, logic, combination, graph, partial
derivative etc.
probability, conditional probability, normal
distribution etc.
cross-validation, confusion matrix error
, precision, recall, F1 score etc.
information theory
math
hypothesis
cost function
optimize cost function
model fit
train
tune parameters cross-validation
model under fit over fit
(
)