Professional Documents
Culture Documents
6COM1044 2023 2024 General ML and PCA
6COM1044 2023 2024 General ML and PCA
Yi Sun
▶ Machine Learning:
is the scientific study of algorithms and statistical
models that computer systems use to perform a specific task
without using explicit instructions, relying on patterns
and inference instead [4].
▶ Approaches
▶ Supervised learning
▶ Unsupervised learning: identify natural clusters in the data.
▶ Reinforcement learning
−2 2
−4 −4
Consider a small data set X = (x1 , x2 ) =
4 4
2 −2
0 0
var (x) = σ 2
σ12
cov (x1 , x2 ) · · · cov (x1 , xd )
cov (x2 , x1 ) σ22 · · · cov (x2 , xd )
Σ= ........................................
cov (xd , x1 ) cov (xd , x2 ) · · · σd2
1 1
projection on PC1 = x11 ∗ u11 + x12 ∗ u12 = −2 × √ + 2 × √ = 0.
2 2
1 1 4
projection on PC2 = x11 ∗u21 +x12 ∗u22 = −2×(− √ )+2× √ = √ .
2 2 2
1.0000 0.7350 0.6618 0.6453 0.6051
1.0000 0.6737 0.7685 0.5290
Σ=
1.0000 0.7632 0.5263
1.0000 0.6066
1.0000
Here Σ in fact is a correlation matrix, since data have zero-mean
and unit-variance.
▶ trace(Σ) = 5.0000;
▶ λ1 = 3.6160, λ2 = 0.5315, λ3 = 0.3864, λ4 = 0.3016, λ5 =
0.1645;
▶
P
λi = 5.0000
Non−survivor
1.5 Survivor
0.5
0
PC2
−0.5
−1
−1.5
−2
−2.5
−3
−5 −4 −3 −2 −1 0 1 2 3 4 5
PC1
[4] https://en.wikipedia.org/wiki/Machine_learning