Professional Documents
Culture Documents
Principal Component Analysis (PCA) : Shusen Wang
Principal Component Analysis (PCA) : Shusen Wang
Shusen Wang
Some Statistical Concepts
Mean and Variance (Scalar)
• Let 𝑋 be a random scalar variable and 𝑝(⋅) be the probability
density function (PDF).
• Mean: 𝜇 = 𝔼 𝑋 = ∫ 𝑥 ⋅ 𝑝 𝑥 𝑑𝑥.
• Variance: 𝜎 . = 𝔼 𝑋 − 𝜇 .
=∫ 𝑥−𝜇 .
⋅ 𝑝 𝑥 𝑑𝑥.
Mean and Variance (Scalar)
• Let 𝑋 be a random scalar variable and 𝑝(⋅) be the probability
density function (PDF).
• Mean: 𝜇 = 𝔼 𝑋 = ∫ 𝑥 ⋅ 𝑝 𝑥 𝑑𝑥.
• Variance: 𝜎 . = 𝔼 𝑋 − 𝜇 .
=∫ 𝑥−𝜇 .
⋅ 𝑝 𝑥 𝑑𝑥.
0
A = ∑3670 𝐱 6
• Sample mean: 𝛍
3
PCA Explained
Step 1: subtract the mean
0
A = ∑3670 𝐱 6
• Sample mean: 𝛍
3
• Translation: 𝐱 6D = 𝐱 6 − 𝛍
A
translation
PCA Explained
Step 1: subtract the mean
0
A = ∑3670 𝐱 6
• Sample mean: 𝛍
3
• Translation: 𝐱 6D = 𝐱 6 − 𝛍
A
After translation
PCA Explained
Step 2: rotation
=×=
• Find a rotation matrix 𝐕 ∈ ℝ
the 2nd greatest variance direction
• Rotation: 𝐱 6DD = 𝐕 ? 𝐱 6 ′ ∈ ℝ=
Rotation
• Find a rotation matrix 𝐕 ∈ ℝ=×=
• Rotation: 𝐱 6DD = 𝐕 ? 𝐱 6 ′ ∈ ℝ=
Annual cycle
The 2nd Principal Components
Annual cycle
The 3rd Principal Components
El Nino
The 4th Principal Components
La Nina
Summary
Dimensionality Reduction
• Applications
• Visualization and analysis
• Data processing (to make downstream ML more efficient or accurate)
Dimensionality Reduction
• Applications
• Visualization and analysis
• Data processing (to make downstream ML more efficient or accurate)
• Methods
• Principal component analysis (PCA)
• Kernel PCA
• Manifold learning
• Autoencoder
• Linear discriminant analysis (LDA)
3 Steps of PCA