Welcome to Scribd!

Advantages and Disadvantage of PCA

Uploaded by

0% found this document useful (0 votes)

19 views2 pages

PCA can be used for dimensionality reduction, feature selection, data visualization, dealing with multicollinearity, noise reduction, data compression, and outlier detection. However, it has disadvantages such as difficulty interpreting principal components, sensitivity to data scaling, potential information loss, assumption of linear relationships, computational complexity for large datasets, and risk of overfitting.

Original Description:

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

19 views2 pages

Advantages and Disadvantage of PCA

Uploaded by

Shobha Kumari Choudhary

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

Advantages of Principal Component Analysis

1. Dimensionality Reduction: PCA is a popular technique used

for dimensionality reduction, which is the process of reducing the number
of variables in a dataset. By reducing the number of variables, PCA
simplifies data analysis, improves performance, and makes it easier to
visualize data.
2. Feature Selection: PCA can be used for feature selection, which is the
process of selecting the most important variables in a dataset. This is
useful in machine learning, where the number of variables can be very
large, and it is difficult to identify the most important variables.
3. Data Visualization: PCA can be used for data visualization. By reducing
the number of variables, PCA can plot high-dimensional data in two or
three dimensions, making it easier to interpret.
4. Multicollinearity: PCA can be used to deal with multicollinearity, which
is a common problem in a regression analysis where two or more
independent variables are highly correlated. PCA can help identify the
underlying structure in the data and create new, uncorrelated variables that
can be used in the regression model.
5. Noise Reduction: PCA can be used to reduce the noise in data. By
removing the principal components with low variance, which are assumed
to represent noise, PCA can improve the signal-to-noise ratio and make it
easier to identify the underlying structure in the data.
6. Data Compression: PCA can be used for data compression. By
representing the data using a smaller number of principal components,
which capture most of the variation in the data, PCA can reduce the
storage requirements and speed up processing.
7. Outlier Detection: PCA can be used for outlier detection. Outliers are
data points that are significantly different from the other data points in the
dataset. PCA can identify these outliers by looking for data points that are
far from the other points in the principal component space.

Disadvantages of Principal Component Analysis

1. Interpretation of Principal Components: The principal components

created by PCA are linear combinations of the original variables, and it is
often difficult to interpret them in terms of the original variables. This can
make it difficult to explain the results of PCA to others.
2. Data Scaling: PCA is sensitive to the scale of the data. If the data is not
properly scaled, then PCA may not work well. Therefore, it is important to
scale the data before applying PCA.
3. Information Loss: PCA can result in information loss. While PCA
reduces the number of variables, it can also lead to loss of information.
The degree of information loss depends on the number of principal
components selected. Therefore, it is important to carefully select the
number of principal components to retain.
4. Non-linear Relationships: PCA assumes that the relationships between
variables are linear. However, if there are non-linear relationships between
variables, PCA may not work well.
5. Computational Complexity: Computing PCA can be computationally
expensive for large datasets. This is especially true if the number of
variables in the dataset is large.
6. Overfitting: PCA can sometimes result in overfitting, which is when the
model fits the training data too well and performs poorly on new data.
This can happen if too many principal components are used or if the
model is trained on a small dataset.

Data Preprocessing in Data Mining
Document4 pages
Data Preprocessing in Data Mining
Dhananjai Saini
No ratings yet
Unit 3
Document31 pages
Unit 3
wansejalm527
No ratings yet
Need of PCA
Document6 pages
Need of PCA
Simi Jain
100% (1)
Chapter Five Principal Comonent Analysis (PCA)
Document33 pages
Chapter Five Principal Comonent Analysis (PCA)
Ruun Mohamed
No ratings yet
Pca 1692550768
Document13 pages
Pca 1692550768
kan luc N'guessan
No ratings yet
Principal Component Analysis
Document2 pages
Principal Component Analysis
Shobha Kumari Choudhary
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
Document11 pages
The Intuition Behind PCA: Machine Learning Assignment
Palash Ghosh
No ratings yet
DS Ca2 PPT 3010 3017
Document10 pages
DS Ca2 PPT 3010 3017
Apurva Jarwal
No ratings yet
Week 4
Document3 pages
Week 4
MANISH P
No ratings yet
21BPS1595 Iot Lab-9
Document4 pages
21BPS1595 Iot Lab-9
yallsuckbruh
No ratings yet
Introduction To Dimensionality Reduction
Document5 pages
Introduction To Dimensionality Reduction
Shobha Kumari Choudhary
No ratings yet
Dealing With Highly Dimensional Data Using Principal Component Analysis (PCA)
Document14 pages
Dealing With Highly Dimensional Data Using Principal Component Analysis (PCA)
Arthur Patrocínio
No ratings yet
Dimensionality
Document9 pages
Dimensionality
Vijayalakshmi Govindarajalu
No ratings yet
3.2 Pca
Document27 pages
3.2 Pca
Javada Javada
No ratings yet
Week 4
Document5 pages
Week 4
madhu
No ratings yet
Things To Remember - Principal Component Analysis (PCA)
Document2 pages
Things To Remember - Principal Component Analysis (PCA)
kmkatariya
No ratings yet
Pattern Recognition Techniques
Document13 pages
Pattern Recognition Techniques
Sonu Gangwani
No ratings yet
Principal Component Analysis
Document1 page
Principal Component Analysis
ariani.accseahaven
No ratings yet
ML Mod 6
Document5 pages
ML Mod 6
MSD
No ratings yet
Things To Remember - Principal Component Analysis
Document2 pages
Things To Remember - Principal Component Analysis
Umaprasad
No ratings yet
pca شرح
Document8 pages
pca شرح
Omer Ahmed
No ratings yet
Linear Algebra
Document5 pages
Linear Algebra
Chaitanya Birla
No ratings yet
Mloa Exp2 C121
Document20 pages
Mloa Exp2 C121
Devanshu Maheshwari
No ratings yet
Machine Learning Qs
Document10 pages
Machine Learning Qs
onkarxo
No ratings yet
Sunil
Document9 pages
Sunil
gsunilprasad
No ratings yet
Devoir PCA
Document13 pages
Devoir PCA
Mono job
No ratings yet
Advanced Data Analytics Assignment
Document6 pages
Advanced Data Analytics Assignment
Olwethu N Mahlathini (Lethu)
No ratings yet
Featuer Extraction
Document3 pages
Featuer Extraction
jewellerycatinfo
No ratings yet
T Statistics HW-1 (Arefin-2824475)
Document6 pages
T Statistics HW-1 (Arefin-2824475)
Mir Shahnewaz Arefin
No ratings yet
Dimensionality Reduction
Document47 pages
Dimensionality Reduction
bka212407
No ratings yet
Principal Component Analysis
Document10 pages
Principal Component Analysis
Deeksha Manoj
No ratings yet
INTRODUCTION
Document9 pages
INTRODUCTION
Ashitha Ashi
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
Document14 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
Pratik Gupta
No ratings yet
DhBqO7 - vRayQaju - 71WsBg - Intro To Clinical Data Study Guide - M4
Document9 pages
DhBqO7 - vRayQaju - 71WsBg - Intro To Clinical Data Study Guide - M4
Rishabh Masuta
No ratings yet
DMBAR Chapter 4 Dimension Reduction
Document25 pages
DMBAR Chapter 4 Dimension Reduction
ANAM AFTAB 22GSOB2010404
No ratings yet
Data Science Practical 3
Document7 pages
Data Science Practical 3
SLnger
No ratings yet
Principal Components Analysis: Contents at A Glance
Document17 pages
Principal Components Analysis: Contents at A Glance
api-3798296
No ratings yet
Chapter6 - Unit IV2024
Document84 pages
Chapter6 - Unit IV2024
aadhya.1si21is002
No ratings yet
DR Pca
Document22 pages
DR Pca
adarsh.tripathi
No ratings yet
DM&W File
Document18 pages
DM&W File
Prerna Rawal
No ratings yet
Aiml - 07 - 28
Document4 pages
Aiml - 07 - 28
darshil shah
No ratings yet
40 Interview Questions On Machine Learning - AnalyticsVidhya
Document21 pages
40 Interview Questions On Machine Learning - AnalyticsVidhya
Kaleab Tekle
100% (1)
Dimensionality reduction-PCA
Document60 pages
Dimensionality reduction-PCA
inspiring 123
No ratings yet
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
Document6 pages
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
goci
No ratings yet
Assignment
Document24 pages
Assignment
Santhi Palanisamy
No ratings yet
Question Bank DMC
Document28 pages
Question Bank DMC
achal sahare
No ratings yet
Principal Component Analysis - Intro - Towards Data Science
Document4 pages
Principal Component Analysis - Intro - Towards Data Science
Alan Picard
No ratings yet
PCA Analysis Validation Guide
Document2 pages
PCA Analysis Validation Guide
patryk langer
No ratings yet
Datamining Unit 2 Part2
Document11 pages
Datamining Unit 2 Part2
Bhasutkar Mahesh
No ratings yet
2intern1 2
Document10 pages
2intern1 2
Kritika
No ratings yet
Core Data Science Concepts 1629081058
Document24 pages
Core Data Science Concepts 1629081058
Abhishek Prasoon
No ratings yet
Anomaly Detection in LTE KPI With Machine Laerning
Document15 pages
Anomaly Detection in LTE KPI With Machine Laerning
satyajit
No ratings yet
Principal Component Analysis
Document20 pages
Principal Component Analysis
Shahnawaz sahil
No ratings yet
Exp1 Data Visualization: 1.line Chart 2.area Chart 3.bar Chart 4.histogram
Document6 pages
Exp1 Data Visualization: 1.line Chart 2.area Chart 3.bar Chart 4.histogram
Nihar Chalke
No ratings yet
A Machine Learning Application For Classification of Chemical Spectra
Document14 pages
A Machine Learning Application For Classification of Chemical Spectra
Fabian
No ratings yet
Principal Component Analysis Limitations and How To Overcome Them Let's Talk A
Document5 pages
Principal Component Analysis Limitations and How To Overcome Them Let's Talk A
Hamza Zabet
No ratings yet
1 SM
Document11 pages
1 SM
Didi Quaydi
No ratings yet
Python Machine Learning for Beginners: Unsupervised Learning, Clustering, and Dimensionality Reduction. Part 1
From Everand
Python Machine Learning for Beginners: Unsupervised Learning, Clustering, and Dimensionality Reduction. Part 1
Tom Lesley
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Understanding Linear Regression - 12
Document2 pages
Understanding Linear Regression - 12
Shobha Kumari Choudhary
No ratings yet
Binomial Distribution-8
Document4 pages
Binomial Distribution-8
Shobha Kumari Choudhary
No ratings yet
K Means Clustering
Document11 pages
K Means Clustering
Shobha Kumari Choudhary
No ratings yet
Implementing PCA in Python With Scikit
Document6 pages
Implementing PCA in Python With Scikit
Shobha Kumari Choudhary
No ratings yet
Difference Between K Means and Hierarchical Clustering
Document2 pages
Difference Between K Means and Hierarchical Clustering
Shobha Kumari Choudhary
No ratings yet
Conclusions 5
Document2 pages
Conclusions 5
Shobha Kumari Choudhary
No ratings yet
Overview of Data Cleaning
Document17 pages
Overview of Data Cleaning
Shobha Kumari Choudhary
No ratings yet
Linear Regression-3
Document3 pages
Linear Regression-3
Shobha Kumari Choudhary
No ratings yet
Linear Equations-2
Document2 pages
Linear Equations-2
Shobha Kumari Choudhary
No ratings yet
SQL Sequences
Document3 pages
SQL Sequences
Shobha Kumari Choudhary
No ratings yet
Linear Models Basics-1
Document2 pages
Linear Models Basics-1
Shobha Kumari Choudhary
No ratings yet
SQL Part 1
Document4 pages
SQL Part 1
Shobha Kumari Choudhary
No ratings yet
SQL Query Processing10
Document3 pages
SQL Query Processing10
Shobha Kumari Choudhary
No ratings yet
SQL UNION Clause
Document3 pages
SQL UNION Clause
Shobha Kumari Choudhary
No ratings yet
SQL WITH Clause
Document3 pages
SQL WITH Clause
Shobha Kumari Choudhary
No ratings yet
EDU6950 Advance Statistics in Education Assignment 2-Multiple Regression Analysis
Document14 pages
EDU6950 Advance Statistics in Education Assignment 2-Multiple Regression Analysis
khalifa
No ratings yet
Economics For Managers: The Demand Equation Estimation Continued
Document25 pages
Economics For Managers: The Demand Equation Estimation Continued
Romit Banerjee
No ratings yet
Jurnal Fauzan 03-05-2021
Document17 pages
Jurnal Fauzan 03-05-2021
irma rohimah
No ratings yet
Quiz Statistik 1
Document4 pages
Quiz Statistik 1
M. FADHIL
No ratings yet
Final Analisis Cuantitativo - LFS
Document5 pages
Final Analisis Cuantitativo - LFS
ISABEL CRISTINA ECHEVERRY OSORIO
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
Document22 pages
Machine Learning Laboratory Record Book: 1 Find S Algorithm
Newton Fernandis
No ratings yet
w16 - Multiple Linear Regression
Document33 pages
w16 - Multiple Linear Regression
Muhammad Areeb
No ratings yet
Manufacturing Planning and Control: MPC 6 Edition
Document39 pages
Manufacturing Planning and Control: MPC 6 Edition
Khaled Toffaha
No ratings yet
Statistics - Xii Chapter - 3 Correlation Test MCQ - Marks 30 35 Min
Document5 pages
Statistics - Xii Chapter - 3 Correlation Test MCQ - Marks 30 35 Min
api-232747878
No ratings yet
477 - Statistical Inference STS412
Document20 pages
477 - Statistical Inference STS412
Collins Musera
No ratings yet
Sesi 2 Akuntansi Manajemen: Perilaku Biaya Aktivitas
Document34 pages
Sesi 2 Akuntansi Manajemen: Perilaku Biaya Aktivitas
Dian Permata Sari
No ratings yet
REPORT Split Plot ANOVA (SPANOVA)
Document13 pages
REPORT Split Plot ANOVA (SPANOVA)
Mhild Gandawali
No ratings yet
Practical Exercise 8 - Solution
Document3 pages
Practical Exercise 8 - Solution
midori
No ratings yet
Peso Estatura Xi Yi Xiyi Xi
Document4 pages
Peso Estatura Xi Yi Xiyi Xi
anon-494447
No ratings yet
Recursive - and - Non - Recursive - Models - Lecture
Document14 pages
Recursive - and - Non - Recursive - Models - Lecture
Tram Anh
No ratings yet
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
Document3 pages
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
S
No ratings yet
AIML 2m
Document2 pages
AIML 2m
sonasinghmaurya0527
No ratings yet
Course Syllabus Introduction To SPSS
Document2 pages
Course Syllabus Introduction To SPSS
Anissa Negra Akrout
No ratings yet
Kepribadian Dan Efikasi Diri Dengan Motivasi Belajar Siswa Kelas V Sekolah Dasar
Document15 pages
Kepribadian Dan Efikasi Diri Dengan Motivasi Belajar Siswa Kelas V Sekolah Dasar
R Ronal
No ratings yet
Bstat 3322 Test Study Guide
Document8 pages
Bstat 3322 Test Study Guide
dsgd
No ratings yet
Mitrushina (2005) Rey O Meta Norms
Document11 pages
Mitrushina (2005) Rey O Meta Norms
Pavel Litvin
No ratings yet
Stock Price Trend Forecasting Using Supervised Learning Methods
Document17 pages
Stock Price Trend Forecasting Using Supervised Learning Methods
SHAIK AZAJUDDIN 18MIS7192
No ratings yet
Pengaruh Pengalaman Kerja Dan Kerja Sama Tim Terhadap Kinerja Karyawan Pada Restaurant International and Convention Hall Pematangsiantar
Document17 pages
Pengaruh Pengalaman Kerja Dan Kerja Sama Tim Terhadap Kinerja Karyawan Pada Restaurant International and Convention Hall Pematangsiantar
Muhammad Habib
No ratings yet
1 Choosing The Lag Length For The ADF Test
Document11 pages
1 Choosing The Lag Length For The ADF Test
Burak Cem Bıçkın
No ratings yet
Regression Modeling Strategies - E-Bok - Frank E Harrell ... - Bokus
Document2 pages
Regression Modeling Strategies - E-Bok - Frank E Harrell ... - Bokus
asdlasjdlka
No ratings yet
Prior-To-Class Quiz 10 - Statistics For Business-T123PWB-1
Document6 pages
Prior-To-Class Quiz 10 - Statistics For Business-T123PWB-1
Minhh Hằngg
No ratings yet
Mplus Users Guide v6
Document758 pages
Mplus Users Guide v6
Marco Bardus
No ratings yet
A New Modified Ridge-Type Estimator For The Beta Regression Model: Simulation and Application
Document23 pages
A New Modified Ridge-Type Estimator For The Beta Regression Model: Simulation and Application
أبن الرمادي
No ratings yet
STA302 Week09 Full
Document39 pages
STA302 Week09 Full
tianyuan gu
No ratings yet
Selvanathan-7e 17
Document92 pages
Selvanathan-7e 17
Linh Chi
No ratings yet