Professional Documents
Culture Documents
Factor Analysis
Factor Analysis
Factor Analysis
Factor Analysis a multivariate statistical technique is primarily used to examine the structure
of data by explaining the correlations among variables. Factor analysis summarizes data into
a few dimensions by condensing a large number of variables into a smaller set of latent
variables or factors. Factor analysis is commonly used in the social sciences, market research,
and other industries that use large data sets.
Consider a credit card company that creates a survey to evaluate customer satisfaction. The
survey is designed to answer questions in three categories: timeliness of service, accuracy of
the service, and courteousness of phone operators. The company can use factor analysis to
ensure that the survey items address these three areas before sending the survey to a large
number of customers. If the survey does not adequately measure the three factors, then the
company should re-evaluate the questions and retest the survey before sending it to
customers.
Example
Five factors describe these data perfectly, but the goal is to reduce the number of factors
needed to explain the variability in the data. The proportion of variability explained by the
last two factors is minimal (0.019 and 0.002, respectively) and they can be eliminated as
being important. The first two factors together represent 86% of the variability while three
factors explain 98% of the variability. The question is whether to use two or three factors.
The next step might be to perform separate factor analyses with two and three factors and
examine the communalities to see how individual variables are represented. If there were one
or more variables not well represented by the more parsimonious two factor model, you
might select a model with three or more factors.
Factor analysis model
X = m + L F + e,
Under the factor analysis model, the p x p covariance matrix of the data, X, is:
Cov(X) = L L' + Y,
Key Points
The goal of factor analysis is to find a small number of factors, or unobservable variables,
that explains most of the data variability and yet makes contextual sense. You need to decide
how many factors to use, and find loadings that make the most sense for your data.
Number of factors
The choice of the number of factors is often based upon the proportion of variance explained
by the factors, subject matter knowledge, and reasonableness of the solution. Initially, try
using the principal components extraction method without specifying the number of
components. Examine the proportion of variability explained by different factors and narrow
down your choice of how many factors to use. A Scree plot may be useful here in visually
assessing the importance of factors. Once you have narrowed this choice, examine the fits of
the different factor analyses. Communality values, the proportion of variability of each
variable explained by the factors, may be especially useful in comparing fits. You may decide
to add a factor if it contributes to the fit of certain variables.
Rotation
Once you have selected the number of factors, you will probably want to try different
rotations. A similar result from different methods can lend credence to the solution you have
selected. At this point you may wish to interpret the factors using your knowledge of the data.