Professional Documents
Culture Documents
SMA 2437 L1-2 Introduction
SMA 2437 L1-2 Introduction
Introduction • Test for mean vector: one-sample and Hotteling T tests based on
tkaranjah@2023 tkaranjah@2023
Introduction Introduction
Ordinarily the variables are measured simultaneously We seek to express what is going on in terms of a reduced
on each sampling unit. Typically, these variables are set of dimensions.
correlated. If this were not so, there would be little Such multivariate techniques are exploratory; they
use for many of the techniques of multivariate essentially generate hypotheses rather than test them.
analysis. On the other hand, if our goal is a formal hypothesis test,
we need a technique that will
(1) allow several variables to be tested and still preserve
We need to untangle the overlapping information
the significance level and
provided by correlated variables and peer beneath
the surface to see the underlying structure. Thus the (2) do this for any intercorrelation structure of the
goal of many multivariate approaches is variables.
simplification. Many such tests are available.
tkaranjah@2023 tkaranjah@2023
1
1/11/2023
Introduction Introduction
multivariate analysis is concerned generally with two areas, These linear functions may also be useful as a follow-up
descriptive and inferential statistics. In the descriptive to inferential procedures.
realm, we often obtain optimal linear combinations of
variables. When we have a statistically significant test result that
The optimality criterion varies from one technique to compares several groups, for example, we can find the
another, depending on the goal in each case. Although linear combination (or combinations) of variables that led
linear combinations may seem too simple to reveal the to rejection of the hypothesis.
underlying structure, we use them for two obvious reasons: Then the contribution of each variable to these linear
1. they have mathematical tractability (linear combinations is of interest.
approximations are used throughout all science for the
same reason) and
2. they often perform well in practice.
tkaranjah@2023 tkaranjah@2023
tkaranjah@2023 tkaranjah@2023
Mean And Variance Of A Univariate Random Variable Mean And Variance Of A Univariate Random Variable
The density function f (y) indicates the relative frequency If f (y) is unknown, the population mean μ will ordinarily
of occurrence of the random variable y. remain unknown unless it has been established from
Thus, if f (𝒚𝟏 ) > f (𝒚𝟐 ), then points in the neighborhood of extensive past experience with a stable population.
𝒚𝟏 are more likely to occur than points in the If a large random sample from the population represented
neighborhood of 𝒚𝟐 .
by f (y) is available, it is highly probable that the mean of
The population mean of a random variable y is defined the sample is close to μ.
(informally) as the mean of all possible values of y and is
denoted by μ.
The mean is also referred to as the expected value of y,
or E(y). If the density f (y) is known, the mean can
sometimes be found using methods of calculus.
tkaranjah@2023 tkaranjah@2023
2
1/11/2023
Mean And Variance Of A Univariate Random Variable Mean And Variance Of A Univariate Random Variable
The sample mean of a random sample of n observations However, is considered a good estimator for μ because
𝒚𝟏 , 𝒚𝟐 , . . . , 𝒚𝒏 is given by the ordinary arithmetic average E( ) = μ and var( ) = 𝜎 2 /n, where 𝜎 2 is the variance
of y. In other words, is an unbiased estimator of μ and
has a smaller variance than a single observation y.
Generally, will never be equal to μ; by this we If every y in the population is multiplied by a constant a,
the expected value is also multiplied by a:
mean that the probability is zero that a sample
will ever arise in which is exactly equal to μ.
tkaranjah@2023 tkaranjah@2023
Mean And Variance Of A Univariate Random Variable Mean And Variance Of A Univariate Random Variable
The sample mean has a similar property The sample variance is defined as
tkaranjah@2023 tkaranjah@2023
Mean And Variance Of A Univariate Random Variable Covariance And Correlation Of Bivariate Random Variables
The sample variance is generally never equal to the
population variance (the probability of such an Covariance
occurrence is zero), but it is an unbiased estimator, that If two variables x and y are measured on each research
is,
unit (object or subject), we have a bivariate random
variable (x, y).
If each y is multiplied by a constant a, then population Often x and y will tend to co-vary; if one is above its
variance is given as mean, the other is more likely to be above its mean,
and vice versa.
Similarly, if
For example, height and weight were observed for a
sample of 20 college-age males. The data are given in
then the sample variance of z is given by
Table 1.
tkaranjah@2023 tkaranjah@2023
3
1/11/2023
Covariance And Correlation Of Bivariate Random Variables Covariance And Correlation Of Bivariate Random Variables
Table 1. Height and Weight for a Sample of 20 College-age Males The values of height x and weight y from Table 1
are both plotted in the vertical direction in Figure1.
The tendency for x and y to stay on the same side
of the mean is clear in Figure 1.
tkaranjah@2023 tkaranjah@2023
Covariance And Correlation Of Bivariate Random Variables Covariance And Correlation Of Bivariate Random Variables
Figure 1. Two variables with a tendency to covary.
tkaranjah@2023 tkaranjah@2023
Covariance And Correlation Of Bivariate Random Variables Covariance And Correlation Of Bivariate Random Variables
To obtain the sample covariance for the height Now we have
and weight data in Table 1, we first calculate
4
1/11/2023
Covariance And Correlation Of Bivariate Random Variables Covariance And Correlation Of Bivariate Random Variables
tkaranjah@2023 tkaranjah@2023
Covariance And Correlation Of Bivariate Random Variables Covariance And Correlation Of Bivariate Random Variables
tkaranjah@2023 tkaranjah@2023
Covariance And Correlation Of Bivariate Random Variables Covariance And Correlation Of Bivariate Random Variables
Thus we have
tkaranjah@2023 tkaranjah@2023
5
1/11/2023
tkaranjah@2023 tkaranjah@2023
All n observation vectors 𝒚𝟏 , 𝒚𝟐 , . . . , 𝒚𝒏 can be Since n is usually greater than p, the data can be
transposed to row vectors and listed in the data more conveniently tabulated by entering the
matrix Y as follows: observation vectors as rows rather than columns.
, which can be
transposes to obtain
tkaranjah@2023 tkaranjah@2023
6
1/11/2023
In populations, the mean of y over all possible It can be shown that the expected value of each
values in the population is called the population
mean vector or expected value of y. It is defined
as a vector of expected values of each variable,
tkaranjah@2023 tkaranjah@2023
To find the mean vector, we simply calculate the The sample covariance matrix
average of each column and obtain is the matrix of sample variances and covariances
of the p variables:
tkaranjah@2023 tkaranjah@2023
7
1/11/2023
tkaranjah@2023 tkaranjah@2023
If y is a random vector taking on any possible The population covariance matrix can also be
value in a multivariate population, the population found as
covariance matrix is defined as
8
1/11/2023
tkaranjah@2023 tkaranjah@2023
The sample correlation matrix is analogous to the The second row, for example, contains the
covariance matrix with correlations in place of correlation of y2 with each of the y’s (including
covariance's: the correlation of y2 with itself, which is 1).
tkaranjah@2023 tkaranjah@2023
The correlation matrix can be obtained from the Then by use of matrix approach,
covariance matrix, and vice versa.
Define;
tkaranjah@2023 tkaranjah@2023
9
1/11/2023
where
Note that .865 > .493 > .327, which is a different order
tkaranjah@2023
than that of the covariances in S
tkaranjah@2023
Mean Vectors And Covariance Matrices For Subsets Of Variables Mean Vectors And Covariance Matrices For Subsets Of Variables
Mean Vectors And Covariance Matrices For Subsets Of Variables Mean Vectors And Covariance Matrices For Subsets Of Variables
For the sample of n observation vectors, the mean To illustrate, let p = 2 and q = 3. Then
vector and covariance matrix have the form
tkaranjah@2023 tkaranjah@2023
10
1/11/2023
Mean Vectors And Covariance Matrices For Subsets Of Variables Mean Vectors And Covariance Matrices For Subsets Of Variables
Example 3.
The following data In Table 3 contain measure of five
variables in a comparison of normal patients and diabetics
patients in an experimental set up. We give partial data for
normal patients only. The three variables of major interest
were
tkaranjah@2023 tkaranjah@2023
Mean Vectors And Covariance Matrices For Subsets Of Variables Mean Vectors And Covariance Matrices For Subsets Of Variables
tkaranjah@2023 tkaranjah@2023
Mean Vectors And Covariance Matrices For Subsets Of Variables Mean Vectors And Covariance Matrices For Subsets Of Variables
tkaranjah@2023 tkaranjah@2023
11
1/11/2023
Mean Vectors And Covariance Matrices For Subsets Of Variables Mean Vectors And Covariance Matrices For Subsets Of Variables
Mean Vectors And Covariance Matrices For Subsets Of Variables Mean Vectors And Covariance Matrices For Subsets Of Variables
tkaranjah@2023 tkaranjah@2023
12