Professional Documents
Culture Documents
Gmath Module 7
Gmath Module 7
Our target learning outcomes for this module are a) use a variety of statistical tools
to process and manage numerical data; b) use the methods of linear regression and
correlation to predict the value of a variable given certain conditions; c) advocate the use
of statistical data in making important decisions
What is Statistics?
Statistics is the science of collecting, organizing, presenting, analyzing, and
interpreting data to assist in making more effective decisions.
A. Divisions of Statistics
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 112
B. Population and Sample
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 113
C. Sample Size Determination
1. Cochran (1977) presented a set of formulas that can be used to determine the
sample size.
Estimating a Population
Mean
Estimating a Population
Proportion
where n is the sample size, 𝒁𝜶 is the two-tailed 𝑧-score corresponding to the level of
𝟐
significance 𝛼, s is the known standard deviation, e is the margin of error, p is the
past estimate of the population proportion, and q=1-p
NOTE
a. The level of significance 𝛼 can take any of the standard values namely: 0.01,
0.05, and 0.10. Theoretically, the level of significance is the probability of the
type 1 error in hypothesis testing.
b. The following table presents the values of 𝒁𝜶 corresponding to the standard
𝟐
values of 𝛼
𝛼 𝒁𝜶
𝟐
0.01 2.575
0.05 1.96
0.10 1.645
c. The standard deviation, s, can be estimated from a pilot data set or the value
can be adopted from a previous study that considered the same or similar
population.
d. In the same manner as s, p can be the past estimate of the population
proportion or can be computed from a pilot data set.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 114
2. Yamane’s Formula (Simplified Formula for Proportions)
If the behavior of the population is not certain or the researcher is not familiar
with the population‟s behavior, Yaro Yamen’s formula (1980) or Taro Yamane’s
formula (1967) may be used. The formula is:
𝑁
𝑛=
1 + 𝑁𝑒 2
Example 1.41% of Jacksonville residents said that they had been in a hurricane. How many
adults should be surveyed to estimate the true proportion of adults who have been in a
hurricane, with a 95% confidence interval and 3% margin of error?
Solution
41% is a past estimate of population proportion; Unknown population size. Hence,
we use the following formula.
𝛼 =0.05
p=0.41
q=1-0.41=0.59
𝑍𝛼 = 1.96
2
2
𝒁𝜶 𝑝𝑞
𝟐
𝑛≥
𝑒2
𝟏. 𝟗𝟔 2 (0.41)(0.59)
𝑛≥
(0.03)2
𝑛 ≥ 1,032.54 ≈ 1.033
Example 2. From a population of 10,000 individuals of a certain town, what sample size is
needed in order to get an accurate result for a certain study using a margin of error of 3%.
Solution
𝑁
𝑛=
1 + 𝑁𝑒 2
10,000
𝑛=
1 + (10,000)(0.03)2
𝑛 = 1,000
Hence, the sample size needed in order to get an accurate result for a certain study
using a margin of error of 3% is 1000 individuals.
D. Sampling Techniques
Sampling is the process of selecting units, like people, organizations, or objects from
a population of interest in order to study and fairly generalize the results back to the
population from which the sample was taken. The two types of sampling are
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 115
1. Random Sampling Techniques
Members from the population are selected in such a way that each individual
member in the population has a chance of being selected.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 116
We use Proportional Allocation to draw a sample from each stratum to
reach the desired sample size:
Department 𝑁 𝑛
Business Administration (BA) 1,500 140
Management(M) 1,200 112
Finance(F) 850 80
Entrepreneurship(E) 200 19
Culinary Arts(CA) 150 14
Total 3,900
Solution:
1,500
𝑛𝐴𝐵 = 363 = 139.62 ≈ 140
3,900
1,200
𝑛𝑀 = 363 = 111.69 ≈ 112
3,900
850
𝑛𝐹 = 363 = 79.12 ≈ 80
3,900
200
𝑛𝐸 = 363 = 18.62 ≈ 19
3,900
150
𝑛𝐶𝐴 = 363 = 13.96 ≈ 14
3,900
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 117
Sample Size Round-off
Rule: When the calculated sample size is not a whole number, it should
be rounded up to the next higher whole number. Rounding up a
sample size calculation for conservativeness ensures that your sample
size will always be representative of the population.
d. Cluster Random Sampling. Divide the population into sections (or clusters),
then randomly select some of those clusters, and then choose all
members from those selected clusters.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 118
1. Modal Instance Sampling. When we do modal instance sampling, we
are sampling most frequent cases. The problem with modal instance
sampling is identifying the “modal” case.
2. Expert Sampling. It involves the assembling of a sample of persons with
known or demonstrable experience and expertise in some area.
3. Quota Sampling. Selecting items non randomly according to some
fixed quota.
4. Snowball Sampling. Begin by identifying someone who meets the
criteria for inclusion in your study. You ask them to recommend others
who they may know who also meet the criteria.
E. Statistical Data
Statistical data are the raw materials of research or any statistical investigations
usually obtained by counting or measuring items. Data are categorized:
A. according to description:
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 119
Example 7.
The numbers of eggs that hens lay are discrete data
because they represent counts.
The number of students of a given ethnic group in a
class.
The number of books on a shelf.
B. according to source:
2. Ordinal Scale. It is the simplest scale which orders people, objects, or events
along some continuum. Values of variables measured at the ordinal level
offer at least a rough indication of quantitative differences; they can also be
categorized and ranked, numbers are used only to place objects in order.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 120
Example 10.
year level
job position
3. Interval Scale. It is the scale on which zero is arbitrary. It does not reflect the
absence of an attribute. Data gathered from variables measured at an
interval scale can be categorized, ranked, and can be added or
subtracted.
Example 11
IQ Scores
temperature
4. Ratio Scale. It possesses all of the characteristics of interval scales but has a
true zero point. Thus, a case where 0 is on a scale indicates the total absence
of the property being measured. For values at this level, differences and ratios
are both meaningful.
Example 12.
Distances (in km) traveled by cars (0 km represents no distance
traveled, and 400 km is twice as far as 200 km.)
Prices of books(P0.00 does represent no cost, and a P300.00
book does cost twice as much as a P150.00 book.)
height
weight
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 121
Negative:
1. There‟s a strong possibility of non-response, especially when
questionnaires are mailed.
2. Questions not easily understood may not be answered.
D. Experiment Method - this method is used when the objective is to determine the
cause-and-effect relationship of certain phenomena under controlled conditions. It
is usually used by scientific researchers.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 122
METHODS OF DATA PRESENTATION
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 123
2. Frequency Polygon. One type of statistical graph involves the class midpoints.
A frequency polygon uses line segments connected to points located directly
above class midpoint values. A variation of the basic frequency polygon is
the relative frequency polygon, which uses relative frequencies (proportions
or percentages) for the vertical scale. When trying to compare two data sets,
it is often very helpful to graph two relative frequency polygons on the same
axes.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 124
4. Pie chart is a graph that depicts qualitative data as slices of a circle, in which
the size of each slice is proportional to the frequency count for the category.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 125
MEASURES OF CENTRAL TENDENCY AND OTHER LOCATIONS
These are numerical values that tend to locate in some sense the middle of a set of
data when arranged in increasing or decreasing order. The term average is often
associated with these measures: mean, median, mode, midrange.
1. Mean 𝝁 or x
Example 14: Suppose you chose ten people who entered the campus
and whose ages are as follows: 15 25 18 20 25 18 18 20 20 25
What is the mean age of this sample?
Solution:
Example 15: A student was taking 5 subjects last semester. Find his average
if his final grades were as follows:
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 126
Solution:
Characteristics of Mean
1. Interval and ratio measurements
2. All the scores or measurements are considered in the computation of the
mean.
3. Very high or very low scores or measurements affect the mean.
2. Mode 𝝁 or 𝒙
It is the value in the distribution with the highest frequency. It locates the point
where the observation values occur with the greatest density. It can be used for
quantitative as well as qualitative data.
A data set can have one mode, more than one mode, or no mode.
When two data values occur with the same greatest frequency, each
one is a mode and the data set is bimodal.
When more than two data values occur with the same greatest
frequency, each is a mode and the data set is said to be multimodal.
When no data value is repeated, we say that there is no mode.
Example 16
Observe the given ungrouped data below:
a. 1,2,3,4,5,6,7 (No Mode)
b. 15.2, 12.3, 4.6, 12.3, 6.5, 12.3, 5.5 (𝒙=12.3)
c. 15,12,4,15,4,6,5 (𝒙=15 and 𝒙= 4)
d. 3,4,5,1,3,2,4,5,7,10 (𝒙=3, 𝒙=4, and 𝒙= 5)
Characteristics of Mode
1. It is very easy to compute but is seldom used because it is very unstable.
2. When a rough or quick estimate of a central value is wanted.
3. It is most appropriate for nominal scale as a measure of popularity.
3. Median 𝝁 or 𝒙
It is a value that divides the distribution into two equal parts (after arranging
the values in ascending or descending order). As such, it is a positional average. The
median is defined by
Example 17: During the first marking period, Nicole's math quiz scores were
90, 92, 93, 88, 95, 88, 97, 87, and 98. What was the median quiz score?
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 127
Solution:Ordering the data from least to greatest, we get:
Since n =9 (odd),
The median quiz score is 92. (Four quiz scores were higher than 92 and four
were lower.)
Example 18: The ages of 10 college students are listed below. Find the median.
18, 24, 20, 35, 19, 23, 26, 23, 19, 20
.
Solution: Ordering the data from least to greatest, we get:
Characteristics of Median
1. Ordinal or ranked measurements
2. Only the middle scores or measurements are considered in the computation
of the median.
3. Very high or very low scores do not affect the median.
4. When there are extreme cases, thus the distribution is markedly skewed.
5. When we desire to know whether the cases fall within the upper halves or the
lower halves of a distribution
1. Percentiles arevalues that divide a set of observations into 100 equal parts. These
values, denoted by 𝑃1 , 𝑃2 , … , 𝑃99 , are such that 1% of the data falls below 𝑃1 , 2% falls
below 𝑃2 , …, and 99% falls below 𝑃99 .
The 𝑘th percentile, 𝑃𝑘 (𝑘 = 1, 2, 3, … ,99), can be determined using the
following procedure:
a. Arrange the data in increasing order and compute the value of the index
𝑘
𝑖= 𝑛, where 𝑛 is the number of observations.
100
𝑥 𝑖 +𝑥 𝑖+1
b. If 𝑖 is an integer, 𝑃𝑘 = . If 𝑖 is not an integer, use the rounded up value for 𝑖
2
and take 𝑃𝑘 = 𝑥𝑖 . Note that 𝑥𝑖 here pertains to the score in the data set.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 128
2. Deciles are values that divide a set of observations into 10 equal parts. These
values, denoted by 𝐷1 , 𝐷2 , … , 𝐷9 , are such that 10% of the data falls below 𝐷1 , 20%
falls below 𝐷2 , …, and 90% falls below 𝐷9 .
The 𝑘th decile, 𝐷𝑘 (𝑘 = 1, 2, … ,9), can be determined using the following
procedure:
a. Arrange the data in increasing order and compute the value of the index
𝑘
𝑖= 𝑛, where 𝑛 is the number of observations.
10
𝑥 𝑖 +𝑥 𝑖+1
b. If 𝑖 is an integer, 𝐷𝑘 = . If 𝑖 is not an integer, use the rounded up value for 𝑖
2
and take 𝐷𝑘 = 𝑥𝑖 .
3. Quartiles. are values that divide a set of observations into 4 equal parts. These
values, denoted by 𝑄1 , 𝑄2 , and 𝑄3 , are such that 25% of the data falls below 𝑄1 , 50%
falls below 𝑄2 and 75% falls below 𝑄3 .
The 𝑘th quartile, 𝑄𝑘 (𝑘 = 1, 2, 3), can be determined using the following
procedure:
a. Arrange the data in increasing order and compute the value of the index
𝑘
𝑖= 𝑛, where 𝑛 is the number of observations.
4
𝑥 𝑖 +𝑥 𝑖+1
b. If 𝑖 is an integer, 𝑄𝑘 = . If 𝑖 is not an integer, use the rounded up value for 𝑖
2
and take 𝑄𝑘 = 𝑥𝑖 .
Example 19: As part of a quality-control study aimed at improving a production line, the
weights (in ounces) of 50 bars of soap are measured. The results are as follows, sorted from
smallest to largest. Find, first quartile, the 9th decile, and the 43rd percentile.
11.6 12.6 12.7 12.8 13.1 13.3 13.6 13.7 13.8 14.1
14.3 14.3 14.6 14.8 15.1 15.2 15.6 15.6 15.7 15.8
15.8 15.9 15.9 16.1 16.2 16.2 16.3 16.4 16.5 16.5
16.5 16.6 17.0 17.1 17.3 17.3 17.4 17.4 17.4 17.6
17.7 18.1 18.3 18.3 18.3 18.5 18.5 18.8 19.2 20.3
a. 43rd Percentile
b. 9th Decile
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 129
(𝑥45+𝑥46)
Since i is an integer, 𝐷9 = . From the data set, 𝑥45 = 18.3 and 𝑥46 = 18.5
2
18.3+18.5
Thus, we have , 𝐷9 = = 18.4
2
Hence, 90% of the values lie below 18.4.
c. First quartile
It indicates the extent to which individual items in a series are scattered about the
average. It is used to determine the extent of the scatter so that steps may be taken to
control the existing variation. General Classifications of Measures of Variation are
• Measures of Absolute Dispersion
• Measures of Relative Dispersion
2. Variance and the Standard Deviation are the most common and useful
measures of variability. These two measures provide information about how
the data vary about the mean.
The variance 𝜎 2 or 𝑠 2 is a measure of variation which considers the
position of each observation relative to the mean of the set.
2
x
i
2
or
2
N xi
2
x i
2
N N2
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 130
Given a finite sample 𝑥1 , 𝑥2 , ..., 𝑥𝑛 ,, the sample variance is given
by
x x n xi x
2 2 2
i i
s2
or
2
n1 nn 1
x
2
i
N
Sample Standard Deviation
x
2
i x
s
n1
where: population standard deviation
xi i th observation
s sample standard deviation
population mean
x sample mean
N population size
n sample size
If the data are clustered around the mean, then the variance and the
standard deviation will be small. If, however, the data are widely scattered about
the mean, the variance and the standard deviation will be somewhat large.
15, 28 25 48 22 43 49 34 22 33 27 25 22 20 39
Find the Range, Standard Deviation, and Variance for the above data.
Solution: For this data set, 𝑛 = 15, 𝑥 = 452, and 𝑥 2 = 15160. These can be
computed using a scientific calculator.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 131
For the Standard deviation:
𝑛 𝑥𝑖 2 − 𝑥𝑖 2
𝑠=
𝑛(𝑛 − 1)
15 15160 − 452 2
𝑠=
15(15 − 1)
𝒔 = 𝟏𝟎. 𝟒𝟖𝟕𝟏𝟖
For the variance:
15 15160 − 452 2
𝑠2 =
15 15 − 1
𝟐
𝒔 = 𝟏𝟎𝟗. 𝟗𝟖𝟎𝟗𝟓
For Range:
The range is 34. The high score is 49; the low score is 15. 49 − 15 = 34.
𝑅𝑎𝑛𝑔𝑒 = 34
Population:
𝜎
𝐶𝑉 = × 100%
𝜇
Sample:
𝑠
𝐶𝑉 = × 100%
𝑥
Example 21: If we have a standard deviation of 1.5 and a mean of 5, the ratio of
the standard deviation to the mean is 0.3. In other words, the standard deviation
is 30% of the mean. When comparing two data sets, the general rule of thumb
you should follow is:
The higher the coefficient of variation, the higher the variability of the data
set This means that, when comparing two or more data sets, the one with the
highest coefficient of variability can be said to have the highest variation.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 132
CORRELATION AND REGRESSION
A. Correlation
n xy x y
r
n x 2
x 2
n y 2 y 2
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 133
Perfect positive Perfect negative
correlation correlation
Example 22. Given the following data on the number of hours of study (x) for an
examination and the scores (y) received by a random sample of 10 students,
compute for the Pearson correlation coefficient.
Student 𝒙 𝒚 𝒙𝒚 𝒚𝟐 𝒙𝟐
1 8 56 448 3136 64
2 5 44 220 1936 25
6 5 54 270 2916 25
9 2 33 66 1089 4
10 8 65 520 4225 64
n xy x y
r
n x 2
x 2
n y 2 y 2
r 0.9625
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 134
There is a very strong positive linear relationship between the number of hours
of study (𝑥) for an examination and the scores (𝑦) received by a random sample of
10 students.
Student 1 2 3 4 5 6 7 8 9 10
Math 5 8 10 12 12 14 15 16 18 20
Score
Stat 2 7 8 9 10 12 14 10 16 12
Score
Student 1 2 3 4 5 6 7 8 9 10 Total
Math Score 5 8 10 12 12 14 15 16 18 20 130
(𝒙)
Stat Score 2 7 8 9 10 12 14 10 16 12 100
(𝒚)
𝒙𝒚 10 56 80 108 120 168 210 160 288 240 1440
r 0.8692
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 135
There is a very strong positive linear relationship between math and stat scores.
B. Regression
The slope is
n xy x y
b
n x 2 x 2
The y-intercept
a
y b x
n n
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 136
It is interpreted as the proportion of the variance in the dependent variable that is
predictable from the independent variable.
The fraction of data points falls on the regression line
𝑹𝟐 =1 (all points lie exactly on a straight line with no points scattered about the line)
means that the dependent variable is perfectly predicted without error using the
independent variable X
𝑹𝟐 =0 means that the dependent variable cannot be predicted using the
independent variable X.
An 𝑹𝟐 between 0 and 1 indicates the extent to which the dependent variable is
predictable.
An 𝑹𝟐 of 0.10 means that 10 percent of the variance in Y is predictable from X;
an 𝑹𝟐 of 0.20 means that 20 percent is predictable; and so on.
SSR
R2 100
SSY
SPXY x i y i
x y i i
SSY y i
2 y i
2
SSX xi
2
x i
2
Example 23: The paired data below consist of the costs of advertising (in thousands
of pesos) and the number of products sold (in thousand units).
Cost # Products xy x 2
y 2
(x) Sold
(y)
9,000.00 85,000.00 765,000,000.00 81,000,000.00 7,225,000,000.00
2,000.00 52,000.00 104,000,000.00 4,000,000.00 2,704,000,000.00
3,000.00 55,000.00 165,000,000.00 9,000,000.00 3,025,000,000.00
4,000.00 68,000.00 272,000,000.00 16,000,000.00 4,624,000,000.00
2,000.00 67,000.00 134,000,000.00 4,000,000.00 4,489,000,000.00
5,000.00 86,000.00 430,000,000.00 25,000,000.00 7,396,000,000.00
9,000.00 83,000.00 747,000,000.00 81,000,000.00 6,889,000,000.00
10,000.00 73,000.00 730,000,000.00 100,000,000.00 5,329,000,000.00
Total 44,000.00 569,000.00 3,347,000,000.00 320,000,000.00 41,681,000,000.00
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 137
2. Find the equation of the regression line to predict weekly sales from
advertising expenditures.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 138
3. Estimate the number of products sold when advertising costs is P4,500.
𝑦=55788.25+2.7885𝑥
𝑦=𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑟𝑜𝑑𝑢𝑐𝑡𝑠 s𝑜𝑙𝑑=55788.25+2.7885(4,500)
𝑦=𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑟𝑜𝑑𝑢𝑐𝑡𝑠 s𝑜𝑙𝑑=68,336.50 units
Therefore, 50.08 % of the variance in the number of products sold is predictable from
the cost of advertising.
Write your solutions (if any) and answers on a clean sheet of paper. Submit the image of
your HANDWRITTEN SOLUTIONS as a single pdf file in the submission bin for this activity in the
Classroom. You may use image scanning apps on your phone (CamScanner or Tap
Scanner) to save several images into 1 pdf file, or place your images in a document and
save as a pdf file.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 139
___________________ 2. The scores of students in a statistics quiz
___________________ 3. The main source of income
___________________ 4. The birth order of children in the family
___________________ 5. Age of students availing a discount
___________________ 6. Weights of a sample of bags of raw materials for the
production of a certain product, measured in grams.
___________________ 7. The natural eye color of a sample of 100 children.
___________________ 8. The economic status of a sample of families in a certain
area.
___________________ 9. The final grade of graduate students taking up Statistics.
___________________ 10. The school in which a graduate student is enrolled in.
D. In each of the following situations, identify the population, each variable, and
determine if the variable is qualitative or quantitative.
1. A quality control worker with Sweet-Tooth Candy weighs every 100 candy th
E. To assign workers to two stores, the owner has the workers count off by two to divide
them into teams. Is this (team) a qualitative or quantitative variable?
F. A school is studying its students‟ test scores by grade. Explain how the characteristic
„grade‟ could be considered either a categorical or a numerical variable?
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 140
G. Which of the following situations will result in probability or non-probability sampling?
1. Population: All residents of a big city
Sampling technique: For one week, researchers stop every fourth person who
passes by a busy downtown street corner.
2. Population: All students in a large high school
Sampling technique: selecting the first 50 students reporting to school on a
Wednesday morning.
3. Population: All the 72 guests at a birthday party.
Sampling technique: The name of each person is written on a slip of paper
then all are placed in a box, mixed, then drawn one after the other for the
available ten door prizes.
4. Population: Business owners with less than 15 employees
Sampling technique: Get information from the DTI (business permits section),
then select a sample size of about 30% from each of the included 12
barangays
5. A market analyst would like to compare the durability, in terms of mean time
before wear, of two leading brands of car tires.
I. The numbers of incorrect answers on a true or false competency test for a random
sample of 15 students were recorded as follows: 2, 1, 3, 0, 1, 3, 6, 0, 3, 3, 5, 2, 1, 4, and 2.
Find a. mean, b. median, c. mode. Show your solution.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 141
J. A student was taking five subjects in college during the first semester. Find his average
grade if his final grades were as follows. Show your solution.
Units 3 5 3 2 4
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 142