Professional Documents
Culture Documents
Statistics A Review
Statistics A Review
Statistics A Review
A REVIEW
Learning Outcomes
1. Use a variety of statistical tools to process and manage
numerical data.
Levels of Measurement
Definition
Nominal Level - characterized by data that consist of names, labels, or
categories only. The data cannot be arranged in an ordering scheme (such
as low to high).
Example:
Survey responses as yes, no, undecided
Ordinal Level - involves data that may be arranged in some order, but
differences between data values either cannot be determined or are
meaningless.
Example:
Course grades A, B+, B, C+, C, D, F or AF
Interval Level - like the ordinal level, with the additional property that
the difference between any two data values is meaningful. However, there
is no natural zero starting point (where none of the quantity is present).
Example:
IQ, Temperature
Ratio Level - possesses the characteristic of interval level, and there exist
a true zero. Differences and ratios are meaningful.
Example:
Height, Salary, Time
A B B AB O O O
B AB B B B O A
O A O O O AB AB
A O B A
Categorical Frequency Distribution
A B B AB O O O
B AB B B B O A
O A O O O AB AB
A O B A
B IIIII-II 7 28%
O IIIII-IIII 9 36%
AB IIII 4 16%
33 – 41 32.5 – 41.5 II 2
51 - 59 50.0 – 59.5 II 2
60 - 68 59.5 – 68.5 I 1
Total 36 36
Data Presentation
Histogram - a bar graph in which the horizontal scale represents classes of data values
and the vertical scale represents frequencies. The heights of the bars correspond to the
frequency values, and the bars are drawn adjacent to each other (without gaps).
Sample Mean ()
1 + 2 + 3+ … + n
= =
Population Mean (µ)
1+ 2 + 3+ … + N
µ= =
12 + 10 + 15 + 8 + 5 68
= = = 11.333…
6 6
if is odd if is even
2
= middle score =
Example:
1. Six students borrowed these numbers of books in a library:
1, 2, 3, 4, 6, 7 (3 and 4 are the middle scores)
= = 3.5
2
2. Five students borrowed these numbers of books in a library:
2, 2, 3, 3, 3
=3
Mode (Mo)
- most frequently occurring number in a data set
Examples:
1. 2 5 8 3 2 2 3 1 (Mode is 2)
2. 1 1 3 5 7 8 3 4 (Bimodal, Mode is 1 and 3)
3. 2 4 6 8 9 3 1 5 (No mode)
Midrange (MR)
- the value midway between the highest and lowest values in the original data
set
highest score + lowest score
MR =
2
Weighted Mean
- multiply each value by its corresponding weight and dividing the sum of
the products by the sum of the weights.
11 + 22+ … + nn
= =
1 + 2 + n …+
where w1, w2, w3, ..., wn are the weights and X1, X2, X3, ..., Xn are the values.
Example:
Numerical
Subject Units (w) Letter Grade Value (X)
PE 2 A 4
Calculus 5 C+ 2.5
English 3 B 3
2 . 4 + 5 .2.5 + 3 . 3
X= = 2.95
10
the QPI is 2.95
Measures of Dispersions
• Range
- difference between highest and lowest value
• Standard Deviation
- always positive - zero if all the data are the same - uses the same
units as the original data set - can be influenced by outliers
• Variance
- does not use the same units as the data
Population Standard Deviation (σ)
σ= =
1
• Deciles - divide the data set into 10 equal groups
2
1
Image taken from the internet
2
Image taken from the internet
• Quartiles - divide the data set into quarters
3
• z-score (standard score)
- the z-score for a given data value x is the number of
standard deviations that x is above or below the mean of the data
population: sample:
z= z=
3
Image taken from the internet
Example:
A basketball player Carl is 78 inches tall and a volleyball player Jane is 76
inches tall. Carl is obviously taller by 2 inches, but which player is relatively
taller? Does Carl’s height among men exceed Jane’s height among women?
Men have mean height of 68 inches and a standard deviation of 2.8 inches
while women have mean height of 63.6 inches and a standard deviation of 2.5
inches.
Carl : z = = = 3.21
Jane : z = = = 4.96
Carl’s height is 3.21 standard deviations above the mean, but Jane’s height is a
whopping 4.96 standard deviations above the mean.
∴ Jane’s height among women is relatively greater than Carl’s height among
men.
Exercises
2. The mean time to download pdf file is 12 min with a standard deviation of 4 min. Belle’s
download time is 20 min. John’s download time is 6 min. How can you compare Belle’s
download time compare with John?
3. Cheryl has taken two quizzes in her history class. She scored 15 on the first quiz, for which
the mean of all scores was 12 and the standard deviation was 2.4. Her score on the second
quiz, for which the mean of all scores was 11 and the standard deviation was 2.0, was 14.
In comparison to her classmates, did Cheryl do better on the first quiz or the second quiz?
4. Roland received a score of 70 on a test for which the mean score was 65.5. Roland has
learned that the z-score for his test is 0.6. What is the standard deviation for this set of test
scores?
Box and Whisker Plot (box plot)
- used to provide a visual summary of a set of data
- it shows the median, the 1st and 3rd quartiles, and the
minimum and maximum values of a data set
4
Image taken from the internet
Example:
Construct a box-and-whisker plot for the following data:
Number of Rooms Occupied in a Resort during an 18-day period
86 77 58 45 94 96 83 76 75
65 68 72 78 85 87 92 55 61
References:
1. Richard Aufmann, et. al. Mathematics Excursion, Third edition.
2. Allan Bluman, Elementary Statistics, A Step by Step Approach.
Eight Edition.
3. Raja Almukkahal, et. al. CK-12 Advanced Probability and
Statistics, Concept Collection.
God Bless!