Professional Documents
Culture Documents
Chapter 1 QM (PC)
Chapter 1 QM (PC)
Presentation
Introduction to Statistics
➢ Statistics is the science of collecting, organizing, presenting,
analyzing and interpreting numerical data to assist in making
more effective decisions.
➢ Purposes of Statistics
• Statistical techniques are used extensively by marketing
managers, accountants, consumers, educators, politicians,
physicians, etc.
• Statistical techniques are used to make many decisions that
affect our lives. Regardless what your future line of work is,
you will make decisions that involved data.
Each, every
1
➢ Type of variables:
2
Data Presentation
➢ Raw data
• Data collected that have not been organized or processed are
called raw data.
• When every observed value of the random variables is listed,
the data are called ungrouped data.
• Grouping is one of the most common methods of organizing
data. When we group data we are actually frequency
distributions for the raw data.
➢ Frequency distribution
• A table in which possible values for a variable are grouped
Age f into non-overlapping classes, and the number of observed
1-3 4
2-4 5 values which fall into each class is recorded.
1-3 3 • Data organized in a frequency distribution are called
4-6 4
grouped data.
4
Eg 2: The following is a record of the number of books borrowed per
week in the library for 30 weeks.
5
Eg 3: The amount of rain fall (in cm3) for a small town was recorded
for the month of December.
6
Eg 4:
Eg 5:
Amount of rain Class boundaries Class size Class
fall (cm3) Midpoint, x
19 - < 22 19 – 22 3 20.5
22 - < 25 22 – 25 3 23.5
25 - < 28 25 – 28 3 26.5
28 - < 31 28 – 31 3 29.5
31 - < 34 31 – 34 3 32.5
7
Histogram
➢ A graph in which the classes are marked on the horizontal axis
and the class frequencies on the vertical axis.
➢ Shapes of histogram
❖ Symmetric
❖ Skewed
❖ Uniform or a rectangular
8
Eg 6: Construct a histogram for the frequency distribution of the
number of books read borrowed per year in the library by 30
students.
Solution:
Number of f Class
books boundaries
10 – 29 8 9.5 – 29.5
30 – 49 7 29.5 – 49.5
50 – 69 4 49.5 – 69.5
70 – 89 7 69.5 – 89.5
90 – 109 4 89.5 – 109.5
9
Eg 7: Construct a histogram for the frequency distribution the amount
of rainfall in the month of December.
Solution:
10
➢ For frequency distribution of unequal class size, the height of
each bar is drawn proportional to the adjusted frequency of each
𝑐𝑜𝑚𝑚𝑜𝑛 𝑐𝑙𝑎𝑠𝑠 𝑠𝑖𝑧𝑒 ×𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦
bar where Adjusted frequency =
𝑐𝑙𝑎𝑠𝑠 𝑠𝑖𝑧𝑒
11
➢ Cumulative Frequency Distribution
• Used to determine how many or what proportion of the data
values are below or above a certain value.
• The “Less than” cumulative frequency distribution is a table
showing the total frequency of all values less than the upper
class boundary of each class.
Eg 9:
Number of f Class Upper cf or F
books boundaries boundary
9.5 0
10 – 29 8 9.5 – 29.5 29.5 8
30 – 49 7 29.5 – 49.5 49.5 15
50 – 69 4 49.5 – 69.5 69.5 19
70 – 89 7 69.5 – 89.5 89.5 26
90 – 109 4 89.5 – 109.5 109.5 30
Eg 10:
Amount of f Class Upper cf
rain fall boundaries boundary
(cm3)
19 0
19 - < 22 4 19 – 22 22 4
22 - < 25 10 22 – 25 25 14
25 - < 28 12 25 – 28 28 26
28 - < 31 4 28 – 31 31 30
31 - < 34 1 31 – 34 34 31
12
The “Less than” Cumulative frequency polygon (Ogive)
is a line chart of a cumulative frequency distribution that shows the
cumulative frequency less than the upper class boundary plotted
against the upper class boundary of a class.
13
Solution:
Output Number of Class Upper cf
(units) employees, f boundaries boundary
0.5 0
1–5 1 0.5 – 5.5 5.5 1
6 – 10 2 5.5 – 10.5 10.5 3
11 – 15 3 10.5 – 15.5 15.5 6
16 – 20 9 15.5 – 20.5 20.5 15
21 – 25 5 20.5 – 25.5 25.5 20
14
From the ogive,
(i) number of employees producing output less than 13 units = 4.5
(ii) proportion of employees producing output more than 22 units
20−16.5 3.5
= = = 0.175
20 20
15
(i) Data collection and pre-processing are always the first steps of
BA project.
(ii) Data often need to be collected, cleansed and combined with
other sources, - not all current and historical data stored contains
all the information required for a certain analysis.
(iii) Descriptive analytics- data will be analyzed and patterns
(insight/information) are found.
(iv) Predictive analytics- insight found from predictive phase used in
this phase to predict what is likely to happen in the future, if the
situation remains the same.
(v) Prescriptive analytics- alternative decisions are determined that
change the situation and which will lead to desirable outcomes.
(vi) -Decision has to be implemented, this requires various skills such
as knowledge of change management.
-Some of the steps above need to be repeated depending on the
outcome. For eg. If predictions are not accurate enough for a
particular application, then extra data is required to improve them.
-Not all BA projects include all the steps above. For eg.
Prescriptive analytics are not included if the project achieve
prediction goal. The project finish after descriptive or predictive
steps.
16
Example:
A hotel chain analyzes its reservations to look for patterns: which are
the busiest days of the week? What is the impact of events in the city?
Is there a seasonal pattern? Etc. The outcomes are used to make a
prediction for the revenue in the upcoming months. By changing the
pricing of the rooms in certain situations (such as sports events or
school holiday), the expected revenue can be maximized.
17