Professional Documents
Culture Documents
Lecture 2 - Descriptive Statistics - Part 1
Lecture 2 - Descriptive Statistics - Part 1
Descriptive statistics
Associate Prof. Mohamed El Ashhab
Statistics and Data
• What is Statistics science?
Data
Statistics is the science of:
2. Dot plot
3. Scatter plot
4. Frequency distribution
5. Histogram
Pareto Diagram
1
Construction of Pareto Diagram
48 100 %
Pareto in Excel
Dot Plot
2
Continued.
9
Dot Plot
Ages of Students
15 18 21 24 27 30 33 36 39 42 45 48 51 54 57
From this graph, we can conclude that most of the values lie
between 18 and 32.
10
11
Scatter Plot
• When each entry in one data set corresponds to an entry in another data
set, the sets are called paired data sets.
• In a scatter plot, the ordered pairs are graphed as points in a coordinate
plane. The scatter plot is used to show the relationship between two
quantitative variables.
• The following scatter plot represents the relationship between the number
of absences from a class during the semester and the final grade.
Continued.
12
0 2 4 6 8 10 12 14 16
Absences (x)
From the scatter plot, you can see that as the number of absences increases, the final
grade tends to decrease.
Continued.
13
Frequency Distributions
4
14
Continued.
15
Frequency Distributions
Consider the following heights of 50 Nano-pillars were measured in nanometers
(nm), during the fabricating a new transmission type electron multiplier of a flat
silicon membrane.
245 333 296 304 276 336 289 234 253 292
366 323 309 284 310 338 297 314 305 330
266 391 315 305 290 300 292 311 272 312
315 355 346 337 303 265 278 276 373 271
308 276 364 390 298 290 308 221 274 343
Continued.
16
Continued.
17
Constructing a Frequency Distribution
2‐ Calculate the range of the data (Range = Max – Min).
Starting point of the first class is arbitrary and should be less than or
equal to the minimum value.
Continued.
18
Continued.
19
Frequency distribution ‐ Example
Continued.
20
21
Frequency density distribution ‐ Example
Students’ mid‐term exam grade percentage
46
22
23
Stem‐and‐Leaf display
• Generally, a stem and leaf plot, or stem plot, is a technique used to
classify either discrete or continuous variables.
• A stem and leaf plot is used to organize data as they are collected.
17 24 27 32 34 15 42 21 28 37
24
• If we wanted to avoid the loss of information
inherent in the preceding table, we could keep track
of the last digits of the readings within each class,
getting
Continued.
25
Stem‐and‐Leaf display
This can also be written as
• The left-hand column forms the stem, and the numbers to the left of the vertical line are the stem
labels, which in our example are 1, 2, . . . , 5. Each number to the right of the vertical line is a leaf.
• The numbers in a row, the leaves, have the unit 1.0. In the last step, the leaves are written in
ascending order.
• There should not be any gaps in the stem even if there are no leaves for that particular value.
26