Professional Documents
Culture Documents
Dessertation Design &data Collection
Dessertation Design &data Collection
Dessertation Design &data Collection
Dr. Ghassan M
Ghassan Mohamed
Ghassan was born on November 23, 1986, in Giza, Egypt. He graduated from the faculty of science!
Then he earned a master’s degree in statistical quality control and quality assurance from Cairo university
with an excellent degree. Later, He earned a doctorate degree in statistical quality control and quality
assurance from Cairo university with an excellent degree.
The educational journey did not end there. This year, he earned a diploma in governance, TOT, and
Organizational excellence diploma. Ghassan realized his biggest dream: He finally became a lecturer at Cairo
University to complete his learning journey. He although an assessor in government agencies for the Egypt
Award and Egypt's Vision 2030.
However, he achieved one of his dreams to become a member of CHG family, teaching quality, statistics,
and 7 habits besides his daily work as a quality supervisor.
Today, Ghassan encourages a promising people—to study the different sciences. his advice to follow your
dreams, no matter how great.
CONTENTS
Dissertation
01 Dissertation & Dissertation design
Data
02 Data Types and how to dealing with it
Exam
04
00 Learning Objectives
What is a dissertation?
A dissertation is...
1 Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Check your guidelines!
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Structuring a dissertation or thesis
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Title page
✓ Dissertation title
✓ Your name
✓ Type of document
✓ Department and institution
✓ Degree program
✓ Date of submission
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Acknowledgements
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Acknowledgements
Abstract
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Abstract
2- Your methodology – how did you go about investigating the topic and finding answers to your research
question(s)?
4- Your conclusions – based on your findings, what conclusions did you draw? What answers did you find to
your research question(s)?
Table of contents
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Table of contents
Lists of figures and tables
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Lists of figures and tables
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Lists of figures and tables
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
List of abbreviations
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
List of abbreviations
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Glossary
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Glossary
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Introduction
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Literature review
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Theoretical framework
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Methodology
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Discussion
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Conclusion
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Reference list or bibliography
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
APA Style
Parenthetical citation
It is important to avoid plagiarism in academic writing (Smith, 2020, p. 15).
Narrative citation
Smith (2020, p. 15) states that it is important to avoid plagiarism in academic writing.
Multiple authors
1. Author
2. Date
3. Title
4. Source
The author component
Retrieval date for online sources that Retrieved December 3, 2020, from …
are continually updated
The title component
Plain text for sources within The evolving European model of sports finance.
sources (articles, chapters,
web pages)
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
What is Statistics mean?
Statistics is the science of collecting, organizing,
summarizing and analyzing information in order to
draw conclusions.
The Process of Statistics
Many studies evaluate batterer treatment programs, but there are few experiments designed to
compare batterer treatment programs to non-therapeutic treatments, such as community service.
Researchers designed an experiment in which 376 male criminal court defendants who were accused
of assaulting their intimate female partners were randomly assigned into either a treatment group or a
control group. The subjects in the treatment group entered a 40-hour batterer treatment program
while the subjects in the control group received 40 hours of community service. After 6 months, it
was reported that 21% of the males in the control group had further battering incidents, while 10% of
the males in the treatment group had further battering incidents. The researchers concluded that the
treatment was effective in reducing repeat battering offenses.
Source: The Effects of a Group Batterer Treatment Program: A Randomized Experiment in Brooklyn by Bruce G. Taylor, et. al. Justice Quarterly,
Vol. 18, No. 1, March 2001.
Step 1: Identify the research objective.
The researchers randomly divided the subjects into two groups. Group 1
participants received the 40-hour batterer program, while group 2 participants
received 40 hours of community service. Group 1 is called the treatment group
and the program is called the treatment. Group 2 is called the control group.
Six months after the program ended, the percentage of males that battered their
intimate female partner was determined.
Step 3: Organize and summarize the information.
We extend the results of the 376 males in the study to all males who batter
their intimate female partner. That is, males who batter their female partner and
participate in a batter treatment program are less likely to batter again.
Types of data
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Data Collection Techniques
Observations,
Tests,
Surveys,
Document analysis
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Key Factors for High Quality
Experimental Design
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
What Makes a Good Quantitative Research Design?
4 Key Elements
1. Freedom from Bias
2. Freedom from Confounding
3. Control of Extraneous Variables
4. Statistical Precision to Test Hypothesis
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Bias: When observations favor some
individuals in the population over others.
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Precision Vs Accuracy
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Both Accurate Accurate
and Precise Not precise
Not accurate
But precise
Neither accurate
nor precise
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Interpreting Results of Experiments
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Overall Methodology:
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Introduction to Sampling
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Sampling
Distributions:
When you form a sample, you often show it by a plotted
distribution known as a histogram .
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Interpreting quantitative findings
Descriptive Statistics
Dr. Ghassan M
Ghassan_mohamed@yahoo.com
Example
• Measures of dispersion are descriptive statistics that describe how similar a set of
scores are to each other
• The more similar the scores are to each other, the lower the measure of dispersion will be
• The less similar the scores are to each other, the higher the measure of dispersion will be
• In general, the more spread out a distribution is, the larger the measure of dispersion
will be
81
Measures of Dispersion
125
• Which of the distributions of 100
75
scores has the larger dispersion? 50
25
0
1 2 3 4 5 6 7 8 9 10
Histogram
Pareto Chart
Check Sheet
Scatter Diagram
Control Chart
Dr. Ghassan Mohamed
WHY
GRAPHS ?
• To reveal a trend or comparison of a data
• Easily understood
93
Bar
Graph
• Bar graph is drawn on an x-y graph and it has labelled horizontal or
vertical bars that show different values
• The size, length and color of the bars represent different
values.
94
Preferred
use(Limitation)
Non continuous data
Comparing or contrasting the size of the different categories of the
data provided.
95
Exampl
e
96
Stem and Leaf Plot
• Stem and leaf plot also called as stem plot are connected with quantitative data
such that it helps in
• Displaying shapes of the distributions,
• Organize numbers and
• Set it as comprehensible as possible.
4. Add a key
5
6
Go through the marks in turn and put in the units figures of each mark
in the proper row. These are the ‘LEAVES’
4
8
5
6 3 1
When all the marks are entered the diagram will look like this:
4 0 4 7 9
8 2 9 4 7 9 3 7
5
5 9 3 4 8 5 8 1
6 3 1
5 0 6 4 0
Rewrite the diagram so that the units figures in each row are in order:
4 0 4 7 9
2 3 4 7 7 8 9 9
5
1 1 3 3 4 5 5 8 8 9
6
0 0 4 5 6
Add a KEY:
4 0 4 7 9
5|2 = 52 2 3 4 7 7 8 9 9
5
1 1 3 3 4 5 5 8 8 9
6
0 0 4 5 6
Remember:
- Always put in a Key
- Always put your data in Order
Median:
- to work out the median, you must find the middle value
- if there are two middle values, you need the average
Range:
- to work out the Range, subtract the smallest number from the
biggest
The stem & leaf diagram below shows the masses in kg of some people in a lift.
(a) How many people were weighed?
(b) What is the range of the masses?
(c) Find the median mass.
Stem Leaf
7 2 2 7 (b) 86 – 31 = 55 kg
8 1 6
(c) 56 kg Dr. Ghassan Mohamed
Frequency Polygon
• The frequency polygon has most of the properties of a histogram, with an extra
feature. Here the mid point of each class of the x-axis is marked. Then the
midpoints and the frequencies are taken as the plotting point. These points are
connected using line segments.
• We also complete the graph, that is, it's closed by joining to the x-axis. Frequency
polygon gives a less accurate representation of the distribution, than a histogram,
as it represents the frequency of each class by a single point not by the whole class
interval.
10
7
Dr. Ghassan Mohamed
Example
10
8
Dr. Ghassan Mohamed
Final Product
10
9
Dr. Ghassan Mohamed
Frequency Curve
• The frequency polygon consists of sharp turns, and ups and downs which are not in
conformity with actual conditions.
• To remove these sharp features of a polygon, it becomes necessary to smooth it. No
definite rule for smoothing the polygon can be laid down.
• It should be understood very clearly that the curve does not, in any way, sharply
deviate from the polygon.
• In order to draw a satisfactory frequency curve, first of all, we need to draw a
frequency histogram the frequency polygon and ultimately the frequency curve.
11
0
Example
11
1
Cumulative Frequency
(OGIVE)
• Cumulative frequency is a graph plotting cumulative frequencies on the y-axis and class
scores on the x-axis.
• The difference between frequency curve and an ogive is that in the later we plot the
cumulative frequency on the y-axis rather than plotting the individual frequencies.
11
2
Dr. Ghassan Mohamed
Example
11
3
Dr. Ghassan Mohamed
Example
11
4
Dr. Ghassan Mohamed
Pareto Diagram
Joseph Juran used the item “ Vital few & Trivial many”
He noted that 20 % of the quality problems caused 80 % of the dollar loss.
• A scatter plot or scatter graph is a type of graph which is drawn in Cartesian coordinate to
visually represent the values for two variables for a set of data. It is a graphical representation
that shows how one variable is affected by the other.
12
5
Dr. Ghassan Mohamed
Scatter plot
Use to illustrate the relationship between two variables by potting one against the
other.
Random Assignable
causes causes
Common Special
causes causes
Special Cause :
Variation can be exhibited within or out control limits e.g
( Trends, Step, Functions, shift ……..etc.)
• Correlation Analysis
• Coefficient of Determination
In this chapter we carry this idea further. We are going to calculate numerical
measures (Correlation Analysis) to express the strength of relationship
between two variables.
• The Independent Variable provides the basis for estimation. It is the predictor
variable (denoted by x).
Examples: Dependent
• The first step in correlation analysis is drawing a scatter diagram to portray the
relationship between the two variables.
•It can range from -1.00 to 1.00 according to the scale below:
non linear
relationship
Scatter diagram:
X: sales calls; Y:copiers sold
Positive linear
relationship, is it
strong or moderate?
r=0.759
There is a direct linear relationship between the number of sales calls and the number of copiers
sold. The association is strong.
Note that this doesn’t mean that more sales calls cause more copiers sales. we have not
demonstrated cause and effect here, only that the two variables—sales calls and copiers sold—are
related.
Correlation Vs Causality just because two things occur together does not mean that one is
the cause of the other. For example: an increase in ice-cream consumption in the summer is
correlated with increased rate of drowning deaths. But this doesn’t mean that ice cream
consumption causes drowning.
Applied Statistical Methods (BMTH113)-----
Dr. Rasha El Kholy----Eslsca
147
https://www.e-education.psu.edu/marcellus/node/636
2- Coefficient of Determination
The coefficient of determination (r2) is the proportion of the total variation in
the dependent variable (Y) that is explained by the variation in the
independent variable (X). It is the square of the coefficient of correlation.
• It ranges from 0 to 1.
r2= (0.759)2=0.576
151
3- Simple Linear Regression Model: Example
Recall the example of Copier Sales. Find the linear regression line to express the
relationship between the two variables. What is the expected number of copiers sold by a
representative who made 20 calls?
Absolute
value of r
r2
(b) Using the regression line, what would be the predicted college GPA
if a student has a high school GPA?
Possible values
Corresponding
of the
probabilities
Random
variable
Example…cont’d
Also we can represent the Probability distribution using a chart
What is a Probability
Distribution?
A probability distribution of a random variable is a listing of
all
possible values of the random variable and the probability
associated with each outcome.
CHARACTERISTICS OF A PROBABILITY
DISTRIBUTION
Examples
Tossing a coin three times and counting the number of heads.
A bank counting the number of credit cards carried by a group of customers.
Number of students in a class.
1- Discrete Random
Variable
The probability distribution for the number of cards carried can take the
following form:
Examples
The time between flights between Atlanta and LA are 4.67 hours, 5.13
hours, and so on.
The annual snowfall in Minneapolis, measured in inches.
Sales, in dollars, of a certain company.
Normal Probability
Distribution
The normal probability distribution is a continuous distribution that is widely used in
theory and in practice.
Used to model natural phenomena as height, IQ scores,…
It is asymptotic, meaning the curve approaches but never touches the X-axis
The area and the curve equals to 1
It is symmetric at zero
Area to the left of zero equals the area to the right of zero
How to read the Z
table?
How to read the Z
table?
P(0 < Z< 0.56)
P(Z >
1.96)
P(Z<
0.56)
Example:
Find
(a) Find P(Z>2)
900−1000 1100−1000
P($900 < X < $1,100) = P <𝑍<
100 100
= P −1 < 𝑍 < 1 =2(0.3413)=0.6826
P(0<Z<1) = 0.3413 so
0.3413 * 2 =0.6826 or about 68%
P(0<Z<2) =0.4772 so
0.4772 * 2 = 0.9544 or about 95%
P(0<Z<3) = 0.4987 so
0.4987 * 2 = 0.9974 or about 99.7%
The Empirical Rule…
Example
As part of its quality assurance program, the Autolite Battery
Company conducts tests on battery life. For a particular D-cell
alkaline battery, the mean life is 19 hours. The useful life of the
battery follows a normal distribution with a standard deviation
of 1.2 hours.
1. About 68% of the batteries failed between what two values?
μ ±1 σ; 19 ± 1(1.2) hours;
About 68% of batteries will fail between 17.8 and 20.2 hours.
2. About 95% of the batteries failed between what two values?
μ ±2 σ; 19 ± 2(1.2) hours;
About 95% of batteries will fail between 16.6 and 21.4 hours.
3. Virtually all of the batteries failed between what two values?
μ ±3 σ; 19 ± 3(1.2) hours;
Practically all (99.7%)of the batteries will fail between 15.4 and 22.6
hours.
Finding the value of Z given a
probability
Find the value of z satisfying each of the
following:
(a) P(Z>z)=0.5
(b) P(Z<z)=0.8643
(c) P(-z<Z<z)=0.9
(d) P(-z<Z<z)=0.99
(e) P(Z<z)=0.33
One method of controlling the quality of a product is 100% inspection which requires huge
expenditure in terms of time, money and labor. Moreover due to boredom and fatigue involved in
repetitive inspection process, there exists a possibility to overlook and some defective products
may pass the inspection point
• (i) The method is applicable in those industries where there is mass production and the industries follow a
set production procedure.
• (vi) Products of destructive nature during inspection can be easily inspected by sampling.
• (vii) Due to quick inspection process, scheduling and delivery times are improved
• (i) It does not give 100% assurance for the confirmation of specifications so there is always
some likelihood/risk of drawing wrong inference about the quality of the batch/lot.
• (ii) Success of the system is dependent on, sampling randomness, quality characteristics to
be tested, batch size and criteria of acceptance of lot.
Is ‘d’ ≤ ‘c’
Tightene
Reduced Normal
d
Discussion
% 100 Thank you