Professional Documents
Culture Documents
Quantitative Analysis
Quantitative Analysis
Descriptive Statistics
Akanksha Patra (TF, RMSSD)
Types of Quantitative Analysis
1. Univariate Analysis: analysis of a single variable ; serves the purpose
of description
Eg: frequency distributions, measures of central tendency & dispersion
• Example:
The scores of a student enrolled in RMSSD’23 over 7 in-class quizzes
are as follows:
12, 15, 10, 17, 17, 13, 18
What is the mean?
Weighted Mean
• For a larger data set, it is easier to summarize data in a frequency
table before calculating the mean. In this case, you need to weigh
each possible value by the frequency of the value to calculate the
total.
• Example: Consider the following scores by students on in-class quiz 3.
What is the average performance of the class?
Scores No. of Students
0 8
5 12
7 14
8 11
9 9
10 14
12 9
Measures of Central Tendency: Mean
• The mean is sometimes represented by the letter M and is also called
the typical, average, or most central score.
• A small n represents the sample size for which the mean is being
computed. A large N would represent the population size.
• The sample mean is the measure of central tendency that most
accurately reflects the population mean.
• It is very sensitive to extreme scores. An extreme score can pull the
mean in one or the other direction and make it less representative of
the set of scores and less useful as a measure of central tendency.
Measures of Central Tendency: Median
• This is also an average of a different kind. It is the midpoint in a set of
scores.
• Example:
The scores of a student enrolled in RMSSD’23 over in-class quiz 2 are as
follows:
7, 12, 15, 10, 17, 19, 13, 18, 16, 9, 5
What is the median?
Percentile Points
• Percentile points are used to define the percentage of cases equal to
and below a certain point in a distribution or set of scores.
• For example, if a score is at the 75th percentile, it means the score is
at or above 75% if the other scores in the distribution.
• The median is the 50th percentile.
• The 25th percentile is Q1, the 75th is Q3
• What is Q2?
Measures of Central Tendency: Median
• Why use the median instead of the mean? It is insensitive to extreme
scores, also known as outliers.
• When you have a set of scores in which one or more scores are
extreme, the median better represents the centremost value of that
set of scores than any other measure of central tendency.
• Example: Consider the list of five incomes:
• Rs. 135,456 Rs. 54,365 Rs. 37,668 Rs. 32,456 Rs. 25,500
Measures of Central Tendency: Median
• Rs. 135,456 Rs. 54,365 Rs. 37,668 Rs. 32,456 Rs. 25,500
• The mean of the set of five scores you see above is the sum of the set
of five divided by 5, which turns out to be Rs. 57,089. On the other
hand, the median for this set of five scores is Rs. 37,668. Which is
more representative of the group?
• When dealing with a large set of data points, it is usually unlikely, but
possible. Are the categories mutually exclusive?
When to Use What Measure of Central
Tendency
• Depends on the type of data you are describing. Remember the
various scales of measurement? Nominal, Ordinal, Interval, Ratio
• A measure of central tendency for qualitative, categorical, or nominal
data (such as gender, voting preference, hair color) can be described
using Mode.
• Median and Mean are best used with quantitative data, such as
height, income level (in currency, not categories), age, IQ scores etc.
• Mean is more precise than Median. Median is more precise than the
Mode. This means Mean is the most often used measure of central
tendency.
Measures of Variability or Dispersion
• Consider the following data sets:
• 7, 6, 3, 3, 1
• 3, 4, 4, 5, 4
• 4, 4, 4, 4, 4
Measures of Variability or Dispersion
• Variability (or spread, or dispersion) is a measure of how different
data points or scores are from one another.
• It is even more accurate (and easier) to think of variability as how
different scores are from one particular score. Which one do you
think that is?
• Three measures of variability are used to reflect the degree of
variability: range, standard deviation and variance.
Range
• Range is how far apart scores are from one another. The range is
computed simply by subtracting the lowest score from the highest.
• This is also called the exclusive range. The Inclusive Range is highest
score minus the lowest sore plus 1 (h-l+1).
Standard Deviation
• The most frequently used measure of variability. It represents the
average amount of variability in a set of scores.
• It is the average distance from the mean. The larger the standard
deviation, the larger average distance each data point is from the
mean of the distribution and the more variable the set of scores is.
Standard Deviation
• Consider this data set:
• 5, 8, 5, 4, 6, 7, 8, 8, 3, 6
Paths to description: