Professional Documents
Culture Documents
Written Assignment Unit 2
Written Assignment Unit 2
University of People
2
Abstract
This study examined the distribution of the number of pairs of sneakers owned by forty
randomly selected students. The dataset provided the frequency of students owning each
respective number of pairs of sneakers, ranging from 1 to 7. Descriptive statistics such as the
mean, sample standard deviation, quartiles, median, and percentiles were calculated to gain
insights into the distribution of ownership.
The mean number of pairs of sneakers owned was found to be approximately 3.175, with a
sample standard deviation of approximately 1.53. Quartile analysis revealed that 25% of the
students owned 3 or fewer pairs of sneakers (Q1), while 75% owned 4 or fewer pairs (Q3). The
median, representing the middle value of the dataset, was determined to be 4 pairs.
Additionally, the study found that 2.5% of the students owned at least five pairs of sneakers. The
40th percentile of ownership corresponded to 2 pairs, indicating that 40% of the students owned
2 or fewer pairs. Moreover, the 90th percentile indicated that 90% of the students owned 5 or
fewer pairs of sneakers.
These findings provide valuable insights into the distribution of sneaker ownership among the
student population, which can inform decisions related to inventory management, marketing
strategies, and student preferences. Further research could explore factors influencing sneaker
ownership trends and their implications for student demographics and consumer behavior.
quartiles, median, percentiles, sample standard deviation, mean, data analysis, distribution
The mean ( ) is the average value of a dataset. It's calculated by summing up all the values and
dividing by the total number of values.
x1=1, f1=2
x2=3, f2=5
x3=3, f3=8
x4=4, f4=12
x5=5, f5=12
x6=6, f6=0
x7=7, f7=1
n=40 (total number of observations)
So, the mean number of pairs of sneakers owned by the forty students is approximately 3.175.
The sample standard deviation (ss) measures the amount of variation or dispersion in a dataset.
It's calculated using the formula:
Where:
The relative frequency (fi) is the frequency of each value divided by the total number of
observations (N).
The cumulative relative frequency (CRF) is the sum of the relative frequencies up to each value
of X.
4
The first quartile (Q1) is the value below which 25% of the data fall. We find Q1 by locating
the cumulative relative frequency (CRF) that is closest to, but greater than, 0.25.
The median is the middle value of the dataset when arranged in ascending order. Since there are
40 observations, we locate the 20th and 21st observations. The median is the average of these
two values.
The 20th and 21st observations are both 4. Therefore, the median is 4
Similarly, the third quartile is the value below which 75% of the data fall. We locate the
cumulative relative frequency (CRF) closest to, but greater than, 0.75.
To find the percentage of students who owned at least five pairs of sneakers, we look at the
cumulative relative frequency (CRF) corresponding to X=5, which is 0.975. This means that
97.5% of the students owned five or fewer pairs of sneakers.
To find the percentage of students who owned at least five pairs, we subtract 0.975 from 1 (since
1 represents 100% of the data), which gives us 0.025 or 2.5%. Therefore, 2.5% of the students
owned at least five pairs of sneakers.
The 40th percentile is the value below which 40%40% of the data fall. We locate the cumulative
relative frequency (CRF) closest to, but greater than, 0.40.
Similarly, the 90th percentile is the value below which 90% of the data fall. We locate the
cumulative relative frequency (CRF) closest to, but greater than, 0.90.