Professional Documents
Culture Documents
Quantitative Analysis Applied To International Business
Quantitative Analysis Applied To International Business
Quantitative Analysis Applied To International Business
Ignacio Ocampo
Introduction
This essay analyses certain data for a given population in a Spanish province. The size of the population is
23,537 individuals for which the age, gender and home size has been determined. We will take and analyze
a sample of 300 individuals and compare the data obtained from the sample to that in the population and
draw conclusions regarding the accuracy of the calculations.
Below we can see the age pyramid (histogram) for certain given age brackets, which gives us an
approximate distribution of the age, by gender. The first bracket starts at the age of 16, since this is the
lowest value found in the population. I have determined six 10-year brackets and one +75 bracket. We
can see some slight differences between the age distributions of the male and female groups. While the
female´s by far largest group is 36-45, for men this group is the largest as well but very similar in numbers
to the 46-55 bracket, which in part explains the slightly higher average age for men. The highest age tiers
contain significantly more women than men, even though the male population is significantly larger than
the female. This may be explained by the general assumption that the life expectancy is higher for women
than for men.
The complete set of data extracted from the population is analyzed on the table below, in which we can
see all the relevant statistical indicators.
Age
Mean 42.147
Standard Error 0.076
Median 42
Mode 48
Standard Deviation 11.617
Variance 134.946
Kurtosis -0.014
Skewness 0.236
Range 82
Minimum 16
Maximum 98
Count 23537
Gender
The proportion of genders is far from balanced: 60.4% of the population are male and 39.6% are female.
This is represented in the pie chart below.
GENDER PROPORTION
39.6%
Male
60.4% Female
Home Size
The average home size is 112.44 with a standard deviation of 50.99. There seems to be no significant
correlation between age and home size: the correlation coefficient is 0.08. The chart below shows how
home sizes are distributed across age levels. As we can see in the trendline (Red), there is only a very weak
correlation between age and home size. In this chart we can clearly spot a number of outliers that can
move the mean in one direction or another, but when it comes to age, the outliers (few largest homes)
seem to be somewhat evenly distributed across the different age levels.
Home size
1000
900
800
700
600
500
400
300
200
100
0
0 20 40 60 80 100 120 140
Also, the difference between home size for men and women seems to be almost negligible: the average
home for men is 112.1 sq. meters, while for women it is a bit larger, 112.8 sq. meters.
In the table below, we can see the complete set of descriptive statistics for the home size population
dataset. One of the key observations is that the distribution is not symmetrical, and values tend to
concentrate in the lower range. This is rather obvious as smaller homes are more common than
excessively large homes. For this reason, we can see that the median is lower than the mean (outliers
are pulling the mean upwards) and the skewness value shows us this fact as well.
Home Size
Mean 112.443
Standard Error 0.332
Median 100.000
Mode 90.000
Standard Deviation 50.990
Variance 2599.949
Kurtosis 36.929
Skewness 4.278
Range 935
Minimum 15
Maximum 950
Count 23537
Sampling
A sample of 300 individuals was extracted of the total population (without replacement). The method
used to obtain a completely random sample was to assign a random number to each of the rows in the
population, then sort by number in an increasing order (could have been decreasing as well) and then
select only the first 300. Out of that sample, the following values were calculated:
Indicator Value
Mean Age 41.43
Age Variance 135.26
Age St Deviation 11.63
In the table below, the indicators obtained from the sample are compared to those extracted from the
population. This enables us to see the accuracy of the sample and the error
Sample Population
Mean Age 41.43 42.15
Age Variance 135.26 134.95
Age St Deviation 11.63 11.62
Given that we have the real values from the population we can see that the real means of the population
and fall within the established ranges for both confidence levels. However, in the case of male/female
proportion, the actual population mean is outside the established range at a 90% confidence. There was
a 10% chance that the actual mean was outside of those limits, and it actually was. However, when the
range is extended to the 95% confidence level, the population mean is within the range.