Professional Documents
Culture Documents
Descriptive Stat Intro
Descriptive Stat Intro
VEENA GOSWAMI
Getting started
In simple words, statistics is a way to get information from data. More
precisely, it is a science of collecting and analyzing data for the purpose
of drawing conclusions and making decisions.
Inferential statistics:
Inferential statistics are the methods that use the data collected from a
small group to draw conclusions about a larger group.
Types of Variables
Categorical variables (also known as qualitative variables) have values
that can only be placed into categories such as yes and no.
“Do you currently own bonds?” (yes or no) and the level of risk of a
bond fund (below average, average, or above average) are examples of
categorical variables.
For example, there are six colors of candies in a bag of M&M's candies.
Levels of measurement
Levels of measurement
MUTUALLY EXCLUSIVE:
A property of a set of categories such that an individual or object is
included in only one category.
EXHAUSTIVE:
A property of a set of categories such that each individual or object
must appear in a category.
Levels of measurement
Nominal= data that is classified into categories and cannot be arranged in any
particular order.
EXAMPLES: eye color, gender, religious affiliation.
To summarize, nominal-level data have the following properties:
1. Data categories are mutually exclusive and exhaustive.
2. Data categories have no logical order.
Ordinal= involves data arranged in some order, but the differences between data
values cannot be determined or are meaningless.
EXAMPLE: During a taste test of 4 soft drinks, Mellow Yellow was ranked
number 1, Sprite number 2, Seven-up number 3, and Orange Crush number 4.
The properties of ordinal-level data are:
1. The data classifications are mutually exclusive and exhaustive.
2. Data classifications are ranked or ordered according to the particular trait they
possess.
Levels of measurement
Interval= similar to the ordinal level, with the additional
property that meaningful amounts of differences between data
values can be determined. There is no natural zero point.
EXAMPLE: Temperature on the Fahrenheit scale.
Taos, Acoma, Zuni, and Cochiti are the names of four Native American
villages from the population of names of all Native American villages in
Arizona and New Mexico.
Solution:
These data are at the nominal level. Notice that these data values are
simply names. By looking at the name alone, we cannot determine if
one name is “greater than or less than” another. Any ordering of the
names would be numerically meaningless.
Example 2: Levels of measurement
In a high school graduating class of 319 students, Jim ranked 25th, Kim
ranked 19th, Walter ranked 10th, and Julia ranked 4th, where 1 is the
highest rank.
Solution:
These data are at the ordinal level. Ordering the data clearly makes
sense. Walter ranked higher than Kim. Jim had the lowest rank, and
Julia the highest.
For instance, if you looked at grade point average, Walter and Julia may
have had a large gap between their grade point averages, whereas Kim
and Jim may have had closer grade point averages.
Solution:
These data are at the interval level. We can certainly order the data,
and we can compute meaningful differences. However, for Celsius-scale
temperatures, there is not an inherent starting point.
The value 0C may seem to be a starting point, but this value does not
indicate the state of “no heat.”
Furthermore, it is not correct to say that 20C is twice as hot as 10C.
Example 4: Levels of measurement
Length of fish swimming in the Yellowstone River.
Solution:
These data are at the ratio level. An 18-inch fish is three times as long
as a 6-inch fish. Observe that we can divide 6 into 18 to determine a
meaningful ratio of fish lengths.
Example: Levels of measurement
What is the level of measurement for each of the following variables?
a. Distance students travel to class.