Professional Documents
Culture Documents
Fundamentals of Statistics 1
Fundamentals of Statistics 1
Ratio
You might have heard of the sequence of terms to describe data : Nominal, Ordinal,
Interval and Ratio. They were used quite extensively but have begun to fall out of favor.
These terms are used to describe types of data and by some to dictate the appropriate
statistical test to use. Most statistical text books still use this hierarchy so students
generally end up needing to know them.
Nominal basically refers to
categorically discrete data such as name of
your school, type of car you drive or name
of a book. This one is easy to remember
because nominal sounds like name(they
have the same Latin root).
Who Cares?
Where did this all come from you ask and why do we care? Well, the short answer is, we
should care most about identifying nominal data--which is categorical data. If it isn't
nominal, then it's quantitative. So why all the fuss? In the 1940's when behavioral
science was in its infancy, there was much concern about trying to make the practice as
legitimate as possible. Psychology and other Social and Behavioral Sciences are
considered soft sciences as opposed to the hard sciences of Chemistry and Physics. It was
thought that by applying some of the same thinking from the hard sciences, it would
improve the legitimacy of these soft sciences--as well as the veracity of the claims made.
One approach was to map types of scaling to more natural laws (something akin to the
physical laws of gravity and motion). This classification system was proposed in 1946 by
SS Stevens. In the article Stevens went so far as to say that you should only take
averages on at least interval and ratio data. Nominal and Ordinal data should only be
counted and described in frequency tables--no means and standard deviations.
One of the more famous articles showing the fallacy of such rigid thinking was by an
eminent statistician named Lord who in his article: "On the statistical Treatment of
Football Numbers" showed how the means of nominal data can be meaningful too!
In practice, rating scales are ubiquitous in behavioral sciences and rarely have they been
shown to have interval, much less ratio scales (what is the 0 point of customer
satisfaction ?)
So means, standard deviations, t-tests, regressions and ANOVA are run daily and the
results are published without much concern for these categories (to the chagrin of a few
purists). What this classification system does remind us of is to not make interval and
ratio claims about ordinal data. So if the average customer satisfaction on Product A is 4.0
and the Average on B is 2.0, we need to be careful in thinking the difference in
satisfaction is twice. We can say there is a difference, but we're less sure if it's two times.
In summary, it's generally OK to take means and apply statistical tests to ordinal data,
just be careful about making interval claims such as "twice as satisfied."
http://www.usablestats.com/lessons/noir