Lesson 1

I.Q.
Lesson 1
Terminology
Data
Data (singular datum) is whatever we

collect in a study. It can be numerical or not
A datum may be referred to as a data point
in a table or graph. It may also be referred
to as a score
Data set refers to all the data collected in an
experiment
Types of Data
Data are first categorized by numerical, and non

numerical
Quantitative data can be interval or ratio
In research, numerical is called QUANTitative, and

non-numerical is called QUALitative
The difference is that ratio data have an absolute 0
and standard increases, so for example, 4 would be
twice as more than 2
Qualitative data can be nominal or categorical
This distinction is irrelevant, as the analyses for

qualitative data are all the same
Variable
A variable is what we collect data of. It

determines what type of data we have
We call it a variable because it can have
different values that can be measured
e.g. IQ is a quantitative variable: we collect
numerical data from it
e.g. Hair colour is a qualitative variable: we
collect non-numerical data from it
Population
The population is the complete group from

which we collect data, and what we make
conclusions about in a study.
A population can be anything, from the total
people of a country, to a class of
psychology students, to the total number of
Oak trees in the world.
Sample
However, populations are usually to large to

collect data from, so we select a group from the
population at random, called a sample
If the sample is randomly selected from the
population, we can assume that it represents the
population, and we can make conclusions about
the population from the data we collect from the
sample.
Population random sample collect data
conclusion about the population.
Individual
The individual is a single member of the

population/sample.
Samples consist of a group of individuals
selected from the population
Populations consist of ALL individuals we
want to study
Descriptive Statistics
Before we can do inferential statistics to

make conclusions, we need to know things
about the population first: We need to
describe it.
Descriptive statistics are calculated from the
data set, or raw data (the data before ANY
calculations or manipulations are done to it)
Central Tendency
The first type of descriptive statistic we calculate

is a measure of central tendency.
Central tendency essentially means the center
point of a data set, or the number that all datum
have a tendency to be close to.
Measures of central tendency include the mean,
the median, and mode, and each serve different
purposes.
The Mean
The mean is the standard measure of central

tendency, and the most useful for doing inferential
statistics
The mean is simply the average: all the scores added
together then divided by the total number of scores
Sometimes the mean does not adequately measure
the central tendency, such as when one or more
scores are extreme compared to the rest.
e.g. 3, 5, 6, 30: 30 is extreme (a single score

significantly changes the mean)
The Median
In cases of extreme scores (called

outliers), we need a different measure of
central tendency that is not affected by
outliers.
That's the median: the center of a data set
when the scores are listed from smallest to
greatest, or vice versa
Smallest to largest is the standard and

expected way to find the median
The Mode
Mean and median only work for numerical data, like

age or IQ scores.
Sometimes we collect data that is not numerical, like
hair colour or gender.
We cannot find the average number hair

colour.
But, we can find the most common hair colour

The mode is simply whichever category appears the
most
Frequency
Frequency means something specific in

stats
It is the number of times a score appears in
a data set.
Recall, the mode is the score with the

highest frequency
Later we'll learn how to construct a

frequency table
Variance
Recall that measures of central tendency are

based on variables: things that have different
values (it varies between individuals)
This means there are differences between
individuals, resulting in different scores for each
individual
We call these differences variance.
Data comes from a variable, which varies,

resulting in variance
Distribution
A distribution is a graphical representation

of a data set
It gives us a picture of how the data are
distributed in the sample or population
It is created using, and gives us a picture
of, the central tendency, frequency, and
variance
It is used in the final steps of an analysis
Inferential Statistics
Using descriptive statistics (mean, variance, etc.),

we can conduct an analysis
An analysis (plural analyses) is a mathematical
procedure that takes numerical information
(descriptive statistics) about a population, and
determines a probability
Based on the results of analyses, we make
conclusions about the population, called
inferences.
Probability
Probability is the most basic form of inferential

statistics, and often discussed within the context
of descriptive statistics
Probability is simply the likelihood of a specific
outcome.
e.g. If you flip a coin, there are 2 possible
outcomes: heads or tails. Therefore, the
probability of getting heads is 50%, and 50% for
tails.
Conclusion
Hopefully by now you have a rudimentary
understanding of the terms we use in statistics.
We will use the terms from this lesson quite a bit in
future lessons, so make sure you are clear on the
terms. If you need to, come back to this slide
show, or save it as a reference.
Referring back to these as you go will help you to
better memorize the terms by using them, rather
than just reading them over and over again.

Lesson 1

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lesson 1

Uploaded by

Copyright:

Available Formats

I.Q.

Data (singular datum) is whatever we

Data are first categorized by numerical, and non

Quantitative data can be interval or ratio

In research, numerical is called QUANTitative, and

Qualitative data can be nominal or categorical

This distinction is irrelevant, as the analyses for

A variable is what we collect data of. It

The population is the complete group from

However, populations are usually to large to

The individual is a single member of the

Before we can do inferential statistics to

The first type of descriptive statistic we calculate

The mean is the standard measure of central

e.g. 3, 5, 6, 30: 30 is extreme (a single score

In cases of extreme scores (called

Smallest to largest is the standard and

Mean and median only work for numerical data, like

We cannot find the average number hair

But, we can find the most common hair colour

Frequency means something specific in

Recall, the mode is the score with the

Later we'll learn how to construct a

Recall that measures of central tendency are

Data comes from a variable, which varies,

A distribution is a graphical representation

Using descriptive statistics (mean, variance, etc.),

Probability is the most basic form of inferential

You might also like