Professional Documents
Culture Documents
Apuntes UNIT 1
Apuntes UNIT 1
Apuntes UNIT 1
WHAT IS STATISTICS?
Data is almost always an imperfect measure of what we are interested in because of:
We need to find ways to extract insights from these imperfect measures. Anything we
choose to measure will vary from person to person, from time to time, and from place
to place. We need to extract insights from this variability.
- What can we learn from the way a variable changes from person to person?
- what can we learn from the way a variable changes from place to place?
- Can you think of examples?
Generally, statistical literacy (the ability to understand and critically evaluate statistical
results) is fundamental in the “age of data”
Ex: We measure the height (in cm) of 50 UEM students chosen at random:
Mean= 170, S.D. = 10 ----------------It describes the average height (and
variability) in that group of 50 students
We come up with an objective and clear definition that allows us to assign numbers to
objects.
For example:
WHAT IS A VARIABLE?
TYPES OF VARIABLES
CONTINUOUS: Can take on almost any numerical value using decimals (1,0; 1,1; 1,2…)
Ex: weight
CLASSIFICATION OF VARIABLES
CATEGORICAL VARIABLES:
INTERVAL: it is used with numbers that are equally spaced. Zeros are not
meaningful
Ex: IQ score
RATIO: they are similar to interval variables, but zero has meaning (absence of
the thing you are measuring)
Ex: drug dosage in mg
OPERATIONS
NOMINAL:
We can only count the number of observations In each category and report that
number.
Ex:
- Nationalities: How many people come from France? How many from Spain?
- Marital status: how many people are married? single? Divorced?
ORDINAL:
Ex:
INTERVAL:
We can calculate differences in values, because numbers are meaningful and equally
spaced.
Ex:
- IQ score
- A person with an IQ= 110 is 10 points above the mean (100). 110-100= 10
RATIO:
Same properties as interval data, but we can also multiply and divide values.
EX: We randomly assign patients diagnosed with depression (with similar depression
levels) to two different treatments.
If we see a difference in recovery, then we can say that one treatment is more
effective than the other.
EX: we want to compare cognitive and group therapy but we cannot randomly assign
patients. We need to apply each type of therapy in groups that are given to us
(patients attend different psychotherapy centers)
If we see a difference in recovery, we can say that one treatment is more effective
than the other. But we should also consider that the psychotherapy center has an
effect on results
EX: a recent study found that people who engage in cultural activities (going to
museums, going to the opera, cinema) lived longer than people who did not do so.
SIMPSON’S PARADOX
The total results are affected by the amount of patients in poor health who arrived in
hospital B
Radelet and Pierce (1991): the lower percentage of total death sentences for African-
Americans is due to the fact that cases in which the victim is black are less likely to get
death sentences and same-race violence is more prevalent.