Professional Documents
Culture Documents
2020 - Files Sent To The Students - Compressed
2020 - Files Sent To The Students - Compressed
●
Data
Qualitative Quantitative
Discrete Continuous
VARIABLES
AND
DATA
A
variable
is
a
characteris9c
that
changes
or
varies
over
9me
and/or
for
different
individuals
or
objects
under
considera9on.
For
example,
body
temperature
is
a
variable
that
changes
over
9me
within
a
single
individual;
it
also
varies
from
person
to
person.
Religious
affilia9on,
ethnic
origin,
income,
height,
age,
and
number
of
offspring
are
all
variables—characteris9cs
that
vary
depending
on
the
individual
chosen.
An
experimental
unit
is
the
individual
or
object
on
which
a
variable
is
measured.
A
single
measurement
or
data
value
results
when
a
variable
is
actually
measured
on
an
experimental
unit.
A popula1on is the set of all measurements of interest to the inves9gator.
A
sample
is
a
subset
of
measurements
selected
from
the
popula9on
of
interest.
When
a
variable
is
actually
measured
on
a
set
of
experimental
units,
a
set
of
measurements
or
data
result.
Univariate
data
result
when
a
single
variable
is
measured
on
a
single
experimental
unit.
d 9/6/11 7:28 PM Page 9
If
you
measure
the
body
temperatures
of
148
people,
the
resul9ng
data
are
univariate
Bivariate
data
result
when
two
variables
are
measured
on
a
single
experimental
unit.
1.2 TYPES OF VARIABLES ❍ 9
Mul1variate
data
result
when
more
than
two
variables
are
measured.
●
five
ents
five
variables
were
measured
on
each
student,
resul9ng
in
mul+variate
data.
The second variable measured on the students is gender, in column C. This variable
TYPES
OF
VARIABLES
Qualita1ve
variables
measure
a
quality
or
characteris9c
on
each
experimental
unit.
Quan1ta1ve
variables
measure
a
numerical
quan9ty
or
amount
on
each
experimental
unit.
1.2 TYPES OF VARIABLES ❍ 9
Qualita9ve
variables
produce
data
that
can
be
categorized
according
to
similari9es
The second or
variable
differences
in
kind;
measured onhtheence,
they
aisre
gender,
students o\en
cin
alled
categorical
column C. Thisdvariable
ata.
is somewhat different from GPA, since it can take only one of two values—male (M)
The
variables
gender,
year,
and
major
in
the
table
are
qualita9ve
variables
that
or female (F). The population, if it could be enumerated, would consist of a set of Ms
produce
c ategorical
d ata.
H ere
a re
s ome
o ther
examples:
and Fs, one for each student at the university. Similarly, the third and fourth variables,
•
Poli9cal
year and major, affilia9on:
Republican,
generate nonnumerical Democrat,
data. Year Independent
has four categories (Fr, So, Jr, Sr),
•
Tmajor
and aste
ranking:
has one ecategoryxcellent,
for
good,
eachfair,
poor
undergraduate major on campus. The last variable,
•
Color
current of
an
Mof&M’S®
number units cenrolled,
andy:
brown,
yellow,
red,
is numerically valued,orange,
set
of numbers
green,
balue
generating
rather than a set of qualities or characteristics.
Although we have discussed each variable individually, remember that we have
Quan9ta9ve
variables,
o\en
represented
by
the
leber
x,
produce
numerical
data,
such
as
those
listed
here:
A
discrete
variable
can
assume
only
a
finite
or
countable
number
of
values.
A
con1nuous
variable
can
assume
the
infinitely
many
values
corresponding
to
the
points
on
a
line
interval.
The
name
discrete
relates
to
the
discrete
gaps
between
the
possible
values
that
the
variable
can
assume.
Variables
such
as
number
of
family
members,
number
of
new
car
sales,
and
number
of
defec9ve
9res
returned
for
replacement
are
all
examples
of
discrete
variables.
On
the
other
hand,
variables
such
as
height,
weight,
9me,
distance,
and
volume
are
con+nuous
because
they
can
assume
values
at
any
point
along
a
line
interval.
clearly and understandably to your audience!
●
Data
Qualitative Quantitative
Discrete Continuous
type of GRAPHS
variableFOR
youCATEGORICAL
have measured. DATA
When the variable of interest is qualitative or categorical, th
What
categories
of
the values
of
the
vbeing
ariable
have
been
measured
considered along with a measure of
•
How
o\en
each
value
has
occurred
occurred. You can measure “how often” in three different way
A
sta+s+cal
table
can
be
used
to
display
the
data
graphically
as
a
data
distribu1on.
• The frequency, or number of measurements in each cat
•
The
frequency,
or
number
of
measurements
in
each
category
•
The
rela1ve
• frequency,
The relativeor
propor9on
of
measurements
frequency, in
each
category
or proportion of measurements
•
The
percentage
of
measurements
in
each
category
• The percentage of measurements in each category
If
you
let
Ifn
you
be
the
lettotal
n be number
the totalof
measurements
number of in
tmeasurements
he
set,
you
can
find
in
the
the
rela9ve
set, you
frequency
and
percentage
using
these
rela9onships:
quency and percentage using these relationships:
Frequency
Relative frequency ! ""
n
Percent ! 100 # Relative frequency
The
sum
of
the
frequencies
is
always
n,
the
sum
of
the
rela9ve
frequencies
is
1,
and
the
sum
of
the
percentages
is
100%.
Example:
In
a
survey
concerning
public
educa9on,
400
school
administrators
were
asked
to
rate
the
quality
of
educa9on
in
the
United
States.
Their
responses
are
summarized
in
Table
1.1.
Construct
a
pie
chart
and
a
bar
chart
for
this
set
of
data.
Solu9on
To
construct
a
pie
chart,
assign
one
sector
of
a
circle
to
each
category.
The
angle
of
each
sector
should
be
propor9onal
to
the
propor9on
of
measurements
(or
rela+ve
frequency)
in
that
category.
Since
a
circle
contains
360°,
you
can
use
this
equa9on
to
find
the
angle:
Angle
︎=
Rela9ve
frequency
X
︎
360°
The visual impact of these two graphs is somewhat different. The pie chart is used to
display the relationship of the parts to the whole; the bar chart is used to emphasize the
actual quantity or frequency for each category. Since the categories in this example are
ordered “grades” (A, B, C, D), we would not want to rearrange the bars in the chart to
change its shape. In a pie chart, the order of presentation is irrelevant.
●
1.3 D A
B 260 260/400 ! .65 65%FOR CATEGORICAL
1.3 GRAPHS 234.0º DATA ❍ 13
C 93 93/400 ! .23 23% 82.8º
D 12 12/400 ! .03 3% 10.8º
Calculations for the Pie Chart in Example 1.3
● Total 400 1.00 100% 360º
Rating Frequency Relative Frequency Percent Angle
A 35 35/400 ! .09 9% .09 " 360 ! 32.4º
B 260
The visual 260/400
impact .65 two graphs
of !these 65% is234.0º
somewhat different. The pie chart is used
C 93 93/400 ! .23 23% 82.8º
D
display
12
the relationship of
12/400 ! .03
the parts
3%
to the whole;
10.8º
the bar chart is used to emphasize t
Total
actual400quantity or
1.00
frequency for each
100%
category.
360º
Since the categories in this example a
ordered “grades” (A, B, C, D), we would not want to rearrange the bars in the chart
change its shape. In a pie chart, the order of presentation is irrelevant.
The visual impact of these two graphs is somewhat different. The pie chart is used to
3 display the relationship of the parts to the whole; the bar chart is used to emphasize the
● quantity or frequency for each category. Since the categories in this example are
actual
mple 1.3 D
we would A8.8%
ordered “grades” (A, B, C, D), 3.0% not want to rearrange the bars in the chart to
change its shape. In Ca pie chart, the order of presentation is irrelevant.
23.3%
●
.3 D A
3.0% 8.8%
C
23.3%
B
65.0%
4
● B
ordered “grades” (A, B, C, D), we would not want to rearrange the bars in the chart to B
Pie
chart
In a pie chart, the order of presentation is irrelevant.Bar
Diagram
65.0%
change its shape.
FIGURE 1.4
● ●
D A
Bar chart for Example 1.3
3.0% 8.8% 250
C
23.3% 200
Frequency
150
100
50
0
A B C D
Rating
B
65.0%
150 the bars. To emphasize that brown is the most frequent col
quan9ty
or
frequency
for
each
category.
orange, we Since
ordertthehe
barscategories
from largest to smallest and crea
in
this
example
are
ordered
“grades”
bar(chart
A,
Bin,
Cwhich
,
D),
thewbars
e
ware
ould
100
ordered from largest to sm
50
not
want
to
rearrange
the
bars
in
the
chart
to
change
its
0
shape.
In
a
pie
chart,
the
order
of
presenta9on
is
irrelevant.
A B C D
Rating
1.1
Experimental
Units
Iden9fy
the
experimental
units
on
which
the
following
variables
are
measured:
a.
Gender
of
a
student
b.
Number
of
errors
on
a
midterm
exam
c.
Age
of
a
cancer
pa9ent
d.
Number
of
flowers
on
an
azalea
plant
e.
Color
of
a
car
entering
a
parking
lot
ANSWER:
a:
student
A
variable
is
a
characteris9c
that
changes
or
varies
over
9me
and/or
for
different
B:
exam
individuals
or
objects
under
considera9on.
C:
pa9ent
D:
Plant
E:
car
dren in the class, respectively. Variable 4, the winnin
1.2
Qualita9ve
or
Quan9ta9ve?
Iden9fy
each
variable
as
quan9ta9ve
or
horse, is the only continuous variable in the list. The wi
sured with sufficient accuracy, could be 121 seconds, 1
qualita9ve:
or any values between any two times we have listed.
FIGURE 1.2
Answer:
Types of data
●
Data
Discrete:
a,d
Qualitative
Quantitative
Con9nuous:
b,c,e
Qualita1ve
variables
measure
a
quality
or
Discrete Continuous
Answer:
(a) Sample
;
(b)
age
of
the
death
(
c)
quan9ta9ve
A
popula1on
is
the
set
of
all
measurements
of
interest
to
the
inves9gator.
A
sample
is
a
subset
of
measurements
selected
from
the
popula9on
of
interest.
Qualita1ve
variables
measure
a
quality
or
characteris9c
on
each
experimental
unit.
Quan1ta1ve
variables
measure
a
numerical
quan9ty
or
amount
on
each
experimental
unit.
1.7
Voter
Awtudes
You
are
a
candidate
for
your
state
legislature,
and
you
want
to
survey
voter
awtudes
regarding
your
chances
of
winning.
Iden9fy
the
popula9on
that
is
of
interest
to
you
and
from
which
you
would
like
to
select
your
sample.
How
is
this
popula9on
dependent
on
9me?
Answer:
The
popula9on
is
the
set
of
voter
preferences
for
all
voters
in
the
state.
Voter
preference
may
change
over
9me.
A
popula1on
is
the
set
of
all
measurements
of
interest
to
the
inves9gator.
A
sample
is
a
subset
of
measurements
selected
from
the
popula9on
of
interest.
1.8
Cancer
Survival
Times
A
medical
researcher
wants
to
es9mate
the
survival
9me
of
a
pa9ent
a\er
the
onset
of
a
par9cular
type
of
cancer
and
a\er
a
par9cular
regimen
of
radiotherapy.
a.
What
is
the
variable
of
interest
to
the
medical
researcher?
b.
Is
the
variable
in
part
a
qualita9ve,
quan9ta9ve
discrete,
or
quan9ta9ve
con9nuous?
c.
Iden9fy
the
popula9on
of
interest
to
the
medical
researcher.
d.
Describe
how
the
researcher
could
select
a
sample
from
the
popula9on.
e.
What
problems
might
arise
in
sampling
from
this
popula9on?
Answer:
a:
survival
9me
of
a
pa9ent
having
par9cular
type
of
cancer
b:
Quan9ta9ve
con9nuous
c:
Due
onset
of
par9cular
cancer
type
,
pa9ents
have
just
started
a
par9cular
regimen
of
radiotherapy
d:
calculate
mean
survival
9me
of
pa9ents(x);
y>x,
y=x,
y<x.
e:
It
depends
on
many
factors:
gender,
age,
co-‐morbidity
and
physiological
reason.
1.9
New
Teaching
Methods
An
educa9onal
researcher
wants
to
evaluate
the
effec9veness
of
a
new
method
for
teaching
reading
to
deaf
students.
Achievement
at
the
end
of
a
period
of
teaching
is
measured
by
a
student’s
score
on
a
reading
test.
a.
What
is
the
variable
to
be
measured?
What
type
of
variable
is
it?
b.
What
is
the
experimental
unit?
c.
Iden9fy
the
popula9on
of
interest
to
the
experimenter.
Answer:
a) Score
on
the
reading
test;
quan9ta9ve
b) Students
c)Set
of
scores
for
all
the
deaf
students
who
hypothe9cally
might
take
the
test
A
popula1on
is
the
set
of
all
measurements
of
interest
to
the
inves9gator.
A
sample
is
a
subset
of
measurements
selected
from
the
popula9on
of
interest.
BASIC TECHNIQUES
BASIC
TECHNIQUES
1.10
1.10Fi\y
Fiftypeople
people are
grouped
are groupedinto
four
into categories—
four categories— A,
B,
C,
and
A, B, D—and
C, and the
D—and
number
othe
f
people
number who
offall
peopleinto
each
who category
fall
is
into shown
the
table:
is shown in the table:
eachin
category
a.
What
is
the
experimental
unit?
Category Frequency b.
What
is
the
variable
being
measured?
Is
it
A 11 qualita9ve
or
quan9ta9ve?
B 14 c.
Construct
a
pie
chart
to
describe
the
data.
C 20 d.
Construct
a
bar
chart
to
describe
the
data.
D 5 e.
Does
the
shape
of
the
bar
chart
in
part
d
change
depending
on
the
order
of
presenta9on
of
the
four
a. What is the experimental categories?
unit? Is
the
order
of
presenta9on
important?
f.
What
propor+on
of
the
people
are
in
category
B,
C,
or
b. What is the variable being measured? Is it
D?
qualitative or quantitative? g.
What
percentage
of
the
people
are
not
in
category
c. Construct a pie chart B?
to
describe the data.
d.Answer:
Construct a bar
a)
category
chart to describe the data.
e.b)
fDoes
requency,
the qshape
uan9ta9ve
of the bar chart in part d change
d)
B ar
chart
will
on
depending be
ythe
-‐axis
order :
frequency
and
x-‐axis
category
of presentation of the four A,
B,
C
and
D
categories? Is the order of presentation important?
Qualita9ve
variables
produce
data
that
can
be
categorized
according
to
similari9es
or
differences
in
kind;
hence,
they
are
o\en
called
categorical
data.
A
bar
chart
shows
the
same
distribu9on
of
measurements
among
the
categories,
with
the
f. What proportion of the people are in category B, C,
height
of
the
bar
measuring
how
o\en
a
par9cular
category
was
observed.
or D?
BASIC TECHNIQUES
BASIC
TECHNIQUES
1.10
1.10Fi\y
Fiftypeople
people are
grouped
are grouped into
four
into categories—
A,
B,
C,
four categories—
and
A, B, D—and
C, and the
D—and
number
othe
f
people
number who
of fall
into
each
people who category
fall
is
into shown
the
table:
is shown
eachin
category a.
What
in ithes
the
table:
experimental
unit?
b.
What
is
the
variable
being
measured?
Is
it
Category Frequency qualita9ve
or
quan9ta9ve?
A 11 c.
Construct
a
pie
chart
to
describe
the
data.
B 14 d.
Construct
a
bar
chart
to
describe
the
data.
C 20 e.
Does
the
shape
of
the
bar
chart
in
part
d
change
depending
on
the
order
of
presenta9on
of
the
four
categories?
D 5 Is
the
order
of
presenta9on
important?
f.
What
propor+on
of
the
people
are
in
category
B,
C,
or
D?
a. What is the experimental unit?
g.
What
percentage
of
the
people
are
not
in
category
B?
b. What is the variable being
measured? Is it
qualitative or Frequency
Catagory
quantitative? Rela1ve
Percent
Angle
c. Construct a pie chart to describe Frequency
the data.
d.A
Construct a bar
11
chart to describe
11/50=
0.22
the data. 22
%
0.22
x360
=
79.2°
what
Too
much
pressure
20
other types20/100=0.2
of questions might
20
you want
72
to
investigate?
Too
much
15
15/100=0.15
15
54
work
Would
not
be
good
at
14
14/100
Fanatics
1.14 Facebook =
0.14
14
social 50.4
The network-
it
EX0114
Too
much
ing site called Facebook has grown quickly since
arguing
5
5/100
=
0.05
5
18
its
Others
inception in 2004. In fact, Facebook’s United States
user base 6
grew from6/100
=
0.06
users6
to 103 million
42 million 21.6
users
Total
between 100
1
2009 and 2010. The table below 100
shows 360
the age
distribution of Facebook users (in thousands) as it
Opini Frequency
Rela1ve
Percent
Angle
(°)
on
Frequency
(%)
b)Would
you
use
a
pie
chart
or
a
Poll
bar
chart
to
graphically
describe
Other
career
40
40/100=0.4
40
144
the
data?
Why?
plans/
no
interest
Too
20
20/100=0.2
20
72
much
pressur
e
Too
15
15/100=0.15
15
54
much
work
Would
14
14/100
=
0.14
14
50.4
not
be
good
at
it
Too
5
5/100
=
0.05
5
18
much
arguing
Others
6
6/100
=
0.06
6
21.6
Total
100
1
100
360
A
pie
chart
is
the
familiar
circular
graph
that
shows
how
the
measurements
are
distributed
among
the
categories.
A
bar
chart
shows
the
same
distribu9on
of
measurements
among
the
categories,
with
the
height
of
the
bar
measuring
how
o\en
a
par9cular
category
was
observed.
Frequency
45
40
35
30
25
20
15
Frequency
10
5
0
Other
Too
much
Too
much
Would
Too
much
Others
career
pressure
work
not
be
arguing
plans/no
good
at
it
interest
Frequency
Other
career
plans/no
interest
a.
Define
the
variable
that
has
been
measured
in
this
table.
b.
Is
the
variable
quan9ta9ve
or
qualita9ve?
c.
What
do
the
numbers
represent?
d.
Construct
a
pie
chart
to
describe
the
age
distribu9on
of
Facebook
users
as
of
January
4,
2009.
e.
Construct
a
pie
chart
to
describe
the
age
distribu9on
of
Facebook
users
as
of
January
4,
2010.
f.
Refertopartsdande.Howwouldyoudescribethe
changes
in
the
age
distribu9ons
of
Facebook
users
during
this
1-‐year
period?
Age As of 1/04/2009 As of 1/04/2010
13–17 5675 10,680
18–24 17,192 26,076
25–34 11,255 25,580
35–54 6989 29,918
55! 955 9764
Unknown 23 1068
Total 42,089 103,086
a. Define
the
variable
that
has
been
measured
in
this
table.
Answer:
Number
of
facebook
users
at
a
age
group
b.
Is
the
variable
quan9ta9ve
or
qualita9ve?
Answer:
Quan9ta9ve
c.
What
do
the
numbers
represent?
Quan9ta9ve,
Discrete
d.
Construct
a
pie
chart
to
describe
the
age
distribu9on
of
Facebook
users
as
of
January
4,
2009.
e.
Construct
a
pie
chart
to
describe
the
age
distribu9on
of
Facebook
users
as
of
January
4,
2010.
f.
Refer
to
parts
d
and
e.
How
would
you
describe
the
changes
in
the
age
distribu9ons
of
Facebook
users
during
this
1-‐year
period?
Age
Frequency
Rela1ve
Percent
(%)
Angle
(°)
Frequency
13-‐17
5675
5675/42089
=
0.134833329
13.48333294
48.53999857
18-‐24
17192
0.408467771
40.84677707
147.0483974
25-‐34
11255
0.267409537
26.74095369
96.2674333
35-‐54
6989
0.166052888
16.60528879
59.77903965
55+
955
0.022690014
2.269001402
8.168405046
Unknown
23
0.000546461
0.054646107
0.196725985
Total
42089
1
100
360
25-‐34
35-‐54
55+
Unknown
Age
Frequency
Rela1ve
Frequency
Percent
(%)
Angle
(°)
0.103602817 10.36028171 37.29701414
13-‐17
10680
100 360
Total
103086
1
Angle
(°)
e.
Construct
a
pie
chart
to
describe
13-‐17
the
age
distribu9on
of
Facebook
18-‐24
users
as
of
January
4,
2010.
25-‐34
35-‐54
55+
Unknown
f.
Refer
to
parts
d
and
e.
How
would
you
describe
the
changes
in
the
age
distribu9ons
of
Facebook
users
during
this
1-‐year
period?
2009
2010
Angle
Angle
(°)
13-‐17
18-‐24 13-‐17
18-‐24
25-‐34
25-‐34
35-‐54 35-‐54
55+
55+
Unknown
Unknown
Facebook
usage
in
these
age
groups
(
13-‐17,
35-‐54,
55+)
have
increased
and
this
age
group
(18-‐24)
has
decreased.
1.15
d. BConstruct
ack
to
Work
a pieHow
chart long
does
it
tthe
to describe ake
age
you
distribution
to
adjust
to
your
b. nIsormal
the barwork
chart drawn accurate
of aFacebook users as of January 4, 2009. the three bars in the correct pro
rou9ne
e. Construct
\er
coming
a pieochart
back
from
vaca9on?
A
b ar
g raph
with
d ata
from
the
other?
Snap-‐shots
sec9on
f
USA
toToday
describe the agebelow:
is
shown
distribution
c. Use a pie chart to describe the
a.
Are
ofall
Facebook users asaof
of
the
opinions
Januaryf4,
ccounted
or
2010.
in
the
table?
graph is more interesting to look
Add
f. aRefer
nother
tocparts ategory
d and if
ne.ecessary.
How would you describe the
b.
Is
tchanges in thedrawn
he
bar
chart
age distributions
accurately?
ofThat
Facebook is,
are
users
the
three
bars
in
the
correct
propor9on
duringtthis 1-year
o
each
period?
other?
c.
U se
a
pBack
1.15 ie
chart
to
describe
to Work How tlong he
opinions.
does it take Which
you gtoraph
is
more
interes9ng
to
look
adjustat?
to your normal work routine after coming back
One
Day 40%
A Few
Days 34%
No
Time 19%
No
9me
No
9me
19
0.19
68.4
Other