Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

CCMR Educational Programs

Title: Should We Count the Beans…one at a time?


Date Created: August 6, 2006
Author(s): Joan Erickson
Appropriate Level: Introductory statistics or AP Statistics.
Pre or co-requisite with general chemistry
Abstract: We use hands-on activities to demonstrate that sampling with the
appropriate sample size can be a very effective way to make inferences
of a population when it is impossible to measure the entire the
population. We use real-life questions to invoke the students’ thoughts
on how and why we use statistics. We also learn to apply statistical
analyses using the correct vocabulary such as statistics and sampling
distribution, parameters in population, spread, sampling deviation, etc.
Time Requirement: 2 lecture periods, depending on your intended course coverage.
NY Standards Met: • Standard 2 – Information Systems: Students will access, generate,
process, and transfer information using appropriate technologies.
• Standard 3 – Mathematics: Students will understand mathematics
and become mathematically confident by communicating and
reasoning mathematically, by applying mathematics in real-world
settings, and by solving problems through the integrated study of
number systems, geometry, algebra, data analysis, probability, and
trigonometry.
• Standard 4 – Science: Students will understand and apply scientific
concepts, principles, and theories pertaining to the physical setting
and living environment
• Standard 7 – Interdisciplinary Problem Solving: Students will apply
the knowledge and thinking skills of mathematics, science, and
technology to address real-life problems and make informed
decisions.
Equipment/Materi • A see-through jar, colored beads (any color combination as long as
als List you know the number of beads for each color) A paddle with 40
holes. The jar & paddle demo is available through many
commercial teaching resources. Similar jar & paddle lesson plans
are readily available on the internet.
• Graph Paper or poster board for frequency distribution chart
Resources/Credits • Cartoon clips http://www.fotosearch.com/illustration/bean-
Web links counting.html
• Similar lesson concept
http://www.amstat.org/publications/jse/v5n1/schwarz.supp/sampling
bowl.html
• Resources on New York Learning Standards
http://www.emsc.nysed.gov/ciai/mst/math.html
• National Council of Teachers of Mathematics Learning Standards
(NCTM) http://standards.nctm.org/document/chapter7/data.htm
• MALDI-TOF mass spectrometer
http://www.pslc.ws/mactest/maldi.htm
Statistics Lesson Plan
Random Sampling in Action

http://www.fotosearch.com/illustration/bean-counting.html
Lesson Plan:

Notes to Teachers:

• This lesson plan is designed to provide ideas for hands-on demonstrations. The lecture
outline is laid out for the teacher. It is your choice to interject the lecture with certain
activities (See Demos). I use an example from another discipline (chemistry) to probe how
well the students understand the difference between a population and a sampling process,
how to interpret the various shapes of the density curves, and the very fundamental concept
of CLT (central Limit Theorem).
• A printable activity worksheet is included at the end of the lesson plan.
• Keep in mind that it takes a while to build up the frequency table when demonstrate CLT.
I do random sampling and sample mean distribution for CLT in lab, where the students can
do random sampling on Minitab.
• I usually spend another 3-4 lecture periods on CLT after this lesson and get heavily into
σ
the calculations.
n

Recapping Previous Knowledge:

• By now the students have learned words such as population, mean (µ), variance, and sigma
(σ). They also know the meaning of observation, frequency, spread, center, etc.

New concepts and vocabulary Needed for This Lesson:

• Randomness and the need to watch out for biases.


• Sampling processes and sample size.
• How to relate sampling outcome to the population.
• Sample mean distribution and Central Limit Theorem. (If time allows)

Lecture Outline:

• What happens when we can’t capture EVERY object in a population?


• Introduce the concept of random sampling and necessary terminology.
• If we don’t know much about the population, what can we learn by taking random
samples? (Making inferences based on sound information)
• Introduce Central Limit Theorem. I talk about the conditions necessary for CLT to work
even though the population distribution may not be Normal.
• Explain that CLT is all about sampling the population and how we can use CLT to help us
make estimations about an entire population. Examples may vary (see Demo ideas).

Demo #1(Hook):
• Coin-toss. Let heads = 0 and tails = 1. If the coin is fair, ask the students to toss the coin
20 times and get an average of the 20 outcomes. What they think the most probable
outcome should be? (They should know it is about 0.5)
• Verify it by building a frequency table. The peak should be near 0.5.
• (TI-83/84 has a coin toss simulator under APPS. But the degree of fairness is
predetermined by the user.)

Demo #2 (Hands-on, see Activity Worksheet):

• Rolling a die. If the die is fair, what is the most probable average outcome? (I use a
weighted die so the students can’t just guess the average is 3.5)
• Go to the activity worksheet.

Demo #3 (Hands-on, see Activity worksheet):

• Show the jar of beads in class. Don’t tell the students that the orange beads make up X %
of the content. DO tell the students that there are Y beads total in this jar.
• Can we find out how many orange beads there are without having to count the beads one-
by-one?
• Find the most probable average % in the sampling distribution. Call it X %. X ⋅ Y = # of
orange beads.

Question (Probe):

• We know that in chemistry, a polymer is a long chain of a certain molecule. For example,
polystyrene is a polymer that is made of many styrene molecules linked together. Think of
the polystyrene molecules in terms of n ⋅ (C8 H 8 ) where n = integer. Depending on the
value of n, the polystyrene molecules can have various weights. A mass spectrometer can
show us the molecular weight distribution of polystyrene molecules.

Single Styrene n = 1

Styrene dimer n = 2
Polystyrene n= 3

Polystyrene (C8H8)n

• Think of the population of ALL the polystyrenes where 0≤ n < ∞ . Is there a most probable
value of n? In other words, we know polystyrene likes to grow in length, but is there a
finite length for the polymer chain? What do you think the density curve of polystyrenes
looks like?

Is it a symmetric curve? The peak of the curve indicates the most probable
weight of the polystyrene molecule?

Or

Is it a skew- curve where the most probable value of n is on the high side
but there are a few short chains (low molecular weights)?

Or

Is it a skew+ curve where the value of n is on the low side with a few long
chains?

Or

The polystyrene molecules of various weights are equally likely to be


present?
• MALDI-TOF mass spectrometer uses a very simple physics principle F=ma to determine
the masses of the molecules. Namely, the lighter the molecule, the “faster” it flies across
the fixed distance. MALDI gives a frequency distribution of the various weights of the
polymer.
C o rn e ll U n iv e rsity B B A 0 4 8 E3 P u ls e V = 2 1 0 0
Laser = 150
LinearP S _250 0_01Jul2006 8 (0.267) S b (99,10.00 ); S m (S G , 2x15.00); C m (1 :21) TOF LD+
24 56 .4 2.8 7e3
100

2 3 52 .4
2 5 60 .5
2 4 55 .4
2 2 48 .4

2 24 7 .4
2 55 9 .5

2 45 7.4
26 6 4.5

25 6 1.5
Although the actual MALDI spectrometer plot
2 14 4 .4
21 4 3.4

22 4 9.4
23 53 .4
2 6 63 .5
27 68 .5

2 76 9.5
28 7 2.5
is Normalized by energy focusing, it is a good
2 7 67 .5

2 0 40 .3

22 46 .4
2 3 50 .4

24 5 4.4
28 71 .5
2 8 73 .5 example of a sampling distribution when the
25 58 .5

1 93 6.3
2 1 42 .4

2 45 8.4 2 5 62 .4 2 66 6 .5 29 7 6.5
2 9 77 .5 population is large.
%

2 0 38 .3 2 04 1 .3 2 7 70 .5
2 35 4 .4
27 6 6.5 2 8 74 .5
3 08 0 .5
18 31 .2 22 5 0.4
1 93 4.3 29 78 .5
1 9 37 .3 2 87 0 .5
2 24 5 .4 2 34 9.4
21 4 1.3 3 18 4.6
2 4 53 .5
1 7 27 .2 2 03 7.3 2 97 4.5
25 57 .5 25 6 3.4
19 33 .3 20 42 .3 31 8 3.6
23 55 .4 2 6 61 .5
3 2 89 .6
1 72 6.2 3 2 88 .6
18 2 9.2
2 2 51 .4
1 62 3.1 19 3 8.3
1 7 25 .2 1 72 9 .2 33 9 3.6
1 62 2 .1
1 51 9 .0 1 8 34 .2 2 04 3 .3
15 20 .0 16 2 5.1 3 49 7.7

0 m /z
1400 16 00 1 800 2000 2200 240 0 26 00 2800 3000 320 0 3400 3600 3800 400 0 4200

Conclusion:

• Wrap up the activity. Discuss the noticeable trends in the sampling distribution and draw
inferences about the population.
• Discuss the possible biases in the activity. Discuss the concept of “randomness”.
• Where else can we apply CLT to study quantitative data in real life? Ask the students to
give examples and specific ways of measuring the statistics.

Follow-up:

• Assign homework problems. Assign real-life quantitative survey project if there is enough
time in the semester
Random Sampling in Action

http://www.fotosearch.com/illustration/bean-counting.html

Is the Die Fair? Activity---How to Build a Frequency Table

Part One: Concept Check

a.) You have rolled a die before. You know that there are _______ possible outcomes each
time you roll a die.

b.) If the die is fair, the probability of getting a “4” is the same as getting any other number,
which is ________ % chance.

c.) If you rolled the die twice and got a “2” and “5”, the mean (average) of the two outcomes
would be _________.

d.) Assume the die is fair, if you roll it six times and get a different number each time, the
mean of these outcomes will be ___________

e.) What do you suppose the frequency table of the outcome mean values will look like? Draw
a rough sketch.
Part Two: Let’s Roll It!

a.) Roll the die 10 times, record the number you get each time, then calculate the mean.
b.) Repeat the process 3 more times.
c.) Now go up to the chart on the wall, mark an X for each of the four average outcome values.
d.) Does the chart on the wall look like the one you predicted? How or how not so?

_________________________________________________________________________

Beads & Paddle Activity---Random Sampling in Action

Part One: Vocabulary/Concept Review

We are trying to find how many percent of the beads in the jar are orange. Once we have an idea
how what percent it is for the orange beads, with the fact there is a total number of ____________
beads in the jar (ask me), we can estimate how many orange beads there are.
Before you start sampling the beads, think about how the statistical terms are applied in the beads
& paddle activity.

a.) In the case of the beads & paddle activity, what is the population? ____________

b.) The population size = ___________

c.) What is a “sample” in this activity? ______________________________

d.) The sample size n =________

e.) Briefly explain the sampling process in this activity

_________________________________________________________________________

___________________________________________________________

f.) We conduct the sampling process because we want to know

__________________________________________________________________

g.) Each time you sample the jar, what is the lowest possible number of the orange beads that
can be included in the sample? ____________, which is equivalent to _______ %

h.) Similarly, what is the highest possible number of the orange beads you can bring up in each
sample? __________, which is equivalent to __________ %

i.) What you think the sampling distribution of the beads & paddle activity should look like?
Draw a rough sketch.
j.) Use any method to make a guess what percent of the orange beads there are in the jar.
_______%. Compare your guessed value with the value obtained in Part Three a).
Part Two: Building the Sample Mean Distribution

a.) Each person will “dip” into the jar 3 times. Record the number of the orange beads you see
in your sample. Return the beads back in the jar after sampling each time.
Dip #1 Dip #2 Dip #3 Mean %
# of orange beads

# of orange beads

# of orange beads

# of orange beads

# of orange beads

b.) The mean (average # of orange beads) of the 10 samples is _______

c.) Convert it to % by dividing the mean by 40, then multiply by 100 = ________ %

d.) Repeat steps a). through c). 4 more times. Write down the 4 average percents.

Now go up to the chart on the wall, mark an X for each of the five average % values.

If you have more time, sample the beads as many times as you wish, mark more X’s on our chart.
Remember, the more samples averages are taken, the better!

Part Three: Estimate the number of white beads in the jar.

a.) Based on the frequency distribution chart, the peak is at __________ %

b.) Therefore, we estimate the total number of orange beads is about _______________

c.) So, to answer the question asked in the title of this worksheet “Should we count the
beans...one at a time?” Let’s think about what possible factors may prevent us from counting
every object in the population? How does taking samples help us understand the whole
population? Justify your reasons.

______________________________________________________________________________

______________________________________________________________________________

You might also like