Professional Documents
Culture Documents
9 - Data Analysis
9 - Data Analysis
9 - Data Analysis
er
sampling vs. assignment
Random sampling means to create your study sample randomly, by
chance. Random selection results in a representative sample; you can generalize
lp
and predictions about a population’s behavior based on your sample.
Random assignment is where study participants are randomly assigned to a
study group. In a single blind study, the participant does not know whether they are
He
in the experimental group or the control group. In a double-blind study, neither the
participant nor the researcher knows.
H
• Then, once you have a collected a sample of subjects, you
randomly assign.
AT
• So, sampling happens first, and assignment happens second.
TM
SA
d
ar
Bo
mr
er
use a “random sample,” where respondents are chosen entirely by chance from the
population at large.
Margin of error: A percentage that tells you how much you can expect your survey
lp
results to reflect the views of the overall population. The smaller the margin of error,
the closer you are to having the exact answer at a given confidence level.
For example, a 60% “yes” response with a margin of error of 5% means that
He
between 55% and 65% of the general population think that the answer is “yes.”
What is 'Standard Deviation'
... is a measure that is used to quantify the amount of variation or dispersion of a set
of data values. A standard deviation close to 0 indicates that the data points tend to
be very close to the mean (also called the expected value) of the set, while a high
H
standard deviation indicates that the data points are spread out over a wider range
of values. AT
TM
SA
er
If the true value is x and it has a margin error r% So the confidence interval 𝒊𝒔
( 𝒙 ( 𝟏 − 𝒓%) , 𝒙 ( 𝟏 + 𝒓%))
Ex: x has 5% margin error. So, the confidence interval is between 𝒙 ( 𝟏 – 𝟎. 𝟎𝟓 ), 𝒙 ( 𝟏 + 𝟎. 𝟎𝟓 )
lp
𝟎. 𝟗𝟓𝒙 , 𝟏. 𝟎𝟓𝒙
Ex: Number of students come today 200 with margin 6%
He
200 x ( 1- 0.06 ) = 192 ,,, 200 x ( 1 + 0.06 ) = 212 conf. interval between 192 , 212
Mode:
Most repeated data. (value)
H
𝑻𝒐𝒕𝒂𝒍
Mean: (Average) → Mean = 𝑵𝒐.
( individual data )
Outliers
d
Ms. Mai is one of the music teachers at a High School, To determine if gender and number of siblings are
which has 1200 students. She selected 50 of her music related to whether 10 -year-old children like to read,
students at random and asked each whether they play Salma selected a random sample of 60 boys with fewer
er
an instrument. of the 50 students surveyed. 39 play an than 2 siblings and a random sample of 50 girls with 2
instrument. Based on the design of the study, which of or more siblings tram the 10-year-old children in her
city. For each child, she recorded the child's age,
the following is the largest group of students to whom
gender, and whether the child reported liking to read.
the results of Ms. Mai’s survey can be generalized? Why is it inappropriate for Salma to draw a conclusion
lp
from her study?
A) All of Ms. Mai’s music students at a high School
B) All of the music students at a High School A) The two samples are not of equal size.
B) The two samples should have been chosen from
C) All of the students High School different cities.
He
D) All of the music students in the city. C) There is no upper hound on the number of siblings
of the 50 girls.
D) Salma will not be able to tell whether a difference in
liking to read is related to the difference in gender
or to the difference in number of siblings.
2 AmrBoard SAT Helper
H
An athletic trainer wants to determine whether a new
exercise regimen improves the performance of
members of high school athletic teams. To test whether
the regimen improves performance, the trainer
AT 4 AmrBoard SAT Helper
arranges for all the members of the boys track team at a
local high school to use it for one season. The trainer
TM
A survey was given to a random sample of 100 high
will then compare this season’s performances to
performances in previous years. Which of the following school students in Cairo. The results of this survey
would NOT improve the quality of the study? should be representative of which the following
SA
population?
A) Randomly assigning half of the track team to use the
new regimen while the other half uses the current
regimen
A) All high school students in Egypt.
B) Including members of all the boys’ athletic teams at
the high school in the study B) All high school students in Cairo.
C) Including members of the girls' track team at the C) All students in Cairo.
high school in the study
d
D) Including students who are not on athletic teams at D) All students in Egypt.
the high school in the study
ar
Bo
mr
er
reported that they recycle regularly. If the reported
percentage is used as an estimate for the proportion of
all residents in the town who recycle regularly, the
margin of error is 17% which of the following
lp
statements is appropriate based on the data provided?
He
recycle regularly.
7 AmrBoard SAT Helper
C) Between 43% and 77% of all the town residents
recycle regularly.
D) Approximately 16.7% of the surveyed residents What is the median number of children for all the
misstated how often they recycle families surveyed?
A) 0
H
B) 1
6 AmrBoard SAT Helper AT
C) 2
D) 3
A survey was taken of the weight of students in the
TM
school, and it was found that the mean weight was 56
kg and the median weight was 65 kg. Which of the 8 AmrBoard SAT Helper
following situations could explain the difference Based on the survey data, which of the following most
between the mean and median weight in the school? accurately compares the expected total number of
SA
A) The students have weights that are close to each A) The total number of families with 2 children is
other. expected to be equal at two towns
B) There are a few students whose weights are less
B) The total number of families with 2 children at Town
than the rest
d
D) Many students have weights between 56 kg and 65 B is expected to be 3 more than at Town A
kg
D) The total number of families with 2 children at
Bo
er
168 cm and the mean height was 175 cm. Which of the
following situations could explain the difference
between the mean and median height in the school?
lp
other
B) There are a few students whose heights are less
than the rest There are a total of 300 people at Street A and 200
C) There are a few students whose heights are much people at Street B
He
more than the rest
D) Many students have heights between 168 cm and
175 cm 11 AmrBoard SAT Helper
10 AmrBoard SAT Helper What is the median number of Pets for all the People
surveyed?
H
A) 0
AT
B) 1
C) 2
D) 3
TM
the following is closest to the average (arithmetic B) The total number of people with 2 pets at Street A
mean) number of residents per apartment? is expected to be 25 more than at Street B
ar
A theater owner wanted to determine whether local the tables below show the distribution of scores of
residents were more interested in seeing operas or recent quizzes in English and Physics given to the
symphonies. The theater owner asked 85 people same 33 students of a particular class.
er
who were in a shopping mall on a Sunday and 5
people declined to respond. Which of the following
factors is the greatest flaw in the theater owner’s
methodology in reaching a reliable conclusion about
lp
the local residents’ performance-viewing
preferences?
A) The size of the sample
B) The location in which the survey was given
He
C) The population of the area
D) the residents who declined to respond
H
A medical study was conducted in order to
determine whether product K could help people
with hearing loss improve their hearing. The
administrators of the study selected 200 subjects
at random from a large group of people who had
AT
Which of the following is true about the data
severe hearing loss. Half of the subjects were provided for the 33 students?
TM
randomly assigned to be given product K and half
were not. The resulting data demonstrated that A) The standard deviation of the scores on the
subjects who were given product K had significantly
English quiz is larger.
improved hearing compared to those who were
SA
not given product K. Based on this study, which of B) The standard deviation of scores on the Physics
the following conclusions is most appropriate?
quiz is larger.
A) Product K will enable all people who take it to
C) The standard deviation of the scores on the
significantly improve their hearing.
English quiz is the same as that of the Physics
B) Product K is more effective than all other
quiz.
d
er
Half the subjects were randomly selected to consume called 5,000 random people between 12 P.M. to 4 P.M.
beverage C and the rest did not consume beverage C. on a Thursday. Of the 5,000-people called, 3,000 did not
The results of the study showed that the subjects who answer, and 250 refused to participate. Which of the
consumed beverage C slept less than those who did not following was the biggest flaw in the design of the
consume beverage C. Based on the design and results of survey?
lp
the study, which of the following statements is the best
conclusion? A) The time the survey was taken
A) Beverage C will cause more loss in sleep than all
B) Population size
other caffeinated beverages.
He
C) Sample size
B) Beverage C will cause a substantial loss in sleep.
D) The fact that the survey was done by telephone
C) Beverage C is likely to reduce the amount of sleep of
people without sleep disorders.
19 AmrBoard SAT Helper
D) Beverage C will reduce sleep of anyone who
H
A group of scientists designed a study to test the
consumes it. effectiveness of pesticide P at eradicating aphids
from rose gardens. From a large group of botanists,
17 AmrBoard SAT Helper AT
400 participants with aphid-infested rose gardens
were randomly selected to participate in the study.
Half of the 400 botanists sprayed their rose
gardens with pesticide P, and the other half did
TM
not. The data showed that the rose gardens
sprayed with pesticide P had significantly fewer
aphids as compared to those that were not sprayed
with pesticide P. Which of the following is an
SA
The histogram above shows the distribution of the A) Pesticide P will decrease the number of aphids
scores of 22 students on a recent biology test. in any rose garden.
Which of the following could be the median score
B) Pesticide P is the best pesticide available for
d
A) 68
C) Pesticide P will likely decrease the number of
B) 71 aphids in aphid-infested rose gardens.
C) 77 D) Pesticide P will kill substantial numbers of
Bo
er
marine biologists. The majority of the sample group
were in favor of featuring an aquatic exhibit in the new
wing. Which of the following is true about the boards
survey?
lp
A) The sample group should have included more
marine biologists.
B) It concludes that a majority of the scientists are in
He
favor of featuring an aquatic exhibit in the new wing.
C) the sample group is biased because it is not
representative of all scientists.
D) The sample group should have consisted only of
There are 35 U.S. Presidents who were 60 years old or
H
scientists who are not marine biologists.
younger when they were inaugurated, as shown in the
table above. Based on the table, what was the median 23 AmrBoard SAT Helper
age for these 35 presidents?
A) 51
AT
A community advocacy group recently polled 500
B) 52 people who were selected at random from a small town
and asked each person, “Are you in favor of the
TM
C) 54 referendum to increase the property tax rate. Of those
D) 55 surveyed, 67 percent stated that they were opposed to
the property referendum. Which of the following
21 AmrBoard SAT Helper statements must be true based on the results of the
poll?
SA
referendum.
A) The average circumference of all the trees in the
III. Of all the people in the town, 67 percent are
forest is approximately 35 inches.
ar
er
Ahmed 6 min 11 sec response choice to calculate a mean rating for the
Maya 6 min 30sec survey question.
lp
Strongly agree 2
Mohammed 8 min 12 sec
Agree 1
Neither agree not disagree 0
Disagree -1
He
Five students ran a mile, and their times are shown in
Strongly disagree -2
the table above. If Mohammed’s time is removed, the
data set which of the following measures will change What is the mean rating for all 50 responses?
the least?
A) 0.24
A) Mean
B) 0.26
H
B) Median
C) 0.44
C) Maximum
D) Range
AT
D) 0.92
in which the survey was completed. The survey A) The majority of the customers would recommend
question and the survey results are shown below. To the restaurant to others
what extent do you agree or disagree with the following B) The majority of the customers would not
statement: “l would recommend this restaurant to recommend the restaurant to others.
others." (Select only one of the five choices.) C) The majority of the customers would neither
Survey Question Results recommend nor not recommend the restaurant to
Response Number of others.
d
Total 50
mr
According to a study done by the Cairo of labor A community advocacy group recently polled 500
Statistics on typical day between the years 2009 and people who were selected at random from a small town
2012, about 8% of Egyptians aged 12 and older and asked each person, “Are you in favor of the
er
exercised. The Pie chart below shows the distribution of referendum to increase the property tax rate. Of those
the length of time those people spent exercising each surveyed, 67 percent stated that they were opposed to
day. the property referendum. Which of the following
statements must be true based on the results of the
poll?
lp
I. If another 500 people selected at random from the
town were polled, 67 percent of them would state
they are opposed to the property tax referendum.
II. If 500 people selected at random from a different
He
town were polled, 67 percent of them would
report they are opposed to the property tax
referendum.
III. Of all the people in the town, 67 percent are
opposed to the property tax referendum.
A) I only
H
B) I and III only
C) II and III only
AT
D) None
Based on the data shown, which of the following could 29 AmrBoard SAT Helper
TM
be the median length of time spent exercising each day Ticket Prices by Row Number
for those people who exercised? Row number Ticket price
1-2 $25
A) 36 minutes 3-10 $20
SA
er
automobiles in the group are listed below. her old tires is 0.30 meter, and the radius of each of her
Automobile Weight ( in pounds ) new tires is 11% larger than the radius of one of her old
A 1,950 tires. What is the circumference of each new tire, to the
lp
nearest (tenth of a meter?
B 3,250
C 3,350
D 8,550
He
The removal of which of these automobiles from
the group will have the largest impact on the mean
weight of the group of automobiles?
A) Automobile A
H
B) Automobile B
C) Automobile C
D) Automobile D
AT33 AmrBoard SAT Helper
er
Of the 1,600 residents selected at random to be polled,
to the value of the mean of the data set? 53% were in favor of the tax increase. The poll had a
margin of error of 4 percentage points. Which of the
following conclusions is most appropriate about all
A) The mean will remain the same. residents of the community, based on the results of the
lp
B) The mean will decrease. poll?
C) The mean will increase. A) The percent of all residents who favor the tax
increase is 53%.
D) There is not enough information to determine how
He
B) The percent of all residents who favor the tax
the mean will change. increase is likely less than 50%.
C) The percent of all residents who favor the tax
increase is likely between 49% and 57%.
D) The percent of all residents who favor the tax
increase is likely either less than 49% or greater than
35 AmrBoard SAT Helper 57%.
H
The tallest student in Class X is 52 inches tall, and the
tallest student in Class Y is 49 inches tall. Each class has
27 students, and the shortest student in each class is
the same height. Which of the following statements
AT37 AmrBoard SAT Helper
Employee Absences
must be true?
TM
Number of Number of
I. The range of the heights for students in Class X days employees
is greater than that in Class Y.
0 8
II. The median of the heights for students in Class
X is greater than that in Class Y. 1 4
2 3
SA
A) 1
B) 2
C) 4
Bo
D) 5
mr
er
randomly assigned to taste tea in mugs that differed
only by color, some white and some clear. The same
type of tea was used in both mugs. The researchers Andria and Marram each collected six rocks, and the
concluded that the mean flavor intensity rating was masses of the rocks are shown in the table above. The
significantly higher for those who drank tea in a white mean of the masses of the rocks Marram collected is
lp
mug than for those who drank tea in a clear mug. Based 0.1 kilogram greater than the mean of the masses of the
on this study, which of the following statements is rocks Andria collected. What is the value of h?
correct?
He
A) The color of the mug was the cause of the difference
in mean intensity rating for these volunteers, and this
conclusion can be generalized to all tea drinkers.
B) The color of the mug was the cause of the difference
in mean intensity rating for these volunteers, but it is
not reasonable to generalize this conclusion to all tea
drinkers. 41 AmrBoard SAT Helper
H
C) It is not reasonable to conclude that the color of the
mug was the cause of the difference in mean intensity The members of a city council wanted to assess the
rating for these volunteers.
D) It is not possible to draw any conclusions from this
experiment because volunteers were used.
AT
opinions of all city residents about converting an open
field into a dog park. The council surveyed a sample of
500 city residents who not own dogs. The survey
showed that the majority of those sampled were in
favor of the dog park. Which of the following is true
TM
39 AmrBoard SAT Helper about the city council’s survey?
It is often possible to donate money to a charity by mail
or by cell phone. The amounts of 5 mail donations and 5 A) It shows that most city residents are in favor of the
cell phone donations are given in the table below. What dog park.
SA
A) 0
B) 10
C) 25
Bo
D) 50
mr
er
8000
Number of guests
7000
6000
5000
4000
3000
2000
lp
1000
0
1 to 4
5 to 9
under 1 year
10 to 14
15 to 19
20 to 24
25 to 29
30 to 34
35 to 39
40 to 44
45 to 49
50 to 54
60 to 64
65 and over
55 to 59
He
Age - group
H
Which of the following age groups is closest to 40%
smaller in number than the 1 to 4 group?
42 AmrBoard SAT Helper
Of the following, which is closest to the ratio of the
AT a)
b)
35 to 39
40 to 44
number aged 65 and over to the number aged 30 c) 45 to 49
TM
to 34? d) 50 to 54
a) 2 to 5
b) 1 to 4
c) 2 to 1
SA
c) 29
d) 22
ar
Bo
mr
er
which sampling method would be the best in
estimating this proportion? Tuesday - $ 12.00
a) Research the 200 houses most recently sold
Wednesday - $10.00
lp
in the city
b) Randomly select a small area within the city Thursday – $ 8.00
and research the houses in that area
c) Randomly select 50 houses from all the Friday - $ 10.00
He
houses in the city.
d) Randomly select 200 houses from all the
houses in the city.
Based on the menu above, by how much does the price,
in dollars, of the lunch special on Tuesday exceed the
average (arithmetic mean) price of the lunch Special for
H
the 5 days shown? (Disregard the 5sign when gridding
your answer. For example, if your answer is $1.37, grid
AT
1.37)
A radar gun in Abu Dhabi was used to measure the 48 AmrBoard SAT Helper
speed of vehicles travelling along a highway. After
all the data were collected. It was found that the A random sample of 300 Popcorn consumers was
SA
er
likely to represent all potential car buyers? deviation for the data were found. The horse with the
lowest reported weight was found to actually weigh
A) 500 first-time car buyers selected at random 20 kgs less than its reported weight. What value
remains unchanged if the four values are reported
B) 500 potential car buyers selected at random
lp
using the corrected weight?
C) The first 500 first-time car buyers who enter a certain A) Mean
car dealership
B) Median
He
D) The first 500 potential car buyers who enter a
C) Range
certain car dealership
D) Standard deviation
H
Ages of Participants
52 AmrBoard SAT Helper
37 54 41
24
26
15
38
32
48
AT
Near the end of a US cable news show, the host
invited viewers to respond to a poll on the show’s
44 39 68 website that asked, "Do you support the new federal
14 36 76 policy discussed during the show?” At the end of the
TM
32 29 73 show, the host reported that 28% responded “Yes,”
and 70% responded “NO.” which of the following
best explains why the results are unlikely to
An article reported the mean age of participants
represent the sentiments of the population of the
SA
er
number of dogs.
D) The median could be greater than or less than the
average, depending on the population of the
suburban area.
lp
He
55 AmrBoard SAT Helper
H
According to a local ordinance, no household in this
area may own more than 3 dogs. What is the
approximate percentage of households in this area that
AT
are in violation of the ordinance?
TM
A) 9% The graph above shows the average daily temperature
for City A (which is in the Northern Hemisphere) and
B) 17% City B (which is in the Southern Hemisphere) over the
C) 26% course of a year. Which of the following is the most
accurate statement about this information?
SA
D) 83%
A) The average annual temperature for City A is
greater than the average annual temperature for
City B.
B) The standard deviation of the average daily
54 AmrBoard SAT Helper temperatures for City A is greater than the
standard deviation of the average daily
d
the median number of dogs per household and the warmer than the warmest average daily
temperature for City A.
average number of dogs per household for this D) The warmest average daily temperature for City B is
suburban area. Based on the graph, which of the approximately the average annual temperature for
Bo
City A.
following statements must be true?
mr
information
According to the data, approximately how many of the
current smokers in the study had cholesterol levels
er
below 210 mg/dl?
A) 48
B) 72
lp
C) 120
D) 160
He
57 AmrBoard SAT Helper
H
Do these graphs provide strong evidence in support of
the tested hypothesis?
the sake of this study, "high" cholesterol is defined D) No, because the percentage of ex-smokers with
as overall levels higher than 269 mg/dl. "high" cholesterol levels is not significantly
different than the percentage of current smokers
with "high" cholesterol levels.
d
ar
Bo
mr
er
lp
He
The graph above shows the percentage of 5 groups
of adults, by age group and employment level, that The table above shows data for the 2012
live below the poverty line. The data were campaigns of the incumbents in 6 Congressional
gathered from a survey of 1,000 randomly chosen districts.
adults. Which of the following statistics about the 59 AmrBoard SAT Helper
1,000 surveyed adults can be most accurately How much campaign money, in dollars, did the
H
determined from the graph?
A) The percentage of unemployed adults ages 18 District 5 incumbent spend for each vote received?
0.27
mr
A researcher is studying the eating habits of all adults in A botanist hypothesizes that, in a certain patch of
a large city and is interested particularly in how often
er
those adults eat fast food rather than prepare their own forest, the median diameter of the 360 white oak
meals. The researcher asked 350 adult customers at a
major chain restaurant about how often they ate fast trees is 42 inches. The histograms below show the
food and how often they prepared their own meals. Of
these respondents, 120 were unmarried. Which of the data collected on the diameters of 360 white oak
lp
following changes in The survey method would best
improve the reliability of the results? trees. Which graph is consistent with the botanists'
He
restaurant
B) Conducting the survey at a farmer's market rather
than at a chain restaurant
C) Excluding the results from the unmarried
respondents
D) Giving the survey to a group of adults selected at
random from public records
H
62 AmrBoard SAT Helper
AT
TM
The mean value of all the cars on a used car lot is
$1 1,000 and the median value of these cars is $7,000.
Which of the following offers the best explanation for
the discrepancy between the mean car value and the
median car value?
SA
D) There are a few cars that cost much more than the
other cars do.
ar
Bo
mr
er
An agriculture class harvested 18 potatoes from the
school garden and compiled the weights of the potatoes
lp
in the table above. If the 2-ounce measurement is
removed from the data, which of the following
statistical measures of the values listed will change the
least?
He
The graph above shows the distribution of Richter scale
magnitudes for 200 recent earthquakes in north west A) The mean
California. What is the median magnitude of these 200 B) The median
earthquakes? C) The range
D) the total
A) 1.0
B) 1.5
H
C) 2.0 67 AmrBoard SAT Helper
D) 2 .5
also take the zinc supplement number of seedlings that are at least 2 inches high
B) A group of 200 children with cold symptoms who in the entire lawn?
ar
er
the manhole covers on Street B?
Robert and Louis each have five rose bushes, and the
lp
heights of the bushes are shown in the table above. The
mean of the heights of Robert’s rose bushes is 0.3 feet
greater than the mean of the heights of Louis’s rose
He
bushes. What is the value of h?
H
cover are added to Street B, and if the mean weight, in
er
that of her daughter. What is the youngest age her
daughter could be, in years?
lp
He
The table above shows the recommended daily intake
H
73 AmrBoard SAT Helper
of calcium, in milligrams ( mg ), by age and gender
the least?
ar
A) Mean
B) Median
C) Maximum
Bo
D) Range
mr
er
Burj Khalifa 2717 destinations in 2013 than the median number in 2012,
to the nearest tenth of a million?
Shanghai Tower 2073
lp
Makkah Royal Clock Tower 1971
He
's tallest buildings. If a new building is built, what is one
possible height, rounded to the nearest foot, that would 76 AmrBoard SAT Helper
The number of international tourist arrivals in Russia in
make the average height of the four buildings greater
2012 was 13.5% greater than in 2011. The number of
than 2210 feet but less than 2211 feet?
international tourist arrivals in Russia was k million
H
more in 2012 than in 2011. What is the value of k to the
nearest integer?
The table above shows the number of international B) Approximately 40% of all the trees in the forest
have circumferences less than 35 inches.
tourist arrivals, rounded to the nearest tenth of a C) Approximately 40% of all the red maple trees in the
forest have circumferences less than 35 inches.
million, to the top nine tourist destinations in both
D) The majority of all the trees in the forest have
Bo
1 A 26 A 51 B 76 3
2 D 27 D 52 B 77 C
er
3 D 28 D 53 B
4 B 29 20 54 C
5 C 30 D 55 B
lp
6 B 31 88 56 C
7 C 32 2.1 57 D
He
8 A 33 3,4,5 58 C
9 C 34 B 59 2.60
10 D 35 A 60 .27
11 B 36 C 61 D
H
12 B 37 B 62 D
13
14
B
D
38
39
B
B
AT
63
64
C
15 A 40 2.6 65 C
TM
16 C 41 D 66 B
17 D 42 A 67 D
SA
18 A 43 B 68 5.8
19 C 44 D 69 14
20 C 45 D 70 277,287,297
21 C 46 D 71 800
d
22 C 47 2.2 72 4
ar
23 D 48 B 73 B
24 B 49 B 74 2081,…
25 C 50 A 75 1.3
Bo
mr