Professional Documents
Culture Documents
معدل سايمون
معدل سايمون
معدل سايمون
Previous questions:
1. If the scores on a variable are 11, 14, 18, 20, and 25, the median is?
A. 11 and 14.
B. 19.
C. 23.
D. 18.
9. A researcher is preparing a report and wants to select a measure of central tendency that shows
the most common score in a particular distribution. Which statistic should she select?
A. Mode.
B. None of the above.
C. Median.
D. Mean.
10. A researcher asked a sample of dual career families about the percentage of the family budget
contributed by the wife's job (Y) and the total number of children (X). Pearson's r for this
relationship is -0.34. Which of the following is an appropriate interpretation of these results?
A. For every dollar contributed by the wife, the number of children increases by 0.34.
B. For every additional child, the wife must work longer hours.
C. Every additional child lowers the economic wellbeing of the family.
D. As number of children increase, the percentage of the budget contributed by the wife
decreases.
11. A researcher tested the relationship between age and level of vitamin B12 in blood and found a
coefficient of correlation equal to r = 0.05 This relationship is:
A. Strong.
B. Positive.
C. Weak.
D. Significant but not important.
12. A researcher tested the relationship between age and level of vitamin B12 in blood and found the
coefficient is equal to r = - 0.05 This relationship is:
A. Significant but not Important.
B. Strong.
C. Positive.
D. Weak.
13. When conducting hypothesis tests for paired t test two sample means, the test statistic is?
A. The difference in sample means.
B. Alpha.
C. The degrees of freedom.
D. The difference in the population means.
14. In the Data View in the SPSS Data Editor, each row represents what?
A. A variable.
B. A case.
C. A data point.
GH & FJ. 2 نسألكم الدعاء
15. Contingency table in which one variable had three categories and the other variable had four
categories would have:
A. Nine cells.
B. Six cells.
C. Four cells.
D. Twelve cells.
18. Your score on the test is the same as the third quartile (Q₃). You may conclude that :
A. Your score is ‘typical’ since it is the same value as the median.
B. You scored higher than 25% of the people who took the test.
C. The distribution of the score is skewed.
D. You scored higher than 75% of the people who took the test.
24. The higher the alpha level, the more likely we will:
A. Be unable to decide whether to reject or fail to reject the null hypothesis.
B. Commit a Type II error.
C. Commit a Type I error.
D. Conduct a two-tailed test.
28. If a regression line is parallel to the horizontal axis of the scattergram , the slope ( b ) will be:
A. 0.45.
B. 0.00.
C. 1.00.
D. (-1.00).
29. For a single variable measured at the categorical nominal level, an appropriate graph would be:
A. Bivariate table.
B. Frequency polygon.
C. Pie chart.
D. Histogram.
GH & FJ. 4 نسألكم الدعاء
30. Researcher summarizes the age of 1,000 people by calculating the average age , she is using :
A. A qualitative technique.
B. An ordinal categorical variable.
C. A numerical variable.
D. A categorical variable.
31. To conduct a chi square test , the variables must first be organized into a:
A. Univariate table.
B. Frequency distributions.
C. Bivariate table.
D. Contingency table.
33. In order to conduct a test of hypothesis with means or proportions with parametric analysis. the
sampling distribution...:
A. Bimodal.
B. Large.
C. Normal.
D. Skewed.
35. In the Data View in the SPSS Data Editor, each column represents what?
A. A variable.
B. A case.
C. A data point.
36. A study of traffic safety shows a correlation of 0.57 between average speed of traffic and number
of total accident stretch of highway. This means that:
A. Fatalities tend to increase an average speed increases.
B. Drivers should speed up to get through the areas as quickly as possible.
C. As speed increases, fatalities decrease.
39. The distances (spread) between the scores and the mean are called:
A. Deviation
B. Summations
C. Ranges.
D. Differences.
40. Can be used to examine how a relationship between two variables might be 'explained' by one or
more additional (potentially confounding) variables:
A. Correlation Coefficient.
B. Partial Correlation.
C. Rank Correlation.
D. None of these.
41. Which of the following values could not represent a correlation coefficient?
A. r = 0.99.
B. r = 1.05.
C. r = -0.98.
D. r = 0.
43. Comparing one- and two-tailed tests (with a constant alpha level and sample size), the probability
of rejection will be higher for:
A. The two-tailed test.
B. The one-tailed test, if you have correctly predicted the direction of the difference.
C. Neither, the probability of rejection does not change.
D. The test with the most conservative test statistic.
46. A researcher has analyzed differences in average college board scores for random samples of
students from four different colleges. The obtained F score is 0.45. what can be concluded about
null hypothesis ?
A. Nothing, more information is needed.
B. Reject the null hypothesis, differences did not occur by random chance.
C. Fail to reject the null hypothesis, differences are not statistically significant.
D. Fail to reject the null hypothesis, differences are statistically significant.
50. If you wanted to measure the association between the age of people (in years) and the number of
arrests for driving while intoxicated, the most appropriate measure of association would be.
A. Gamma.
B. Pearson's r.
C. Chi.
D. Lambda.
51. What would be the most appropriate test of significance for r ...... variable and an independent
variable with three or more categories?
A. ANOVA.
B. Chi square.
C. The t test.
53. For the relationship between social class and movie attendance. a researcher found a r of -0.65.
This relationship is:
A. Strong and negative.
B. Moderate and weak.
C. Strong and positive.
D. Negative and weak.
54. The relationship between two variables patriating out the effect that a third variable has on both
variables can be explained using a correlation:
A. Liner.
B. Impartial.
C. Descriptive.
D. Partial.
55. The mean on a standardized test is 100 and the standard deviation is 35. Your score is 65. What
percentage of the scores were higher than yours?
A. About 16%.
B. About 84%.
C. No more than 50%.
D. Incorrect: about 34%.
57. For a normal distribution of 1000 cases, how many cases will be farther away from the mean than
+3 standard deviations?
A. 327.
B. At least 500.
C. About 5.
D. Incorrect: it is impossible to estimate.
60. The ANOVA test uses means and standard deviations to compare the amount of variation ------
----- with the amount of variation ------ --.
A. Within categories - between categories.
B. Above categories - below categories.
C. Within sample means - between sample means.
D. Within-sample standard deviations - between sample standard deviations.
61. The most appropriate measure of central tendency for the variable age (measured in years)
would be the:
A. Mode.
B. Mean.
C. Proportion.
D. Median.
63. When variables vary in opposite directions, the association between variables is said to be:
A. Negative.
B. Positive.
C. Neutral.
D. Contradictory.
68. What measure of central tendency would be most appropriate to summarize data about the
colour of movie stars eye?
A. None of the above.
B. Mean.
C. Standard deviation.
D. Median.
69. A researcher can demonstrate a strong association between age and income. Which variable is
independent (explanatory)?
A. Age.
B. Both.
C. Income.
D. Neither.
71. Chi-square has been used to test the relationship between gender and smoke consumption. The
null hypothesis has been rejected ( p = 0.001 ) What may be concluded:
A. Males are more smokers.
B. Smoke consumption depends on gender.
C. Females are more smokers.
D. Gender and smoke are not related.
72. Application of the linear regression: Vit D and age are associated. can we predict the vitamin D
level of one student of UOS with age 20 years old? Beta (b) = 2.3 constant (a) = 1. ( Y = a + bX )
A. None of them.
B. The Vit D level will be 47.
C. The Vit D level will be 20.
D. The Vit D level will be 34.
75. If a researcher wants to temporarily divide a data file into subgroups that share one or more
common characteristics, which SPSS procedure should he or she use?
A. Split File.
B. Weight Cases.
C. Select Cases.
D. Divide File.
76. If the skewness and kurtosis statistics for a distribution of data are both close to zero, which of
the following is probably true?
A. The normality assumption will probably be violated.
B. The distribution is flat (i.e. rectangle-shaped).
C. The standard deviation of the distribution will be 1.
D. The data are normally distributed.
77. On the variable Cholesterol, the range is 45 and the minimum value is 130. What is the
maximum value of this variable?
A. 85.
B. 200.
C. 175.
78. Value labels are most likely to be necessary for which type(s) of variables?
A. Nominal variables.
B. Ordinal variables.
C. Value labels are always necessary for all types of variables.
D. Scale variables.
81. This statistic reflects the average amount by which scores in a distribution differ from the
mean of the distribution.
A. Standard deviation.
B. Median.
C. Shapiro-Wilk.
D. Range.
86. The variance of the variable (AGE) is 77.01. Define the level of standard deviation.
A. 8.77.
B. Same as variance.
C. Less than 0.05.
D. 5.24.
89. Zainab wants to make comparisons between participants in her sample who drive SUVs, and
those who drive other vehicles However, the vehicle type' variable in her data file lists about 25
individual vehicle models. Seven of these models are SUVs whilst the rest are not. Which SPSS
option will Sophia need to use to create a new variable that distinguishes between SUV and
non-SUV drivers?
A. Compute Variable.
B. Recode into Same Variables.
C. Recode into Different Variables.
D. Any of these SPSS options can be used.
90. In this tab, rows represents individual cases and columns represent variables in your data.
A. Variable view.
B. Data editor.
C. Data view.
91. An analysis of the relationship between income and education found a correlation of 0.43.
After controlling for age, the partial correlation was 0.43. The relationship between income
and education seems to be:
A. Direct.
B. Spurious.
C. Intervening.
D. Interactive.
92. The Leven’s test is used for testing the equality of the variances in
A. Independent t-test.
B. Correlation.
C. Regression.
D. Partial correlation.
95. Frequencies (Analyze > Descriptive Statistics > Frequencies) is primarily used for:
A. To create frequency tables, bar charts, and pie charts for a single categorical variable.
B. None of them.
C. To capture the dependent nature of this data-and that's where multiple response sets come in.
D. To create contingency tables, which describe the interaction between two categorical variables
this tutorial covers the descriptive statistics aspects of the crosstabs procedure including and
row. Column, and total percents.
96. Smokers with lower level with vitamin D less than 30 are obese in which %? What is your
strategy of analysis?
A. Select cases: vitamin D less than 30 split file: BMI obese and not obese descriptive analysis,
frequencies, table of frequencies, check the prevalence of smokers with obesity.
B. Select cases: vitamin D more than 30 split file: BMI obese and not obese descriptive analysis,
frequencies, table of frequencies, check the prevalence of smokers with obesity.
C. Select cases: vitamin D less than 30 split file: smokers and no smokers descriptive analysis,
mean of BMI in smokers.
D. Select cases: vitamin D less than 30 split file: smokers and no smokers transforming the
numerical variable BMI in categorical variables and creating the variables (BMI highes 30 =
obese) descriptive analysis, frequencies, table of frequencies, check the prevalence of obesity
in smokers.
97. Which measure of central tendency is affected by every score in the distribution?
A. The mode.
B. The mean.
C. The median.
D. All of the above.
98. The sum of the differences between the scores of a distribution and the mean of the scores is
always:
A. 0.
B. None of the above.
C. 1.
D. The same as the number of cases.
102. For testing the difference between two sample means, the level of measurement is assumed to
be:
A. Nominal.
B. Ordinal.
C. Interval-ratio.
D. Any of the above.
103. If two variables are independent, the cell frequencies will be:
A. Exactly the same.
B. Determined by random chance.
C. Concentrated in only two of the cells.
D. Less than the expected frequencies.
104. The distribution which describes a range of data where the scores at either end of the
distribution are the same distance to the mean is the
A. Gamma-distribution.
B. Exponential distribution.
C. Normal distribution.
D. Chi-square distribution.
105. For a normal distribution of 1000 cases, how many cases will be farther away from the mean
than +3 standard deviations?
A. 327.
B. About 5.
C. At least 500.
D. It's impossible to estimate.
106. The one-way ANOVA is used to test statistical hypotheses concerning which of the following?
A. Proportions.
B. Standard deviations.
C. Means.
D. Variances.
GH & FJ. 15 نسألكم الدعاء
107. What does ANOVA calculate?
A. Chi square.
B. F-ratio.
C. T-ratio.
D. Z-ratio.
108. The area beyond + 2 standard deviations contains approximately what % of the area under the
normal curve?
A. 50%.
B. 75%.
C. 99%.
D. 5%.
110. A researcher measured people’s physiological reactions while watching a horror film and
compared them to when watching a comedy film, and a documentary about wildlife. Different
people viewed each type of film. The resulting data were normally distributed and the variances
across groups were similar. What test should be used to analyse the data?
A. Repeated-measures analysis of variance.
B. Kruskal–Wallis test.
C. Friedman’s ANOVA.
D. Independent analysis of variance.
111. A researcher was interested in stress levels of lecturers during lectures. She took the same
group of 8 lecturers and measured their anxiety (out of 15) during a normal lecture and again in
a lecture in which she had paid students to be disruptive and misbehave. What test is best used to
compare the mean level of anxiety in the two lectures?
A. Independent samples t-test.
B. Paired-samples t-test.
C. One-way independent ANOVA.
D. Mann–Whitney test.
112. Which of the following values could not represent a correlation coefficient?
A. r = 0.99.
B. r = 1.05.
C. r = -.98.
D. r=0
114. What does the error bar on an error bar chart represent?
A. The confidence interval around the mean.
B. The standard error of the mean.
C. The standard deviation of the mean.
D. It can represent any of these.
115. A researcher measured the same group of people’s physiological reactions while watching
horror films and compared them to when watching erotic films. The resulting data were skewed.
What test should be used to analyse the data?
A. Independent t-test.
B. Wilcoxon signed-rank test.
C. Dependent (related) t-test.
D. Mann–Whitney test.
116. A researcher measured the same group of people’s physiological reactions while watching
horror films and compared them to when watching erotic films, and a documentary about
wildlife. The resulting data were skewed. What test should be used to analyse the data?
A. Independent analysis of variance.
B. Repeated-measures analysis of variance.
C. Friedman’s ANOVA
D. Kruskal–Wallis test.
117. Assuming the assumptions of parametric tests are met, non-parametric tests, compared to their
parametric counterparts:
A. Are all of these.
B. Are more conservative.
C. Are less likely to accept the alternative hypothesis.
D. Have less statistical power.
118. Which post hoc test would be most suitable for use with non-equal groups?
A. Bonferroni.
B. Duncan.
C. Tukey.
D. LSD.
119. How many levels must there be in one independent variable for an ANOVA to be used?
A. 3.
B. 2.
C. 1.
D. 4.
GH & FJ. 17 نسألكم الدعاء
120. To determine whether the test statistic of ANOVA is statistically significant, it can be
compared to a critical value. What two pieces of information are needed to determine the critical
value?
A. Sample size, number of groups.
B. Mean, sample standard deviation.
C. Expected frequency, obtained frequency.
D. MSTR, MSE.
121. Determine whether the following scenario should use a one-way ANOVA test or two-way
ANOVA test:
A nutritionist is studying the effects of diet on cholesterol in men and women. She has data which
reports cholesterol levels for men and women for three different diets (low-fat low calorie, Adkins
diet, Mediterranean diet).
A. one-way ANOVA test.
B. two-way ANOVA test.
122. Determine whether the following scenario should use a one-way ANOVA test or two-way
ANOVA test:
A researcher want to determine whether there is an interaction between physical activity level
123. Determine whether the following scenario should use a one-way ANOVA test or two-way
ANOVA test:
A group of psychiatric patients are trying three different therapies: counseling, medication and
biofeedback. They want to see if one therapy is better than the others.
124. Suppose you want to study the effect of sugar on memory for words. You have three groups in
your experiment, sugar, a little sugar and no sugar. Each participant only participates in one
condition of the experiment. The data collected from each group are as shown above.
Calculate the degree of freedom for the test.
A. 12, 2.
B. 2, 12.
C. 3, 15.
D. 15, 3.
127. How many independent and dependent variables are there in a One-Way Within Groups
ANOVA?
A. 1 independent variable and 1 dependent variable.
B. 2 independent variables and 1 dependent variable.
C. 1 independent variable and 3 dependent variables.
D. 1 independent variable and 2 dependent variable.
129. You are analyzing a scatter plot that has a negative correlation. What is true of the of the x and
y values?
A. As x increases, y decreases.
B. As x increases, y increases.
C. As x decreases, y decreases.
132. What would the correlation be between math grades and the time it takes to run a mile?
A. Positive Correlation.
B. Negative Correlation.
C. No Correlation.
D. Cannot Be Determined.
135. There is no linear relationship between two interval ratio variables when the regression:
A. Is parallel to the horizontal axis.
B. Does not touch all of the scores.
C. Is not an equal distance from all the scores.
D. Is bimodal.
136. What does Levene's test examine?
A. Differences between one group and the population.
B. Differences in variance scores across two within-group conditions.
C. Homogeneity of variances between two groups.
D. Differences in dependent variable scores between two groups.
137. When would you use the levene’s test?
A. To test for homogeneity of variance.
B. To test independence of errors.
C. To test for normality.
D. To test for differences in pairs of means.
138. What does a paired t-test measure?
A. Outcomes from two parametric DVs, measured across a single group
B. Outcomes from a parametric DV, measured across three conditions of a single group
C. Outcomes from a non-parametric dependent variable (DV), measured across two conditions
of a single group.
D. Outcomes from a parametric DV. measured across two conditions of a single group.
142. When the null hypothesis in the chi square test for independence is true, there should be:
A. Little difference between the observed frequencies and the expected frequencies.
B. No difference between the observed frequencies and the marginals.
C. Large difference between the observed frequencies and the expected frequencies.
D. No difference between the row and the column marginals.
146. An analysis on the race of patients who visited an emergency reveals that 4096 are White, 2596
are black, 20% are Native, 996 Americans and 1596 Asian. These data can best be represented
graphically by:
A. Normal Curve.
B. Histogram.
C. Cumulative frequency Curve.
D. Pie Chart.
147. For a normal distribution of 1000 cases, how many cases will be within +1 standard deviations?
A. About 680.
B. About 5.
C. no more than 500.
D. It's impossible to estimate because all normal curves are different.
149. If in a discrete series 25% values are greater than 75, then
A. Q1 > 75.
B. Q3 = 75.
C. Q1 = 75.
D. Q3 > 75.
150. The measurement of spread of a set of data around the central point
A. Measures of kurtosis.
B. Measures of variation.
C. Measures of central tendency.
D. Measures of skewness.
151. What symbol represents the test statistic for the Mann–Whitney test?
A. Ws.
B. T.
C. U.
D. H.
156. Which of the following assumptions is not made for the F test for comparing three of more
means?
A. The populations from which the samples were obtained must be normally distributed.
B. The samples must be independent of each other.
C. The sample sizes must be equal.
D. The variances of the populations must be equal.
158. Which of the following is an assumption that must be made in Step 1 of the ANOVA test?
A. Independent random samples.
B. Dependent random samples.
C. Level of measurement is nominal.
D. Stratified random samples.
159. The direction of a relationship between variables can be detected on a scatter plot by
considering the angle of the regression line. The relationship is
A. Positive when the line slopes upward from left to right.
B. Positive when the line slopes downward from left to right.
C. Negative when the line slopes upward from left to right.
D. Negative when the line slopes downward form right to left.
163. For a relationship involving education and library use, pearson’s r was + 0.37. This indicates
the:
A. Library use increases with education.
B. The well educated are more likely to purchase books.
C. As education goes up, people are less likely to go to the library.
D. People with limited funds have greater need for free public libraries.
166. A researcher is analyzing regional differences in family size. She has information on number of
children for samples of families from two regions. Which of the following would be an
appropriate statistical test?
A. ANOVA.
B. t test for difference in means.
167. A researcher has computed a Pearson's r of -0.45 between martial happiness and number of
children. What can she conclude?
A. Families with more children are happier. This is a moderate relationship.
B. Children have a strong, positive influence on levels of martial happiness.
C. There is a moderate, negative relationship between number of children and martial
happiness.
D. There is a weak, negative relationship between the variables.
168. An analysis of variance (ANOVA) differs from a test for independent samples t-test in that an
ANOVA :
A. is used to compare the means of three or more groups whereas an independent samples t-test
is used to compare two groups.
B. is usually used to compare the means of two groups but an independent samples t-test is used
to compare three groups.
C. figures variances as part of the analysis whereas an independent samples t-test does not figure
variances.
D. is conducted before the experiment whereas an independent samples t-test is conducted after
the experiment.
GH & FJ. 24 نسألكم الدعاء
169. A post hoc test is:
A. A test to compare two or more means in one overall test.
B. A follow-up test to the analysis of variance when there are three or more groups.
C. Standard error.
D. A follow-up test to the independent t-test.
174. Which of the following would be used to check that the variances were equal?
A. Independent-samples t-test.
B. Kolmogorov Smirnov test.
C. Paired t-test.
D. Levene's test.
178. In the Chi square test, expected frequencies are computed by:
A. Adding the observed frequency to N and subtracting the number of ..
B. Multiplying the observed frequencies by the row margins.
C. Multiplying the proper row and column marginals for each …
D. Adding the proper row and column marginals together and …
179. Which assumption about level of measurement is made for the Chi square test?
A. All variables are nominal in level of measurement.
B. At least one variable must be ordinal in level of measurement.
C. All variables are at least ordinal in level of measurement.
D. All variables are at least interval-ratio in level of measurement.
180. When you are going to analyze one database with variables normally distributed, which
statistical analysis are you going to apply?
A. Non-parametric tests.
B. Frequencies.
C. Only descriptive statistics.
D. Parametric test.
185. Which of the following correctly states the relationship between SST (the total sum of squares),
SSB (the sum of the) (incomplete question)
A. SST= SSB + SSW.
B. SST = SSB – SSW.
C. SSW = SSB + SST.
D. SSB + SST – SSW.
186. Which measure of central tendency is more representative of the typical observation if the
graph of the data is skewed to the right? [ Median ]
187. When data contains extreme values, which of the following is the poorest measure of central
tendency? [ Mean ].
188. For a set of data which contains values, the best measure of a central tendency among the
following is the [ Mean ].
189. If in a discrete series 15% values are greater than 70, then: [ P85 = 70 ].
190. When there is the possibility to estimates the marginal means of PCR in 3 different vaccines
groups. we are applying the [ Estimates Dependent Variable: PCR 1 ].
191. How is a variable label different from a variable name? [ It is longer and more detailed ].
192. How is a variable name different from a variable label? [ It is shorter and less detailed ].
193. Most of students in a class are underweight. what is the shape of this distribution? [ Positively
skewed ].
194. Most of students in a class are overweight. What is the shape of distribution? [Negatively
skewed]
196. Which of the following is mainly used to explore the relationship between two numerical
variables [ Scatter plot ].
197. Which statistical test is used to identify whether there is a relationship between two numerical
variables? [ Pearson's correlation test ].
199. Which statistical test is used to identify whether there is a relationship between two categorical
variables? [ chi-square test ].
200. A frequency distribution in which high scores frequent is said to be: [ Negatively skewed ].
201. A frequency distribution in which low scores are most frequent is said to be [Positively skewed]
203. Analyze the prevalence of diabetes (glicCAT) in Men with vitamin D deficiency (vitD < 20).
Form select cases should [ vitD ].
204. Analyze the prevalence of diabetes (glicCAT) in Men with vitamin D deficiency (vitD < 20).
Form this question, spilt file should be done for which variable? [ Gender ].
205. Analyze the prevalence of diabetes (glicCAT) in Men with vitamin D deficiency (vitD < 20).
Form this question, frequency table should be done for which variable? [ glicCat ].
206. Analyze the prevalence of overweight patients (BMICAT)who are smokers with vitamin D
deficiency (vitD0<20).Form this question, spilt file should be done for which variable? [ Smoker ].
207. Analyze the prevalence of overweight patients (BMICAT) who are smokers with vitamin D
deficiency (vitD0<20). Form this question, select cases should be done for which variable?[ vitD ].
208. Analyze the prevalence of overweight patients (BMICAT) who are smokers with vitamin D
deficiency (vitD0<20). Form this question, frequency table should be done for which variable?
[ BMICat ].
209. What is an outlier ? [ Outliers are values very different from the rest of the data ].
210. Which of the following is used for entering and viewing data? [ Data view ]
211. John has a test score of 64, which is at the 90th percentile. This means that: [ 90% of people
score lower than 64 ]
212. If all the scores on examination cluster around the mean, the dispersion is said to be: [ Small ]
213. A graph that shows the pattern of the relation of two variables is a [ Scatter plot ].
214. The interquartile range is. [ The difference between the third quartile and the first quartile ]
215. The possible relationship between pairs of data values could be examined from a [Scatter plot].
217. In SPSS, what is the ”data viewer”? [ A spreadsheet into which data can be entered ].
218. The descriptive measure of dispersion that is:[ the standard deviation ]
221. A standard normal distribution has which of the following properties?[ The mean is equal to 0
and the variance is equal to 1 ].
222. Which test may be used if the conditions of the chi-square test are not true? [ Fisher's exact
test ]
223. If a test was generally difficult, except for a few students who had very high scores, then the
distribution of scores would be: [ Positively skewed ].
224. If a test was generally very easy, except for a few students who had very low scores, then the
distribution of scores would be [ Negatively skewed ].
225. What does the operation "Recode Into Different Variables" do to the data? [ Redistributes a
range of values into a new set of categories and creates a new variable ]
226. The measure of variation that is not influenced by extreme values is: [ the interquartile range ].
227. The relationship between two variables controlling for the effect that a third variable has on
one of those variables can be expressed using a: [ Semi-partial correlation ].
228. Which central values can be used in the case of ordinal categorical variables?
[ Only the mode and the median can be used ].
229. How would you use the drop-down menus in SPSS to generate a frequency table?
[ Click on: Analyze; Descriptive Statistics: Frequencies ].
230. Why might you tell SPSS to represent the "slices" of a pie chart in different patterns?
[ If you do not have a colour printer, it makes the differences between the slices clearer. ]
231. Which measure of central tendency may not exist for all numeric data set ? [ MODE ].
232. The measure of central tendency which is the most likely to be influenced by extreme values in
the data set is the [ Mean ].
234. Post hoc tests [tests you do after ANOVA when you didn't know what to expect, Tukey test].
235. Which statement about the relationship between effect size and statistical significance is true?
[The smaller the effect size, the larger the sample has to be in order to have a statistical…]
236. can be used to examine how a relationship between two variables might be 'explained by
237. What are the four levels of measurement? [ Nominal, ordinal, interval, ratio. ]
– 77 – 85 – 90 – 95 ]. [ 73. ]
240. What term is used to describe the science of organizing and analyzing information to make the
241. Which of the following would be used to describe the type of statistical methods used to
242. Which of the following would be used to describe the type of statistical methods used to make
243. The mode measures central tendency in terms of which of these? The most common score.
244. Which of the following is the easiest way to describe data? Average.
245. Which of the following measures of central tendency is the midpoint for a set of scores?
Median.
247. To identify the point in a distribution at which 50% of scores fall above and 50% fall below a
given score, which measure of central tendency would you report? Median.
248. If you wanted to identify the most frequently occurring score in a distribution, which measure
of central tendency would you report? Mode.
249. The median is the score of the middle case when the number of cases in the data set is which of
the following? An odd number.
250. In the formula for computing the mean, what does the letter X represent? Individual scores.
251. In the formula for computing the mean, what does the letter "n" represent? Sample size.
252. Which of the following is correct formula for calculating the mean? ZX/n.
253. The average and which of the following can be used to describe the characteristics of a
distribution and show how distributions differ from one another? Variability.
254. Variability is a measure of how much each score in a group of scores differs from what? Mean.
255. The three measures of variability commonly used to reflect the spread or dispersion in a group
of scores are the standard deviation, the variance, and what? Range.
256. When subtracting the largest number in a distribution from the smallest, what type of
variability is being calculated? Range.
258. In the formula for computing the range, what does the letter "h" represent? Highest score.
GH & FJ. 30 نسألكم الدعاء
259. When calculating the standard deviation, what must be done in order to obtain an unbiased
estimate of the population? Subtract 1 from n.
261. What will the sum of the deviations from the mean always equal? 0.
263. Which of the following is one way to represent variance? s^2 (s to the power of 2).
264. What is the most frequently used measure of variability? Standard deviation.
266. In testing for the significance of the difference between two sample means when the sample size
is small, we may assume equal population variances if: Sample sizes are approximately equal.
267. If a difference between random samples is not statistically significant, it is probably On the
268. Measures of central tendency (like the mean) and measures of dispersion (like the range)
269. The index of qualitative variation (IQV) is the ratio of the actual variation in the scores to:
270. The index of qualitative variation (IQV) varies from 0.00 to 1.00. Which of the IQV's below
271. The second quartile (Q2) is equal in value to: The median.
272. If a test score lies at the first quartile, it is: Higher than 25% of the scores.
273. The average deviation (AD) is based on: Ignoring the signs of the differences between the
276. The tails of the theoretical normal curve: Never touch the horizontal axis.
277. Unlike empirical distribution, the theoretical normal curve is: Perfectly symmetrical.
278. On all normal curves the area between the mean and +-1 standard deviation will be: About
total area.
280. Converting scores into Z scores standardizes the original distribution to units of the: Standard
deviation.
281. The standardized normal distribution (or Z distribution) has: A mean of 0 and a standard
deviation of 1.
282. If a z score is +1.00 then the value of the corresponding raw score would be: Equal to the mean
283. Social scientists gather data from samples instead of populations because: Populations are
populations.
286. In the research process, theory: Attempts to explain the relationship between phenomena.
288. A hypothesis states, in part, that "income increases as education increases". In this statement,
289. In addition to saying that one case is different from another, the ordinal level of measurement
allows us to: Order categories from high to low. Say that one case is more or less than another.
290. In terms of the research process, a hypothesis is derived from and leads to .
Theory, observation
293. Which of the following questions would generate a continuous variable? How old are you?
294. Which of the following is a continuous variable? Time spent watching TV.
296. Choose the nominal level variable below: Eye color of students in statistics class.
4. A pie chart is an alternative to a bar graph providing a display of the frequency distribution of a
scale variable [ False ].
5. When we reject the null hypothesis for a chi-square test of independence, we are concluding
that there is a relation between the two variables [ True ].
6. When we reject the null hypothesis for a chi-square test of independence, we are concluding that
there is no relation between the two variables [ False ].
7. When we reject the null hypothesis for a chi-square test of independence, we conclude that the two
variables are related [ True ].
8. When we reject the null hypothesis for a chi-square test of independence, we conclude that the two
variables are not related [ False ].
9. When we reject the null hypothesis for a chi-square test of independence, we conclude that the two
variables are dependent [ True ].
10. Non-parametric statistics describe procedures that assume the population from which the sample
is drawn follows a normal distribution [ False ].
11. Non-Parametric statistics describe procedures that assume that the population from which the
sample is drawn follows a non-normal distribution [ True ].
12. Parametric statistics describe procedures that assume that the population from which the sample
is drawn follows a normal distribution [ True ].
13. Parametric statistics describe procedures that assume that the population from which the sample
is drawn follows a non-normal distribution [ False ].
14. A frequency distribution is a table that shows the number of data observations that fall into
specific categories [ True ].
16. Histograms displaying continuous data have gaps between their bars [ False ].
17. The null hypothesis for a chi-square test of independence always states that there is no association
between the two variables [ True ].
18. The null hypothesis for a chi-square test of independence always states that the two variables are
independent of one another [ True ].
19. The null hypothesis for a chi-square test of independence always states that the two variables are
not independent of one another [ False ].
20. Continuous data is often the result of measuring observations rather than counting them [ True ].
21. Continuous data are values based on observations that can be counted and are typically
represented by whole numbers [ False ].
22. The stem-and-leaf display is an appropriate method for graphing numerical data [ True ].
23. The stem-and-leaf display is an appropriate method for graphing categorical data [ False ].
24. A health and fitness club surveys 40 randomly selected members and found that the mean weight
of those questioned is 157 lb. It is an example of a statistic [ True ].
25. The average height of all freshmen entering college in a certain year is 68.4 inches. It is an example
of statistic [ False ].
26. A summary table indicates the frequency of items in a set of categories [ True ].
27. The variance of the birth weights of all babies born in Manama is an example of a parameter
[True].
28. The mean age of all people in Saudi Arabia is an example of a parameter [ True ].
29. Given that the 95% confidence interval of the mean of folic acid is (9.104,13.604) then the null
hypothesis H0: mean of folic acid =11 mcg cannot be rejects at 5% level of significance [ True ].
30. Following confidence of intervals 95%: (-13.34 : -1.23) are indicative of p value <0.05? [ True ].
31. A researcher is curious about the effect of sleep on students' test performances. He chooses 60
students and gives each two tests: one given after two hours' sleep and one after eight hours' sleep.
The test the researcher should use would be a related samples test [ True ].
34. A statistic is a value that describes a population characteristic while a parameter is computed from
a sample [ False ].
35. Responses to the question, How old are you? A represent a quantitative discrete variable [ False ].
36. One of the advantages of a bar chart is that it clearly shows that the total of all the categories of
GH & FJ. 50 نسألكم الدعاء
the bar chart adds to 100% [ False ].
37. Quantitative data are values that are categorical, describing a characteristic such as gender or
level of education [ False ].
39. You can use the Tukey-Kramer procedure to test for differences in pairs of means.
40. Suppose that the height is normally distributed among male teenagers. The independent-sample t-
test should be used to test the null hypothesis H1: mean height = 170 cm [ False ].
41.Suppose that the values of hemoglobin are not normally distributed among patients. If the
researcher would like to conduct a statistical test the median should be used instead of the mean
for the statistical test. [ True ].
42. Suppose you conduct a significance test for the population proportion and your p value is 0.184.
Given a 0.05 level of significance, then the condition is "Fail to accept Ho”. [ False ]
43. The central area under the normal curve within 3 standard deviation of the mean represents 75%
of the data. [ False ]
44. For a sample of 100 patients of which 65 are non- smokers the odds of smoking is given by 35/65. [
True ].
45. In a pilot study with a limited number of data you should use a non-parametric tests. [ True ].
46. A repeated measures test does not fit if you want to compare the performance of 2 particular
groups of athletes during 3 different follow up. [ False ]
48. If there is a small overlap between two groups, the effect size (d) will be relatively small. [ False ]
49. Each category of a pie chart occupies a slice of the pie that represents the cumulative relative
frequency of that category. [ False ].
50. The variance is used as a statistical measure to represent the center of normally distributed data
[ False ]
51. Analysis of variance is a technique used to conduct a hypothesis test to compare three or more
population propo(incomplete question) [ True ].
52. Hypothesis testing and estimation are the two key branches of the field of inferential statistics?
[ True ]
54. When p>05 is reported in a journal article that you read for an observed relationship, it means
that the author has rejected the null hypothesis (assuming that the author is using a significance or
alpha level of .05). [ False ]
55. Assuming innocence until “proven” guilty, a Type I error occurs when an innocent person is found
guilty. [ True ]
When the null hypothesis in the chi square test for independence is true, there
should be:
A. Little difference between the observed frequencies and the expected frequencies.
The mean on a standardized test is 100 and the standard deviation is 35. Your
score is 65. What percentage of the scores were higher than yours?
A. About 84%.
A dichotomous variable:
A. Is binary with 2 categories.
Which of the following would be used to describe the type of statistical methods
used to organize and describe the characteristics of a collection of data?
A. Statistics.
Which of the following would be used to describe the type of statistical methods
used to make inferences about the population based on a sample?
A. Inferential statistics.
Which of the following measures of central tendency is the midpoint for a set of
scores?
A. Median.
What is the most common type of average reported?
A. Mean.
To identify the point in a distribution at which 50% of scores fall above and
50% fall below a given score, which measure of central tendency would you
report?
A. Median.
The median is the score of the middle case when the number of cases in the data
set is which of the following?
A. An odd number.
What is the term associated with scores that are at the extreme ends of the
distribution?
A. Outliers.
In the formula for computing the mean, what does the letter X represent?
A. Individual scores.
In the formula for computing the mean, what does the letter "n" represent?
A. Sample size.
The average and which of the following can be used to describe the
characteristics of a distribution and show how distributions differ from one
another?
A. Variability.
Variability is a measure of how much each score in a group of scores differs
from what?
A. Mean.
When subtracting the largest number in a distribution from the smallest, what
type of variability is being calculated?
A. Range.
In the formula for computing the range, what does the letter "h" represent?
A. Highest score.
When calculating the standard deviation, what must be done in order to obtain
an unbiased estimate of the population?
A. Subtract 1 from n.
What will the sum of the deviations from the mean always equal?
A. 0.
which of the following are among the things to remember when creating
figures?
A. Maintain the scale in the graph.
Maintain the scale in the graph What is the most basic method of illustrating
data?
A. Frequency distribution.
When the left tail of a distribution's curve is longer than the right, what is this
called?
A. Negative skew.
When the right tail of a distribution's curve is longer than the left, what is this
called?
A. Positive skew.
What is the term associated with how flat or peaked a distribution appears?
A. Kurtosis.
Which of the following refers to a distribution’s curve that is relatively peaked
in comparison to a normal curve?
A. Leptokurtic.
If the mean of a set of scores is greater than the median, what can be said about
the distribution?
A. Positively skewed.
If the median of a set of scores is greater than the mean, what can be said about
the distribution?
A. Negatively skewed.
If you were interested in tracking your GPA during the time you are in
graduate school, which of the following might you use?
A. Line graph.
A correlation can range in value. Which of the following illustrates this range?
A. -1.0 to 1.0.
If variables change in the same direction, what type of correlation is this called?
A. Positive correlation.
When data points group together in a cluster from the lower left-hand side of
the xy axis to the upper right-hand side, what is this?
A. Positive slope.
If the coefficient of determination between two variables is. 0.81, what is the
Pearson correlation coefficient?
A. 0.90.
If you correlate scores from your test with some other valid measure that
assesses the same set of abilities, what type of validity evidence are you
collecting?
A. Criterion validity.
If I can say that my weekly statistics quiz fairly assesses the material covered,
what source of validity evidence should I have collected?
A. Content validity.
A score that you would actually record is an example of which of the following?
A. Observed score.
If the measures associates with a test are said to be consistent, you might
conclude that the measure is which of the following?
A. Reliable.
When we calculate reliability, we know the observes score. What are the two
unknown components of the reliability equation?
A. True and error scores.
Which of the following is the Greek letter associated with Cronbach's alpha,
sometimes known as coefficient alpha?
A. a.
What would the interrater reliability be for a 50-item measure in which the
number of agreements between Rater 1 and Rater 2 was 45?
A. 0.90.
When testing for the significance of the difference between two samples, which
is the proper assumption for step 1?
A. Samples are independent as well as random.
When random samples are drawn so that the selection of a case for one sample
has no effect on the selection of cases for another sample, the samples are:
A. Independent.
When testing for the significance of the difference between two sample means,
the null hypothesis is that?
A. The populations from which the sample was drawn are different.
When testing for the significance of the difference between two samples, the
null hypothesis states that the ___________ are the same.
A. Population means.
Rejection of the null hypothesis in the two-sample case implies that the:
A. Populations from which the samples are drawn are different on the trait of interest.
When conducting hypothesis tests for two sample means, the test statistic is:
A. The difference in sample means.
A pooled estimate:
A. Combines information from two samples.
For testing the difference between two sample means, the level of measurement
is assumed to be:
A. Interval-ratio.
In testing for the significance of the difference between two sample means when
the sample size is small, we may assume equal population variances if:
A. Sample sizes are approximately equal.
The ANOVA test is designed for dependent variables that have been measured
at:
A. The interval-ratio level.
What would be the most appropriate test of significance for research situations
in which we have an interval-ratio dependent variable and an independent
variable with three or more categories?
A. ANOVA.
Stated generally, the null hypothesis for the ANOVA test is:
A. μ1 = μ2 = μ3 = ... =μk.
We have rejected the null hypothesis in an ANOVA test. This means that the
variation _________ the samples is significantly greater than the variation
_________ the samples:
A. Between, within.
One limitation of ANOVA is that, when the null hypothesis is rejected, the test:
A. Does not tell us which sample mean(s) is/are different.
In an ANOVA test, when the null hypothesis is rejected, we know that at least
one of the means is significantly different from the others. In order to find out
which mean(s) are significantly different, we must conduct a:
A. Post hoc test.
Which assumption about level of measurement is made for the Chi square test?
A. All variables are nominal in level of measurement.
Unlike other tests of significance, Chi square easily handles situations in which?
A. The variables of interest have more than two categories or scores.
To calculate the expected frequencies for a chi square analysis, you need all but
which of the following?
A. Observed frequencies.
In the Chi square test for independence, the null hypothesis and the research
hypothesis:
A. Always contradict each other.
For the chi square goodness-of-fit test, the null hypothesis would be that:
A. The distribution of the variable is determined by random chance.
One limitation of the Chi square test (and all tests of hypothesis) is that they
cannot tell us if relationships between variables are:
A. Important.
Measures of central tendency (like the mean) and measures of dispersion (like
the range) provide information about distributions of scores which is:
A. Complementary.
The index of qualitative variation (IQV) is the ratio of the actual variation in
the scores to:
A. The maximum variation possible.
The index of qualitative variation (IQV) varies from 0.00 to 1.00. Which of the
IQV's below shows the greatest degree of homogeneity?
A. 0.25.
The distances between the scores and the mean are called:
A. Deviations.
On all normal curves the area between the mean and +-1 standard deviation
will be:
A. About 68% of the total area.
On all normal curves between the mean and +-2 standard deviations will be:
A. About 95% of the total area.
Assuming a normal distribution of 1000 cases, how many cases will be farther
away from the mean than +-3 standard deviations?
A. About 3.
If a z score is +1.00 then the value of the corresponding raw score would be:
A. Equal to the mean of the empirical distribution plus one standard deviation.
A repeated measures test does not fit if you want to compare the performance
of 2 particular groups of athletes during 3 different follow up.
A. False.
In addition to saying that one case is different from another, the ordinal level
of measurement allows us to:
A. Order categories from high to low.
B. Say that one case is more or less than another.
In terms of the research process, a hypothesis is derived from _______ and leads
to_______.
A. Theory, observation.
A hypothesis differs from a theory in that:
A. It is "testable".
Which of the following is NOT a nominal level variable? level of education zip
code occupation make of auto.
A. Zip code.
Data is the same thing as:
A. Information collected in numerical form.
The variable socioeconomic status ranges from upper class to lower class and
is an example of:
A. Ordinal level of measurement.
What data must be included in the columns of a frequency table for nominal
data?
A. Category and frequency.
When organizing a frequency table for a variable measured at the ordinal level,
categories should be ordered as:
A. Lowest to highest or highest to lowest.
The ________ and the ________are two of the most popular and useful methods
of standardizing for size and comparing distributions.
A. Proportion; percentage.
If there are 20 girls and 10 boys in a class, the ratio of boys to girls is calculated
as:
A. 10/20.
A comparison of the actual number of people who violate the speed limit to the
total number of drivers is an example of a:
A. Rate.
The ratio of men to women in a karate class is 3.3:1. If there are 100 women,
how many men are there?
A. 330.
City A (population = 1567 people) had 34 auto thefts last year. City B
(population = 34,567) had 40 auto thefts in the same time period and City C
(population = 156,980) had 70 auto thefts. Which city had the highest RATE of
auto theft?
A. City A.
Histograms and line charts or frequency polygons are used with data measured
at the:
A. Interval-ratio level.
For a single variable at the interval-ratio level, an appropriate graph would be:
A. A histogram.
A group of friends enjoys playing miniature golf together. Here are a set of final
scores from a recent outing: 39, 41, 44, 50, 54, 56, 56, 56, 61 What is the mode?
A. 56.
A group of friends enjoys playing miniature golf together. Here are a set of final
scores from a recent outing: 39, 41, 44, 50, 54, 56, 56, 56, 61 What is the median?
A. 54.
A group of friends enjoys playing miniature golf together. Here are a set of final
scores from a recent outing: 39, 41, 44, 50, 54, 56, 56, 56, 61 What is the mean?
A. 50.78.
The following values represent the amount of money (in dollars) 10 people have
in their wallets: 10, 12, 19, 5, 6, 8, 1, 13, 4, 7 What is the median?
A. 7.5.
The following values represent the amount of money (in dollars) 10 people have
in their wallets: 10, 12, 19, 5, 6, 8, 1, 13, 4, 7 What is the mean?
A. 8.5.
The following values represent the amount of money (in dollars) 10 people have
in their wallets: 10, 12, 19, 5, 6, 8, 1, 13, 4, 7 The sum of all deviations from the
mean is equal to:
A. 0 (always zero).
A distribution with a mean of 25, a median of 22.5, and a mode of 20, is said to
be:
A. Positively skewed.
During which stage does the Wheel of Science begin for a researcher
undertaking a new study?
A. Hypothesis.
In terms of the "Wheel of Science", statistics are central to the research process.
A. Only between the observation phase and the empirical generalization phase.
A public opinion poll that gauges the popularity of the President of the United
States is an example of:
A. Inferential statistics.
In addition to saying that one case is different from another, the ordinal level
of measurement allows us to:
A. Say that one case is more or less than another.
When using interval-ratio data, the distance between the scores is:
A. Exactly defined.
Which of the following can be treated as an interval-ratio variable?
A. Age.
Addition and subtraction are completely justified only when variables are?
A. Interval-ratio.
A researcher has numbered all 50 states from 1 to 50 and has calculated a mean
of 17.43 for the variable "state of birth".
A. Since the variable is nominal, the mean makes no sense.
A researcher has calculated the mean for a variable that is ordinal in level of
measurement.
A. This violation of level of measurement criterion is common and results should be
treated with caution.
Proportions and percentages, ratios and rates are all ways of expressing?
A. Relative frequencies.
To calculate proportion, the number of cases in any category (f) is divided by:
A. The number of cases in all categories (N).
If class intervals overlap with one another, there will be issues of:
A. Categories not being mutually exclusive.
Open-ended intervals:
A. Can be useful when there are very high or very low scored in a distribution.
For a single variable at the interval-ratio level, an appropriate graph would be:
A. A histogram.
The graphical presentation method that uses midpoints rather than real limits
is a:
A. Line chart.
The mode should be used to measure central tendency when the level of
measurement of the variable is:
A. Nominal.
For ordinal level variables, the most appropriate measure of central tendency
is generally:
A. The median.
To locate the 6th decile, we would first multiple the number of cases (N) by:
A. 0.60.
If you subtracted the mean from each score in a distribution, squared the
differences, and then added the squared differences, the sum would be:
A. A minimum.
The sum of the differences between the scored of a distribution and the mean
of the scores is always:
A. 0.
If you subtracted the mean from each score in a distribution and added the
results, the sum would be:
A. Zero.
In a campaign debate over the level of prosperity in the state, the incumbent
governor says: "average income is $53,000." His opponent responds "the
average citizen earns only $38,000." Both statements are true. This is possible
because:
A. The governor is using the mean.
To compute a mean for scores that have been grouped into frequency
distribution, we assume that the scores in each interval are:
A. Located at the midpoint of the interval.
To compute a median for scores that have been grouped into a frequency
distribution, we assume that the scores in each variable are:
A. Spaced evenly across the interval.
If one has data measured at the interval-ratio level, they can use which measure
of central tendency?
A. You can use any measure on interval-ratio data.
When interval-ratio data are badly skewed, the appropriate measure of central
tendency is the:
A. Mean.
Income distributions almost always have a positive skew (since there are only a
very few wealthy people). Therefore, the preferred measure of central tendency
for this variable would be the:
A. Median.
When you subtract the value of the mean from each score in a distribution and
then add the results, the sum will be:
A. 0.
If the variance of a distribution is 16, the mean is 12, and the number of cases
is 24, the standard deviation is:
A. 4.
The sum of the squared deviations around the mean for a distribution of 10
scores is 1000. The standard deviation:
A. Is 10.
As the distribution of scores becomes more variable, the value of the standard
deviation:
A. Increases.
A box plot uses all but which of the following to present dispersion?
A. Standard deviation.
The lower limit for the standard deviation is 0; the upper limit is:
A. Undefined; there is no upper limit.
If a Z score is 0, then the value of the corresponding raw score would be:
A. The same as the mean of the empirical distribution.
If a Z score is +1.00, then the value of the corresponding raw score would be:
A. Equal to the mean of the empirical distribution plus one standard deviation.
The Z score table gives the area between a score and the mean. For a Z score of
-100, that area (in percentages) is:
A. 34.13%.
Column c in the normal curve table lists "areas beyond Z". This is the area:
A. Above a positive Z score.
In a distribution of 150 test scores, the mean grade was an 82 and the standard
deviation was 8. If a student scored a 93, what would their equivalent Z score
be?
A. 1.38.
The area between the mean and a Z score of +1.50 is 43.32%. This score is
higher than ____________ of the scores in the distribution.
A. 93.32%.
The mean score on a final chemistry exam was 75, and the standard deviation
of the scores was 5. If the distribution is normal and your score was 70, what
percentage of the scores was lower than yours?
A. 15.87%.
To find the area above a positive Z score or below a negative Z score you would:
A. Use the "Area Beyond Z" column of the Z score table.
To obtain the area below a positive Z score or above a negative Z score you
would:
A. Add the area between the Z score and the mean to 50%.
As used in the social sciences, probabilities are a type of __________ which can
vary from.
A. Proportion, 0.00 to 1.00.
To obtain the area below a positive Z score or above a negative Z score you
would:
A. Add the area between the Z score and the mean to 50%.
The Z scores of two tests are +1.2 and +1.5. To obtain the area between these
scores:
A. Find the area between each score and the mean in the Z score table and then subtract
the smaller area from the larger area.
The area between a negative Z score and a positive Z score can be found by:
A. Adding the areas between each Z score and the mean.
The Z scores of two test score are -1.17 and +2.38. To find the total area between
these two scores:
A. Add the column c areas.
If a case is randomly selected from a normal distribution, the score of the case
will most likely be:
A. Close to the mean in value.
When a list of the population does not exist, the probability sampling technique
most commonly used is:
A. Cluster.
The sampling distribution links the _________ to the __________.
A. Sample, population.
Unlike the sample and population distributions, the sampling distribution is:
A. Theoretical.
If you drew all possible samples from some population, calculated the mean
for each of the samples, and constructed a line graph (showing the shape of
the distribution) based on all of those means, what would you have?
A. A sampling distribution
The analysis of variance is a statistical test that is used to compare how many
group means?
A. Two or more
What is the standard deviation of a sampling distribution called?
A. Standard error
Hypothesis testing and estimation are the two key branches of the field of
inferential statistics?
A. True
When p<.05 is reported in a journal article that you read for an observed
relationship, it means that the author has rejected the null hypothesis
(assuming that the author is using a significance or alpha level of .05).
A. True
When p>05 is reported in a journal article that you read for an observed
relationship, it means that the author has rejected the null hypothesis
(assuming that the author is using a significance or alpha level of .05).
A. False
_________ are the values that mark the boundaries of the confidence interval.
A. Confidence limits
_____ results if you fail to reject the null hypothesis when the null hypothesis
is actually false.
A. Type II error
The car will probably cost about 16,000 dollars; this number sounds more like
a(n):
A. Point estimate
A ________ is a range of numbers inferred from the sample that has a certain
probability of including the population parameter over the long run.
A. Confidence interval
When the researcher rejects a true null hypothesis, a ____ error occurs.
A. Type I
A. A follow-up test to the analysis of variance when there are three or more groups
The use of the laws of probability to make inferences and draw statistical
conclusions about populations based on sample data is referred to as
___________.
A. Inferential statistics
The cutoff the researcher uses to decide whether to reject the null hypothesis
is called the:
Which ____ percent confidence interval will be the widest (i.e., the least
precise) for a particular data set that includes exactly 500 cases?
A. 99%
As sample size goes up, what tends to happen to 95% confidence intervals?
A. Both a and b (They become more precise, they become more narrow)
t-test
A. The t-test is a statistical hypothesis test in which the test statistic follows a student's t
distribution under the null hypothesis (usually used when there is a normal
distribution
Population mean x̄
A. Test value
Independent t-test
A. Comparing two groups
Homogeneity of variance
A. An assumption underlying both t-tests and f-tests in which the population varies or
two or more samples are equal
Imagine you conduct a t-test using IBM SPSS and the output reveals that
Levene’s test for equality of variance is significant. What should you do?
(Hint: Levene’s test tests the assumption that variances indifferent groups are
approximately equal.)
A. Interpret the figures in the row labelled 'equal variances not assumed'.
For what is the 'variable view' in IBM SPSS's data editor used?
A. Defining characteristics of variables.
Which of the following best describes the variable 'Gender'?
A. All of the possible answers are correct.
What should Levene's test be in order to use the 'equal variance assumed' row
of a t-test?
A. It should be non-significant (P > 0.05)
A poll is done to estimate the proportion of adult Americans who like their jobs.
The poll is based on a random sample of 400 individuals. What is the
"conservative" margin of error of this poll?
A. 0.05
QUESTION 2
1. In a study of the relationship between geographical mobility (number of times a person
has changed residences) and number of friends, Pearson's r² is reported as .40. Which of
the following would be a correct interpretation?
QUESTION 3
1. In order to conduct a test of hypothesis with means or proportions with parametric
analysis, the sampling distribution must be
large
skewed
bimodal
normal
QUESTION 4
1. define the correct statement:
the box plot defines that smokers have identical mean of age than no smokers
the box plots define that smokers have a lower mean of age than no smokers
smokers are less than no smokers in number.
none of them
QUESTION 5
1. A researcher is preparing a report and wants to select a measure of central tendency that
shows the most common score in a particular distribution. Which statistic should she
select?
median
mean
none of the above
mode
QUESTION 6
1. A contingency table in which one variable had three categories and the other variable
had four categories would have
four cells
six cells
nine cells
twelve cells
QUESTION 7
1. The sampling distribution for the ANOVA test is
the Z distribution
the F distribution
the t distribution
QUESTION 8
1. “As the temperature rises, the crime rate increases.”
If the temperature would remain stable, the crime rate would stay the same
QUESTION 9
1. A researcher asked a sample of dual career families about the percentage of the family
budget contributed by the wife's job (Y) and the total number of children (X). Pearson's r
for this relationship is -0.34. Which of the following is an appropriate interpretation of
these results? (This model assumes part time job for the wife)
For every additional child, the wife must work longer hours
QUESTION 10
1. In according to this figure and the available analysis,
there is normality
QUESTION 11
1. A study of traffic safety show a correlation of 0.57 between average speed of traffic and
number of fatal accidents for a particular stretch of highway. This means that
every 1 mile per hour increase in average speed increases the number of traffic
fatalities by .57.
QUESTION 12
1. If a researcher summarizes the age of 1,000 people by calculating the average
age, she is using _______-
an ordinal categorical variable
a qualitative technique
a categorical variable
a numerical variable
QUESTION 13
1. The three commonly used measures of central tendency are
mode, median, and mean
QUESTION 14
1. In according to this figure
there is a positive association
none of them
QUESTION 15
1. If you wanted to measure the association between the age of people (in years) and the
number of arrests for driving while intoxicated, the most appropriate measure of
association would be
phi
gamma
Pearson's r
lambda
QUESTION 16
1. What would be the most appropriate test of significance for research situations in which
we have an interval-ratio dependent variable and an independent variable with three or
more categories?
the t test
ANOVA
regression
Chi square
QUESTION 17
1. In according to this analysis, what is the age estimated in one patient with WBC values of
1
81.56
-0.093
0.
81.65
QUESTION 18
1. If a = 2 and b = 4, what would the value of Y be when X
= 1?
QUESTION 19
1. Levene's test is used to test if 2- samples have equal variances in which contest?
in chi square
in independent t test
in descriptive data
QUESTION 20
1. In according to the following analysis, select the correct statement:
there is a statistically significant difference in age, between in gender (female and
male
there is a statistically significant difference in gender, for mean of female
numerical variable
another % variable
none of them
QUESTION 22
1. When conducting hypothesis tests for paired t test two sample means, the test
statistic is
alpha
QUESTION 23
1. If the scores on a variable are 11, 14, 18, 20, and 25, the median is
23
19
11 and 14
18
QUESTION 24
1. The PIE charts are used for showing
correlation
frequencies
QUESTION 25
1. If a regression line is parallel to the horizontal axis of the scattergram, the
slope (b) will be
-1.00
1.00
0.00
0.45
QUESTION 26
1. If a = 5 and b = 10, what would the value of Y be when X = 100?
5001
1000
1005
QUESTION 27
1. From a university population, random samples of 145 men and 237 women have been
asked if they have ever cheated in a college class. 8% of the men and 6% of the women
said that they have. What is the appropriate test to assess the significance of this
difference?
test for the significance of the difference between two sample proportions, matched
samples
test for the significance of the difference between two sample proportions, large
samples (chi square)
test for the significance of the difference between two sample proportions, small
samples
test for the significance of the difference between two sample means, large samples
QUESTION 28
1. When would you use the Tukey-Kramer procedure?
QUESTION 29
1. The relationship between two variables partialling out the effect that a third variable has
on both variables can be expressed using a ------------- correlation
linear
partial
descriptive
impartial
QUESTION 30
1. For a single variable measured at the categorical nominal level, an appropriate
graph would be a
frequency polygon
pie chart
bivariate table
histogram
QUESTION 31
1. To conduct a chi square test, the variables must first be organized into a________-
frequency distributions
univariate table
contingency table
QUESTION 32
1. In according to the following tables, which test has been applied?
before...
after....
INDEPENDENT T TEST
REPEATED MEASURES
ANCOVA
ANOVA
QUESTION 33
1. What measure of central tendency would be most appropriate to summarize data about
the color of movie star's eyes?
median
mean
mode
QUESTION 34
1. A researcher tested the relationship between age and level of vitamin B12 in blood
and found a coefficient of correlation equal to r= - 0.05. This relationship is
weak
strong
positive
QUESTION 35
1. Suppose in Kolmogorov-Smirnov (KS) test, the p-value is computed to be 0.034.
Then the test is significant if p-value is less than 0.05.
QUESTION 36
1. Random samples of 1546 men and 1678 women have been given a scale that measures
support of legal abortion. Men average 12.45 and women average 12.46 and the
difference is significant at the 0.05 level. What can we conclude?
Because of the large sample sizes, these results may be statistically significant but
trivial.(not completely)
The difference should be re-tested at a higher alpha level
QUESTION 37
1. with the command "select cases" in SPSS, we can________
select the percentage of people to enroll
split file
QUESTION 38
1. In according to this table below, define the RANGE
62
834
80.92
QUESTION 39
1. The distances (spread) between the scores and the mean are called
differences
ranges
summations
deviations
QUESTION 40
1. For the relationship between social class and movie attendance, a
researcher found a r Pearson correlation coefficient of - 0.45. This
relationship is_____________
a) Quantitative continuous
b) Quantitative discrete Statistics
c) Qualitative nominal AGE
d) Qualitative ordinal N Valid 20
The most frequent age of the patients is Missing 0
Mean 4.70
a) 3.
b) 4.5. Median 4.50
c) 5. Mode 5.00
d) 4.6. Std. Deviation 2.23
Percentiles 25 3.00
The balance age of the patients is
50 4.50
a) 3. 75 7.00
b) 4.5.
c) 6.
d) 4.7.
The median age of the patients is
a) 3.
b) 4.5.
c) 6.
d) 4.6.
The variance is
a) 2.23.
b) 4.97.
c) 4.7.
d) 1.49.
What can we say about the shape of this distribution by looking at the output?
1
3) The population mean μ is called:
5) The midpoint of the values after they have been ordered from the smallest to
the largest or the largest to the smallest is called:
a) 40 b) 50 c) 60 d) (60 – 40) / 2
a) Q1 = 50 b) Q1 < 50 c) Q1 > 50 d) Q1 ≠ 50
13) If the values of mean, median and mode coincide in a unimodel distribution, then the distribution
will be:
2
14) A curve that tails off to the right end is called:
15) If a set of data has one mode and its value is less than mean, then the distribution is called:
16) Determine whether the approximate shape of the distribution in the histogram is
17) A survey question which asks whether or not the respondent is “for” or “against” a proposal
represents
a) a continuous variable
b) a discrete variable
c) a numerical variable
d) a categorical variable
18) Use the box-and-whisker plot below to determine which statement is accurate.
a) One half of the cholesterol levels are between 180 and 211.
b) One half of the cholesterol levels are between 180 and 197.5.
c) About 25% of the adults have cholesterol levels of at most 211.
d) About 75% of the adults have cholesterol levels less than 180.
3
20) The branch of statistics that uses sample data to draw conclusions about an entire population is
known as:
21) Which measure of central tendency can be used for both numerical and categorical variables?
22) Which of the following statements about the median is NOT true?
23) It is possible that a distribution of data may not actually contain one of the measures of central
tendency. Which one might that be?
24) The smaller the spread or variation of scores around the mean:
27) Suppose that, starting with the original data set, the smallest measurement was removed. Which
measures of central tendency would be change from those of the original data set?
a) Mean
b) Median
c) Mode
d) None of these measures
28) The smaller the variance the less variation of the data around the mean
True / False
4
29) Here is a stem and leaf plot of the scores of Bob the Bowler’s last 18 bowling games, made by
SPSS.
Stem-and-leaf of Bowling N = 18
Leaf Unit = 1.0
1 9 4
1 10
1 11
2 12 8
4 13 12
7 14 346
(6) 15 147799
5 16 01445
30) In case there are too many outliers in the data set, the most representative average value is
a) Mean
b) Median
c) Mode
d) None of these measures
31) Since mode is the most frequently occurring score, it can be determined directly from a frequency
distribution or a histogram
True / False
5
32) The relationship among the mean, median and mode for the above boxplot .
a) is that they are all equal. b) is that the mean is the largest value.
c) is that the mean is the smallest value. d) Can't be determined.
a) The scores are not widely distributed, and the median would be an unreliable measure of central
tendency.
b) All of the measures of central tendency would be reliable
c) Scores are not widely distributed, and the mean is a reliable measure of central tendency
d) Scores are widely distributed and that the mean may not be reliable measure of central
tendency
e) Scores are not normally distributed
34) What type of distribution is observed when most of the scores cluster around the lower end of the
scale?
a) A flat distribution
b) A related distribution
c) A negatively skewed distribution
d) A positively skewed distribution
e) A normal distribution
35) Which measure is the most unreliable indicator of central tendency if data are Skewed Distribution?
a) Median
b) Mode
c) Range
d) Mean
36) In meteorology, the intensity of UV-B radiation is classified as follows: weak, moderate, strong,
very strong, extreme. What is the type of this data?
a) Categorical ordinal.
b) Discrete numerical.
c) Categorical nominal.
d) Continuous numerical.
37) Which central values can be used in the case of nominal categorical variables?
38) Which central tendency fits for characterizing random variables measured on ordinal scale?
6
39) The variable is a
7
46) The ordinal variable is a
50) The y-axis in the graph of the histogram may represents the
a) Frequency.
b) Relative frequency.
c) Frequency percent.
d) either a), b) or c)
a) Qualitative variable.
b) Variable with no mode.
c) Variable takes on values within intervals.
d) Variable with a specific number of values.
e) Variable which cannot be measured.
8
53) The nominal variable is a
a) Descriptive statistics
b) Numerical measures
c) Statistical charts
d) Inferential statistics
56) Below is the boxplot for the number of hours. The distribution of the number of hours is
a) Bimodal
b) right skewed
c) symmetrical
d) left skewed
a) A bar chart typically has gaps between the bars while a histogram has no gaps
b) There is no difference between these two graphical displays.
c) The bars in a bar chart may be of various widths while the bars of a histogram are all the
same width.
d) The bars in a bar chart are all the same width while the bars of a histogram may be of various
widths.
a) A continuous variable
b) A discrete variable
c) A ordinal variable
d) A categorical variable
9
59) If the graph of a distribution of data shows that the graph is skewed to the right, then the
a) Variance
b) Standard deviation
c) Mode
d) Range
61) Which measure of central tendency can be used for both numerical and categorical variables?
a) Median
b) Mean
c) Mode
d) First quartile
a) standard deviation
b) third quartile
c) skewness
d) kurtosis
a) Variance
b) Median
c) Standard deviation
d) Range
64) The cholesterol levels of a group of adults the day after Eid is
a) discrete variable
b) qualitative variable
c) independent variable
d) continuous variable
65) A statistic is a value that describes a population characteristic while a parameter is computed
from a sample.
False / True
66) As a rule, a value is considered an extreme value if its Z score is greater than 3.
True / False
10
67) In right-skewed distributions, which of the following is the Correct statement?
73) Biostatistics 's data obtained from medicine and biological science
False / True
11
75) We can classify the qualitative variable to discrete and continuous variable.
False / True
76) How would you use the drop-down menus in SPSS to generate a frequency table?
a) Open the Output Viewer and click: Save As; Pie Chart
b) Click on: Analyze; Descriptive Statistics; Frequencies
c) Click on: Graphs; Frequencies; Pearson
d) Open the Variable Viewer and recode the value labels
77) Why might you tell SPSS to represent the "slices" of a pie chart in different patterns?
78) How would you print a bar chart that you have just produced in SPSS?
a) In Output Viewer, click File, Print, select the bar chart and click OK
b) In Variable Viewer, open bar chart, click File, Print, OK
c) In Chart Editor, click Descriptive Statistics, Print, OK
d) In Data Editor, open Graphs dialog box, click Save, OK
12