Professional Documents
Culture Documents
End Term Question Paper - BA1 - Term V Batch 2020-22
End Term Question Paper - BA1 - Term V Batch 2020-22
All the questions are multiple choice question which carries 2 marks each
1. Binary logistic regression is being used when your dependent variable is
a) two class categorical variable
b) continuous variable
c) three class categorical variable
d) multiclass categorical variable
3. If df is a dataframe in Python pandas then what will be the output of df.head() command in
python
a) It will show the top 5 rows of the dataframe
b) It will show the first 5 columns of the dataframe
c) It will show the last 5 rows of the dataframe
d) It will show the last 5 columns of the dataframe
4 Dependent variables of classification variable should be the any of the following data type.
1
a) Numeric
b) Factor
c) Both a) and b)
d) None of the above
5 If you have a dataframe df and you have imported pandas module as pd in python what will be
the output of the following command:
pd.describe(df)
a) It will show the summary statistics of all the variables
b) It will show the summary statistics of numeric variables
c) It will show the summary statistics of categorical variables
d) None of the above
6 Which of the following command is correct in python programming if numpy module is already
being installed
a) import numpy as np
b) imports numpy as np
c) library(numpy)
7 There is a data frame named df which has 5 columns named A, B, C, D, E accordingly. Please write
the output of the following commands.
df[df.columns[3]]
a) A b) B c) C d) D e) E
8 If your data has outlier what are the methods, you use to handle that
i) Remove the variable from the table which contains the outlier
ii) Replace the outlier value with the mean value of the variable
iii) lower outliers (<5th percentile) are replaced by the value at 5th percentile, and higher outliers
(>95th percentile) are replaced by the value at 95th percentile.
a)i &ii
2
b) I,ii &iv
c) ii,iii &iv
a) Scatter plot
b) Bar graph
c) Boxplot
d) Pie chart
11 You can check to see whether an pyhton object has missing value with the _________ function.
a) pandas.is.nullobj()
b) pandas.isna()
c) pandas.missing()
a) Hello Earth
b) hello earth
c) Hello earth
3
d) None of the above
a) d = {}
b) d = {“john”:40, “peter”:45}
c) d = {40:”john”, 45:”peter”}
d) d = (40:”john”, 45:”peter”)
15 Mean imputation is not the correct missing value imputation method for the following case
16 The regression equation for predicting number of speeding tickets (Y) from information about
driver age (X) is Y = .065(X) + 5.57. How many tickets would you predict for a 20-year-old driver?
A. 6.87
B. 4.27
C. 5.57
D.1
4
a) TRUE
b) FALSE
19 When one has a normally distributed data which missing value imputation technique is being
considered better?
a) Median imputation
b) Mode imputation
c) Mean imputation
20 Using regression analysis for forecasting, a R Square = 0.15 suggests that we can: