Fine DataClassification

You might also like

Download as txt, pdf, or txt
Download as txt, pdf, or txt
You are on page 1of 3

How many classes will the following command return(target classes in the dataset) :

classes=list(iris['species'].unique())
3

A process used to identify data points that are simply unusual


Anamoly detection

Ensemble learning is used when you build component classifiers that are more
accurate and independent from each other.
True (False)

Select pre-processing techniques from the options (check box)


All the options

Ordinal variables has


clear logical order

Which command is used to select all NUMERIC types in the dataset.


(include=[numpy.number])

What does the command iris['species'].value_counts() return?


The count with unique values in the iris ['species'] coulmn

Imputing is a strategy to handle


Missing values

Is there a class imbalance problem in the given data set?


Yes

A classifer that can compute using numeric as well as categorical values is


Decision Tree Classifier

Which type of cross validation is used for imbalanced dataset?


Stratified Shuffle Split

To view the first 3 rows of the dataset, which of the following commands are used?
iris.head(3)

Imagine you have just finished training a decision tree for spam classication and
it is showing abnormal bad performance on both your training and test sets. Assume
that your implementation has no bugs. What could be reason for this problem
All the options

Let's assume, you are solving a classification problem with highly imbalanced
class. The majority class is observed 99% of times in the training data. Which of
the following is true when your model has 99% accuracy after taking the predictions
on test data
accuracy is good idea

Which of the following is not a technique to process missing values


none

Model Tuning helps to increase the accuracy


True

Choose the correct sequence for classifier building from the following:
train - test - initialize - predict
initialize - train - predict - evaluate
email spam detection is an example of
unsupervised classification

A technique used to depict the performance in a tabular form that has 2 dimensions
namely “actual” and “predicted” sets of data.
confusion matrix

The commonly used package for machine learning in python is


sklearn

Which classifier converges easily with less training data?


naive bayes classifier

Which of the given hyper parameter(s), when increased may cause random forest to
over fit the data?
Number of trees

Images,documents are examples of


unstructured data

Pruning is a technique associated with


decision trees

How many new columns does the following command return?

iris_series = pd.get_dummies(iris['Species'])
2

Supervised learning differs from unsupervised learning in that supervised learning


requires
labelled data

Cross-validation causes over-fitting


False

Can we consider sentiment classification as a text classification problem?


yes

Cross-validation technique will provide accurate results when the training set and
the testing set are from two different populations
True

You might also like